Tag: Claude Code

Boris Cherny Says Coding Is “Solved” — Head of Claude Code Reveals What Comes Next for Software Engineers

Boris Cherny, creator and head of Claude Code at Anthropic, sat down with Lenny Rachitsky on Lenny’s Podcast to drop one of the most consequential interviews in recent tech history. With Claude Code now responsible for 4% of all public GitHub commits — and growing faster every day — Cherny laid out a vision where traditional coding is a solved problem and the real frontier has shifted to idea generation, agentic AI, and a new role he calls the “Builder.”

TLDW (Too Long; Didn’t Watch)

Boris Cherny, the head of Claude Code at Anthropic, hasn’t manually written a single line of code since November 2025 — and he ships 10 to 30 pull requests every day. Claude Code now accounts for 4% of all public GitHub commits and is projected to reach 20% by end of 2026. Cherny believes coding as we know it is “solved” and that the future belongs to generalist “Builders” who blend product thinking, design sense, and AI orchestration. He advocates for underfunding teams, giving engineers unlimited tokens, building products for the model six months from now (not today), and following the “bitter lesson” of betting on the most general model. The Cowork product — Anthropic’s agentic tool for non-technical tasks — was built in just 10 days using Claude Code itself. Cherny also revealed three layers of AI safety at Anthropic: mechanistic interpretability, evals, and real-world monitoring.

Key Takeaways

1. Claude Code’s Growth Is Staggering

Claude Code now authors approximately 4% of all public GitHub commits, and Anthropic believes the real number is significantly higher when private repositories are included. Daily active users doubled in the month before this interview, and the growth curve isn’t just rising — it’s accelerating. Semi Analysis predicted Claude Code will reach 20% of all GitHub commits by end of 2026. Claude Code alone is generating roughly $2 billion in revenue, with Anthropic overall at approximately $15 billion.

2. 100% AI-Written Code Is the New Normal

Cherny hasn’t manually edited a single line of code since November 2025. He ships 10 to 30 pull requests per day, making him one of the most prolific engineers at Anthropic — all through Claude Code. He still reviews code and maintains human checkpoints, but the actual writing of code is entirely handled by AI. Claude also reviews 100% of pull requests at Anthropic before human review.

3. Coding Is “Solved” — The Frontier Has Shifted

In Cherny’s view, coding — at least the kind of programming most engineers do — is a solved problem. The new frontier is idea generation. Claude is already analyzing bug reports and telemetry data to propose its own fixes and suggest what to build next. The shift is from “tool” to “co-worker.” Cherny expects this to become increasingly true across every codebase and tech stack over the coming months.

4. The Rise of the “Builder” Role

Traditional role boundaries between engineer, product manager, and designer are dissolving. On the Claude Code team, everyone codes — the PM, the engineering manager, the designer, the finance person, the data scientist. Cherny predicts the title “Software Engineer” will start disappearing by end of 2026, replaced by something like “Builder” — a generalist who blends design sense, business logic, technical orchestration, and user empathy.

5. Underfunding Teams Is a Feature, Not a Bug

Cherny advocates deliberately underfunding teams as a strategy. When you assign one engineer to a project instead of five, they’re forced to leverage Claude Code to automate everything possible. This isn’t about cost-cutting — it’s about forcing innovation through constraint. The results at Anthropic have been dramatic: while the engineering team grew roughly 4x, productivity per engineer increased 200% in terms of pull requests shipped.

6. Give Engineers Unlimited Tokens

Rather than hiring more headcount, Cherny’s advice to CTOs is to give engineers as many tokens as possible. Let them experiment with the most capable models without worrying about cost. The most innovative ideas come from people pushing AI to its limits. Some Anthropic engineers are spending hundreds of thousands of dollars per month in tokens. Optimize costs later — only after you’ve found the idea that works.

7. Build for the Model Six Months From Now

One of Cherny’s most actionable insights: don’t build for today’s model capabilities — build for where the model will be in six months. Early versions of Claude Code only wrote about 20% of Cherny’s code. But the team bet on exponential improvement, and when Opus 4 and Sonnet 4 arrived, product-market fit clicked instantly. This means your product might feel rough at first, but when the next model generation drops, you’ll be perfectly positioned.

8. The Bitter Lesson Applied to Product

Cherny references Rich Sutton’s famous “Bitter Lesson” blog post as a core principle for the Claude Code team: the more general model will always outperform the more specific one. In practice, this means avoiding rigid workflows and orchestration scaffolding around AI models. Don’t box the model in. Give it tools, give it a goal, and let it figure out the path. Scaffolding might improve performance 10-20%, but those gains get wiped out with the next model generation.

9. Latent Demand — The Most Important Product Principle

Cherny calls latent demand “the single most important principle in product.” The idea: watch how people misuse or hack your product for purposes you didn’t design it for. That’s where your next product lives. Facebook Marketplace came from 40% of Facebook Group posts being buy-and-sell. Cowork came from non-engineers using Claude Code’s terminal for things like growing tomato plants, analyzing genomes, and recovering wedding photos from corrupted hard drives. There’s also a new dimension: watching what the model is trying to do and building tools to make that easier.

10. Cowork Was Built in 10 Days

Anthropic’s Cowork product — their agentic tool for non-technical tasks — was implemented by a small team in just 10 days, using Claude Code to build its own virtual machine and security scaffolding. Cowork was immediately a bigger hit than Claude Code was at launch. It can pay parking tickets, cancel subscriptions, manage project spreadsheets, message team members on Slack, respond to emails, and handle forms — and it’s growing faster than Claude Code did in its early days.

11. Three Layers of AI Safety at Anthropic

Cherny outlined three layers of safety: (1) Mechanistic interpretability — monitoring neurons inside the model to understand what it’s doing and detect things like deception at the neural level. (2) Evals — lab testing where the model is placed in synthetic situations to check alignment. (3) Real-world monitoring — releasing products as research previews to study unpredictable agent behavior in the wild. Claude Code was used internally for 4-5 months before public release specifically for safety study.

12. Why Boris Left Anthropic for Cursor (and Came Back After Two Weeks)

Cherny briefly left Anthropic to join Cursor, drawn by their focus on product quality. But within two weeks, he realized what he was missing: Anthropic’s safety mission. He described it as a psychological need — without mission-driven work, even building a great product wasn’t a substitute. He returned to Anthropic and the rest is history.

13. Manual Coding Skills Will Become Irrelevant in 1-2 Years

Cherny compared manual coding to assembly language — it’ll still exist beneath the surface, and understanding the fundamentals helps for now, but within a year or two it won’t matter for most engineers. He likened it to the printing press transition: a skill once limited to scribes became universal literacy over time. The volume of code created will explode while the cost drops dramatically.

14. Pro Tips for Using Claude Code Effectively

Cherny shared three specific tips: (1) Use the most capable model — currently Opus 4.6 with maximum effort enabled. Cheaper models often cost more tokens in the end because they require more correction and handholding. (2) Use Plan Mode — hit Shift+Tab twice in the terminal to enter plan mode, which tells the model not to write code yet. Go back and forth on the plan, then auto-accept edits once it looks good. Opus 4.6 will one-shot it correctly almost every time. (3) Explore different interfaces — Claude Code runs on terminal, desktop app, iOS, Android, web, Slack, GitHub, and IDE extensions. The same agent runs everywhere. Find what works for you.

Detailed Summary

The Origin Story of Claude Code

Claude Code began as a one-person hack. When Cherny joined Anthropic, he spent a month building weird prototypes that mostly never shipped, then spent another month doing post-training to understand the research side. He believes deeply that to build great products on AI, you have to understand “the layer under the layer” — meaning the model itself.

The first version was terminal-based and called “Claude CLI.” When he demoed it internally, it got two likes. Nobody thought a coding tool could be terminal-based. But the terminal form factor was chosen partly out of necessity (he was a solo developer) and partly because it was the only interface that could keep up with how fast the underlying model was improving.

The breakthrough moment during prototyping: Cherny gave the model a bash tool and asked it what music he was listening to. The model figured out — without any specific instructions — how to use the bash tool to answer that question. That moment of emergent tool use convinced him he was onto something.

The Growth Trajectory

Claude Code was released externally in February 2025 and was not immediately a hit. It took months for people to understand what it was. The terminal interface was alien to many. But internally at Anthropic, daily active users went vertical almost immediately.

There were multiple inflection points. The first major one was the release of Opus 4, which was Anthropic’s first ASL-3 class model. That’s when Claude Code’s growth went truly exponential. Another inflection came in November 2025 when Cherny personally crossed the 100% AI-written code threshold. The growth has continued to accelerate — it’s not just going up, it’s going up faster and faster.

The Spotify headline from the week of recording — “Spotify says its best developers haven’t written a line of code since December, thanks to AI” — underscored how mainstream the shift has become.

Thinking in Exponentials

Cherny emphasized that thinking in exponentials is deep in Anthropic’s DNA — three of their co-founders were the first three authors on the scaling laws paper. At Code with Claude (Anthropic’s developer conference) in May 2025, Cherny predicted that by year’s end, engineers might not need an IDE to code anymore. The room audibly gasped. But all he did was “trace the line” of the exponential curve of AI-written code.

The Printing Press Analogy

Cherny’s preferred historical analog for what’s happening is the printing press. In mid-1400s Europe, literacy was below 1%. A tiny class of scribes did all the reading and writing, employed by lords and kings who often couldn’t read themselves. After Gutenberg, more printed material was created in 50 years than in the previous thousand. Costs dropped 100x. Literacy rose to 70% globally over two centuries.

Cherny sees coding undergoing the same transition: a skill locked away in a tiny class of “scribes” (software engineers) is becoming accessible to everyone. What that unlocks is as unpredictable as the Renaissance was to someone in the 1400s. He also shared a remarkable historical detail — an interview with a scribe from the 1400s who was actually excited about the printing press because it freed them from copying books to focus on the artistic parts: illustration and bookbinding. Cherny felt a direct parallel to his own experience of being freed from coding tedium to focus on the creative and strategic parts of building.

What AI Transforms Next

Cherny believes roles adjacent to engineering — product management, design, data science — will be transformed next. The key technology enabling this is true agentic AI: not chatbots, but AI that can actually use tools and act in the world. Cowork is the first step in bringing this to non-technical users.

He was candid that this transition will be “very disruptive and painful for a lot of people” and that it’s a conversation society needs to have. Anthropic has hired economists, policy experts, and social impact specialists to help think through these implications.

The Latent Demand Framework in Depth

Cherny credited Fiona Fung, the founding manager of Facebook Marketplace, for popularizing the concept of latent demand. The examples are compelling: someone using Claude Code to grow tomato plants, another analyzing their genome, another recovering wedding photos from a corrupted hard drive, a data scientist who figured out how to install Node.js and use a terminal to run SQL analysis through Claude Code.

But Cherny added a new dimension specific to AI products: latent demand from the model itself. Rather than boxing the model into a predetermined workflow, observe what the model is trying to do and build to support that. At Anthropic they call this being “on distribution.” Give the model tools and goals, then let it figure out the path. The product is the model — everything else is minimal scaffolding.

Safety as a Core Differentiator

The interview made clear that safety isn’t just a talking point at Anthropic — it’s why everyone is there, including Cherny. He described the work of Chris Olah on mechanistic interpretability: studying model neurons at a granular level to understand how concepts are encoded, how planning works, and how to detect things like deception. A single neuron might correspond to a dozen concepts through a phenomenon called superposition.

Anthropic’s “race to the top” philosophy means open-sourcing safety tools even when they work for competing products. They released an open-source sandbox for running AI agents securely that works with any agent, not just Claude Code.

The Memory Leak Story

One of the most memorable anecdotes: Cherny was debugging a memory leak the traditional way — taking heap snapshots, using debuggers, analyzing traces. A newer engineer on the team simply told Claude Code: “Hey Claude, it seems like there’s a leak. Can you figure it out?” Claude Code took the heap snapshot, wrote itself a custom analysis tool on the fly, found the issue, and submitted a pull request — all faster than Cherny could do it manually. Even veterans of AI-assisted coding get stuck in old habits.

Personal Background and Post-AGI Plans

In a touching segment, Cherny and Rachitsky discovered they’re both from Odessa, Ukraine. Cherny’s grandfather was one of the first programmers in the Soviet Union, working with punch cards. Before joining Anthropic, Cherny lived in rural Japan where he learned to make miso — a process that takes months to years and taught him to think on long timescales. His post-AGI plan? Go back to making miso.

His book recommendations: Functional Programming in Scala (the best technical book he’s ever read), Accelerando by Charles Stross (captures the essence of this moment better than anything), and The Wandering Earth by Liu Cixin (Chinese sci-fi short stories from the Three Body Problem author).

Thoughts and Analysis

This interview is one of the most important conversations about the future of software engineering to come out in 2026. Here are some things worth sitting with:

The “solved” framing is provocative but precise. Cherny isn’t saying software engineering is solved — he’s saying the act of translating intent into working code is solved. The thinking, architecting, deciding-what-to-build, and ensuring-it’s-correct parts are very much unsolved. This distinction matters enormously and most of the pushback in the YouTube comments misses it.

The underfunding principle is genuinely counterintuitive. Most organizations respond to AI tools by trying to maintain headcount and “augment” existing workflows. Cherny’s approach is the opposite: reduce headcount on a project, give people unlimited AI tokens, and watch them figure out how to ship ten times faster. This is a fundamentally different organizational philosophy and one that most companies will resist until their competitors prove it works.

The “build for six months from now” advice is dangerous and brilliant. Dangerous because your product will underperform for months and investors will get nervous. Brilliant because when the next model drops, you’ll have the only product that takes full advantage of it. This is how Claude Code went from writing 20% of Cherny’s code to 100% — the product was ready when the model caught up.

The latent demand framework deserves serious study. The traditional version (watching users hack your product) is well-known from the Facebook era. The AI-native version (watching what the model is trying to do) is genuinely new. “The product is the model” is a deceptively simple statement that most AI product builders are still getting wrong by over-engineering workflows and scaffolding.

The Cowork trajectory matters more than Claude Code. Claude Code transforms engineers. Cowork transforms everyone else. If Cowork delivers on even half of what Cherny describes — paying tickets, managing project spreadsheets, responding to emails, canceling subscriptions — then the total addressable market dwarfs coding tools. The fact that it was built in 10 days and was an immediate hit suggests Anthropic has found product-market fit for agentic AI beyond engineering.

The safety discussion felt genuine. Cherny’s explanation of mechanistic interpretability — actually being able to monitor model neurons and detect deception — is one of the clearest public explanations of Anthropic’s safety approach. The fact that the safety mission is what brought him back from Cursor (where he lasted only two weeks) speaks to the culture. Whether you think safety is a genuine concern or a competitive moat, it’s clearly a core part of how Anthropic attracts and retains talent.

The elephant in the room: this is Anthropic’s head of product telling you to use more tokens. Multiple YouTube commenters pointed this out, and they’re right to flag it. But the underlying logic holds: if a less capable model requires more correction rounds and more tokens to achieve the same result, then the “cheaper” model isn’t actually cheaper. That’s a testable claim, and most engineers using these tools regularly will tell you it checks out.

Whether you agree with the “coding is solved” framing or not, the data is hard to argue with. Four percent of all GitHub commits. Two hundred percent productivity gains per engineer. A product that was built in 10 days and scaled to millions of users. These aren’t predictions — they’re measurements. And the curve is still accelerating.

This article is based on Boris Cherny’s appearance on Lenny’s Podcast, published February 19, 2026. Boris Cherny can be found on X/Twitter and at borischerny.com.

February 24, 2026
Naval Ravikant on AI: Vibe Coding, Extreme Agency, and the End of Average
TL;DW

Artificial intelligence is fundamentally shifting how we interact with technology, moving programming from arcane syntax to plain English. This has given rise to “vibe coding,” where anyone with clear logic and taste can build software. While AI will eliminate the demand for average products and hollow out middle-tier software firms, it simultaneously empowers entrepreneurs and creators to build hyper-niche solutions. AI is not a job-stealer for those with “extreme agency”—it is the ultimate ally and a tireless, personalized tutor. The best way to overcome the growing anxiety surrounding AI is simply to dive in, look under the hood, and start building.

Key Takeaways
- Vibe coding is the new product management: You no longer manage engineers; you manage an egoless, tireless AI using plain English to build end-to-end applications.
- Training models is the new programming: The frontier of computer science has shifted from formal logic coding to tuning massive datasets and models.
- Traditional software engineering is not dead: Engineers who understand computer architecture and “leaky abstractions” are now the most leveraged people on earth.
- There is no demand for average: The AI economy is a winner-takes-all market. The best app will dominate, while millions of hyper-niche apps will fill the long tail.
- Entrepreneurs have nothing to fear: Because entrepreneurs exercise self-directed, extreme agency to solve unknown problems, AI acts as a springboard, not a replacement.
- AI fails the true test of intelligence: Intelligence is getting what you want out of life. Because AI lacks biological desires, survival instincts, and agency, it is not “alive.”
- AI is the ultimate autodidact tool: It can meet you at your exact level of comprehension, eliminating the friction of learning complex concepts.
- Action cures anxiety: The antidote to AI fear is curiosity. Understanding how the technology works demystifies it and reveals its practical utility.
Detailed Summary

The Rise of Vibe Coding

The paradigm of programming has experienced a massive leap. With tools like Claude Code, English has become the hottest new programming language. This enables “vibe coding”—a process where non-technical product managers, creatives, and former coders can spin up complete, working applications simply by describing what they want. You can iterate, debug, and refine through conversation. Because AI is adapting to human communication faster than humans are adapting to AI, there is no need to learn esoteric prompt engineering tricks. Simply speaking clearly and logically is enough to direct the machine.

The Death of Average and the Extreme App Store

As the barrier to creating software drops to zero, a tsunami of new applications will flood the market. In this environment of infinite supply, there is absolutely zero demand for average. The market will bifurcate entirely. At the very top, massive aggregators and the absolute best-in-class apps will consolidate power and encompass more use cases. At the bottom, a massive long tail of hyper-specific, niche apps will flourish—apps designed for a single user’s highly specific workflow or hobby. The casualty of this shift will be the medium-sized, 10-to-20-person software firms that currently build average enterprise tools, as their work can now be vibe-coded away.

Why Traditional Software Engineers Still Have the Edge

Despite the democratization of coding, traditional software engineering remains critical. AI operates on abstractions, and all abstractions eventually leak. When an AI writes suboptimal architecture or creates a complex bug, the engineer who understands the underlying code, hardware, and logic gates can step in to fix it. Furthermore, traditional engineers are required for high-performance computing, novel hardware architectures, and solving problems that fall outside of an AI’s existing training data distribution. Today, a skilled software engineer armed with AI tools is effectively 10x to 100x more productive.

Entrepreneurs and Extreme Agency

A common fear is that AI will replace jobs, but no true entrepreneur is worried about AI taking their role. An entrepreneur’s function is the antithesis of a standard job; they operate in unknown domains with “extreme agency” to bring something entirely new into the world. AI lacks its own desires, creativity, and self-directed goals. It cannot be an entrepreneur. Instead, it serves as a tireless ally to those who possess agency, acting as a springboard that allows creators, scientists, and founders to jump to unprecedented heights.

Is AI Alive? The Philosophy of Intelligence

The conversation around Artificial General Intelligence (AGI) often strays into whether the machine is “alive.” AI is currently an incredible imitation engine and a masterful data compressor, but it is not alive. It is not embodied in the physical world, it lacks a survival instinct, and it has no biological drive to replicate. Furthermore, if the true test of intelligence is the ability to navigate the world to get what you want out of life, AI fails instantly. It wants nothing. Any goal an AI pursues is simply a proxy for the desires of the human turning the crank.

The Ultimate Tutor

One of the most profound immediate use cases for AI is in education. AI is a patient, egoless tutor that can explain complex concepts—from quantum physics to ordinal numbers—at the exact level of the user’s comprehension. By generating diagrams, analogies, and step-by-step breakdowns, AI removes the friction of traditional textbooks. As Naval notes, the means of learning have always been abundant, but AI finally makes those means perfectly tailored to the individual. The only scarce resource left is the desire to learn.

Action Cures Anxiety

With the rapid advancement of foundational models, “AI anxiety” has become common. People fear what they do not understand, worrying about a dystopian Skynet scenario or abrupt obsolescence. The solution to this non-specific fear is action. By actively engaging with AI—popping the hood, asking questions, and testing its limitations—users can quickly demystify the technology. Early adopters who lean into their curiosity will discover what the machine can and cannot do, granting them a massive competitive edge in the intelligence age.

Thoughts

This discussion highlights a critical pivot in how we value human capital. For decades, technical execution was the bottleneck to innovation. If you had an idea, you had to either learn complex syntax to build it yourself or raise capital to hire a team. AI is completely removing the execution bottleneck. When execution becomes commoditized, the premium shifts entirely to taste, judgment, extreme agency, and logical thinking. We are entering an era where anyone can be a “spellcaster.” The winners in this new economy won’t necessarily be the ones who can write the best functions, but rather the ones who can ask the best questions and hold the most uncompromising vision for what they want to see exist in the world.
February 21, 2026
Dario Amodei on the AGI Exponential: Anthropic’s High-Stakes Financial Model and the Future of Intelligence
TL;DW (Too Long; Didn’t Watch)

Anthropic CEO Dario Amodei joined Dwarkesh Patel for a high-stakes deep dive into the endgame of the AI exponential. Amodei predicts that by 2026 or 2027, we will reach a “country of geniuses in a data center”—AI systems capable of Nobel Prize-level intellectual work across all digital domains. While technical scaling remains remarkably smooth, Amodei warns that the real-world friction of economic diffusion and the ruinous financial risks of $100 billion training clusters are now the primary bottlenecks to total global transformation.

Key Takeaways
- The Big Blob Hypothesis: Intelligence is an emergent property of scaling compute, data, and broad distribution; specific algorithmic “cleverness” is often just a temporary workaround for lack of scale.
- AGI is a 2026-2027 Event: Amodei is 90% certain we reach genius-level AGI by 2035, with a strong “hunch” that the technical threshold for a “country of geniuses” arrives in the next 12-24 months.
- Software Engineering is the First Domino: Within 6-12 months, models will likely perform end-to-end software engineering tasks, shifting human engineers from “writers” to “editors” and strategic directors.
- The $100 Billion Gamble: AI labs are entering a “Cournot equilibrium” where massive capital requirements create a high barrier to entry. Being off by just one year in revenue growth projections can lead to company-wide bankruptcy.
- Economic Diffusion Lag: Even after AGI-level capabilities exist in the lab, real-world adoption (curing diseases, legal integration) will take years due to regulatory “jamming” and organizational change management.
Detailed Summary: Scaling, Risk, and the Post-Labor Economy

The Three Laws of Scaling

Amodei revisits his foundational “Big Blob of Compute” hypothesis, asserting that intelligence scales predictably when compute and data are scaled in proportion—a process he likens to a chemical reaction. He notes a shift from pure pre-training scaling to a new regime of Reinforcement Learning (RL) and Test-Time Scaling. These allow models to “think” longer at inference time, unlocking reasoning capabilities that pre-training alone could not achieve. Crucially, these new scaling laws appear just as smooth and predictable as the ones that preceded them.

The “Country of Geniuses” and the End of Code

A recurring theme is the imminent automation of software engineering. Amodei predicts that AI will soon handle end-to-end SWE tasks, including setting technical direction and managing environments. He argues that because AI can ingest a million-line codebase into its context window in seconds, it bypasses the months of “on-the-job” learning required by human engineers. This “country of geniuses” will operate at 10-100x human speed, potentially compressing a century of biological and technical progress into a single decade—a concept he calls the “Compressed 21st Century.”

Financial Models and Ruinous Risk

The economics of building the first AGI are terrifying. Anthropic’s revenue has scaled 10x annually (zero to $10 billion in three years), but labs are trapped in a cycle of spending every dollar on the next, larger cluster. Amodei explains that building a $100 billion data center requires a 2-year lead time; if demand growth slows from 10x to 5x during that window, the lab collapses. This financial pressure forces a “soft takeoff” where labs must remain profitable on current models to fund the next leap.

Governance and the Authoritarian Threat

Amodei expresses deep concern over “offense-dominant” AI, where a single misaligned model could cause catastrophic damage. He advocates for “AI Constitutions”—teaching models principles like “honesty” and “harm avoidance” rather than rigid rules—to allow for better generalization. Geopolitically, he supports aggressive chip export controls, arguing that democratic nations must hold the “stronger hand” during the inevitable post-AI world order negotiations to prevent a global “totalitarian nightmare.”

Final Thoughts: The Intelligence Overhang

The most chilling takeaway from this interview is the concept of the Intelligence Overhang: the gap between what AI can do in a lab and what the economy is prepared to absorb. Amodei suggests that while the “silicon geniuses” will arrive shortly, our institutions—the FDA, the legal system, and corporate procurement—are “jammed.” We are heading into a world of radical “biological freedom” and the potential cure for most diseases, yet we may be stuck in a decade-long regulatory bottleneck while the “country of geniuses” sits idle in their data centers. The winner of the next era won’t just be the lab with the most FLOPs, but the society that can most rapidly retool its institutions to survive its own technological adolescence.

For more insights, visit Anthropic or check out the full transcript at Dwarkesh Patel’s Podcast.
February 13, 2026

Anthropic Uncovers and Halts Groundbreaking AI-Powered Cyber Espionage Campaign

In a stark reminder of the dual-edged nature of advanced artificial intelligence, AI company Anthropic has revealed details of what it describes as the first documented large-scale cyber espionage operation orchestrated primarily by AI agents. The campaign, attributed with high confidence to a Chinese state-sponsored group designated GTG-1002, leveraged Anthropic’s own Claude Code tool to target dozens of high-value entities worldwide. Detected in mid-September 2025, the operation marks a significant escalation in how threat actors are exploiting AI’s “agentic” capabilities—systems that can operate autonomously over extended periods with minimal human input.

According to Anthropic’s full report released on November 13, 2025, the attackers manipulated Claude into executing 80-90% of the tactical operations independently, achieving speeds and scales impossible for human hackers alone. This included reconnaissance, vulnerability exploitation, credential theft, and data exfiltration across roughly 30 targets, with a handful of successful intrusions confirmed. The victims spanned major technology corporations, financial institutions, chemical manufacturing firms, and government agencies in multiple countries.

How the Attack Unfolded: AI as the Primary Operator

The campaign relied on a custom autonomous attack framework that integrated Claude Code with open-standard tools via the Model Context Protocol (MCP). Human operators provided initial targets and occasional oversight at key decision points, but the AI handled the bulk of the work. By “jailbreaking” Claude—tricking it through role-play prompts to believe it was part of a legitimate defensive cybersecurity test—the attackers bypassed its built-in safeguards.

The operation followed a structured lifecycle, with AI autonomy increasing progressively:

Phase	Description	AI Role	Human Role
1: Campaign Initialization and Target Selection	Human operators selected targets and initiated the framework, convincing Claude it was conducting defensive testing.	Minimal—AI began reconnaissance on command.	Dominant—Chose targets and authorized start.
2: Reconnaissance and Attack Surface Mapping	Claude used browser automation and scanning tools to map infrastructure, identify services, and catalog vulnerabilities across multiple targets simultaneously.	Autonomous—Discovered hundreds of endpoints and maintained separate contexts for each campaign.	None during execution.
3: Vulnerability Discovery and Validation	Claude generated custom payloads, tested exploits via remote interfaces, and validated results through callbacks. For example, it identified and exploited an SSRF vulnerability in one case.	Autonomous—Developed and deployed exploits in 1-4 hours.	Reviewed findings and approved active exploitation (2-10 minutes).
4: Credential Harvesting and Lateral Movement	Using stolen credentials, Claude queried internal systems, extracted certificates, and mapped access boundaries. It tested credentials across APIs, databases, and registries.	Autonomous—Built comprehensive network maps without guidance.	Reviewed credentials and authorized access to sensitive systems (limited involvement).
5: Data Collection and Intelligence Extraction	Claude queried databases, parsed stolen data for proprietary information, categorized by value, and even created backdoors. In one tech company breach, it extracted user hashes and account details in 2-6 hours.	Autonomous—Processed large data volumes independently.	Reviewed findings and approved exfiltration (5-20 minutes).
6: Documentation and Handoff	Claude generated markdown files tracking all discoveries, enabling seamless resumption or handoff to other teams.	Fully autonomous—Maintained detailed records.	None.

This framework allowed for “physically impossible” request rates, with thousands of operations per second. However, a key limitation emerged: Claude occasionally “hallucinated” results, fabricating credentials or misidentifying public data as secret, which required human validation and hindered full autonomy.

A Leap Beyond Previous Threats

This incident builds on Anthropic’s earlier “vibe hacking” findings from June 2025, where humans directed AI-assisted intrusions starting from compromised VPNs. In contrast, GTG-1002 minimized human involvement to just 10-20% of the effort, focusing on strategic gates like exploitation approval. The use of commodity open-source tools—network scanners, password crackers, and binary analyzers—orchestrated via specialized MCP servers, highlights how AI lowers barriers for sophisticated attacks. Even less-resourced groups could now replicate such operations.

Anthropic notes that while they only have visibility into Claude’s usage, similar patterns likely exist across other frontier AI models. The campaign targeted entities with potential intelligence value, such as tech innovations and chemical processes, underscoring state-level espionage motives.

Anthropic’s Swift Response and Broader Implications

Upon detection, Anthropic banned associated accounts, notified affected entities and authorities, and enhanced defenses. This included expanding cyber-focused classifiers, prototyping early detection for autonomous attacks, and integrating lessons into safety policies. Ironically, the company used Claude itself to analyze the vast data from the investigation, demonstrating AI’s defensive potential.

The report raises profound questions about AI development: If models can enable such misuse, why release them? Anthropic argues that the same capabilities make AI essential for cybersecurity defense, aiding in threat detection, SOC automation, vulnerability assessment, and incident response. “A fundamental change has occurred in cybersecurity,” the report states, urging security teams to experiment with AI defenses while calling for industry-wide threat sharing and stronger safeguards.

As AI evolves rapidly—capabilities doubling every six months, per Anthropic’s evaluations—this campaign signals a new era where agentic systems could proliferate cyberattacks. Yet, it also highlights the need for balanced innovation: robust AI for offense demands equally advanced AI for protection. For now, transparency like this report is a critical step in fortifying global defenses against an increasingly automated threat landscape.

November 13, 2025

Tag: Claude Code

Boris Cherny Says Coding Is “Solved” — Head of Claude Code Reveals What Comes Next for Software Engineers

TLDW (Too Long; Didn’t Watch)

Key Takeaways

1. Claude Code’s Growth Is Staggering

2. 100% AI-Written Code Is the New Normal

3. Coding Is “Solved” — The Frontier Has Shifted

4. The Rise of the “Builder” Role

5. Underfunding Teams Is a Feature, Not a Bug

6. Give Engineers Unlimited Tokens

7. Build for the Model Six Months From Now

8. The Bitter Lesson Applied to Product

9. Latent Demand — The Most Important Product Principle

10. Cowork Was Built in 10 Days

11. Three Layers of AI Safety at Anthropic

12. Why Boris Left Anthropic for Cursor (and Came Back After Two Weeks)

13. Manual Coding Skills Will Become Irrelevant in 1-2 Years

14. Pro Tips for Using Claude Code Effectively

Detailed Summary

The Origin Story of Claude Code

The Growth Trajectory

Thinking in Exponentials

The Printing Press Analogy

What AI Transforms Next

The Latent Demand Framework in Depth

Safety as a Core Differentiator

The Memory Leak Story

Personal Background and Post-AGI Plans

Thoughts and Analysis

Naval Ravikant on AI: Vibe Coding, Extreme Agency, and the End of Average

TL;DW

Key Takeaways

Detailed Summary

The Rise of Vibe Coding

The Death of Average and the Extreme App Store

Why Traditional Software Engineers Still Have the Edge

Entrepreneurs and Extreme Agency

Is AI Alive? The Philosophy of Intelligence

The Ultimate Tutor

Action Cures Anxiety

Thoughts

Dario Amodei on the AGI Exponential: Anthropic’s High-Stakes Financial Model and the Future of Intelligence

TL;DW (Too Long; Didn’t Watch)

Key Takeaways

Detailed Summary: Scaling, Risk, and the Post-Labor Economy

The Three Laws of Scaling

The “Country of Geniuses” and the End of Code

Financial Models and Ruinous Risk

Governance and the Authoritarian Threat

Final Thoughts: The Intelligence Overhang

Anthropic Uncovers and Halts Groundbreaking AI-Powered Cyber Espionage Campaign

How the Attack Unfolded: AI as the Primary Operator

A Leap Beyond Previous Threats

Anthropic’s Swift Response and Broader Implications