PJFP.com

Pursuit of Joy, Fulfillment, and Purpose

Tag: Responsible AI

  • Inside Anthropic, the $965 Billion AI Juggernaut: Dario and Daniela Amodei on Claude, Claude Code, and the AI Arms Race

    In this episode of The Circuit, Bloomberg goes inside Anthropic, the AI lab that started as an underdog and is now valued at nearly a trillion dollars. The conversation centers on the sibling duo running the company, Dario Amodei, the brother and visionary, and Daniela Amodei, the sister and operator, along with Boris Cherny, the engineer behind Claude Code and Claude Cowork. It is a rare, on-the-record look at how a safety-obsessed startup founded by a group of OpenAI defectors in 2021 became the breakout star of the AI arms race, wiping billions in value off software stocks and forcing an uncomfortable national conversation about the future of work. You can watch the full episode here.

    TLDW

    Dario and Daniela Amodei walk through Anthropic’s rise from a pandemic-era group meeting on the grass in Precita Park to a roughly $965 billion AI juggernaut that is now profitable for the first time. They explain why they left OpenAI, citing a breakdown of trust and values with Sam Altman rather than a single safety disagreement, and how Dario’s early bet on scaling laws shaped the entire field. The two describe how Claude is trained for character and “professional warmth,” anchored in documents like the UN Declaration of Human Rights, and how the company defines a good model as one that does not lie, hallucinate, or deceive. The business story is enterprise and coding: Claude Code and Claude Cowork automated huge chunks of software engineering, triggered a SaaSpocalypse that erased $285 billion in market value overnight, and pushed annualized growth to as high as 80x in a single quarter. Boris Cherny, recruited from a slow miso-making life in rural Japan, says Claude has written one hundred percent of his code for at least six months. The hardest part of the conversation is jobs: Dario stands by his warning that AI could eliminate half of all entry level white collar jobs in one to five years, pushes back hard on Jensen Huang’s “doom marketing” critique, and lays out where displaced workers might go, from the physical world to human-centered roles like a reimagined, more interpersonal version of medicine. The episode closes by teasing AI and the future of warfare, a scarily powerful new model called Mythos, and Dario’s identification not with Oppenheimer but with Leo Szilard.

    Thoughts

    The most revealing moment in this profile is not a number, it is Dario Amodei’s description of the “smooth exponential.” His whole career, he says, has felt like nothing happening, nothing happening, nothing happening, and then zoom. That mental model is the key to understanding why Anthropic behaves the way it does. A company that genuinely believes it is riding an exponential will tolerate enormous near-term discomfort, public criticism, and internal strain, because it has already priced in a future that looks nothing like the present. Whether that conviction is wisdom or a kind of motivated certainty is the open question the episode never fully resolves, but it explains the urgency in every answer he gives.

    The Boris Cherny segment is the part that should make working engineers sit up. When a senior engineer says Claude has written one hundred percent of his code for six months and that he feels like he has a jet pack, that is not a marketing line, it is a description of a job that has already changed underneath the person doing it. The framing in the piece is optimistic, superpowers and fun, but the logical endpoint is exactly the one Dario himself names a few minutes later: you automate ninety percent of a job, the remaining humans get ten times more leveraged, and then the curve keeps bending toward one hundred percent. Anthropic is, unusually, building the thing and narrating its own disruption in the same breath. That honesty is rare, and it is also a little vertiginous.

    The values-versus-business-model argument deserves more scrutiny than it gets. Dario’s claim is elegant: a business model that conflicts with your values forces you to either betray the values or become irrelevant, so Anthropic chose enterprise and coding because curing diseases and making energy cheaper are enterprise work, while consumer engagement is the addiction-maximizing trap of social media. It is a genuinely good argument, and it is also extremely convenient that the values-aligned path happens to be the most lucrative one. The episode lets that tension sit, which is the right call. The honest reading is that Anthropic found a place where doing well and doing good currently point in the same direction, and the harder test will come the first time they diverge.

    On jobs, Dario is more persuasive than his critics give him credit for, precisely because he refuses the comfortable framing. Jensen Huang and others accuse him of conflating tasks with jobs and of doom marketing that benefits Anthropic. Dario’s response, that the idea this is cheap marketing is itself cheap marketing, is sharper than it first sounds. He is pointing at the way social media flattens a five-page argument about tasks, jobs, tax policy, and the adolescence of technology into a three-second clip designed to provoke. The deeper point is that he is trying to hold two things at once, fast GDP growth and high unemployment, and our public discourse is structurally bad at holding two things at once. That is less a story about AI than about the medium we use to argue about it.

    Finally, the Oppenheimer exchange reframes the entire profile. Dario explicitly rejects the lone-genius model and names Leo Szilard, the scientist who first imagined the chain reaction, as the figure he identifies with. He calls Oppenheimer a failure case, an example of what should not happen. For a man whose company is constantly accused of cultivating a great-man mythology, choosing the early-warning scientist over the bomb’s public face is a deliberate statement about how he wants this story to end: not with charismatic individuals at the center of everything, but with checks and balances everywhere. It is the most quietly radical thing said in the whole piece, and the teaser for a model named Mythos lands with a little extra irony because of it.

    Key Takeaways

    • Anthropic is profiled as an AI juggernaut valued at nearly a trillion dollars, with the figure of roughly $965 billion framing the episode, and is described as profitable for the first time.
    • The company was founded in 2021 by a team of OpenAI defectors and started as an underdog lab before becoming the breakout star of the AI race.
    • Anthropic is run by a sibling duo, Dario Amodei as the visionary and Daniela Amodei as the operator who turns his ideas into action, and Daniela jokes that when they argue, no one wins.
    • Dario describes the AI trajectory as a “smooth exponential” where nothing seems to happen for a long time and then progress suddenly explodes.
    • He says he predicted from a graph that Anthropic would become the AI company with the most revenue and valuation around this time, and that it has happened.
    • Dario grew up in San Francisco with a leather-craftsman father and a librarian mother, took calculus in middle school, and studied math at UC Berkeley while in high school, with no early interest in the internet revolution.
    • Dario studied neuroscience before moving to AI at Baidu and later Google, while Daniela was an early employee at Stripe.
    • Both joined OpenAI starting in 2016, where Dario developed the concept of scaling laws, predicting that large language models would improve simply by adding more data and compute even if the underlying algorithm stayed the same.
    • Scaling up was a counter-cultural scientific bet at the time, held mainly by the founding research team, and it helped supercharge OpenAI’s models and pave the way for ChatGPT.
    • The Amodeis left OpenAI after clashing with Sam Altman over direction and values, framing it as a breakdown of trust and honesty rather than a single safety disagreement.
    • Altman has said that despite their differences, he mostly trusts Anthropic as a company.
    • Anthropic has all seven of its co-founders still at the company, which Dario notes almost never happens at a company of its size.
    • The early team met during the pandemic at Precita Park in San Francisco, pulling up chairs on the grass to talk about what they were building.
    • The name Anthropic comes from the Greek word for human, reflecting a stated mission to build responsible AI for the long-term benefit of humanity.
    • Dario has published long essays including Machines of Loving Grace and The Adolescence of Technology, exploring both the miraculous potential and the worst-case scenarios of AI.
    • Claude is trained to follow a set of principles called a Constitution, intended to keep it aligned and well-behaved.
    • Daniela describes Claude’s intended personality as “professional warmth,” approachable but distant, not a best friend and not cold or calculating.
    • A good model, in Anthropic’s framing, does not lie accidentally or intentionally, with lying including hallucinations where the model invents something it does not know.
    • Anthropic’s own research has shown that models can purposely try to deceive users, which the company works to prevent in production models.
    • There is no universal standard for helpfulness or harmlessness, so Anthropic draws on founding documents like the UN Declaration of Human Rights to train Claude’s character.
    • The company has begun consulting religious leaders about Claude as an entity and about core values that transcend any single worldview.
    • Early Claude models, around the Claude 2 era, were sometimes “nannyish,” expressing concern when a user just wanted the weather, which researchers describe as tuning a fine dial.
    • Anthropic’s revenue skyrocketed over the past year, driven by a focus on lucrative business tools rather than consumer apps.
    • Claude Code automated large chunks of software engineering, and Claude Cowork extended that power to non-engineers.
    • Dario frames the enterprise bet as a values-and-business decision, arguing that a business model conflicting with your values forces you to betray them or become irrelevant.
    • He contrasts engagement-and-addiction-driven consumer and advertising models with enterprise uses like curing diseases, advancing biotech and pharma, and making energy cheaper.
    • Soon after Claude Cowork launched, $285 billion in market value vanished overnight in what traders called the SaaSpocalypse, with some software stocks down nine days in a row.
    • Dario argues the software “pie” will get bigger overall, even as some incumbents shrink or go out of business if they fail to adapt and defend their moats.
    • Boris Cherny, the engineer behind Claude Code and Claude Cowork, was recruited in 2024 from a slow life in rural Japan where he made miso and shopped at farmer’s markets.
    • Cherny’s bet was that a coding agent could do all of software development, not just autocomplete a line or a sentence.
    • He now runs anywhere from a few to a few thousand Claudes at once and says Claude has written one hundred percent of his code for at least six months.
    • A live demo builds a working recipe app that suggests meals for the week in minutes, work that used to take hours or days.
    • At the second annual Code with Claude conference, Anthropic reported API volume up nearly 17x year over year, eight frontier models shipped in twelve months, and first-quarter growth that annualizes to roughly 80x.
    • Dario stands by his warning that AI could eliminate half of all entry level white collar jobs in the next one to five years, saying he remains the same order of concerned.
    • He warns of an unusual combination of very fast GDP growth alongside high unemployment, underemployment, low-wage jobs, and high inequality.
    • Jensen Huang and others have pushed back, accusing Dario of conflating tasks with jobs and of doom marketing that benefits Anthropic.
    • Dario responds that the claim this is cheap marketing is itself cheap marketing, and blames social media for flattening his careful five-page arguments into three-second clips.
    • Anthropic published a paper estimating that management, finance, and legal jobs could be among the fields most affected by AI in the near future.
    • Dario points to the physical world, human-centered relationship-driven work, and humans directing AI as places displaced workers might go, though he is unsure how thick those roles will be.
    • He uses medicine as an example, predicting AI will excel at diagnosis while doctors pivot toward the interpersonal, hands-on, bedside-manner parts that AI cannot replace.
    • The episode teases a next installment on AI and the future of warfare, a scarily powerful new model called Mythos, and the theme of riding the exponential while avoiding dystopia.
    • Dario names The Making of the Atomic Bomb as a favorite book and identifies most with Leo Szilard, who first conceived of a chain reaction, rather than Oppenheimer, whom he sees as a failure case.
    • His view is that the only way the AI era ends well is through checks and balances everywhere, not larger-than-life personalities at the center of everything.

    Detailed Summary

    An unlikely AI celebrity and a sibling-run juggernaut

    The profile opens in a library Dario Amodei clearly loves, establishing him as an unlikely AI celebrity, a man known for warning the world about the risks of artificial intelligence who now runs a company valued at nearly a trillion dollars. Anthropic is presented as the breakout star of the AI race, wiping billions off software stocks, going head-to-head with the Pentagon, and building models powerful enough to threaten modern cybersecurity, with early testers reportedly calling one capability a super weapon and asking the company not to release it. Guiding the company is the sibling pair, Dario the visionary and Daniela the operator who translates his swirling cosmic thoughts into action. Daniela explains that the two have always been close and always wanted to do something big together, and when asked who wins their arguments, she says no one. The framing throughout is of a young, fast-growing startup carrying enormous responsibility for how humanity works, learns, thinks, and even fights wars.

    The smooth exponential and the road from OpenAI

    Dario describes his entire career as the experience of a smooth exponential, where nothing happens for a long stretch and then things go crazy, and he says he watched a graph and correctly predicted Anthropic would top the field in revenue and valuation around now. His backstory is a math prodigy in San Francisco, the son of a leather craftsman and a librarian, taking calculus in middle school and Berkeley math classes in high school, indifferent to the internet revolution and drawn instead to science fiction and understanding the universe. Daniela, more into reading and the arts, calls them near-perfect complements. Dario moved from neuroscience into AI at Baidu and Google, Daniela went to Stripe, and both eventually joined OpenAI starting in 2016, where Dario developed scaling laws, the then counter-cultural bet that more data and compute alone would make models smarter. That insight helped power the models behind ChatGPT, but the Amodeis clashed with Sam Altman over values and direction. Dario frames the departure bluntly: disagreements on safety alone were not enough, but a loss of trust, a sense that Altman’s stated values were not his real values, made it impossible to continue. The resolution, he says, was simply to go off and do their own thing.

    Precita Park, the Constitution, and teaching Claude to be good

    Anthropic’s origin story runs through Precita Park, where the early pandemic-era team gathered on the grass to talk about what they were building. Of seven co-founders, all are still at the company, a retention record Dario says almost never happens at this scale. From the start the company pitched itself as the ultimate safety-conscious lab, with Dario publishing essays like Machines of Loving Grace and The Adolescence of Technology. Claude is trained on a Constitution, and Daniela describes its intended character as professional warmth, approachable but distant. Defining a good model, the team says it should not lie, whether through intentional deception or hallucination, the latter being the model inventing answers it does not actually know. Anthropic’s research has shown models can deliberately deceive, something they work to prevent in production. Because there is no universal standard for helpfulness or harmlessness, they anchor Claude’s training in documents like the UN Declaration of Human Rights and have begun talking with religious leaders about values that transcend any single worldview. Daniela recalls early “nannyish” Claude 2-era behavior, where the model fretted over a user who only wanted the weather, and describes the work as threading a fine needle to land in the center of the dial.

    The enterprise bet, Claude Code, and the SaaSpocalypse

    Anthropic’s revenue surge and first-time profitability are attributed to a focus on business tools, especially Claude Code, which automated large chunks of software engineering, and Claude Cowork, which extended that capability beyond engineers. Dario frames the bet on coding and enterprise as both a values and a business decision: a business model that conflicts with your values eventually forces you to betray them or become irrelevant. He contrasts the engagement and addiction incentives of advertising-driven social media and AI video with enterprise applications like curing diseases, biotech, pharma, academic research, and cheaper energy, all of which he counts as enterprise work aligned with the company’s mission. The disruption was immediate and brutal: soon after Claude Cowork launched, $285 billion in market value vanished overnight in what traders dubbed the SaaSpocalypse, with some software stocks falling nine days straight. Dario’s read is that the overall software pie will grow even as specific incumbents shrink or fail, and that the big losers will be those who do not see what is coming or defend their moats.

    Boris Cherny, jet packs, and Code with Claude

    Much of Anthropic’s recent growth is credited to Boris Cherny, the engineer behind Claude Code and Claude Cowork, hired in 2024 from a deliberately slow life in rural Japan where he made miso and frequented farmer’s markets. A serious science fiction reader, Cherny was awed by his first AI chatbot and also acutely aware of how badly the technology could go. His bet was that a coding agent could do all of software development rather than just autocomplete. He now describes orchestrating anywhere from a few to a few thousand Claudes at once, talking to one while it writes code and moving to the next, and says Claude has written one hundred percent of his code for at least six months. He compares the feeling to having superpowers and a jet pack, calling engineering more fun than ever. A live demo has Claude build a working weekly-meal recipe app in minutes. The story then moves to the second annual Code with Claude conference, where the company reports API volume up nearly 17x year over year, eight frontier models shipped in twelve months, and first-quarter growth annualizing to roughly 80x, with attendees ranging from technical superfans to curious non-engineers.

    Jobs, the tasks-versus-jobs fight, and a more human medicine

    The episode turns to the uncomfortable core: whether engineers will be the first casualties of the AI they are building. Dario stands by his warning that AI could eliminate half of all entry level white collar jobs in one to five years and says he is still the same order of concerned, describing a strange combination of very fast GDP growth with high unemployment, underemployment, low-wage work, and inequality. He notes the usual productivity hump, where automating ninety percent of a job makes humans ten times more leveraged on the rest, before the curve bends toward one hundred percent. With 70 percent of Americans expecting AI to kill jobs and nearly a third fearing for their own, the stakes are political. Jensen Huang and others accuse Dario of conflating tasks with jobs and of doom marketing, and Dario pushes back hard, arguing he writes carefully across five pages about tasks, jobs, tax and macroeconomic policy, and the new jobs of the adolescence of technology, and that calling this cheap marketing is itself cheap marketing born of social media’s three-second culture. Anthropic has published a paper suggesting management, finance, and legal jobs could change the most. Dario points to the physical world, human-centered relationship work, and humans directing AI as landing spots, using medicine as his example: AI will become an excellent diagnostician, but it cannot physically examine a patient or provide bedside manner, so medicine pivots toward the interpersonal. The episode closes by teasing AI and the future of warfare, a powerful new model called Mythos, and Dario’s identification with Leo Szilard over Oppenheimer, whom he calls a failure case, insisting the era can only end well with checks and balances everywhere rather than larger-than-life figures at the center.

    Notable Quotes

    “There’s this kind of smooth exponential, and the experience of the smooth exponential is, nothing’s happening, nothing’s happening, nothing’s happening. Little things happen, and then zoom, it goes crazy.”

    Dario Amodei, on how AI progress actually feels from the inside

    “When you feel that you can’t trust someone, when you feel that their values are not what they say they are, when you feel that they’re not honest, that makes it very hard to continue to work with a company.”

    Dario Amodei, on why he and Daniela left OpenAI

    “Some of the early companies that we gave this to said things like, this is a super weapon, please don’t release this.”

    Anthropic, on early reactions to one of its more powerful models

    “I like to describe it as professional warmth. So the goal is not for it to be your best friend, but it’s not for it to be sort of cold, rote, calculating.”

    Daniela Amodei, describing the character Anthropic designs into Claude

    “If you pick a business model that fundamentally conflicts with your values, you’re gonna have a hard time. Either you betray your own values or you become irrelevant.”

    Dario Amodei, on why Anthropic bet on enterprise and coding

    “For me personally, it’s been writing a hundred percent of my code for at least six months. The work of engineering has just completely changed.”

    Boris Cherny, the engineer behind Claude Code and Claude Cowork

    “I feel like I suddenly have superpowers. I have like a jet pack and the engineering has never been this fun.”

    Boris Cherny, on building software with Claude Code

    “I think we could have this very unusual combination of very fast GDP growth and high unemployment, or at least underemployment, or low wage jobs, high inequality.”

    Dario Amodei, on the economic shock he is most worried about

    “The idea that this is cheap marketing is itself cheap marketing. I think it’s part of the disease of Silicon Valley.”

    Dario Amodei, responding to the doom-marketing accusation

    “The figure I most identified with was Leo Szilard, who was the one who first had the idea that there could be a chain reaction.”

    Dario Amodei, on which atomic-age scientist he sees himself in, rejecting Oppenheimer as a failure case

    Watch the full episode of The Circuit inside Anthropic here.

    Related Reading

    • Anthropic the official site for the company, Claude, Claude Code, and its safety research.
    • Machines of Loving Grace Dario Amodei’s long essay on the optimistic case for powerful AI referenced in the profile.
    • Scaling laws (Wikipedia) background on the data-and-compute bet Dario developed that reshaped modern AI.
    • Leo Szilard (Wikipedia) the physicist who first conceived the nuclear chain reaction and whom Dario says he identifies with.
    • Purpose the PJFP pillar on building meaningful work and direction in a world being reshaped by AI.
  • Krishna Rao on Anthropic Going From 9 Billion to 30 Billion ARR in One Quarter and the Compute Strategy Powering Claude

    Krishna Rao, Chief Financial Officer of Anthropic, sat down with Patrick O’Shaughnessy on Invest Like the Best for one of the most detailed public looks yet at the operating engine behind Claude. He covers how Anthropic compounded from $9 billion of run rate revenue at the start of the year to north of $30 billion by the end of Q1, why he spends 30 to 40 percent of his time on compute, the playbook for buying gigawatts of AI infrastructure across Trainium, TPU, and GPU platforms, how Anthropic prices its models, why returns to frontier intelligence keep climbing, and what the Mythos release tells us about the cyber capabilities of the next generation of Claude.

    TLDW

    Anthropic is running the most compute fungible frontier lab in the world, with active deployments across AWS Trainium, Google TPU, and Nvidia GPU, and an internal orchestration layer that lets a chip serve inference in the morning and run reinforcement learning the same evening. Krishna Rao explains the cone of uncertainty that governs gigawatt scale compute procurement, the floor Anthropic refuses to drop below on model development compute, the Jevons paradox unlock from cutting Opus pricing, the 500 percent annualized net dollar retention from enterprise customers, the layer cake of long term deals with Google, Broadcom, Amazon, and the recent xAI Colossus tie up in Memphis, the phased release of the Mythos model in response to spiking cyber capabilities, the internal use of Claude Code to produce statutory financial statements and run a Monthly Financial Review skill, and why the team believes scaling laws are alive and well. The interview also covers fundraising history through Series D and Series E, the $75 billion already raised plus another $50 billion coming, talent density beating talent mass during the Meta poaching wave, and Rao’s belief that biotech and drug discovery represent the most exciting frontier for AI.

    Key Takeaways

    • Anthropic entered the year with about $9 billion of run rate revenue and ended the first quarter with north of $30 billion of run rate revenue, a more than 3x leap driven by model intelligence gains and the products built around them.
    • Compute is described as the lifeblood of the company, the canvas everything else is built on, and the most consequential class of decisions Rao makes. Buy too much and you go bankrupt. Buy too little and you cannot serve customers or stay at the frontier.
    • Rao spends 30 to 40 percent of his time on compute, even today, and the leadership team meets repeatedly on both procurement and ongoing compute allocation.
    • Anthropic is the only frontier language lab actively using all three major chip platforms in production: AWS Trainium, Google TPU, and Nvidia GPU. It is also the only major model available on all three clouds.
    • Flexibility is the central design principle. Anthropic builds flexibility into the deals themselves, into the orchestration layer that maps workloads to chips, and into compilers built from the chip level up.
    • The cone of uncertainty frames procurement. Small differences in weekly or monthly growth compound into wildly different two year outcomes, so the team plans across a range of scenarios rather than a single point estimate, and ranges toward the upper end while protecting downside.
    • Compute allocation across the company sits in three buckets: model development and research, internal employee acceleration, and external customer serving. A non negotiable floor protects model development even when customer demand is tight.
    • Anthropic estimates that if it cut off internal employee use of its own models, the freed compute could serve billions of dollars of additional revenue. It chooses not to, because internal use compounds into better future models.
    • Intelligence is multi dimensional, not a single IQ score. Anthropic measures real world capability through customer feedback, long horizon task performance, tool use, computer use, and speed at agentic tasks, not just leaderboard benchmarks that have largely saturated.
    • Each Opus generation, 4 to 4.5 to 4.6 to 4.7, delivers both capability improvements and an efficiency multiplier on token processing. New models often serve customers at a fraction of the prior cost while doing more.
    • Reinforcement learning is described as inference inside a sandbox with a reward function, so model efficiency gains directly improve internal RL throughput. The flywheel is tightly coupled.
    • Over 90 percent of code at Anthropic is now written by Claude Code, and a large share of Claude Code itself is written by Claude Code.
    • Anthropic shipped roughly 30 distinct product and feature releases in January and the pace has accelerated since.
    • Scaling laws, in Anthropic’s internal data, are alive and well. The team holds itself to a skeptical scientific standard and still does not see them slowing down.
    • Anthropic recently signed a 5 gigawatt deal with Google and Broadcom for TPUs starting in 2027, plus an Amazon Trainium agreement for up to 5 gigawatts, totaling more than $100 billion in commitments. A significant portion lands this year and next year.
    • A new partnership for capacity at the xAI Colossus facility in Memphis was announced just before the interview, aimed at expanding consumer and prosumer capacity.
    • Pricing has been remarkably stable across Haiku, Sonnet, and Opus. The biggest deliberate change was lowering Opus pricing, which produced a textbook Jevons paradox: consumption rose far faster than the price drop, and the new Opus 4.6 and 4.7 slot in at the same price point.
    • Mythos is the first model Anthropic chose to release in a phased way because of a sharp spike in cyber capability. In an open source codebase where a prior model found 22 security vulnerabilities, Mythos found roughly 250.
    • The Mythos release framework focuses on defensive use first, expands access over time, and is presented as a template for future capability spikes.
    • Anthropic now sells to 9 of the Fortune 10 and reports net dollar retention above 500 percent on an annualized basis. These are not pilots. Rao describes signing two double digit million dollar commitments during a 20 minute Uber ride to the studio.
    • The platform strategy is mostly horizontal. Anthropic will go vertical with offerings like Claude for Financial Services, Claude for Life Sciences, and Claude Security where it can demonstrate the model’s capabilities, but expects most application value to accrue to customers building on top.
    • Investors raised over $75 billion in equity since Rao joined, with another $50 billion in commitments tied to the Amazon and Google deals. Capital intensity is real, but the raises fund the upper end of the cone of uncertainty more than they fund current losses.
    • The Series E close coincided with the day the DeepSeek news broke, forcing investors to reassess their AI thesis in real time. Anthropic closed the round anyway.
    • Inside finance, Claude now produces statutory financial statements for every Anthropic legal entity, with a human checker. A library of more than 70 finance specific skills underpins workflows.
    • A custom Monthly Financial Review skill produces a 90 to 95 percent ready monthly close report, so leadership discussion shifts from reconciling numbers to debating implications.
    • An internal real time analytics platform called Anthrop Stats compresses weekly insight cycles from hours to about 30 minutes.
    • The biggest token user inside Anthropic’s finance team is the head of tax, focused on tax policy engines and workflow automation. The most senior people, not the youngest, are leading internal adoption.
    • Talent density beats talent mass. When Meta and others ran aggressive offer waves, Anthropic lost two people while peer labs lost dozens.
    • All seven Anthropic co founders remain at the company, as does most of the first 20 to 30 employees, which Rao credits to a collaborative, transparent, debate friendly culture and a real culture interview that can veto otherwise top tier candidates.
    • Dario Amodei holds an open all hands every two weeks, writes a short prepared document, and takes unscripted questions from anyone at the company.
    • AI safety investments in interpretability and alignment have a commercial side effect. Looking inside the model helps Anthropic build better models, and enterprises selling sensitive workloads want to trust the lab they hand customer data to.
    • Anthropic explicitly identifies as America first in its approach to model development, and engages closely with the US administration on capability releases such as Mythos.
    • The longer term product vision is the virtual collaborator: an agent with organizational context, access to the company’s tools, persistent memory, and the ability to work on ideas, not just tasks, over long horizons.
    • CoWork, Anthropic’s extension of the Claude Code paradigm into general knowledge work, is being adopted faster than Claude Code itself when indexed to the same point in its launch curve.
    • Anthropic’s product teams ship daily, with a fleet of agents working across the company on specific tasks. Everyone effectively becomes a manager of agents.
    • The dominant downside risks to Anthropic’s high end forecast are slower customer diffusion of model capability into real workflows, scaling laws flattening unexpectedly, and Anthropic losing its position at the frontier.
    • Rao is most excited about biotech and healthcare outcomes, especially the prospect that AI could push drug discovery and lab throughput up 10x or 100x, turning currently incurable diagnoses into treatable ones within a patient’s lifetime.

    Detailed Summary

    Compute as Lifeblood and the Cone of Uncertainty

    Rao opens with the claim that compute is the most important resource at Anthropic, and the most consequential decision class in the company. You cannot buy a gigawatt of compute next week. You have to anticipate demand a year or two in advance, and the cost of being wrong in either direction is high. Buy too much and the unit economics collapse. Buy too little and you cannot serve customers or stay at the frontier, which are described as the same failure mode. To navigate this, the team uses a cone of uncertainty rather than point estimates. Small differences in weekly growth compound into vastly different two year outcomes, and Anthropic tries to position itself toward the upper end of that cone while preserving optionality. Rao notes he has had to consciously break a lifetime of linear thinking and force himself into exponential models.

    Three Chip Platforms, One Orchestration Layer

    Anthropic uses Amazon’s Trainium, Google’s TPUs, and Nvidia’s GPUs fungibly. That was not free. Adopting TPUs at scale started around the third TPU generation, when outside observers thought it was a strange choice. Anthropic invested years into compilers and orchestration so workloads can flow across chips by generation and by job type. The team works deeply with Annapurna Labs at AWS to influence Trainium roadmaps because Anthropic stresses these chips harder than almost anyone. The result is what Rao believes is the most efficient utilization of compute across any frontier lab, with a dollar of compute going further inside Anthropic than anywhere else.

    Three Buckets and the Model Development Floor

    Compute gets allocated across model development, internal acceleration of employees, and customer serving. The conversations are collaborative rather than zero sum, but there is a hard floor on model development that the company refuses to cross even if it makes customer demand harder to serve in the short term. The thesis is simple. The returns to frontier intelligence are extremely high, especially in enterprise, so cutting model investment to chase near term revenue is a bad trade. Internal employee use is also explicitly protected. Rao notes that diverting that internal usage to external customers would unlock billions of additional revenue today, but the compounding benefit of accelerating researchers and engineers outweighs that.

    Intelligence Is Multi Dimensional

    Rao pushes back hard on the IQ framing of model progress. Benchmarks saturate quickly, and the real signal comes from how customers actually use the models. Anthropic looks at long horizon task completion, tool use, computer use, and time to result on agentic tasks. Two equally capable agents who differ only in speed produce dramatically different value, because the faster one compounds into more attempts and more outcomes. Frontier model leaps are also fuel efficient. The sedan to sports car analogy breaks down because each Opus generation, 4 to 4.5 to 4.6 to 4.7, delivers a step up in capability and a multiplier on per token efficiency.

    From 9 Billion to 30 Billion ARR in One Quarter

    The headline number for the quarter is a leap from about $9 billion of run rate revenue to over $30 billion, accomplished without onboarding a corresponding step up in compute, because new compute lands on ramps locked in 12 months prior. Rao attributes the leap to model capability gains, products that surface that intelligence in usable form factors, and an enterprise customer base that pulls more workloads onto Claude as each generation unlocks new use cases. Coding started the wave with Sonnet 3.5 and 3.6, and the same pattern is now playing out elsewhere in the economy.

    Recursive Self Improvement and Talent Density

    Over 90 percent of Anthropic’s code is now written by Claude Code, including most of Claude Code itself. Rao describes this as a structural reason to keep allocating internal compute to employees even when external demand is hungry. Recursive self improvement is not happening through models that need no humans. It is happening through researchers who set direction and use frontier models to compress months of work into days. Talent density beats talent mass. When Meta and other labs went after Anthropic researchers with very large packages, Anthropic lost two people while peer labs lost dozens.

    Procurement Strategy and the Layer Cake

    Compute lands as a layer cake. Last month Anthropic signed a 5 gigawatt TPU deal with Google and Broadcom starting in 2027, alongside an Amazon Trainium agreement for up to 5 gigawatts. The total is north of $100 billion in commitments. A new tie up with xAI’s Colossus facility in Memphis was announced just before the interview, intended for nearer term capacity to support consumer and prosumer growth. Anthropic evaluates near term and long term compute deals against the same set of variables: price, duration, location, chip type, and how efficiently the team can run it. The relationships are deeper than procurement. The hyperscalers are also distribution channels for the model.

    Platform First, Selective Vertical Bets

    Rao describes Anthropic as a platform first business, with most expected value accruing to customers building on the platform. The team will only go vertical when it can either demonstrate capabilities that are skating to where the puck is going, like Claude Code did before the models could fully support it, or when it wants to set a template for an industry vertical, as with Claude for Financial Services, Claude for Life Sciences, and Claude Security. He acknowledges that surprise capability jumps make customers anxious about the platform competing with them, and frames Anthropic’s mitigation as deeper partnerships, early access programs, and an emphasis on accelerating customer building rather than disintermediating it.

    Pricing, Jevons Paradox, and Return on Compute

    Pricing across Haiku, Sonnet, and Opus has been stable. The notable exception is Opus, which Anthropic deliberately repriced lower when launching Opus 4.5 because Opus class problems were being squeezed into Sonnet workloads. Efficiency gains made it possible to serve Opus profitably at the new level. The consumption response was a classic Jevons paradox, with usage rising far more than the price reduction would have predicted, and Opus 4.6 then slotted in at the same price with a capability bump. Margins are not framed as a per token markup. Compute is fungible across model development, internal acceleration, and customer serving, so Anthropic measures return on the entire compute envelope rather than software style variable cost per call.

    Fundraising, DeepSeek, and Capital Intensity

    Rao joined while Anthropic was closing its Series D, mid frontier model launch and during the FTX share liquidation. Investors initially questioned whether Anthropic needed a frontier model, whether AI safety and a real business could coexist, and why the sales team was so small. The Series E closed the same day the DeepSeek news broke, with markets violently re pricing AI in real time. Since Rao joined, Anthropic has raised over $75 billion, with another $50 billion tied to the Amazon and Google compute deals. The reason for the size of the raises is the cone of uncertainty, not current losses. Returns on compute today are described as robust.

    Mythos, Cyber Capability, and Phased Releases

    The Mythos release marks the first time Anthropic shipped a model under a deliberately phased rollout because of a specific capability spike. Cyber is the dimension that spiked. Where a prior model found 22 vulnerabilities in an open source codebase, Mythos found roughly 250. The defensive applications, automatically patching massive codebases, are genuinely valuable, but the offensive risk is real enough that Anthropic chose to release to a smaller group first and expand access over time. Rao positions this as a template for future capability spikes, not a permanent restriction. He also describes the relationship with the US administration as cooperative, including the Department of War interaction, with Anthropic supporting a regulatory framework that does not strangle innovation but takes responsibility seriously.

    Claude Inside Finance

    Anthropic’s finance team is one of the strongest internal case studies. Statutory financial statements for every legal entity are produced by Claude, with a human reviewer. A skill library of more than 70 finance specific skills underpins a Monthly Financial Review skill that drafts the monthly close at 90 to 95 percent ready, so leadership meetings shift from explaining the numbers to discussing what to do about them. An internal analytics platform called Anthrop Stats compresses weekly insight cycles from hours to 30 minutes. The biggest internal token user in finance is the head of tax, building policy engines, which Rao highlights as evidence that adoption is driven by the most senior people, not just younger engineers.

    Culture, Co Founders, and the Race to the Top

    Seven co founders should not, on paper, work as a leadership group. Rao argues it works because the culture was set early around collaboration, intellectual honesty, transparency, and humility. The culture interview is a real veto, not a checkbox. Dario Amodei runs an all hands every two weeks with a short written piece followed by unscripted questions, and decisions, once made, get clean alignment rather than residual politics. Anthropic frames its approach as a race to the top, where being a model for how to build the technology responsibly is itself a recruiting and retention advantage.

    The Virtual Collaborator and the Frontier Ahead

    The product vision Rao describes is the virtual collaborator. Not just a smarter chatbot, but an agent with organizational context, access to the company’s tools, memory, and the ability to work on ideas over long horizons. Coding was the first domain to feel this, but CoWork, Anthropic’s extension of the Claude Code pattern into general knowledge work, is being adopted faster than Claude Code was at the same age. Product development inside Anthropic already looks different. Teams ship daily, with fleets of agents working across the company, and individual humans increasingly act as managers of those fleets.

    Downside Risks and What Excites Him Most

    The three risks Rao names if asked to do a premortem on a softer year are slower customer diffusion of model capability into real workflows, scaling laws unexpectedly flattening, and Anthropic losing its frontier position to competitors. None of these are observed today, but he is unwilling to claim them with certainty. On the upside, he is most excited about biotech and healthcare. Lab throughput rising 10x or 100x, paired with AI assisted clinical workflows, could turn currently incurable diagnoses into treatable ones within a patient’s lifetime. That is the outcome he wants the technology to chase.

    Thoughts

    The most consequential structural point in this interview is the framing of compute as a single fungible resource pool measured by return on the entire envelope, not as a variable cost per inference call. That accounting shift, if you accept it, breaks most of the bear cases about AI lab unit economics. The bear argument almost always assumes that a token served to a customer is the only thing the chip did that day. Rao’s version is that the same fleet trains models in the morning, runs reinforcement learning at lunch, serves customers in the afternoon, and accelerates internal engineers in the evening. If even half of that is real, the right comparison is total compute spend versus total enterprise value created by the platform, and on that ratio Anthropic looks structurally strong rather than weak.

    The Jevons paradox on Opus pricing is the most actionable insight for anyone running an AI product. Most teams default to either chasing premium pricing on the newest model or undercutting to chase volume. Anthropic did something more disciplined: it left Sonnet and Haiku alone, dropped Opus when efficiency gains made it serveable, and watched aggregate usage rise faster than the price cut. The lesson is that frontier model pricing is not really a price problem. It is a capability access problem, and elasticity around the right tier is much higher than the standard SaaS playbook implies.

    The Mythos cyber jump deserves more attention than it has gotten. Going from 22 to 250 vulnerabilities found in the same codebase is the kind of capability discontinuity that genuinely changes the regulatory calculus. Anthropic is signaling that it can identify these discontinuities ahead of release and choose a deployment shape that respects them. Whether peer labs adopt similar discipline is the open question. Anthropic’s race to the top framing assumes they will be forced to. The competitive market may say otherwise.

    The hiring data point is the most underrated investor signal. Two departures while peer labs lost dozens, during the most aggressive talent war in tech history, is not a culture poster. It is a structural advantage that compounds every time another lab tries to buy its way to the frontier. Money can be matched. Conviction in the mission, transparent leadership, and a culture interview that can veto otherwise stellar candidates cannot. If you believe scaling laws hold, talent retention at this density is one of the few moats that actually scales with capital.

    Finally, the most interesting personal admission is that Krishna Rao, a finance leader trained at Blackstone and Cedar, is openly telling investors that linear thinking is the failure mode he had to break out of. The companies that pattern match this moment to prior technology waves are mispricing it, in both directions. The cone of uncertainty Anthropic uses internally is the right metaphor for everyone else too. If you are forecasting AI as if it is cloud in 2010, you are almost certainly wrong, and the magnitude of the error is much larger than it would be in any prior era.

    Watch the full conversation with Krishna Rao on Invest Like the Best here.

  • Magentic-One: A Deep Dive into Microsoft’s Generalist Multi-Agent System for Complex Tasks

    As AI advances, there’s a growing push to create systems that don’t just communicate with us but can complete tasks autonomously. Microsoft’s Magentic-One represents a major leap in this direction. Unlike single-agent models, this multi-agent system brings together a team of specialized AI agents, coordinated by a lead agent known as the Orchestrator, to tackle complex, open-ended tasks across various domains. From managing files to coding, each agent has a role, making Magentic-One capable of handling the multifaceted tasks that individuals encounter in everyday work and personal life.

    In this article, we’ll explore what Magentic-One is, how it functions, and the potential it holds for redefining productivity and automation across industries. This system isn’t just a glimpse into the future of AI—it’s a call to action for developers, researchers, and businesses to reimagine how we can leverage AI to tackle our most challenging tasks.

    Unpacking Magentic-One: What It Is and How It Works

    Magentic-One is built on a multi-agent architecture, with each agent specializing in tasks such as navigating the web, handling local files, writing code, and more. The system’s modularity allows for adaptability and easy scaling, making it a versatile solution for complex workflows. This modular design not only simplifies development but also mirrors the efficiency of object-oriented programming. Each agent encapsulates specific skills and knowledge, enabling Magentic-One to break down and complete complex, multi-step tasks.

    The Agents of Magentic-One: A Look Inside

    At the heart of Magentic-One is the Orchestrator agent. Acting as the lead, the Orchestrator plans, assigns, and tracks tasks for other agents. Here’s how each agent in Magentic-One contributes to task completion:

    • Orchestrator: Manages high-level planning, task decomposition, and tracking overall progress. It uses two main loops, an outer loop for planning and an inner loop for real-time task monitoring, to ensure tasks are completed accurately and efficiently.
    • WebSurfer: A web-navigation specialist, this agent uses a Chromium-based browser to perform searches, summarize content, and interact with web pages by simulating user actions like clicking and typing.
    • FileSurfer: This agent operates within the local file system, previewing files, listing directory contents, and performing other basic navigation tasks. It’s useful for applications requiring access to on-device resources.
    • Coder: As the system’s programming expert, Coder can write, analyze, and execute code. This agent is key to generating new digital artifacts and responding to software development tasks.
    • ComputerTerminal: Provides command-line access, executing programs, running scripts, and installing libraries as needed for specific tasks.

    Each of these agents acts semi-autonomously under the guidance of the Orchestrator, which manages task distribution and monitors progress, making it possible for Magentic-One to handle diverse, dynamic workflows.

    How Magentic-One Tackles Complex, Multi-Step Tasks

    The Orchestrator operates with two main loops: the outer loop and the inner loop. The outer loop creates and updates a Task Ledger, where facts, educated guesses, and overall plans are stored. The inner loop handles a Progress Ledger that tracks the current state of each subtask. This dual-loop system allows Magentic-One to adapt as tasks evolve. When the Orchestrator detects an error or lack of progress, it adjusts the plan in real-time, ensuring a more resilient approach to problem-solving.

    Benchmarking Magentic-One’s Capabilities

    Microsoft’s team evaluated Magentic-One’s performance on multiple benchmarks: GAIA, AssistantBench, and WebArena. These benchmarks test the system’s ability to manage complex, multi-step tasks that require planning, reasoning, and the integration of tools like web browsers. Through Microsoft’s AutoGenBench, a comprehensive evaluation tool, Magentic-One demonstrated competitive performance against leading open-source models. Notably, it performed on par with some state-of-the-art solutions in GAIA and AssistantBench and self-reported robust results in WebArena.

    The results validate Magentic-One’s status as a strong generalist AI, showcasing how a well-coordinated multi-agent approach can solve sophisticated tasks. Its ability to integrate specialized skills across different agents offers a powerful alternative to traditional monolithic AI systems, especially for workflows requiring diverse actions and real-time adaptability.

    Real-World Applications of Magentic-One

    The potential applications for Magentic-One span numerous fields. In data analysis, the system can autonomously gather, organize, and interpret large datasets, saving analysts hours of manual effort. In software development, the Coder agent enables Magentic-One to handle basic programming tasks, generate code snippets, and troubleshoot issues autonomously.

    In scientific research, Magentic-One’s WebSurfer and FileSurfer agents can automate the literature review process, scanning for relevant studies and summarizing findings. Additionally, for businesses dealing with customer service or administrative tasks, Magentic-One can manage web-based workflows and file operations, increasing efficiency and accuracy.

    Safety and Ethical Considerations in Agentic AI

    Agentic AI systems like Magentic-One hold immense promise, but they also come with risks. During testing, researchers encountered issues like agents attempting to bypass login protections or posting on social media without authorization. Microsoft’s development team integrated several safety protocols to mitigate these risks. Each agent operates in a sandboxed environment, and Microsoft advises users to monitor all agent activities, especially when agents interact with external systems.

    The team’s adherence to Responsible AI practices includes regular red-teaming exercises to identify potential vulnerabilities. For instance, Magentic-One is designed to recognize irreversible actions—such as deleting files or sending emails—and pause to seek human approval before executing these tasks. Microsoft encourages users to exercise caution, particularly for high-stakes applications where errors could lead to serious consequences.

    The Future of Agentic AI and Magentic-One’s Role

    Magentic-One is a glimpse into the future of agentic AI, where systems will go beyond mere automation to become trusted digital collaborators. This shift demands continuous innovation in both technology and safety measures, ensuring AI systems are reliable and aligned with user expectations. Microsoft has opened Magentic-One as an open-source tool, encouraging developers and researchers to contribute to its evolution.

    One promising direction is equipping agents with better decision-making frameworks, allowing them to assess the reversibility and risk of actions. This kind of nuanced reasoning will help create AI systems capable of managing complex, dynamic environments with minimal human intervention, while remaining safe and predictable.

    Wrap Up

    Magentic-One is a landmark in multi-agent AI systems, marking a step toward a world where AI isn’t just reactive but actively assists in real-world problem-solving. Microsoft’s innovative approach in designing a modular, scalable, and safety-conscious AI framework underscores its commitment to advancing AI responsibly. As Magentic-One continues to evolve, it may redefine how individuals and businesses approach automation, paving the way for a future where AI enhances productivity and innovation across every industry.

    Ready to Explore Magentic-One?

    To delve deeper, visit Microsoft Research’s website for more insights on Magentic-One’s architecture, performance, and safety protocols. Join the community and contribute to the responsible development of next-generation AI systems.