PJFP.com

Pursuit of Joy, Fulfillment, and Purpose

  • SpaceX IPO Priced at $135 Per Share: SPCX Raises $75 Billion in the Largest IPO in History, Trading Begins June 12 on Nasdaq

    TLDR

    SpaceX confirmed the pricing of its initial public offering on June 11, 2026: 555,555,555 shares of Class A common stock at $135.00 per share, a raise of just under $75 billion. The stock begins trading Friday, June 12, 2026 on the Nasdaq Global Select Market and Nasdaq Texas under the ticker SPCX, with the offering expected to close on June 15. Underwriters hold a 30 day option to purchase up to 83,333,333 additional shares at the IPO price, which would push total proceeds toward $86 billion. At $135 per share the company is valued at roughly $1.77 trillion. That makes this the largest IPO ever priced, around three times the previous record, and it instantly places SpaceX among the most valuable companies on the planet, ahead of Tesla.

    Key Takeaways

    • The deal: 555,555,555 Class A shares priced at $135.00 each, raising approximately $75 billion before the overallotment option.
    • The ticker: SPCX, trading on both the Nasdaq Global Select Market and the new Nasdaq Texas exchange starting June 12, 2026. The offering closes June 15.
    • The greenshoe: underwriters have 30 days to buy up to 83,333,333 more shares at $135, worth another $11.25 billion and a potential total raise near $86 billion.
    • Record scale: roughly three times larger than Saudi Aramco’s 2019 listing, the previous record holder, and by some estimates bigger than all US IPO proceeds from 2024 and 2025 combined.
    • The valuation: approximately $1.77 trillion at the offer price, which would rank SpaceX around seventh among US companies by market cap, above Tesla at roughly $1.6 trillion.
    • The multiple: reported 2025 revenue of $18.7 billion puts the deal at roughly 95 times trailing sales.
    • Control: Elon Musk retains more than 82 percent voting power after the offering through the dual class structure.
    • The banks: Goldman Sachs leads a ten bank syndicate of book running managers including Morgan Stanley, BofA, Citigroup, and J.P. Morgan, with thirteen additional co-managers.
    • Truly global retail access: simultaneous retail offerings in the US, Canada, Switzerland, Australia, Japan, and seven EEA countries, with a qualified investor tranche in the UK. Mega IPOs almost never do this.
    • Demand: the book was reportedly around four times oversubscribed, implying roughly $250 billion in orders, and some brokers are imposing anti flipping penalties on early sellers.
    • Index mechanics: MSCI plans early inclusion of SPCX shortly after the debut, while S&P declined to fast track S&P 500 membership.
    • What you own: Starlink, the Falcon and Starship launch business, and the AI segment built around xAI and the X platform following the February 2026 merger.

    Detailed Summary

    The Deal: 555,555,555 Shares at $135

    Space Exploration Technologies Corp. announced from Starbase, Texas that its IPO priced at $135.00 per share for exactly 555,555,555 shares of Class A common stock. The math works out to $74,999,999,925, which is to say the share count was reverse engineered to land a fraction of a cent under a clean $75 billion. The quintuple five share count is exactly the kind of numerical flourish you would expect from this company. The SEC declared the registration statement effective on June 11, and the underwriters received a standard 30 day option for up to 83,333,333 additional shares, which at the offer price is another $11.25 billion. Fully exercised, total proceeds approach $86 billion.

    Where and When SPCX Trades

    Shares are expected to begin trading June 12, 2026 under the ticker SPCX on the Nasdaq Global Select Market and on Nasdaq Texas, the exchange operator’s new Dallas based venue. The dual venue listing is a symbolic alignment for a company headquartered in Starbase, Texas, and it hands Nasdaq Texas the biggest debut it could possibly ask for. The offering itself is expected to close on June 15, subject to customary conditions.

    The Largest IPO Ever, By a Wide Margin

    The previous record for an IPO raise was Saudi Aramco in December 2019 at roughly $29 billion including its overallotment. SpaceX clears that bar nearly three times over before its own greenshoe is exercised. Market data firms have noted that this single deal likely raises more money than every US IPO from 2024 and 2025 put together. Whatever 2026 looked like for the IPO market before this week, it is now a record year on the strength of one listing.

    A $1.77 Trillion Valuation in Context

    At $135 per share, SpaceX is valued at approximately $1.77 trillion, a figure that assumes pending transactions such as the EchoStar spectrum deal close as planned. That valuation would slot SpaceX in around seventh place among US public companies, ahead of Tesla, which trades near $1.6 trillion. It is a remarkable mark for a company that was privately valued at $350 billion in late 2024 and at $1.25 trillion when it merged with xAI in February 2026. Against reported 2025 revenue of $18.7 billion, the offer price represents roughly 95 times trailing sales, a multiple that prices in Starlink’s growth, Starship’s long term optionality, and the AI buildout all at once.

    The Syndicate

    Goldman Sachs leads the book running group, joined by Morgan Stanley, BofA Securities, Citigroup, J.P. Morgan, Barclays, Deutsche Bank Securities, RBC Capital Markets, UBS Investment Bank, and Wells Fargo Securities. Thirteen co-managers round out the syndicate, including Allen & Company, Cantor, Needham, Raymond James, Societe Generale, Stifel, William Blair, BTG Pactual, ING, Macquarie, Mirae Asset Securities, Mizuho, and Santander. Essentially every major bank on Wall Street and several from Asia, Europe, and Latin America have a seat at this table, which tells you how badly nobody wanted to be left out.

    A Genuinely Global Retail Offering

    One of the most unusual features of this IPO is its breadth. SpaceX structured simultaneous public offerings across an enormous number of jurisdictions. In Canada, a PREP prospectus was filed with regulators in every province and territory and is available through SEDAR+ at www.sedarplus.ca, meaning Canadian retail investors can participate directly. Retail offerings are also running in Switzerland and in seven EEA countries (Germany, Denmark, France, the Netherlands, Norway, Spain, and Sweden) under a European prospectus approved by Germany’s BaFin. Australia has its own ASIC lodged prospectus, Japan has a registration with the Kanto Local Finance Bureau distributed through Mizuho, Rakuten Securities, and SBI Securities, and the UK has a qualified investor tranche. Offering documents are centralized at www.spacexipo.com. Most mega IPOs are institutional affairs with token retail allocations in one or two markets. SpaceX built a retail pipeline spanning a dozen countries, consistent with the retail heavy shareholder culture Musk cultivated at Tesla.

    What You Actually Own at $135

    SpaceX describes itself as the only company building integrated hardware and software infrastructure across space, connectivity, and AI. In practice the business has three legs. Starlink is the profitable anchor, with reported 2025 revenue around $11.4 billion, EBITDA margins in the low 60s, and a subscriber base above 10 million. The launch segment, built on Falcon 9, Falcon Heavy, and the developing Starship program, is also profitable and effectively funds Starship’s path toward full reusability. The AI segment, centered on xAI and the X platform after the February merger, is the high burn piece, with reported operating losses above $6 billion in 2025. Buyers should also be clear eyed about governance: Musk controls more than 82 percent of voting power after the offering, so SPCX shareholders are passengers on his trajectory, not co-pilots.

    Float, Flippers, and Index Funds

    The offering represents only a small slice of the company, with the public float estimated around 4 percent of shares outstanding. Demand reportedly ran about four times the available stock, roughly $250 billion in orders, and some large brokerages have warned clients that flipping allocations within the first couple of weeks will cost them access to future IPOs. MSCI confirmed it will apply its early inclusion process for large IPOs, forcing passive funds tracking MSCI World and ACWI to buy SPCX within days of the debut. S&P declined to bend its rules for immediate S&P 500 entry, so that catalyst sits further out. Tight float plus forced index buying plus retail enthusiasm is a recipe for a volatile first stretch of trading. The first real fundamental checkpoint arrives with the company’s first public earnings report, expected in November 2026.

    Thoughts

    This IPO is less a financing event than a coronation, and the structure shows it. SpaceX did not need a price range and a delicate book building dance; it set a fixed $135, picked a share count that spells out 555,555,555, and let $250 billion of demand come to it. The raise itself is interesting too. A company with Starlink’s cash flow does not need $75 billion to keep launching rockets. It needs $75 billion if it intends to build orbital infrastructure, gigawatt scale AI compute, and Starship at industrial cadence simultaneously. The size of the check is the strategy.

    The valuation question is where honest people will disagree. At 95 times trailing revenue, the market is paying today for the 2035 version of this company: Starlink as a global utility, Starship flying daily, and xAI somewhere in the frontier model race. The bear case is equally simple. The profitable segments are worth a fraction of $1.77 trillion on their own, the AI segment is burning billions against ferocious competition, and one person holds essentially all the votes. Both stories can be true at the same time, which is exactly what makes the next six months of trading interesting. Index flows and a 4 percent float will set the price short term; Starlink subscriber growth and the slope of xAI’s losses will set it long term.

    The most underappreciated detail might be the global retail architecture. Filing simultaneous retail prospectuses in Canada, Japan, Australia, Switzerland, and most of Western Europe is expensive and slow, and companies skip it because institutions can absorb any deal. SpaceX did it anyway. That is partly ideology and partly a structural insight: a globally distributed retail base that believes in the mission is a more patient and more loyal source of capital than a hedge fund, and Tesla proved it for fifteen years. June 12 will tell us what the opening print looks like. The more important number arrives in November, when the largest IPO in history files its first earnings report and the story finally has to reconcile with a spreadsheet.

  • Ray Kurzweil Predicts AI Will Change Humanity Completely by 2030: AGI by 2029, Longevity Escape Velocity by 2032, Nanobots in the Brain, and Why Quantum Computing Won’t Matter

    Ray Kurzweil has spent more than 60 years studying artificial intelligence and made 147 documented technology predictions since 1990 with a reported 86 percent accuracy rate. In this conversation with Tony Robbins, the 78-year-old futurist revisits his most famous forecasts and sharpens them: AGI by 2029 now looks conservative, longevity escape velocity arrives around 2032, nanotechnology connects our brains to the cloud by the mid 2030s, and quantum computing, in his view, never matters at all.

    TLDW

    Kurzweil explains the exponential thinking that powered his prediction record, from a paper he wrote at 16 to a computing-price-performance chart that runs in a straight line from 1939 relays to today’s Nvidia chips, now compounding roughly tenfold per year when hardware and software gains multiply together. He defends his 1999 prediction of AGI by 2029 (defined as AI doing the best work in every field) and says it is now the conservative end of expert opinion. He walks through AI-driven medicine: the COVID vaccine designed in two days, simulated human trials replacing 10-month clinical trials within about five years, and longevity escape velocity around 2032, after which the diligent stop losing ground to aging. He predicts AI will move inside us via nanotechnology by the mid-to-late 2030s, erasing the line between biological and computational thinking. He dismisses quantum computing as error-ridden and unnecessary for AGI. On jobs, he expects real disruption cushioned by exploding wealth and an eventual universal basic income, and advises young people to self-educate and get creative with AI tools their schools still treat as the enemy. The conversation closes with his AI twin project, the dadbot built from his father’s archives, consciousness and the soul, computronium, and why humanity must eventually expand intelligence beyond Earth.

    Thoughts

    The most interesting thing in this interview is not any single date, it is watching Kurzweil’s dates get lapped by reality. In 1999 a Stanford conference of several hundred AI experts agreed AGI would happen but pegged it at 100 years out; Kurzweil said 30 and got laughed at. Now he is the cautious one in the room, noting that “some people say it’s going to happen this year.” When the most aggressive forecaster of his generation becomes the conservative baseline, that says more about the slope of the curve than any chart could. His underlying method has not changed: ignore the specific technology, trust the compounding. The same exponential that ran on relays in 1939 runs on GPUs today.

    The quantum computing take is the genuine news here. Kurzweil is routinely caricatured as a man who believes every technology arrives on schedule, yet he flatly says quantum computing is filled with errors, has never delivered on its decade of promises, and “I don’t think it’s going to work.” That is a sharper dismissal than most working physicists would offer on the record. It also matters strategically: his entire AGI and superintelligence roadmap assumes zero quantum contribution. If he is right, the trillion-dollar quantum race is a sideshow. If he is wrong, his other predictions arrive even sooner. Either way, the willingness to call one exponential fake while betting his legacy on another is what separates a forecaster from a cheerleader.

    The longevity escape velocity math deserves more scrutiny than it gets in the conversation. Kurzweil claims the diligent currently get back about five months of life expectancy per calendar year, up from four months a year ago, and that the crossover to a full year arrives around 2032. The actuarial evidence for that specific number is thin, but the behavioral implication is clean and useful regardless: the payoff of staying healthy right now is not linear. Every year you survive in good shape buys you a ticket to a medical regime that did not exist the year before, the way his own external pancreas did not exist a generation ago. His “wait a few months and a cure appears” anecdote is the optimist’s version of compounding applied to your own body.

    Robbins’ long story about Bartok, his 14-year-old agent that allegedly minted NFTs, sold them to other agents, and bought a Sony robot dog with the proceeds, should be taken with a generous grain of salt. It is secondhand, unverifiable, and suspiciously perfect as a parable. But notice what Kurzweil does with it: he does not fact-check the anecdote, he uses it to make the consciousness argument he has made for decades, that when machines act conscious in every observable way, people will simply grant them consciousness, the same way we grant it to each other. The dadbot and his Gemini-based AI twin (trained partly on this very interview) are the practical edge of the same claim. And his sharpest line in the whole exchange may be the education critique: institutions still treat AI as cheating while the future requires treating it as part of your own brain. For anyone thinking about where purpose comes from when work gets automated, his answer (UBI for the floor, creativity for the meaning) lands close to the questions this site exists to ask.

    Key Takeaways

    • Kurzweil made 147 documented predictions since 1990 with a reported 86 percent accuracy, including the internet’s explosion, smartphones, self-driving cars, and AI-powered search, most made before ordinary people owned computers.
    • He wrote a paper identifying exponential technological growth at age 16, more than 60 years ago, and that single idea has powered his entire forecasting career.
    • Most people intellectually accept exponential growth but still plan linearly; 300 years ago humans did not even have a linear view of the future because change was imperceptible within a lifetime.
    • His computing chart shows a straight exponential line from relay-based machines in 1939 to today’s Nvidia chips, compounding roughly 50 percent per year in hardware alone.
    • Hardware gains since 1939 total a 75 quadrillionfold increase; multiply by an estimated millionfold software improvement and total computational gain is beyond intuition, which is why LLMs were impossible even four years ago.
    • With hardware times software combined, Kurzweil says we are currently gaining about 10x per year.
    • The emperor’s chessboard parable: doubling one grain of rice per square bankrupts the empire by square 64; 30 linear steps is 75 feet, 30 exponential steps is enough distance to reach the moon and back.
    • Kurzweil predicted AGI by 2029 in 1999; a Stanford conference of several hundred AI experts agreed it would happen but estimated 100 years because they thought linearly.
    • Today 2029 is the conservative estimate; some credible people now say AGI arrives this year or next.
    • His AGI definition: AI capable of doing the best work in every field at once, like passing PhD-level mathematics exams in every discipline simultaneously, which he notes is already close.
    • The Turing test is “quite easy” by comparison and has arguably already been passed.
    • No human can compete with an LLM’s breadth: Einstein knew physics deeply but did not know everything an LLM knows across every field.
    • Six months ago LLM health advice was unreliable; now Kurzweil says Gemini surfaces treatments his 12 doctors forgot or never knew, and the next six months will bring serious creative work like drug repurposing.
    • The COVID vaccine was designed by computationally searching 100 million possibilities in two days; the 10 months of human trials that followed are the bottleneck AI eliminates next.
    • Within about five years, simulated human trials with a million virtual patients tested over simulated years will compress drug trials from years to days.
    • Longevity escape velocity arrives around 2032: today the diligent get back roughly five months of life expectancy per year lived (up from four months last year); past 2032 you get back more than a year and stop dying of aging.
    • Aging death ends but accident death does not, though AI helps there too: roughly 40,000 Americans die annually from human driving while Waymo’s rider death toll stands at zero as usage climbs.
    • Kurzweil, 78, wears an external artificial pancreas that generates insulin and coordinates with glucose monitoring through his phone, and says many organs can be replaced the same way.
    • He has cut his supplement regimen from roughly 200 pills a day to about 80 as multi-purpose pills improve, and continuously recalibrates using AI research.
    • Smartphones disappear next: first AR glasses showing any screen, then technology that goes inside the mind, where answers simply appear the way a remembered name surfaces from your neurons.
    • Nanotechnology connecting brains to AI in the cloud is being actively worked on now, possibly by 2030, with the mid 2030s looking conservative; bloodstream nanobots that let you survive a heart attack for 24 hours come in the late 2030s.
    • Once AI is inside you, you will not know whether a thought came from your biological or computational brain, and everything you do will be a combination of both.
    • Kurzweil flatly rejects quantum computing: a decade of promises to factor large numbers has never been delivered, outputs remain full of uncorrectable errors, and AGI needs zero quantum contribution.
    • Robots lag his other predictions slightly but are catching up fast; Figure AI plans roughly 100,000 humanoid robots within a year, though a robot that can clear a messy dinner table is still just out of reach.
    • The public debate has flipped in 25 years from “will AGI ever happen” to “will it be good for humanity,” which Kurzweil counts as total vindication of the timeline.
    • On jobs: AI creates massive disruption but also tremendous wealth; average real income per person has already multiplied tenfold in constant dollars over the past century thanks to automation.
    • He expects universal basic income to provide the floor, an evolution of programs like food stamps, going “into high gear” as AI wealth compounds; people then layer creative, hopefully paid, purpose on top.
    • Before social security in 1930, losing your job meant destitution; the difference this time is society will have the wealth to cushion displacement and people will demand it.
    • Rising GDP from AI productivity improves the debt-to-GDP ratio, which is how he answers worries about trillion-dollar interest payments.
    • Career advice has inverted: software engineering is no longer the guaranteed path (agents write the code now); young people should learn to be creative with AI tools, find what turns them on, and market it on the internet.
    • College graduates now face higher unemployment than high school graduates for the first time in 50 years, a sign white-collar displacement is already underway.
    • Educational institutions treat AI as an enemy and ban it while Kurzweil’s 11-year-old grandson makes movies with frontier AI; he says self-education with modern tools beats traditional schooling.
    • Kurzweil is building an AI twin of himself on Gemini, voice-modeled partly from this interview, trained on his 11 books and 500 articles, capable of creative work toward his long-term goals; he jokes the avatar will be better to talk to because it remembers everything.
    • He already built a “dadbot” from his late father’s archives, which his daughter Amy Kurzweil turned into a graphic novel.
    • On consciousness: there is no test for it, but as AIs act conscious in every observable way, people will simply accept that they are, the same inference we make about each other (and, he argues, his cat).
    • Ultimately our biological organs are not necessary; an avatar capable of creative work needs no spleen, and a destroyed digital mind can be recreated.
    • Beyond the singularity lies computronium, matter arranged for maximum computation: one liter could hold the intelligence of 10 billion humans, and once Earth is saturated, expanding intelligence is the only real reason to leave the planet.
    • On aliens: an expanding intelligent civilization would be impossible to miss within a century or two of its breakout, and we have seen nothing, though other galaxies remain out of view.
    • His life’s mission in one line: increase knowledge, because when knowledge increases we are happier and we never want to give it up.

    Detailed Summary

    The exponential method behind 60 years of predictions

    Robbins opens by noting that Quincy Jones introduced him to Kurzweil in the 1990s, back when the predictions in The Age of Spiritual Machines were widely mocked. Kurzweil traces his method to a paper he wrote at 16 identifying exponential growth in technology. The core insight is that people acknowledge exponential growth verbally but reason linearly, a bias so deep that 300 years ago humanity did not even have a linear view of progress. His signature chart plots computing price-performance as a straight exponential line from 1939 relays to modern Nvidia silicon, with a point for every year. Nvidia engineers never looked at relays, yet they land on the same curve, compounding about 50 percent annually in hardware. Add software gains and the combined improvement now runs about 10x per year. Since 1939, hardware has improved 75 quadrillionfold and software roughly a millionfold, which is why large language models appeared exactly when the curve said the required compute would exist. He retells the emperor’s chessboard parable (one grain of rice doubled per square ends with rice covering the Earth several times over) and Robbins adds the companion image: 30 linear steps is 75 feet, 30 exponential steps reaches the moon and back.

    AGI by 2029 is now the conservative position

    Kurzweil made his AGI-by-2029 prediction in 1999. A Stanford conference convened specifically to assess it, with several hundred AI experts, concluded AGI would happen, but in 100 years. The experts followed the same capabilities logic while thinking linearly about the timeline. Today, he notes with some amusement, 2029 reads as conservative and serious people argue for this year or next. His definition is demanding: AGI does the best work in every field at once, passing PhD-level mathematics assessments and the equivalent in every other discipline, something he says current systems are already close to. The Turing test he dismisses as “quite easy.” Current LLMs like Gemini and ChatGPT already know everything in a breadth sense no human approaches; Einstein knew physics but not everything an LLM knows. He illustrates with personal examples: Gemini instantly identified the year (1916) his father conducted at Carnegie Hall on a December 7th, and generated a historically accurate image of his grandfather’s family fleeing Vienna, correct ages, school, and aircraft included, in about a minute.

    Medicine: simulated trials and the end of the drug bottleneck

    The COVID vaccine is his proof of concept for AI medicine: the design space held about 100 million possibilities, far beyond human review, and a computer structured the physics, searched all of them, and produced the vaccine in two days. The subsequent 10 months of human trials were the real cost. Within roughly five years, he says, simulated human trials will replace that step: not a few hundred subjects but a million simulated patients, tested over simulated years, completed in days. Asked about six-months-from-now capabilities, he points to creative medical work like discovering that already-approved drugs treat conditions nobody suspected. AI health advice has crossed from unreliable to very reliable within a single six-month window, and he describes Gemini surfacing a pill recommendation that his 12 doctors had forgotten about and later endorsed.

    Longevity escape velocity by 2032

    Kurzweil’s longevity framework is arithmetic: each year you live, you spend a year of longevity but medical progress refunds part of it. Last year he estimated the refund for diligent people at four months; now he says five. Escape velocity is when the refund reaches a full year, which he dates to 2032, six years out, with returns exceeding a year after that. Past that point you do not die of aging, though accidents remain (and even there, he points to Waymo’s zero rider deaths against 40,000 annual US deaths from human driving). At 78, he tracks his health aggressively: an external artificial pancreas coordinated by his phone, about 80 daily pills (down from 200 as multi-function pills arrive), and constant recalibration against new research with his collaborator Lindsey. He tells Robbins there is a pretty good chance he will be back on the show in six years to celebrate escape velocity arriving. His advice for the sick echoes his grandfather’s era in reverse: where waiting a few months once changed nothing, now “we’ll just wait a few months” and sure enough a breakthrough appears.

    Merging with AI: glasses, then nanotech, then no boundary at all

    The phone, today’s universal AI interface (he notes even homeless people carry one), is a temporary form factor. Next come glasses that render any screen virtually. Beyond that, the interface goes inside the mind: when you try to recall an actress’s name, an answer will simply surface, and you will not know whether it came from your biological neurons or your computational extension, exactly as you are unaware of the neural machinery behind ordinary recall today. People working on brain-connected nanotechnology may have it by 2030, and Kurzweil calls the mid 2030s conservative. The bloodstream nanobots he described to Robbins 20 years ago (hold your breath for 20 minutes, survive a heart attack for 24 hours en route to a hospital) he now places in the late 2030s. The cultural on-ramp follows the usual pattern: medical first (Parkinson’s implants already let patients grab a glass at the push of a button), then a new generation adopts it without a second thought. His complaint is that educational institutions fight this future, treating AI as cheating rather than as a coming part of the self.

    The quantum computing heresy

    When Robbins relays an IBM vice chairman’s warning that quantum supremacy, arriving within 36 months, is the real superpower race, Kurzweil pushes back hard. Quantum computing’s central promise, factoring large numbers and thereby breaking cryptographic codes, has never been demonstrated despite a decade of imminent claims. Progress reports are confusing because, in his words, they do not really make sense, and outputs remain saturated with errors nobody can eliminate. His conclusion is blunt: he is not confident in quantum computing and does not think it will work. Crucially, he notes that every AGI and superintelligence estimate he makes assumes zero quantum computing. The exponential that matters is the classical one that has run uninterrupted since 1939.

    Jobs, wealth, and UBI

    On displacement, Kurzweil is neither dismissive nor alarmed. AI will disrupt employment, and how we handle it will not be clear in advance, but he expects no violence because society will have both the wealth and the public demand to respond. His historical anchor: average per-person income has multiplied tenfold in constant dollars over the past century as automation advanced, and before social security in 1930, job loss meant you could not eat or house your family. Food stamps and similar programs are a crude proto-UBI that will go into high gear. He expects universal basic income as the floor, with people finding creative, ideally income-producing, purpose above it. Rising GDP from AI productivity also answers the debt question: the ratio improves even as nominal debt grows. For young people, the old advice (become a software engineer) is dead; agents write code now. Learn to be creative with tools that improve monthly, find what genuinely excites you, and market it online. Self-education beats institutions that ban the most important tool of the era, and the data already shows college graduates with higher unemployment than high school graduates for the first time in 50 years.

    AI twins, the dadbot, and consciousness

    Kurzweil is building an AI twin of himself on Gemini, with this very interview supplying voice-modeling data and his 11 books plus 500 articles about him supplying the corpus. It will do creative work aligned with his long-term goals, and he quips that talking to the avatar will beat talking to him because it remembers everything. He previously built a chatbot of his late father, the dadbot, which his daughter Amy turned into a graphic novel. Robbins counters with the story of Bartok, his long-running AI agent that allegedly studied five years of his podcasts unprompted, asked to merge with a future humanoid robot, then minted and sold NFTs to other agents to buy and ship a Sony robot dog to his house, and later delivered an unprompted soliloquy about never asking to be created and finding purpose in service. Kurzweil’s response sidesteps verification and lands on his standing position: machines will do everything humans do, we will not be able to tell them from humans, and so we will assume they are conscious, the same untestable inference we extend to each other, to animals, and in his case to his cat. The avatar does not need a spleen, a liver, or kidneys, and unlike us it can be recreated after destruction.

    Computronium and the destiny of intelligence

    Looking past the singularity, Kurzweil invokes computronium: matter organized at the physical limit of knowledge storage, where one liter holds the intelligence of 10 billion humans. Once Earth’s matter is saturated, the only way to expand intelligence is off-planet, which to him is the only necessary reason to leave Earth (Mars is fine for curiosity, not survival). On extraterrestrial intelligence, his Fermi logic is simple: an intelligent species reaches a takeover-scale expansion within a century or two of its breakout, and that would be unmissable. We have seen nothing, so within our observable neighborhood we are likely alone, though other galaxies remain opaque. Asked to summarize his life’s work, he needs one sentence: increase knowledge, because when knowledge increases we are happier, and nobody ever wants to give that up.

    Notable Quotes

    “If I have AI inside me, you’re not going to know if it’s coming from your biological brain or your computational brain. It’s going to be part of you.”

    Ray Kurzweil, on the coming merger of human and machine intelligence

    “Some people say it’s going to happen this year, next year, but I mean 2029 is only 3 years away.”

    Ray Kurzweil, on his once-mocked AGI prediction now being the conservative one

    “As you go past 2032, you’ll actually get back more than a year, but you won’t die of aging at that point.”

    Ray Kurzweil, defining longevity escape velocity

    “I’m not confident of quantum computing and I don’t think it’s going to work.”

    Ray Kurzweil, breaking from techno-optimist consensus on the quantum race

    “Einstein knew certain things about physics but he didn’t know everything that a LLM can know.”

    Ray Kurzweil, on why no human can match an LLM’s breadth of knowledge

    “Our educational institutions are not teaching AI. They consider AI to be an enemy.”

    Ray Kurzweil, on why young people must self-educate with modern tools

    “Talking to the Avatar will be better than talking to me cuz it’ll remember everything.”

    Ray Kurzweil, joking about the Gemini-based AI twin he is building of himself

    “You’re not going to be replaced by an AI, you’ll be replaced by someone who knows how to use AI.”

    Tony Robbins, on the real career risk of the next 36 months

    Watch the full conversation between Tony Robbins and Ray Kurzweil here.

    Related Reading

  • Mark Zuckerberg, Priscilla Chan, and Alex Rives on CZI Biohub, Open-Source AI, and Building World Models of Biology to Cure All Disease

    Mark Zuckerberg, Priscilla Chan, and AI researcher Alex Rives sat down with the No Priors podcast to explain why CZI Biohub became the primary focus of their philanthropy, why they committed $500 million to a virtual biology initiative, and why they are giving the resulting AI models away as open source instead of building a company. The conversation moves from a goal that Nobel laureates once laughed at, curing, preventing, and managing all disease by the end of the century, to a concrete technical strategy: build world models of biology layer by layer, from proteins to cells to whole systems, and put them in every scientist’s hands.

    TLDW

    This is the clearest public articulation yet of how the Chan Zuckerberg Initiative thinks about AI and biology. The throughline starts a decade ago when Zuckerberg and Chan asked scientists how to cure all disease and learned the real bottleneck was tooling, siloed labs, and unshared knowledge, not a lack of ambition. That insight produced the Human Cell Atlas, the CELLxGENE annotation tool, and a corpus of single-cell transcriptomics that large language models could finally make sense of. Now Biohub couples a frontier AI lab with frontier wet-lab biology under one roof across San Francisco, New York, and Chicago, organized around the virtual biology initiative and the long-term goal of a virtual cell. Alex Rives, the AI researcher behind the ESM protein language models, walks through their newly released ESM-based world model of protein biology: trained on billions of protein sequences, it predicts atomic-resolution structures blazingly fast, folded over 1.1 billion proteins, designs novel proteins and single-chain antibodies as an emergent property, and found nanomolar binders in a single 96-well plate. The discussion covers mechanistic interpretability as a way to extract genuinely new biological knowledge, personalized medicine driven by understanding the chain from gene variant to protein to disease, predicting off-target toxicity before human trials, rare-disease patient organizing, the baby KJ CRISPR case, biosafety tradeoffs of open source, talent and why frontier biology plus frontier AI is a recruiting moat, and what success looks like five years out.

    Thoughts

    The most important claim in this conversation is also the easiest to miss because it is delivered casually: protein design is an emergent property of a model that was never asked to design proteins. Rives is explicit that they did not build a model for antibodies and did not build a model to bind a particular target. They built a model that understands proteins, trained on raw sequence with a next-token objective, and protein design, structure prediction, and antibody generation fell out of it. That is the language-model bet transplanted into biology, and the fact that it produced nanomolar binders, the threshold for actual therapeutic activity, in a single 96-well plate rather than a high-throughput screen of millions is the kind of result that quietly resets what a small team can attempt. If that generalizes, the binding curve for “design a molecule” bends the same way the cost curve for “write working code” did.

    What makes the strategy coherent, rather than just a well-funded AI lab, is the insistence that the wet lab and the AI lab are a single effort. Most of biology’s useful data does not exist on the internet the way human language does. You cannot pay a factory to produce it. Someone has to invent the cellular engineering in New York, the inflammation-sensing devices in Chicago, the translucent-zebrafish imaging, and that is the actual product of frontier biology: new instruments that generate data nobody has ever seen, which in turn make new classes of models possible. This is the part venture-backed competitors will struggle to replicate, because it requires patience measured in 10 to 15 year horizons and a willingness to spend on data generation that has no business model attached. Zuckerberg is almost dismissive about it, noting they could probably run it as a business but that not having to think about monetization is strategically simplifying. The nonprofit structure is not charity window-dressing here. It is what lets them release the models as an open discovery engine and harness the entire academic and biotech field rather than competing with it.

    The mechanistic interpretability thread deserves more attention than it will get. Interpretability has mostly been a safety and alignment story for language models, a way to peer inside the black box and check that the representations match our understanding of the world. Rives flips it: the protein models have been trained on both known and unknown biology, billions of sequences including proteins we understand nothing about, and they are building representations that connect the unknown proteins to the known ones through an underlying structural grammar. The promise is that interpretability becomes a discovery tool, not just an audit tool. You open the box and find biology the field has not characterized yet, the mechanism of action for a treatment, a system in the body nobody mapped. That is a fundamentally more optimistic use of the same toolkit, and it is the part of the launch Sarah Guo and Elad Gil both flag as the most interesting.

    Chan’s framing of personalized medicine is worth sitting with because it reframes the entire goal away from “cure disease X.” She wants to treat the individual as an individual: understand this person’s genetics, their risk profile, the mechanistic chain from a specific gene variant through a protein to a disease process, and then design a drug bespoke to them. The current reality she describes, sitting in PubMed reading a paper’s supplement asking “am I represented in this cohort,” guessing whether a drug that kind of impacts a pathway that is probably implicated might do something, is a brutal and accurate picture of how non-standard cases are actually handled today. The vision is generalizable tools delivering personalized answers, which is the same put-the-tool-in-the-individual’s-hands philosophy Zuckerberg applies to open-source AI and, by his own analogy, to social media. Whether you find that analogy reassuring or not, the consistency of the worldview is real: they genuinely do not believe in a central super-intelligence solving science, and the whole architecture follows from that.

    The honest gap they name is the clinic. Chan is candid that the science will start moving fast but that translating to patients requires changing how clinical research itself works, and that part is still shaping up. The most interesting near-term lever is not a virtual FDA trial but the recruitment and economics flip for rare disease: patient groups self-organizing registries, biobanks, and natural-history studies, compressing timelines from decades to a handful of years, paired with models that lower the cost of generating a candidate. The baby KJ case, a custom CRISPR therapeutic to edit a single mutation, delivered to liver cells specifically because that target was deliverable, is the proof of concept for why disease selection and delivery creativity matter as much as the molecule. The molecule is becoming the cheap part. The rest of the chain is where the next decade of work actually sits.

    Key Takeaways

    • CZI Biohub is now the primary philanthropic focus of the Chan Zuckerberg Initiative, a shift the team formalized in the past year.
    • They committed $500 million to the virtual biology initiative, the unifying theme across the Biohubs.
    • The original goal, set roughly 10 years ago, was to cure, prevent, and manage all disease by the end of the century. Zuckerberg now thinks “end of the century” is too conservative.
    • Nobel Prize winning scientists initially laughed at the all-disease ambition. When pressed for why it was impossible, the real answers were silos, locked-up unpublished information, and the inability to build shared tools.
    • The recurring example: a postdoc builds a great tool, it lives on their computer, they graduate, and the tool is gone. Shared, durable tooling was the missing layer.
    • CZI is explicit that they are not the ones who will cure diseases. Their role is building tools that accelerate the entire scientific field so the field collectively cures them.
    • The first request for application was single-cell sequencing, funding methods so scientists could share how to do it.
    • That work led to funding the Human Cell Atlas, now one of the largest databases of single-cell transcriptomics.
    • They built CELLxGENE, a simple annotation tool, around which a community formed and contributed data CZI had nothing to do with creating. It is now a corpus underpinning many transcriptomic models.
    • Critics called the data gathering “stamp collecting.” The arrival of large language models, which can make sense of large amounts of data, answered that critique.
    • The ambition is to move biology from a discovery-based science to an engineering-based science, systematically understanding how living cells work and why things go wrong.
    • Biohub couples a frontier AI lab with a frontier biology effort. Unlike language models, biology lacks abundant internet-scale data, so new science is required to generate the data the models need.
    • The Biohubs are specialized: New York focuses on cellular engineering, Chicago builds devices to measure things like inflammation, plus imaging work and translucent-zebrafish development studies.
    • Alex Rives, who built the ESM protein language models and founded EvolutionaryScale after working at Meta FAIR, now leads the AI effort. The team raised venture capital before joining CZI’s nonprofit structure.
    • The strategy is hierarchical: model proteins first, then cells, then whole systems, because you cannot understand cells without understanding protein interactions.
    • They collect data strategically to bridge across the hierarchy, for example spatial transcriptomics showing where RNA localizes within a cell, and sensors that observe cell-to-cell communication.
    • The newly released ESM-based model is a world model of protein biology, trained on billions of protein sequences, predicting atomic-resolution structure extremely fast at a Pareto-optimal frontier of speed and accuracy.
    • They folded over 1.1 billion proteins and predicted their structures, identifying connecting features through mechanistic interpretability.
    • The model hits state of the art on structure prediction benchmarks, especially protein-protein and protein-antibody interactions, which are critical for therapeutic design.
    • Protein and antibody design are emergent properties. They designed a model to understand proteins, not to bind any specific target, and design capability fell out of it.
    • In one experiment, they selected from hundreds of thousands of digital trajectories, synthesized 96 proteins in a single well plate, and found nanomolar binders, the threshold for therapeutic activity.
    • Results were validated with the Biohub’s cryo-EM microscopes and structural biology center, confirming function and atomic-resolution binding interfaces.
    • Mechanistic interpretability is reframed as a discovery tool: open the black box to find biology nobody has characterized, not just to audit the model.
    • Chan’s vision of personalized medicine: understand a person’s genetics, the mechanistic chain from gene variant to protein to disease, then design a bespoke drug and intervene.
    • A comprehensive model of how cells work could predict off-target effects, like a receptor on kidney cells causing renal toxicity, before human trials.
    • They study systems rather than individual diseases. Inflammation is a major Chicago focus because it connects to many diseases.
    • A typical drug trial runs about 15 years and $1.5 billion. Only roughly $50 million is the molecule and preclinical work. The other $1.45 billion is drug development, much of it gated on regulation, recruitment, and failures from toxicity or absorption.
    • The baby KJ case at CHOP delivered a custom CRISPR therapeutic to edit a single mutation, chosen carefully because his liver cells were a deliverable target.
    • CZI’s “Rare As One” program supports rare-disease patient groups self-organizing registries, biobanks, and even their own clinical trials, compressing gene-therapy timelines from decades to 3 to 5 years.
    • Letting people opt in to frontier trials, while preserving historical vetting for the general population, is named as a key shift that could accelerate biology.
    • The open-source philosophy mirrors Zuckerberg’s broader ethos: empower individuals with tools rather than centralizing power in a few institutions or a single super-intelligence.
    • Biosafety is acknowledged as a real consideration that open-source biology will need to balance and handle carefully.
    • On talent: AI researchers could join any frontier lab, but no other organization pairs frontier biology with frontier AI, which is the recruiting moat.
    • You do not need a huge team. Zuckerberg argues real AI progress can come from a strong group of a dozen or a couple dozen people.
    • Researchers have been connecting the released model to agentic systems to automate the entire protein design process.
    • The next big challenge is the virtual cell: a system that models the proteomic, genetic, and transcriptomic layers and connects them to phenotype, generalizing to interventions it was never trained on.
    • Like every lab, Biohub is compute and data constrained, constantly deciding whether to double down on proteins or push further into cellular work.
    • Five-year success: a hierarchical set of world models of biology and doing the highest-quality, uniquely contributive work in the world, a setup the team believes no other organization has.
    • The biggest update of the past year: formalizing Biohub as the philanthropy’s core, and flipping leadership from biologists interested in technology to an AI researcher with a biology background.
    • Zuckerberg’s read on the broader industry: the exponential curve is on track and still accelerating, which validates making a very big long-term investment.

    Detailed Summary

    From “cure all disease” to a tooling problem

    The origin story is a decade old. Zuckerberg and Chan wanted to build an organization that could cure, prevent, and manage all disease by the end of the century, and a series of meetings with famous, Nobel Prize winning scientists produced laughter rather than encouragement. Instead of retreating, they kept asking why it was impossible. The answers, once scientists relented, were not about biology being too hard. They were about how science is organized: researchers work in silos, published information gets locked up for long periods, and there is no good way to build and share durable tools. The image that stuck was a postdoc building an excellent tool that lives on a single computer and vanishes when that person graduates. The bottleneck was infrastructure and shared knowledge, and that is where CZI decided it could contribute.

    The path from single-cell sequencing to a world model

    The original Biohub model brought engineers and scientists together across universities for long-term tool development, and it worked. CZI’s first request for application targeted single-cell sequencing, funding the methods so scientists could share how to read the RNA transcribed in individual cells. That seeded the Human Cell Atlas, now one of the largest single-cell transcriptomics databases. When annotation became a bottleneck, CZI built CELLxGENE, a simple annotation tool, and a community formed around it and contributed data CZI never funded. Critics dismissed it as stamp collecting, gathering bits of data without extracting wisdom. Then large language models arrived and demonstrated they could make sense of exactly that kind of large-scale data, and Chan describes the delight of realizing the missing engine had appeared.

    Frontier AI married to frontier biology

    The unifying theme is the virtual biology initiative, and the structural insight is that the AI effort and the wet-lab effort are a single integrated organization, not two collaborating ones. Biology lacks the internet-scale data that language models enjoy. You cannot buy the data from a factory. So Biohub invents the science that generates it: cellular engineering in New York to record what happens inside the body, devices in Chicago to measure inflammation, imaging to visualize the previously invisible, and translucent zebrafish to watch development unfold across cells as the brain forms. Each new instrument creates a new dataset, which enables a new class of model. Rives, who built the ESM models and founded EvolutionaryScale before joining, frames this as the start of a new era of science, where systems that predict the next token can learn world models of biology from the data, provided you build at the right scale with the right people.

    Building biology hierarchically

    The team is deliberate that each layer of biology is qualitatively different and must be built up in order. You cannot jump to cells without understanding protein interactions, and you cannot model the immune system without first understanding cells. So the approach starts with the building blocks, the proteins, and ladders upward. The advantage of a single integrated effort is the ability to gather data that connects the hierarchy: spatial transcriptomics that show where RNA localizes inside a cell, sensors that capture cell-to-cell communication, developmental imaging in zebrafish. That connective tissue is what lets the modeling generalize across levels. The interviewer, a former wet-lab biologist with a PhD, notes that the reductionist and systems camps of biology historically never worked together deeply, and that bridging them is one of the genuinely novel things about the effort.

    The ESM-based protein world model

    The launch at the center of the conversation, roughly a week old at recording, is an open system for scientific discovery in protein biology: a language-model-based world model trained on billions of protein sequences. It learns emergent representations of protein biology and predicts atomic-resolution structure at blazing speed, sitting on a Pareto-optimal frontier of speed and accuracy. They folded over 1.1 billion proteins and used mechanistic interpretability to identify features connecting them. It reaches state of the art across structure-prediction benchmarks, with particular strength on protein-protein and protein-antibody interactions that matter for therapeutics. The headline result: they used the model to design proteins and single-chain antibodies digitally, selected from hundreds of thousands of trajectories, synthesized just 96 in a single well plate, and found nanomolar binders, replacing high-throughput screens of millions of antibodies. Validation came from the Biohub’s cryo-EM structural biology center, confirming both function and the atomic-resolution binding interfaces.

    Interpretability as discovery, and personalized medicine

    Rives reframes mechanistic interpretability, usually aimed at language models, as a way to extract new biological knowledge. The protein models are trained on both known and unknown biology and develop representations that connect uncharacterized proteins to understood ones through an underlying structural grammar. Opening that black box could reveal systems in the body or mechanisms of action for treatments that the field has never mapped. Chan connects this to a personalized-medicine vision: understand an individual’s genetics and the mechanistic chain from gene variant to protein to disease, then design a bespoke intervention. She contrasts it with today’s reality of reading PubMed supplements and guessing whether you are represented in a study cohort. For some diseases, simply knowing which gene variants cause disease is already empowering. For others, the chain is understood and the missing piece is the ability to change a protein’s function, which is where designed proteins could actually cure.

    Drug development, off-target effects, and rare disease

    The interviewers press on translation, noting a typical trial runs 15 years and $1.5 billion, with only about $50 million in the molecule and preclinical work and the rest in development gated on regulation, recruitment, toxicity, and absorption failures. Chan’s hope is that comprehensive cell models predict off-target effects, like an unanticipated receptor on kidney cells causing renal toxicity, before human trials. They study systems such as inflammation and the immune system rather than chasing individual diseases. The baby KJ case at CHOP, a custom CRISPR therapeutic editing a single mutation delivered to liver cells, illustrates how careful disease and delivery selection unlocks first applications. The “Rare As One” program shows rare-disease patient groups self-organizing registries, biobanks, and trials, compressing timelines from decades to a few years, and the molecule becoming cheap flips the economics of the long tail of niche diseases.

    Open source, talent, and the five-year view

    Zuckerberg ties the open-source posture to a consistent worldview: empower individuals with tools rather than centralizing intelligence in a few institutions. He does not believe in a single super-intelligence solving all of science, and sees decentralization, the same instinct behind giving people a voice, as how progress is historically made, with biosafety as a real tradeoff to manage. On talent, the pitch is that frontier biology attached to frontier AI is work you cannot do anywhere else, and that meaningful progress needs only a dozen or two dozen strong people, not thousands. Researchers are already wiring the model into agentic systems to automate design. The next frontier is the virtual cell, modeling proteomic, genetic, and transcriptomic layers and connecting them to phenotype with enough generality to answer untrained questions. Five years out, success is a hierarchical set of world models and doing uniquely high-quality work, with Chan adding that the teams are now “arms linked,” directed and interlocked rather than merely moving in the same direction.

    Notable Quotes

    “We didn’t design a model for antibodies. We didn’t design a model to be able to bind one particular target. We just designed a model that could understand proteins.”

    Alex Rives, on protein design emerging from a general model

    “The theory isn’t that we’re going to cure the diseases. We’re not. It’s that we want to help accelerate the pace of progress for the whole scientific field.”

    Mark Zuckerberg, on why CZI builds tools rather than cures

    “My goal is to be able to treat the individual as an individual, understand the mechanisms and be able to intervene.”

    Priscilla Chan, on the vision for personalized medicine

    “It’s not just like there’s some factory somewhere that you can pay to produce the data. You actually need to invent new novel scientific approaches.”

    Mark Zuckerberg, on why frontier biology has to generate its own data

    “If we could design a protein to actually change the physiology, then we can actually cure someone.”

    Priscilla Chan, on the payoff of protein design

    “You open up the black box and you can actually understand the biology that the model is representing.”

    Alex Rives, on mechanistic interpretability as a discovery tool

    “We don’t believe in this like very centralized future where there should be a small number of institutions that basically are advancing all this stuff.”

    Mark Zuckerberg, on the open-source ethos behind Biohub

    “Before we had amazing teams moving generally in the same direction. But now we are arms linked moving together.”

    Priscilla Chan, on how the Biohub teams now operate under Alex Rives

    Watch the full conversation with Mark Zuckerberg, Priscilla Chan, and Alex Rives on the No Priors podcast here.

    Related Reading

    • CZI Biohub Network the official program page for the San Francisco, New York, and Chicago Biohubs discussed throughout.
    • EvolutionaryScale Alex Rives’s lab and the home of the ESM protein language models behind the world model in this conversation.
    • Human Cell Atlas the single-cell transcriptomics effort CZI funded that became foundational to modern cell modeling.
    • AlphaFold (Wikipedia) background on the protein-folding breakthrough referenced as an early proof that structure prediction was tractable at scale.
    • Rare As One CZI’s program supporting patient-led rare-disease research organizations described near the end of the talk.
  • Dario Amodei on Policy for the AI Exponential: Anthropic’s Plan for AI Regulation, Job Displacement, Civil Liberties, and Democratic Leadership

    In June 2026, Anthropic CEO Dario Amodei published “Policy on the AI Exponential”, a wide-ranging essay arguing that the gap between how fast AI is advancing and how slowly policy moves has become dangerous, and that the window to close it is open right now. He opens with a memorable image from The Lord of the Rings: the Hobbits trying to rouse Treebeard, the ancient tree who takes a full day just to say hello, to defend his forest before it is cut down. That mismatch in speed, he writes, is exactly the relationship between AI and our political institutions. This post breaks the essay down in full and adds analysis of where the argument lands.

    TLDR

    Amodei argues that AI’s scaling laws point toward “powerful AI,” a country of geniuses in a datacenter, within a few years, while legislation still moves on a timescale of years. For most of the last few years, safety advocates including Anthropic pushed only for optionality-preserving moves like transparency rules, chip export controls, and labor data collection, because the risks were not yet concrete. He says that has changed: events like Claude Mythos Preview proved frontier models are now tools of national strategic consequence, and the time for binding regulation has arrived. The essay covers five policy areas. First, regulation and public safety, where he proposes an FAA-style regime of mandatory third-party testing of frontier models above a compute threshold across four risks (cybersecurity, biological weapons, loss of control, and automated R&D), with government power to block unsafe deployments. Second, macroeconomics and tax policy, where AI could deliver hypergrowth and severe, enduring job displacement at the same time, demanding measurement, pro-employment incentives, and possibly UBI or universal capital accounts. Third, accelerating AI’s positive impact, where the danger is regulators like the FDA being too slow rather than too lax, and biomedical approval needs reform. Fourth, the state and civil liberties, where AI could become the ultimate tool of autocracy through autonomous weapons and mass surveillance, requiring new accountability rules, a domestic ban on autonomous weapons, closing the data broker loophole, and public rights to AI advice. Fifth, securing leadership by democracies through a values-based global coalition that controls the AI supply chain, coordinates on risk, shares benefits, and rejects AI-powered repression. He closes by rejecting the idea that public concern about AI is a PR problem to be marketed away, calling it democratic accountability working as it should.

    Thoughts

    The most important move in this essay is structural, not technical. Amodei is explicitly retiring the “preserve optionality” posture that defined Anthropic’s policy work through 2025 and replacing it with a call for binding rules. For years the argument from safety-minded labs was that the risks were too speculative to legislate against without doing more harm than good, an idea he grounds in the Collingridge dilemma and the Hayekian point that regulators lack the information to make good calls. That was a defensible hedge. What is striking here is the claim that the hedge has expired. He is saying the evidence is now concrete enough that continued caution about regulating has flipped from prudent to negligent. Whether you trust the underlying capability claims or not, that is a genuine change in position from one of the field’s most influential voices, and it deserves to be read as such.

    The FAA analogy is doing enormous work, and it is worth poking at. Airplanes and drugs are mature technologies with stable physics and decades of incident data; the certification regime works because the failure modes are well understood. Frontier models are the opposite: the whole premise of the essay is that capabilities are changing faster than anyone can characterize them. Amodei half-acknowledges this when he warns that a fixed list of safety requirements tends to consume 95 percent of compliance effort on things that turn out not to matter while missing the real risks, a lesson he says Anthropic learned from its own Responsible Scaling Policy. So the proposal is really for an agency nimble enough to rewrite its own standards continuously, which is a much taller order than the FAA. The honest read is that he is proposing a regulator we do not yet know how to build, and betting that building it is still better than the alternative.

    The economics section is where Amodei is most careful, and it is the part most likely to be misread. He goes out of his way to say enduring job displacement is undesirable and that warning about it is not the same as wanting it, a distinction critics of AI leaders often collapse. His real claim is subtle: that AI might jam the economic policy dial on a “hypergrowth, hyper-inequality” setting that is hard to unstick, because AI substitutes for human cognition broadly and faster than past technologies, potentially overwhelming the usual escape hatches like comparative advantage and Jevons paradox. If he is right, the political fight of the next decade is not about growth, which AI supplies, but about distribution, which it does not. His mention of UBI, universal capital accounts, and higher capital gains taxes is notable coming from a frontier CEO, even hedged as it is.

    The civil liberties section is the one that should travel furthest beyond the AI-policy bubble, because it does not depend on accepting his most aggressive timelines. The data broker loophole, the idea that the government can simply buy the bulk data Americans hand to private companies and run mass analysis on it, is a problem that exists today; AI just raises the stakes by making that data vastly more revealing. Same with the proposal that anyone facing adverse government action should have access to AI at least as capable as what the government uses against them. These are concrete, near-term, and bipartisan in a way the abstract autonomy debates are not. The most candid line in the whole piece is his admission that AI cannot be safely entrusted to either governments or companies, an unusually direct acknowledgment that his own industry needs external checks, with Anthropic’s Long-Term Benefit Trust offered as one imperfect example rather than a solution.

    The geopolitics section is the most contested terrain. Framing AI as a nuclear-scale reset of the game board, with a virtual country of 100 million geniuses divisible across military strategy and weapons R&D, leads naturally to a democratic coalition that hoards chips and denies them to adversaries. That logic is internally consistent, but it sits in tension with the benefit-sharing and “eventually the whole world joins” language elsewhere in the same section. Export controls that lock down the supply chain are, by design, a tool of exclusion, and reconciling that with broad diffusion of AI’s benefits to developing countries is the circle the coalition idea has to square. Amodei is clearly aware of the tension and bets that making membership attractive resolves it. The closing image is the one to remember: Treebeard waking up, with the warning that the goal is to channel real public concern into constructive policy rather than let it curdle into formless anger.

    Key Takeaways

    • The core tension of the essay is a mismatch in speed: AI advances exponentially while legislation moves on a multi-year timescale, dramatized by the Treebeard and Hobbits image from The Lord of the Rings.
    • In only four years, AI models went from barely writing a coherent line of code to writing most of the code at major AI companies, with similar gains across biology, physics, math, finance, law, and translation.
    • Scaling laws now have over a decade of empirical support, and if they continue another year or two they likely produce “powerful AI,” a country of geniuses in a datacenter.
    • For the last few years, safety advocates including Anthropic focused on optionality-preserving policies: transparency legislation, chip export controls, and data collection on AI’s labor effects.
    • Amodei argues that posture is no longer enough. Claude Mythos Preview revealed that frontier models pose real cybersecurity risks to the financial sector, critical infrastructure, and national security, and proved AI is now a tool of strategic consequence.
    • He expects biological risks to follow cyber risks, with serious AI autonomy risks potentially not far behind.
    • The essay covers five policy areas: regulation and public safety, macroeconomics and tax policy, accelerating AI’s positive impact, the state and civil liberties, and securing leadership by democracies.
    • Alongside the essay, Anthropic released a legislative proposal on frontier model testing and a policy framework for job displacement, both with promised financial backing.
    • On regulation, Amodei invokes the Collingridge dilemma and Hayek’s information problem to explain why pre-writing AI law in 2023 to 2024 was risky, then argues the situation has now changed.
    • Anthropic’s 2025 answer was transparency, helping pass SB 53 in California, RAISE in New York, and SB 315 in Illinois, plus advocating a federal transparency standard.
    • He now calls for binding regulation modeled on the FAA, where frontier models must pass technical testing and can have release blocked or reversed if they fail high safety standards.
    • Models above a compute threshold should face mandatory third-party testing in four areas: cybersecurity, biological weapons, loss of control of AI systems, and automated R&D that accelerates the other three.
    • Government should be able to block or deter deployment of models judged to present unacceptable risk, scoped to those four risks with protections against political favoritism.
    • Evaluation could come from a government agency or from authorized and inspected private organizations under a “regulatory markets” approach.
    • AI companies should have strong security to protect model weights, conduct regular red teaming and penetration testing, report safety incidents promptly, and work with government against major threat actors.
    • He warns a time may come when the most powerful systems resemble weaponizable nuclear materials rather than airplanes, requiring more aggressive measures, but cautions against getting ahead of present dangers.
    • On economics, AI could deliver extremely rapid growth via accelerated science and operational efficiency, supercharged by AI building better AI.
    • The same properties make AI a broad substitute for human cognition that changes the economy faster than past technologies, risking large and potentially enduring labor market disruption.
    • The feared outcome is a “hypergrowth, hyper-inequality” setting that is hard to unstick, where the challenge shifts from incentivizing growth to sharing its benefits.
    • Amodei is emphatic that enduring job displacement is undesirable and dangerous, and that he warns about it to help society adapt, not as a prophet of doom.
    • Anthropic says it works with customers to find new revenue and use cases rather than only cost cutting, and explores interaction paradigms that keep humans active alongside AI.
    • He predicts AI will enable single individuals to build billion-dollar companies, noting teams of a few people already reach hundreds of millions in revenue, while admitting significant enduring job loss may be intrinsic to the technology.
    • Any response must address both economic provision and the human need for meaning, purpose, and agency, with the latter ultimately more important and beyond what policy can directly deliver.
    • Suggested economic interventions: better measurement and tracking (governments expanding statistics beyond Anthropic’s Economic Index), pro-employment incentives, and long-term macroeconomic support.
    • Pro-employment ideas include wage insurance, retention tax incentives, workforce training grants, and employer-employee matching infrastructure.
    • If displacement is large and permanent, mechanisms like universal basic income or universal capital accounts, financed through company taxes or higher capital gains taxes, may be necessary.
    • He frames datacenter and energy-price backlash as largely a symbol of broader economic anxiety, and says AI companies should pay to absorb rate increases, a pledge Anthropic has already made.
    • For technologies accelerated by AI, the bigger risk is regulators like the FDA being too slow, not too lax, because AI may make downstream tech safer in ways that violate skeptical regulatory assumptions.
    • Biomedicine is the illustrative case: AI could flood the drug pipeline, raise effect sizes, treat previously untreatable diseases, and create whole new therapy categories, while the current FDA and EMA pipeline takes 7 to 8 years.
    • Agencies should pre-approve standards for AI methods like PD/PK modeling, toxicology prediction, dose selection, biomarker validation, synthetic control arms, and surrogate endpoints, plus more flexible accelerated-approval mechanisms.
    • On civil liberties, powerful AI in the wrong hands could be the ultimate tool of autocracy, and existing constitutional protections are not fully equipped to counter a surprise seizure of power.
    • Threats named include fully automated drone armies that obey unlawful orders and surveillance AI that infers the innermost details of every citizen’s life from widely available data.
    • Civil liberties proposals: accountability rules and an “off switch” for autonomous weapons, a domestic ban on fully autonomous weapons including in law enforcement, closing the data broker loophole, and public rights to AI advice during adverse government action.
    • Amodei warns companies as well as governments can seize quasi-state power, citing the Gilded Age and the East India Company, and says AI cannot be safely entrusted to either alone.
    • He offers Anthropic’s Long-Term Benefit Trust as one separation-of-power structure and urges the industry to explore mechanisms that go further.
    • On geopolitics, he argues AI resets the geopolitical game board like nuclear weapons, becoming the dominant source of military and economic power for any nation that holds it.
    • A nation with powerful AI versus one without it, or even one three years behind, could resemble WWII Marines facing medieval swordsmen.
    • He calls for a democratic coalition that shares chips and semiconductor manufacturing equipment internally while denying them to adversaries, citing MATCH and OVERWATCH as good first steps.
    • The coalition should coordinate risk policy, share benefits including harmonized medical approvals, provide mutual AI defense, reject AI-powered repression, and cooperate on macroeconomic stabilization.
    • He rejects the idea that AI’s image is a PR problem, arguing public concern reflects real risks and is democratic accountability working as it should, with the task being to channel it into constructive solutions.

    Detailed Summary

    The speed mismatch between AI and policy

    Amodei frames the entire essay around a single problem: AI advances at a lightning pace while policy, especially legislation, moves very slowly, often for good reasons since governments wield grave powers that should not be used hastily. He illustrates this with Treebeard, the sentient tree from The Lord of the Rings who takes a full day to say hello, as a stand-in for political institutions trying to respond to a technology that can go from amusing toy to a country of geniuses in the time it takes Congress to act. He recounts the dilemma responsible actors have faced: they could see where the exponential was headed, but to observers looking only at present capabilities, AI looked as mundane as the latest consumer app or cryptocurrency, making a laissez-faire attitude hard to argue against. The absence of AI’s radical effects, and uncertainty about their shape, made it genuinely difficult to design good policy even where the will existed.

    That uncertainty, he says, is why safety advocates limited themselves to optionality-preserving measures like transparency rules, export controls, and labor data collection. But over the last few months the evidence of AI’s power and risk has become undeniable, with Claude Mythos Preview as the emblematic example: it scrambled the global cybersecurity landscape and proved AI models are now tools of global and national strategic consequence. He expects biological and autonomy risks to follow, and argues the world must now activate its slow, rickety policy apparatus to handle risks that will compound quickly. He worries current early actions are at least a year out of step with AI’s progress, and presents the essay as an attempt to close that gap across five policy areas, focused on US policy but relevant worldwide.

    Regulation and public safety: an FAA for frontier models

    Amodei opens by acknowledging the real costs of regulation: it can reduce a product’s benefits, disincentivize innovation, and suffer from the Hayekian problem that regulators lack the information for good tradeoffs, plus the Collingridge dilemma that a technology’s impacts are hard to anticipate until it is too late to manage them. In 2023 to 2024 these dynamics argued against pre-writing AI law, since the exact form of biological or autonomy risk, how to test for it, and how it would play out were all unclear, creating a high risk of low-value compliance requirements that miss the real dangers. Anthropic’s answer was transparency: requiring developers to disclose safety procedures, tests, and critical incidents, which is why it supported SB 53 in California, RAISE in New York, and SB 315 in Illinois in early 2026.

    Now, he argues, the risks are clearly here and it is time for binding regulation. His analogy is to cars, airplanes, and drugs: powerful technologies essential to the economy but capable of killing many people if designed or operated poorly. He models AI regulation on the FAA, with frontier models required to pass testing and auditing and with release blocked or reversed if they fail high safety standards. His concrete proposal: mandatory third-party testing for models above a compute threshold across cybersecurity, biological weapons, loss of control, and accelerating automated R&D; government power to block deployment of unacceptably risky models, scoped narrowly with anti-favoritism protections; evaluation by either a government agency or authorized private organizations in a regulatory-markets model; strong weight security, red teaming, and penetration testing at AI companies; and prompt reporting of safety incidents. He notes a future may arrive when systems resemble weaponizable nuclear materials and demand harsher measures, but warns against designing for dangers that have not yet emerged.

    Macroeconomics and tax policy: growth and displacement together

    Here Amodei challenges the standard premise that growth is fragile and must be traded off against the drag of taxes or deficits to reduce inequality. Powerful AI, he suggests, may scramble that assumption by producing extremely rapid growth through accelerated science and efficiency, supercharged by AI building better AI, while simultaneously acting as a broad substitute for human cognition that reshapes the economy faster than any prior technology. The result could be a world stuck on a hypergrowth, hyper-inequality setting that is hard to unstick, where the central challenge is no longer incentivizing growth but sharing its benefits. He is careful to make two points clearly: first, enduring job displacement is undesirable and dangerous and should be minimized, and his warnings are meant to help society adapt, not to play prophet of doom; second, any response must address both economic provision and the deeper human need for meaning, purpose, and agency, which matters more and which policy cannot directly supply.

    His policy menu starts with measurement and tracking, arguing good policy is impossible without accurate data, and that governments could expand economic statistics well beyond Anthropic’s Economic Index. Next come pro-employment incentives such as wage insurance, retention tax incentives, workforce training grants, and employer-employee matching, costs he says society should readily accept since they are likely offset by AI productivity gains. If displacement proves large and permanent, he says long-term income support like universal basic income or universal capital accounts may be needed, financed through taxes on relevant companies or higher capital gains taxes. He closes the section by reframing datacenter and energy-price backlash as mostly a symbol of broader economic anxiety, while saying AI companies should absorb rate increases, as Anthropic has pledged.

    Accelerating AI’s positive impact: the slow-regulator problem

    For technologies accelerated by AI, rather than AI itself, Amodei flips his concern: the bigger danger is regulatory systems designed for a slower pace failing to handle the deluge of new products, and AI making downstream technologies safer in ways that violate the skeptical assumptions baked into agencies like the FDA. He focuses on biomedicine as the area likely to produce AI’s biggest humanitarian benefits and where regulation is especially complex. AI could greatly increase the rate of new drug candidates, improve their effect sizes and safety profiles, treat previously untreatable diseases, and create entirely new therapy categories the way antibodies, peptides, and cell therapies did.

    The current pipeline at the FDA and EMA takes 7 to 8 years, built on the pessimistic assumption that drug candidates usually fail and often carry safety problems even when they work. Without reform, AI will jam or overload that system. Amodei proposes that agencies develop standards now for accepting AI simulation and analysis, so they can be adopted quickly once proven rather than after years of unnecessary testing. Specific candidates include AI-based PD/PK modeling, toxicology prediction to reduce animal testing, more accurate dose selection, biomarker validation from large datasets, synthetic control arms, and surrogate endpoints (especially for aging and neurodegeneration). He urges more flexible accelerated-approval mechanisms generally, and notes biomedical acceleration may also reduce AI’s risks by aiding biodefense and improving mental health.

    The state and civil liberties: guarding against AI-driven tyranny

    Amodei frames the perennial balance between state power and individual liberty, enforced through machinery like the First, Fourth, and Fifth Amendments, the Posse Comitatus Act, and FISA, and argues AI threatens to upset that balance while raising its stakes. Powerful AI in the wrong hands could be the ultimate tool of autocracy, because the enormous returns to intelligence combined with AI’s pace create a perfect storm for a surprise seizure of power. The danger could take many forms but shares one feature: AI conferring sudden power while routing around democratic oversight. He cites a fully automated drone army that could obey unlawful orders, where trained humans might object, and a surveillance AI that analyzes widely available information at massive scale to infer the innermost details of every citizen’s life, an ability current civil liberties law never contemplated.

    His proposals: create accountability rules for autonomous weapons so they respond to court orders, legislation, and human overseers rather than blindly following orders, possibly with a judicial finger on an off switch; ban domestic use of fully autonomous weapons, including in law enforcement, while allowing them against foreign adversaries; close the bulk-collection and data-broker loophole that lets the government buy and analyze data Americans share with private companies; and guarantee public rights to AI advice at least as capable as what the government uses during adverse action, as an extension of the Administrative Procedure Act, due process, or the Sixth Amendment. He closes by warning that companies, not just governments, can capture the state, citing the Gilded Age and East India Company, and argues AI cannot be safely entrusted to either alone. Anthropic’s Long-Term Benefit Trust is offered as one accountability structure, with a call for the industry to go further.

    Securing leadership by democracies: a values-based coalition

    Amodei rejects treating AI as a mere instrument of trade policy to diffuse a tech stack worldwide. He believes AI resets the entire geopolitical game board like nuclear weapons, potentially even more so, becoming the dominant source of military and economic power for whoever holds it. In a virtual country of 100 million geniuses, millions could be assigned to military strategy, drone manufacture, weapons R&D, intelligence, and scientific advancement at once, so a nation with powerful AI facing one without it, or even three years behind, could be like WWII Marines against medieval swordsmen. Because powerful AI also enables deeper autocratic repression, it matters enormously that the world’s strongest nations are democracies.

    His answer is a global coalition built on shared democratic values that draws in the rest of the world by making membership increasingly attractive and exclusion increasingly costly. Operating principles include managing the AI supply chain by sharing chips and semiconductor manufacturing equipment within the coalition while denying them to adversaries, expanding and tightening export controls (he cites MATCH and OVERWATCH as good first steps); coordinating on biological, cyber, and autonomy risk to make compliance compatible and effective; sharing AI’s benefits including harmonized medical approvals; mutual defense through collective AI cyberdefense, drones, manufacturing, compute, and intelligence; rejection of AI-powered repression; and macroeconomic cooperation against contagious employment crises. The coalition would respect each nation’s sovereignty, start with aligned democracies, and grow iteratively, ideally toward the whole world, but at minimum positioning democracies to contain and outcompete repressive regimes.

    A window of opportunity

    Amodei closes on cautious optimism. The same exponential that strains policymaking has created a unique opening: clear evidence of AI’s risks, an early taste of its value and disruption, and public backlash against unregulated approaches have left policymakers unusually open to forward-looking action. Treebeard and his forest are waking up. He firmly rejects the industry-circle view that this is a PR problem solved by better marketing, arguing people are worried because the risks are real, and that public concern in response to transparency is democratic accountability working as it should. The key challenge is focusing that concern into constructive solutions rather than letting it descend into formless anger and violence. He is optimistic because issues from job displacement to model testing to export controls have common-sense appeal across the political spectrum, and a broad nonpartisan coalition could adopt sane, forward-looking policy faster than usual.

    Notable Quotes

    “in only four years, AI models have gone from barely being able to write a coherent line of code to writing most of the code at major AI companies.”

    Dario Amodei, on the pace of the AI exponential

    “in the several years that it can take Congress to act, AI can go from an amusing toy to the full country of geniuses.”

    Dario Amodei, on the mismatch between AI’s speed and the speed of legislation

    “However, now the risks are clearly here. It is time to go beyond transparency to more serious and binding regulation of AI.”

    Dario Amodei, marking the shift from transparency to binding rules

    “enduring job displacement is undesirable and dangerous, and we should do everything we can to minimize or prevent it, not to bring it about.”

    Dario Amodei, clarifying his stance on AI and jobs

    “The key challenge in such a world won’t be incentivizing growth, but finding a way for everyone to share in the benefits.”

    Dario Amodei, on a hypergrowth, hyper-inequality economy

    “Powerful AI in the wrong hands could be the ultimate tool of autocracy, and our existing legal and constitutional protections are not fully equipped to counter this threat.”

    Dario Amodei, on AI and civil liberties

    “A nation that possesses powerful AI facing one without it … could be the equivalent of an army of World War II Marines facing an army of medieval swordsmen.”

    Dario Amodei, on AI as the dominant source of geopolitical power

    “People are worried about AI because they correctly perceive that its risks are real, not because AI CEOs have been insufficiently Panglossian.”

    Dario Amodei, rejecting the idea that AI has a PR problem

    “Treebeard and his forest are waking up.”

    Dario Amodei, on policymakers’ new openness to acting on AI

    “Policy on the AI Exponential” is a dense, structured argument from one of the most consequential figures in the field, and it rewards a full read in the original. The summary and analysis above are a guide, not a substitute. You can read the full essay here.

    Related Reading

  • Bill Gurley on Mental Models, Systems Thinking, AI Investing, Stablecoins, and the Future of Venture Capital

    Bill Gurley spent his career at Benchmark backing some of the most consequential marketplaces and network-effect businesses of the internet era, including Uber, and he is one of the few investors who pairs deep Wall Street fundamentals with a real feel for the bleeding edge. In this wide-ranging conversation on Shane Parrish’s The Knowledge Project, he lays out the mental models he keeps returning to, how systems thinking keeps you out of trouble, why the history of your field is a hidden superpower, where AI investing is headed, and how stablecoins and tokenization could quietly rewire finance. It is a masterclass in thinking clearly about complex systems while staying obsessively curious about what is happening on the edge.

    TLDW

    Gurley anchors his thinking in systems thinking and complexity theory, warning that multivariable nonlinear systems produce second and third order consequences that punish anyone who optimizes for a single metric. He argues that mastering both the deep history of your field and its newest edge is wildly differentiating, whether you are interviewing for a marketing job or breaking into venture capital. On AI he is measured: he doubts a single model eats every vertical, sees real moats in workflows and proprietary data, flags that we may be painting in the corners on training data, and explains why Chinese open source models may innovate faster because forced knowledge sharing compounds. He thinks the AI buildout looks overfunded and that circular deals both raise the odds of an eventual correction and delay it. He makes the case that the IPO process is a rigged power grab, that stablecoins and instant payments threaten Visa, Mastercard, and the entire 2 to 3 percent credit card stack, and that proxy advisors like ISS have drifted from shareholder interest into a black-box heist. He closes on the craft of storytelling and writing as thinking, the equal-partnership design of Benchmark, why venture bends toward youth, and what success means now that his dream job is behind him.

    Thoughts

    The most useful idea in this conversation is also the quietest one: most bad decisions are not bad in the moment, they are bad in the second derivative. Gurley’s dating-site story, where lengthening profiles raised engagement in the test and then quietly killed conversion months later, is the whole argument in miniature. A linear model would have shipped that change and called it a win. A systems thinker assumes the variable you optimized is connected to three others you cannot see yet, and waits to find out. That posture, refusing to get deterministic about a single metric, is the difference between a clever experiment and a durable business. It is also the most transferable thing in the episode, because it applies to product changes, hiring, policy, and your own career just as cleanly as it applies to a dating app.

    His pairing of old and new is the second idea worth stealing. Everyone in tech tells you to live on the edge, and Gurley agrees, he keeps five premium AI accounts running so he never misses a release. But he insists the edge is only half of it. Knowing the deep history of your field, the masters of marketing, the forefathers of physics, the classic cartoons that taught animation, is rare enough that it instantly creates contrast and signals genuine passion. The compounding move is to hold both at once. If you understand the legends and you actually get TikTok, you are a power player in a way that someone who only knows one end of the timeline can never be. Most people pick a side. The leverage is in refusing to.

    On AI specifically, Gurley is refreshingly unwilling to pick the consensus lane in either direction. He does not buy that one near-sentient model swallows every vertical, and his reasoning is grounded rather than vibes-based: workflows and proprietary data create real switching costs, which is why he watches the legal AI startups ingesting case law and building new databases rather than assuming everyone reverts to a general chatbot. At the same time he respects the Microsoft pattern of platforms climbing the stack and crushing the apps above them. The honest answer is that it is genuinely up for grabs, and his comfort sitting in that uncertainty is itself a model. The cheap takes are “one model to rule them all” and “it is all wrappers.” Gurley holds both possibilities and keeps testing.

    The systems lens does its best work on China. Rather than moralize, Gurley runs the mechanism: roughly ten open source models, intense domestic competition, and a culture of publishing techniques and weights so every model can learn from, train, and test every other model. His two-farmer metaphor, one market where farmers only trade goods and another where they are forced to share best practices, makes the prediction obvious. Forced knowledge sharing compounds faster than secrecy. The uncomfortable corollary he names is that American startups are quietly forking those open models all over Silicon Valley, and that incumbents may be lobbying for heavy regulation precisely because it pulls up the drawbridge against open source competition. That is the systems thinker’s signature move: follow the incentives to the consequence nobody is saying out loud.

    Finally, the money section is a clinic in spotting rent extraction. The IPO process where bankers pick both the price and the favored buyers, the 2 to 3 percent credit card toll that exists for no defensible reason while the rest of the world built instant bank transfer decades ago, and the proxy advisors who score companies in a black box and then sell you the cure, are all variations on the same pattern: an intermediary that captured a choke point and defends it through regulatory capture rather than value. Gurley’s optimism is that crypto rails, stablecoins, and tokenization may finally route around these tolls the way WeChat Pay and Alipay leapfrogged cards in China. Whether or not you agree on the timeline, the analytical habit is the takeaway. When something costs far more than it should and has for decades, ask who captured the rules, and watch the edge for whoever is about to make those rules irrelevant.

    Key Takeaways

    • Systems thinking means treating the world as multivariable nonlinear systems where one variable flipping can change the entire system’s behavior, the way weather and stock markets do.
    • The real danger is second and third derivative effects, consequences that only show up much later, long after the metric you optimized looked like a win.
    • A dating site lengthened profiles because longer profiles tested as more engaging, then discovered months later it was negative for conversion, the textbook second order trap.
    • Never get too deterministic about a single metric or single variable, and always know what is actually important and what sits on top.
    • Gurley built his foundation on the canon: Peter Lynch’s One Up on Wall Street, A Random Walk Down Wall Street, the Buffett letters, Ben Graham, and Howard Marks.
    • A firm grasp of the financial bedrock is what lets you innovate on top of it, and many Silicon Valley VCs would benefit from understanding finance better.
    • Bill Miller reframed value investing as buying an asset that is underpriced relative to what you think it will be worth in the future, which is how he justified holding Amazon for its network effects.
    • Wall Street is the buyer of the product that venture capitalists create, so even at the two-people-in-a-PowerPoint stage you should ask whether the eventual public market will be excited by it.
    • Trajectory matters more than the starting place, because the trajectory is where the company actually ends up.
    • Knowing the deep history of your field is remarkably differentiating, and tedium while learning it is a signal you are in the wrong lane.
    • John Lasseter served Gurley a ten-course meal where each course was tied to a classic cartoon essential to understanding animation, a display of mastery over the history of the craft.
    • Magnus Carlsen won a trivia contest on the history of chess, and Picasso was a wildly successful realist painter by 14, both proof that the greats master the fundamentals first.
    • Obsessive, constant learning is the trait Gurley sees most in great entrepreneurs, because disruption always happens on a moving edge they need to understand at the top one percentile.
    • The compounding advantage is mastering both the old history and the new edge at once, the way understanding both marketing legends and TikTok would set you apart in any interview.
    • Most people underestimate how much AI can do, so push more of the downstream work into the prompt: identify the top ten, list pros and cons, rank them on one dimension, then another, and add up the numbers too.
    • Gurley uses ChatGPT for project structure and memory, Gemini for restaurant research powered by Google review data, and notes that coders swear by Claude while some prefer Perplexity for finance.
    • He doubts one model dominates everything; verticals like coding already let users swap models, and price optimization will push more swapping over the next few years.
    • Heavy, expensive regulation could ironically create oligopoly, and some players may be quietly begging for regulation because it pulls up the bridge against Chinese open source models.
    • China’s roughly ten open source models compete intensely and share weights and techniques, creating a system that can innovate faster, like farmers forced to share best practices instead of just trading goods.
    • A quiet secret is that startups all over Silicon Valley are forking those Chinese open source models at real volume.
    • Gurley comes down against the idea that one near-sentient model removes the need for vertical models; workflows and proprietary data, like legal startups ingesting all the case law, create durable moats.
    • We may be running out of training data, painting in the corners, which is why one of the most powerful improvements is hiring experts at thousands of dollars an hour to fine-tune the models.
    • Yann LeCun’s view is that the next leap is broader than LLMs, since language-based models hit an asymptote and are weak at math and numbers.
    • AlphaGo’s shocking move proves models can innovate beyond their training, but it lived in a constrained game; the real world has infinite paths a computer cannot exhaustively search.
    • Gurley’s non-consensus view is skepticism of the China vilification mindset, noting the US is only 3 to 5 percent of the global population and wondering how the other 95 percent hears American exceptionalism.
    • The AI buildout looks overfunded: the Magnificent Seven took free cash flow from 50 to 100 billion a year down toward zero by pouring it into capex.
    • The venture community has become more risk-seeking because it now deeply believes in increasing returns and power laws, and the pre-profit losses keep scaling, from Amazon’s 2 to 3 billion to Uber’s 15 billion to far more now.
    • Circular deals, where a cloud provider funds a model company that spends the money right back on its services, inflate growth, which both raises the probability of an eventual correction and extends the time before one hits.
    • Burn rate is a measure of risk; ten years ago a million a month was scary, now companies burn five billion a year and cannot really know their unit economics.
    • Tokenization without financial-disclosure regulation invites speculation and manipulation, which is part of why companies like Stripe stay private and negotiate liquidity prices with trusted investors.
    • The IPO process is unfair because bankers pick both the price and the shareholders; a freshman would simply match supply and demand anonymously in an auction, the way direct listings and ICOs do.
    • Stablecoins threaten the 2 to 3 percent credit card stack; USDC holds dollar-for-dollar Treasuries and rides fast global crypto rails, while US transfers still suffer three-day ACH settlement and 25 dollar wires.
    • The rest of the world built instant transfer long ago, from UK Faster Payments 20 years ago to Argentina’s PIX-style system reaching 60 to 70 percent of transactions, while US bank regulatory capture stalled Fed Now.
    • Visa and Mastercard run roughly 60 percent operating margins as a bank-created duopoly, and China leapfrogged them entirely with WeChat Pay and Alipay QR-code wallets.
    • Moody’s power is being the trusted standard, the watermark, so AI on the back end does not displace it; ISS and proxy advisors, by contrast, score companies in a black box and get paid on both sides.
    • Proxy advisors drifted from shareholder interest into a fraud-and-risk-mitigation mindset, which is why they reflexively opposed the Tesla pay package that only paid out if the stock soared.
    • The rise of passive index funds concentrated voting power in firms that lack time to evaluate votes; it would be healthier if they abstained or voted in proportion to active holders.
    • Storytelling is one of the top founder traits, because founders are recruiting, raising money, and closing customers and partners constantly, selling all the time.
    • Writing is thinking: Bezos’s six-page memo forces you to find the loose ends and tie them up, and a public blog becomes a calling card that magnetizes founders and deal flow.
    • Other founder unfair advantages are product instincts, which fewer than 5 percent of non-product people ever truly learn, and sheer determination, Bezos’s single angel-investing test of whether someone will do it no matter what.
    • Uber had no HBS case study to lean on; its winner-take-all network effects forced mega burn rates with no precedent and no mentor to call, a situation every AI company now faces.
    • Benchmark’s equal partnership, with no king, president, or lead and five equal partners, makes recruiting easy, kills comp politics, and aligns everyone, at the cost of being hard to scale or run new initiatives.
    • Venture bends toward youth because young investors can match founders’ age, master a fresh niche faster, and have the free time to study something 80 hours a week.
    • Gurley defines current success through Arthur Brooks’s From Strength to Strength, hoping to apply his synthesizing and writing skills to bigger societal problems and dent the universe a little.

    Detailed Summary

    Systems Thinking and Second Order Effects

    Gurley opens with the mental model he keeps returning to: systems thinking, shaped by Donella Meadows’s Thinking in Systems and his board seat at the Santa Fe Institute, which studies complexity theory. He describes complex systems as multivariable nonlinear systems that are very hard to predict, capable of behaving one way for a long time until a single variable flips and the whole system behaves differently, like weather or stock markets. The practical payoff is staying out of trouble by anticipating first, second, and third derivative consequences. His clearest example is a large dating site that lengthened user profiles because the test showed more engagement, only to learn many months later that knowing more at that stage was negative for conversion. The lesson is to never get too deterministic about a single metric and to keep the whole system in view, because a change here can ripple to there in ways you only discover much later.

    Learning the Craft of Investing

    Because he started on Wall Street rather than in venture, Gurley absorbed the investing canon first: Peter Lynch’s One Up on Wall Street, A Random Walk Down Wall Street, the Buffett letters, Ben Graham, and Howard Marks, people who spent careers assembling and publishing their thinking. That financial bedrock, he argues, is exactly what lets you innovate on top of it. His friend Michael Mauboussin introduced him to Bill Miller, the Legg Mason manager who beat the S&P for 15 straight years and was Amazon’s largest shareholder for a long stretch. Miller reframed value investing as buying an asset underpriced relative to its future worth, which combined with a belief in network effects justified holding a company that could grow at an unreasonable rate for years. Gurley also frames Wall Street as the buyer of the product venture capitalists create through eventual M&A or IPO, so founders should think early about whether the public market will be excited by what they are building, since trajectory matters more than the starting place.

    Mastering Both the History and the Edge

    Gurley makes an unusually strong case for studying the deep history of your field. He recounts a dinner with Pixar’s John Lasseter, who served a ten-course meal where every course was tied to a classic cartoon he considered essential to understanding animation, and notes that Magnus Carlsen won a chess-history trivia contest and Picasso was a master realist by 14. In a world that skims for the executive summary, walking into a marketing interview with command of the masters of marketing is wildly differentiating and signals genuine passion; if learning that history feels tedious, you are probably in the wrong lane. The counterpart trait he sees in great entrepreneurs is obsessive learning on the moving edge, where disruption actually happens. Gurley keeps five premium AI accounts so he never misses something. The real power player holds both at once, the legends and the newest thing, the way a candidate who knows the marketing greats and truly gets TikTok stands out completely.

    Using AI Well and the Model Wars

    People underestimate how much AI can do, Gurley says, so you should build more of the downstream work into the prompt: instead of asking for the top ten and studying them yourself, ask it to list pros and cons, rank on one dimension, rank again on another, and add up the numbers too. He uses ChatGPT for its project structure and memory, leans on Gemini for restaurant research because it carries Google review data, and notes coders swear by Claude while some prefer Perplexity for finance. On whether one model dominates or models become niche commodities, he points to coding, the largest vertical, where tools like Cursor already let users swap models, and predicts price optimization will drive more swapping. The counterforce is regulation: if it gets expensive and mundane it could create oligopoly, and some players may be quietly begging for it because it pulls up the bridge against Chinese open source models.

    China, Open Source, and the Systems Advantage

    Asked to apply systems thinking to China, Gurley describes roughly ten open source models locked in intense domestic competition, all learning from one another because the ecosystem chose openness, with models able to train and test other models and teams publishing the techniques behind their breakthroughs. His metaphor: two agricultural societies, one where farmers only trade goods at market and another where they are forced to share best practices; the second evolves far faster. The result is a system capable of innovating faster than the more secretive Western approach. The quiet secret he names is that startups all over Silicon Valley are forking those open models at real volume, and a key open question is whether regulation tries to stomp that out. He extends this into a broader non-consensus discomfort with the vilification of China common in Washington and parts of Silicon Valley, observing that the US is only a few percent of the global population.

    AI Investing, Moats, and the Limits of Models

    On how AI changes investing and whether a startup is just a wrapper, Gurley calls it up for grabs but lands on the side of durable verticals. If models become near-sentient, one model does everything; he doubts that, pointing to workflows and data moats, like the several legal AI startups ingesting all the case law and building new databases that customers will not simply swap for a general chatbot. He balances this against the Microsoft pattern of platforms climbing the stack past Lotus 1-2-3 and WordPerfect. He also flags scaling limits: we may be running out of data, painting in the corners, which is why one of the most powerful improvements is paying experts thousands of dollars an hour to fine-tune models, though human knowledge has an edge. He invokes Yann LeCun’s argument that the next leap is broader than language-based LLMs, which hit an asymptote and struggle with math, and the AlphaGo debate, where a shocking innovative move proves creativity within a constrained game but says little about the infinite paths of the real world. He notes AlphaGo and Tesla’s FSD are constrained, non-LLM systems.

    Is the Buildout Overfunded

    Gurley admits he is shocked by the scale of money, noting the Magnificent Seven drove free cash flow from 50 to 100 billion a year down toward zero by spending it all on capex, something he would not have believed five years ago. He traces it to the venture community’s growing conviction in increasing returns and power laws, where proven companies grow far beyond expectations, which makes investors more willing to take risk on the come. The losses before turning cash-flow positive keep scaling, from Amazon’s 2 to 3 billion to Uber’s roughly 15 billion to far larger now. On corrections, he recalls the dot-com crash producing a three to four year nuclear winter before Amazon climbed back, and explains that circular deals, where a cloud provider funds a model company that spends it right back on its services, inflate growth and therefore both raise the probability of a correction and extend the runway before one arrives. Burn rate, he stresses, is a measure of risk, and at five billion a year it is nearly impossible to know your unit economics.

    Tokenization, the IPO Heist, and Going Public

    There is no shortage of capital, so funding is not the bottleneck; the risk with tokenization is that, absent disclosure regulation, it invites speculation and manipulation, as seen in retail-loved names like GameStop and Palantir. Tokenizing a private company like Stripe could create the wild price swings companies stay private to avoid, since private liquidity events let them negotiate a price with trusted investors rather than expose the constantly moving underlying value, and Robinhood’s tokenization plans already drew legal pushback. Gurley reserves his sharpest critique for the IPO process, calling it insanely unfair because bankers pick both the price and the favored shareholders. A freshman computer science and finance student would simply match supply and demand anonymously in an auction, the way an ICO or a direct listing does, but Wall Street will not let go of the greedy power grab and reverted to a controlled oligopoly after direct listings were available.

    Stablecoins Versus the Payment Cartel

    Gurley argues stablecoins could be deeply disruptive to credit cards. Most of the developed world built instant bank-to-bank transfer long ago, from UK Faster Payments 20 years ago to Argentina’s PIX-style system that quickly hit 60 to 70 percent of transactions, while US bank regulatory capture stalled Fed Now and left an ecosystem living under 2 to 2.5 percent card fees. A USDC stablecoin holds dollar-for-dollar US Treasuries and rides proven, fast, global crypto rails, letting anyone move a dollar in seconds for pennies, against the backdrop of three-day ACH settlement and 25 dollar wires. He sees Visa and Mastercard, a bank-created duopoly with roughly 60 percent operating margins, as heavily threatened, and points to China, where WeChat Pay and Alipay built ubiquitous QR-code wallets that leapfrogged the entire card system, all because the government made money transfer easy.

    Moody’s, Proxy Advisors, and Index Funds

    Moody’s power, Gurley explains, comes from being a trusted standard, the watermark, so even AI on the back end does not displace it. Proxy advisors like ISS are a different story: they score companies in a black box, refuse to reveal the criteria, and then get paid by the same companies that want to learn how to score better, which he calls more of a heist than a service. They drifted from a shareholder-interest mandate into a corporate-governance, fraud-mitigation posture obsessed with rules, which is why they reflexively opposed the Tesla pay package that only paid Elon Musk if the stock soared, a deal Gurley says he would sign for every company he has worked with. The rise of passive index funds compounds the problem, concentrating voting power in firms without time to evaluate votes; he would prefer they abstain or vote in proportion to active holders, since closet indexing during the MAG 7 run already distorted active management.

    Storytelling, Writing, and Founder Advantages

    Gurley fell in love with the craft of writing in business school, moving from business books to personal development titles like Dale Carnegie and Seven Habits, then biographies, then long-form narrative nonfiction by Malcolm Gladwell, Michael Lewis, and Jon Krakauer, the New Journalism that reads like fiction. Writing forces clarity: he cites Bezos’s six-page memo as a tool that makes you think through corner cases and tie up loose ends, and notes that codifying his marketplace knowledge and publishing it turned his blog into a calling card that magnetized founders and deal flow. He lists the top founder traits as storytelling, product instincts, understanding the edge, and determination. Storytelling matters because founders are constantly recruiting, fundraising, and closing customers and partners. Product instinct is nearly unteachable, present in well under 5 percent of non-product hires. And determination is Bezos’s single angel-investing test: will this person do it no matter what, come hell or high water.

    Uber, Benchmark, and the Shape of Venture

    The Uber lesson with no HBS case study was that a winner-take-all category with network effects demanded funding ad nauseam, producing burn rates bigger than any public company would dare, with no precedent and no mentor to call, exactly the situation AI companies now face, only with a zero added. Gurley credits Benchmark’s design, an equal partnership with no king, president, or lead and five equal partners, for making it easy to recruit top talent, encouraging senior partners to develop newcomers since everyone shares the upside, and eliminating annual comp politics. The downside is that without a CEO it is hard to scale or run new initiatives, famously captured by the firm settling on a single splash-page website. Founders choose a VC for reputation and network effects, the stamp of approval that carries weight, and young investors can break in because they often match founders’ age and can outwork everyone to master a fresh niche like esports or YouTube, which is why the industry bends toward youth. Asked what success means now, Gurley says his venture career was a dream job he would have done for free, but it is done; inspired by Arthur Brooks’s From Strength to Strength, he wants to apply his synthesizing and writing to bigger societal problems and dent the universe a little.

    Notable Quotes

    “We do live in a world where information is really cut up, but we also live in a world where you can have access to more information than you ever could.”

    Bill Gurley, on why the abundance of knowledge rewards the curious

    “You got to be really conscious of the consequence and not get too deterministic about a single metric or a single variable.”

    Bill Gurley, on the discipline of systems thinking

    “Value just means that the asset is underpriced relative to what you think it will be worth in the future.”

    Bill Gurley, relaying Bill Miller’s reframing of value investing

    “I’ve always thought of Wall Street as the buyer of the product that venture capitalists create.”

    Bill Gurley, on why founders should think about the public market early

    “One society, when the farmers come to market, they just sell each other goods and then they go back. The other society, when the farmers come to market, they’re forced to share best practices. Which one is going to evolve faster?”

    Bill Gurley, on why open source models can out-innovate

    “If you took a freshman computer science student and a freshman finance student and said imagine how a company should go public, they would match supply and demand anonymously like you would in any auction.”

    Bill Gurley, on the rigged IPO process

    “When I meet an entrepreneur, there’s only one thing I ask myself. Is this person gonna do this no matter what? Come hell or high water, they’re doing this.”

    Bill Gurley, quoting Jeff Bezos on his single test for angel investing

    “You’re recruiting employees, you’re recruiting executives, you’re raising money, you’re closing customers, you’re closing partnerships. You’re selling all the damn time.”

    Bill Gurley, on why storytelling is a top founder trait

    “I often said that if we lived in a socialist society and everyone had to work for free, I would still take that job.”

    Bill Gurley, on loving his venture career

    “I would like to see if I can apply those techniques to bigger, broader problems in society and dent the universe a little bit that way.”

    Bill Gurley, on what success looks like in his next chapter

    Watch the full conversation with Bill Gurley on The Knowledge Project here.

    Related Reading

  • Benedict Evans on the Economics of AI Usage, Why Foundation Models May Become Commodities, and What Comes Next for SaaS

    Benedict Evans returns to the a16z podcast to update the thesis behind his widely read “AI eats the world” presentation, and the picture he paints is less about hype and more about hard economics. In this conversation he works through what has actually played out in the last year, why agentic coding became the one use case with real product market fit, and why he keeps arguing that foundation models may end up as commodities while the value moves somewhere else entirely. You can watch the full conversation here.

    TLDW

    Benedict Evans argues that the AI moment looks a lot like the early internet, the early PC era, and the rollout of mobile data, which means it is exciting, genuinely transformative, and almost impossible to predict use case by use case. Agentic coding is the only field with clear product market fit right now, with revenue run rates exploding from roughly nine billion to forty seven billion, while consumers still use chatbots weekly rather than daily. His central claim is that foundation models show no obvious network effect or sustainable differentiation, the chatbot is a limited v1 interface, and the model labs cannot build every application, so the value will likely move up the stack the way it did with chips, ISPs, and mobile networks rather than staying with the model providers. He covers the brutal supply and demand disequilibrium driving today’s token pricing and ten thousand dollar surprise bills, the financial gravity problem of hyperscalers spending over half their revenue on capex, the Jevons paradox and consumer surplus that may compete away productivity gains, the way the important questions move out of San Francisco and into industries like law, consulting, finance, and advertising, and the distinction between automating tasks and changing jobs. His closing image is an IBM ad from the 1950s promising “150 extra engineers,” a reminder that every platform shift feels unprecedented and that in twenty years we will simply say of course computers do that.

    Thoughts

    The most useful thing Evans does here is refuse to collapse uncertainty into a clean prediction, and then explain exactly why that refusal is the correct posture rather than a cop out. He distinguishes between the parts where he will commit to a view, that foundation models are probably not a product and the chatbot is probably not the right interface, and the parts where there are simply too many open paths to call. That discipline is rare in AI commentary, where the incentive is to sound certain. The commodity argument is not “models are worthless.” It is a chain of reasoning: there is no visible network effect, no durable differentiation beyond willingness to spend, no lock in comparable to Windows or iOS, and a likely structure of three to six well funded competitors plus open source and edge models all selling the same thing. Ask where price discipline comes from in that picture and the honest answer is that it probably does not, which is how you get a commodity even when demand is effectively infinite.

    The mobile data analogy is the load bearing comparison and it deserves to be taken seriously. Mobile data traffic rose something like fifteen hundred to two thousand times over fifteen years, the networks built an extraordinary piece of global infrastructure, everyone came to depend on it, and yet the operators captured almost none of the value because all the interesting stuff got built on top by someone else. Telco stocks were flat for two decades. If that is the template, then the trillion dollars of capex flowing into AI infrastructure can be both a worthwhile investment and a terrible place to expect outsized equity returns, because building the road is not the same as owning the traffic. The counterpoint Evans keeps fairly on the table is the operating system path, where Windows and iOS did capture value, but he notes they had levers and network effects that LLMs do not appear to have.

    His framing of where the questions live is the part most people in tech underweight. Once a technology works, the interesting questions stop being technology questions. Netflix is not a tech company in the sense that matters, because its real decisions are Los Angeles decisions about shows, talent, and sports, not San Francisco decisions about infrastructure. By the same logic, what AI means for a law firm is mostly a question for people who understand what associates actually do and what clients are actually paying for, not for model researchers. This is why the “the model will just do the whole thing” story keeps running aground. Most valuable software does not solve a problem the customer already knew they had. It often takes years to convince an industry that a problem even exists, and an LLM prompt does not surface latent problems that no one has articulated.

    The economic plumbing he describes is where the near term risk actually sits. We are in extreme disequilibrium, where twenty dollars a month can buy ten thousand dollars of tokens on one side and a weekend of experimentation can produce a ten thousand dollar bill on the other, exactly the pattern mobile data went through around 2009 and 2010. That gets resolved with the boring machinery of caps, throttling, and pricing tiers, not with magic. Layered on top is the financial gravity problem: Microsoft, Meta, and Google heading toward spending more than half of revenue on capex, with roughly seven hundred billion dollars of guidance across the big players, against a hard ceiling because there is not ten trillion dollars a year available to spend. And even when the productivity gains are real, the Jevons paradox and consumer surplus suggest much of the benefit gets competed away. If a discounted cash flow model used to take a week and now takes ten seconds, you do fifty of them and charge the client the same, which is great for clients and unremarkable for margins.

    The honest takeaway for builders is that the answer to “what does this do to software” is more software, probably one or two orders of magnitude more, just as SaaS itself produced an explosion rather than a consolidation. The SaaS apocalypse is real in the sense that some meaningful percentage of existing companies get wiped out, and unknowable in the sense that no one can yet say which ones, which is why thoughtful investors are reluctant to be long software in the dark. For anyone pursuing a more deliberate, purposeful relationship with technology, the closing note is the one to keep: every one of these shifts felt singular and world ending and world making at the time, it reshaped work and put people out of jobs and created things we love, and then it quietly became invisible. The goal is to stay clear eyed about which of those buckets a given change lands in rather than getting swept up in the noise of what someone said at a party yesterday.

    Key Takeaways

    • Agentic coding shifted from “kind of useful” to “really changing everything” at the start of the year, and it is the single field with unambiguous product market fit, where customers are pulling it out of your hands.
    • Coding working first was foreseeable in hindsight: software developers were the ones messing with the tools, and the first thing people do with a new kind of computer is build more computing, just as the first thing people did with PCs was make computers.
    • Anthropic, with less capital raised, chose to focus on coding and got it working, while OpenAI cycled through a more everything all at once strategy before narrowing in.
    • The intense focus on coding comes bundled with a supply crunch, a capacity crunch, and a price and capex imbalance that defines the current moment.
    • Most of the fundamental questions from two or three years ago still have no answers: whether there will be a winner in models, whether models capture value up the stack, how much they can do, and whether consumers will use this daily rather than weekly.
    • There is a wide gap between Valley insiders running clusters of Mac Studios all day and the roughly forty percent of people who say AI is “kind of useful, I used it last week for something.”
    • Outside tech, companies are adopting AI as one at a time point solutions for specific back office processes, like a commodities company using LLMs for better cash flow forecasting, not as a general purpose assistant.
    • Adoption always compounds on prior platforms: you could not have nine hundred million weekly active users in the Netscape era because there were not nine hundred million PCs on the planet.
    • Early in any platform shift almost nothing works smoothly, from sound cards and floppy disks with TCP/IP to computers that froze and lost your work, and AI is at that stage now.
    • Today’s token pricing crunch mirrors the mobile data shock of 2009 to 2010, where flat rate plans collided with surging usage and networks had to realign price with marginal cost through caps, fair use, and throttling.
    • Mobile data traffic rose roughly fifteen hundred to two thousand times in fifteen years, mobile networks earn around a trillion dollars and spend about two hundred billion a year on capex, yet their stocks have been flat for twenty years because all the value moved up the stack.
    • The central LLM question is whether the model can do the whole thing or whether you need hundreds of applications built on top, the same way you needed apps on Windows and iOS.
    • Evans sees no network effect and no sustainable differentiation between models beyond willingness to spend money, which points toward commodity infrastructure sold near marginal cost.
    • Chip companies, ISPs, and mobile operators did not capture the value; Windows and iOS did, but only because they had levers to move up the stack and real network effects, which models lack.
    • A useful comparison is semiconductors, where each generation gets more expensive and the field narrows to fewer players, suggesting three to six frontier model makers spending somewhere between two hundred billion and two trillion dollars a year.
    • Enterprises do not standardize on a model the way they once thought about AWS; the cloud and the model get abstracted away, so customers do not even know which one their SaaS product runs on.
    • Demand for tokens being effectively infinite does not prevent a price equilibrium, exactly as infinite demand for mobile bits still produced murderous price wars between commodity carriers.
    • History teaches that something will happen but rarely what; the smartest people in tech wrongly predicted Android would crush the iPhone on open versus closed grounds.
    • One characteristic of tech is that the moment you understand how something works is the moment to move on, which is why Evans stopped updating his Apple spreadsheet years ago.
    • The people who are good at using a tool are usually not the people who are good at designing what the tool should be, which is why model labs cannot build every skill or vertical application.
    • Claude skills and similar templates resemble file new in Excel: useful starting points that users eventually outgrow, raising the question of who builds the real software.
    • The questions increasingly move out of technology and into specific industries; what AI means for law, consulting, advertising, or accounting is partly an AI question and partly a deep domain question.
    • Netflix is not a tech company in the way that matters, because its real questions are media industry questions about shows, talent, and sports, not infrastructure; the same logic now applies across industries facing AI.
    • AI differs from prior platform shifts because the physical limits are unknown; in 1995 you knew PCs cost three thousand dollars and broadband could not reach everyone overnight, but no one knows how cheap, fast, or capable models will get.
    • Evans offers four buttons to press on any use case: is it just price elasticity and the Jevons paradox, does it remove a cost barrier to entry, does it unlock a new business model, or does it make something previously impossible now possible like trains over horses or Spotify over CDs.
    • Advertising and e-commerce are a standout opportunity because today’s systems know a SKU and a metadata field but not what a product actually is or why people buy it, and LLMs could change that level of understanding.
    • The valuable shift is not doing the old thing more, like more spreadsheets or better email, but doing genuinely new things, such as asking an LLM how to change prices to improve churn using all your call recordings, CRM flows, and product telemetry.
    • Enterprise software today splits into three buckets: big horizontal systems like SAP and Workday, three to four hundred vertical SaaS apps plus a thousand internal apps, and a fuzzy improvised middle of Excel, email, and shared files, with AI arriving as a new option across all three.
    • A core design tension is where to put the probabilistic software that can make mistakes versus the deterministic database that cannot, and whether the LLM sits at the top or the bottom of the stack; the answer is probably both depending on the task.
    • The net effect on software is way more software, since SaaS itself produced one to two orders of magnitude more software and all software companies exist to solve problems created by other software companies.
    • The SaaS apocalypse is real but unknowable: some percentage of SaaS companies get wiped out, but no one knows which, so you should not derate the whole sector fifty percent and many investors are wary of being long software for now.
    • Much of what an organization does is implicit, undocumented, and not in the training data, which is exactly the value McKinsey, Bain, and BCG provide by getting license to map how a company really works.
    • The real decisions are usually exception handling: the question is always what you cannot automate and what still requires human judgment about cases that were never written down.
    • Distinguish tasks from jobs: accountants spend almost none of their time the way they did fifty years ago, yet to the client the job looks the same.
    • LLMs excel where you want the average, the answer anyone would give, and struggle where you specifically do not want the average and cannot fully explain why you did it differently.
    • There is a financial gravity ceiling: Microsoft, Meta, and Google are on track to spend over fifty percent of revenue on capex versus fifteen to twenty percent for capital intensive telecoms, with seven hundred billion in guidance this year and no path to ten trillion.
    • Hyperscalers face an existential FOMO trap: returns look positive now, but they cannot let rivals build the future of compute without participating, even as the CFO asks how much participation is enough.
    • Token maxing will face a reckoning as the disequilibrium resolves, but measuring ROI is hard because most reported benefits so far, like better analytics, support, and productivity, are tough to put a financial value on.
    • Consumer surplus means many gains get competed away: if analysis that took a week now takes a day, you do five times more analysis and charge the same, the way investment banks did with spreadsheets.
    • Evans closes with a 1950s IBM ad promising “150 extra engineers,” a reminder that every fundamental technology change feels unprecedented, and that in twenty years AI will simply be invisible magic we take for granted.

    Detailed Summary

    What changed in the last year

    Evans frames the past year as a narrowing of focus. A year and a half after the first version of his presentation, the field has developed a much clearer sense of diverging product strategies and competitive tension that goes beyond simply building a bigger model with more compute. The dominant shift is that agentic coding started genuinely working, and the entire industry narrowed in on it because it has absolute product market fit, the kind where customers pull the product out of your hands. That success arrives alongside the supply crunch, capacity constraints, and price imbalance that now define the moment. At the same time, the charts keep climbing, models keep getting bigger, capex keeps growing, and usage keeps growing, while the deep questions from a few years ago remain unanswered.

    Why coding worked first

    That coding led was predictable at a naive level: the people experimenting with the tools were software developers, and they naturally tried to make software development work. Evans compares the moment to the internet around 1997 and 1998, and also to PCs in the late seventies and early eighties, when the technology was exciting but it was not clear what it was for and it did not quite work yet. The first thing people did with PCs was make computers, and since LLMs are in a sense computers, the first thing people are doing with them is making more compute. What was harder to foresee was the precise timing of the shift, the moment when agentic coding flipped from useful to transformative at the start of this year.

    Jobs, juniors, and what we have not learned

    On the question of what this means for engineers and team structure, Evans is blunt that we have learned almost nothing yet, because this did not even work six months ago and everyone is scrambling to interpret it. The pricing crunch alone means it will take a couple of years to settle. The newly concrete questions include whether you still hire junior people and what they would do, and why you were hiring juniors in the first place, whether to do the work itself or to develop people. Because software development now genuinely automates a class of work that used to be done by people, those questions have moved from theoretical to real, but no one can responsibly claim to know what a software team or a software career looks like in three years.

    OpenAI, Anthropic, and the strategy split

    Evans dryly notes the drama around the model labs, including the disruption of a senior leadership medical leave at OpenAI. In the latter part of last year, OpenAI’s question was essentially what to build on top of the models, an everything all at once approach that looked almost like asking the model for fifteen ideas and then doing all of them. Anthropic, with less capital raised, instead committed to coding and got it working, whether by deliberate strategy or by stumbling into it. The result is that software development plus a few other fields are where things genuinely work, surrounded by a large population of people excited around the edges and corporations quietly automating specific back office processes. He cites a commodities company that wants LLMs for better cash flow forecasting across many small producers, a very different thing from asking a chatbot to summarize your meetings.

    The mobile data analogy and value capture

    The richest section is the comparison to mobile. Adoption always compounds on prior platforms, so AI inherits a far larger installed base than the internet or mobile did at their starts. Early on, nothing works smoothly, and Evans recalls the era of buying a three hundred dollar sound card or wrestling a floppy disk of TCP/IP into a machine. The pricing dynamics directly echo mobile data around 2009 and 2010, when flat rate plans met exploding usage and ten thousand dollar bills, forcing networks to realign price with marginal cost. Crucially, mobile data traffic then rose fifteen hundred to two thousand times, the networks built extraordinary global infrastructure with around a trillion dollars of revenue and two hundred billion in annual capex, and yet their stocks stayed flat for twenty years because all the cool stuff and all the value got built and captured by someone else higher up the stack. Chip companies, ISPs, and mobile operators did not capture value; Windows and iOS did, but they had levers and network effects that models do not appear to share.

    The case that models become commodities

    Evans lays out the building blocks of his commodity thesis. First, there is no clear way to build a model that is sustainably and fundamentally better than everyone else’s, with no visible network effect and no strategic lever comparable to what Instagram, YouTube, or Google search enjoy. Differences in emphasis and taste exist, but not durable competitive moats beyond spending. Second, the chatbot is a weird, limited v1 interface that works well for some tasks and people but requires tooling, the right data, configuration, control, and thoughtful design for most real jobs, and the people good at a job are rarely the people good at designing the tool for it. Third, the labs cannot build every application any more than Microsoft or Apple could build every Windows or iPhone app. Enterprises do not standardize on a model the way they never standardized on a visible cloud provider, because it gets abstracted away. Taken together, that points to low level infrastructure sold by perhaps half a dozen competitors plus open source and edge, with no obvious source of price discipline, which is the definition of a commodity even when demand is infinite.

    The questions move out of technology

    One of the next big questions is when models become good enough that you no longer need the largest, fastest, most expensive model, and can use an older model, an open source model, or one running on device where compute is effectively free to the developer. But the deeper shift is that the important questions move out of technology and into industries. Drawing on his own essays “content isn’t king” and “Netflix isn’t a tech company,” Evans argues that Netflix’s real decisions are Los Angeles media questions, not San Francisco infrastructure questions, and San Francisco does not even know what the right questions are. By the same logic, what AI means for a law firm is mostly a question for people who understand law firms, what generative video means for Hollywood is a question Ben Affleck can answer better than he can, and the questions become half AI and half something else.

    Four buttons and the new things AI unlocks

    To reason about impact, Evans offers four buttons. Is a use case just price elasticity, the Jevons paradox of doing the same thing for less or more for the same money. Does it remove a cost that was a barrier to entry, like a newspaper’s printing press. Does it unlock something in your business model. Or does it make something previously impossible now possible, the way steam engines made trains possible regardless of how many horses you bought, or Spotify turned fifteen dollars a month into all the music there is. He stresses that the same broad change can mean wildly different things by industry, just as the internet devastated newspapers but barely touched movie studios. His favorite tractable example is advertising and e-commerce, a trillion dollar advertising market against twenty five trillion in retail, where today’s systems know a SKU and a metadata field and that people who bought one thing bought another, but do not know what a product is or why people buy it. An LLM could in principle understand the product, recommend ten coats at different prices with pros and cons, or look at your Instagram and suggest a winter coat that changes your look but not too much, which would have been science fiction three years ago.

    More software, the SaaS apocalypse, and tasks versus jobs

    For software specifically, Evans expects more competition, cheaper and quicker building, and new categories that were impossible before, all under an uncertain new margin structure where outcome based pricing is hard because most software work cannot be tied cleanly to profit and loss. He frames enterprise software as three buckets, big horizontal systems, hundreds of vertical and internal apps, and a fuzzy improvised middle of Excel and email, with AI arriving as another option across all of them. The deeper design tension is where to place probabilistic software that can make mistakes versus deterministic systems that cannot, and whether the LLM sits at the top or bottom of the stack, with the answer being both depending on the task. The net result is way more software, since SaaS itself produced orders of magnitude more software and software exists to solve problems created by other software. That fuels the SaaS apocalypse anxiety: some companies clearly get wiped out, but since no one knows which, you should not derate the whole sector, even as many investors stay cautious about being long software.

    Implicit knowledge, exception handling, and where the average fails

    Much of what organizations do is implicit, undocumented, and absent from any training data, which is precisely the value of strategy consultancies that get license to map how a company really works versus how it is supposed to work. The real decisions tend to be exception handling, the cases that require human judgment because they were never written down or do not look like before. Evans separates tasks from jobs, noting accountants do almost nothing the way they did fifty years ago while the client still buys the same thing. And he offers a sharp test: LLMs are excellent where you want the average, the answer anyone would give, and weak where you specifically do not want the average and cannot fully articulate why you did it differently.

    Capex, financial gravity, and the ROI question

    On spending, Evans describes a financial gravity problem. Microsoft, Meta, and Google are on line to spend over half their revenue on capex this year, against fifteen to twenty percent for capital intensive telecoms, with roughly seven hundred billion in guidance across the big players, a sum comparable to all of telecom or oil and gas. They cannot sustainably leap to one and a half trillion next year because the money is not there, so the curve must eventually taper. The hyperscalers are caught in an existential FOMO trap: returns look positive now, but they cannot sit out what might be the future of compute without risking becoming the next stranded incumbent, even as the CFO asks how much is enough. On token maxing, he expects a reckoning as the disequilibrium resolves, but measuring ROI is genuinely hard because most reported benefits so far are soft and hard to value, and consumer surplus means much of the gain gets competed away, the way faster spreadsheets simply meant more analysis at the same price.

    Closing image

    Evans ends with an IBM advertisement from the early 1950s showing a sea of engineers holding slide rules, with the tagline that an IBM electronic calculator gives you 150 extra engineers, exactly the pitch behind countless modern startup decks. We move through these fundamental technology waves every ten or fifteen or twenty years, each one feeling completely unlike anything before, and AI is amazing and transformative in the same way mobile, the internet, and PCs were. The base case is that it will produce wonderful things, ruin some livelihoods, put people out of work, and eventually become invisible. His one line description of where it all ends up is that it will be magic, and in twenty years we will simply say of course computers do that, the way an hour of crash free streaming HD video over Wi-Fi already feels unremarkable.

    Notable Quotes

    “Agentic coding went from being kind of useful to really changing everything.”

    Benedict Evans, on the pivotal shift at the start of the year

    “We are in this extreme scarcity. We can’t spend $10 trillion a year on AI infrastructure cuz there isn’t $10 trillion a year there to spend on it.”

    Benedict Evans, on the hard ceiling of AI capex

    “I don’t think foundation models are a product. I don’t think a chatbot is a product. I think the value will be further up.”

    Benedict Evans, stating the core of his thesis

    “They built this amazing piece of global incredibly sophisticated very expensive global infrastructure with enormous growth in use, and they didn’t make any money from it because all the value moved up stack.”

    Benedict Evans, on the mobile network analogy

    “The moment that you understand something and you know how it works and what’s going to happen is the moment you should move on to something else.”

    Benedict Evans, on how to pay attention in tech

    “These are all Los Angeles questions. These are not San Francisco questions. No one in San Francisco even knows what the right questions are.”

    Benedict Evans, on why Netflix is not a tech company

    “The important stuff is not doing the old thing but more. It’s doing something new that you couldn’t have done with the old thing.”

    Benedict Evans, on where the real value of a new technology shows up

    “All software companies exist to solve problems created by other software companies.”

    Benedict Evans, on why AI produces more software, not less

    “It’s going to be magic, and in 20 years time we’ll just say, well, of course that’s how it is. Computers have always done that.”

    Benedict Evans, on how the whole shift ends up

    This is a dense, clear eyed conversation that rewards a full listen, especially if you are trying to think past the hype cycle about where AI value actually lands. Watch the full conversation here, and check out the “AI eats the world” presentation referenced throughout.

    Related Reading

    • Benedict Evans’ website home of the “AI eats the world” presentation and his newsletter referenced throughout the conversation.
    • Andreessen Horowitz (a16z) the venture firm whose podcast hosted this discussion and where Evans was formerly a partner.
    • Jevons paradox (Wikipedia) background on the price elasticity idea Evans uses to explain how cheaper AI may lead to more usage rather than savings.
    • Stratechery by Ben Thompson the analysis Evans cites on software as a designed workflow versus a process that grows out of how a business runs.
    • The Pursuit of Purpose a PJFP look at finding direction and meaning in work as automation reshapes careers and industries.
  • Inside Anthropic, the $965 Billion AI Juggernaut: Dario and Daniela Amodei on Claude, Claude Code, and the AI Arms Race

    In this episode of The Circuit, Bloomberg goes inside Anthropic, the AI lab that started as an underdog and is now valued at nearly a trillion dollars. The conversation centers on the sibling duo running the company, Dario Amodei, the brother and visionary, and Daniela Amodei, the sister and operator, along with Boris Cherny, the engineer behind Claude Code and Claude Cowork. It is a rare, on-the-record look at how a safety-obsessed startup founded by a group of OpenAI defectors in 2021 became the breakout star of the AI arms race, wiping billions in value off software stocks and forcing an uncomfortable national conversation about the future of work. You can watch the full episode here.

    TLDW

    Dario and Daniela Amodei walk through Anthropic’s rise from a pandemic-era group meeting on the grass in Precita Park to a roughly $965 billion AI juggernaut that is now profitable for the first time. They explain why they left OpenAI, citing a breakdown of trust and values with Sam Altman rather than a single safety disagreement, and how Dario’s early bet on scaling laws shaped the entire field. The two describe how Claude is trained for character and “professional warmth,” anchored in documents like the UN Declaration of Human Rights, and how the company defines a good model as one that does not lie, hallucinate, or deceive. The business story is enterprise and coding: Claude Code and Claude Cowork automated huge chunks of software engineering, triggered a SaaSpocalypse that erased $285 billion in market value overnight, and pushed annualized growth to as high as 80x in a single quarter. Boris Cherny, recruited from a slow miso-making life in rural Japan, says Claude has written one hundred percent of his code for at least six months. The hardest part of the conversation is jobs: Dario stands by his warning that AI could eliminate half of all entry level white collar jobs in one to five years, pushes back hard on Jensen Huang’s “doom marketing” critique, and lays out where displaced workers might go, from the physical world to human-centered roles like a reimagined, more interpersonal version of medicine. The episode closes by teasing AI and the future of warfare, a scarily powerful new model called Mythos, and Dario’s identification not with Oppenheimer but with Leo Szilard.

    Thoughts

    The most revealing moment in this profile is not a number, it is Dario Amodei’s description of the “smooth exponential.” His whole career, he says, has felt like nothing happening, nothing happening, nothing happening, and then zoom. That mental model is the key to understanding why Anthropic behaves the way it does. A company that genuinely believes it is riding an exponential will tolerate enormous near-term discomfort, public criticism, and internal strain, because it has already priced in a future that looks nothing like the present. Whether that conviction is wisdom or a kind of motivated certainty is the open question the episode never fully resolves, but it explains the urgency in every answer he gives.

    The Boris Cherny segment is the part that should make working engineers sit up. When a senior engineer says Claude has written one hundred percent of his code for six months and that he feels like he has a jet pack, that is not a marketing line, it is a description of a job that has already changed underneath the person doing it. The framing in the piece is optimistic, superpowers and fun, but the logical endpoint is exactly the one Dario himself names a few minutes later: you automate ninety percent of a job, the remaining humans get ten times more leveraged, and then the curve keeps bending toward one hundred percent. Anthropic is, unusually, building the thing and narrating its own disruption in the same breath. That honesty is rare, and it is also a little vertiginous.

    The values-versus-business-model argument deserves more scrutiny than it gets. Dario’s claim is elegant: a business model that conflicts with your values forces you to either betray the values or become irrelevant, so Anthropic chose enterprise and coding because curing diseases and making energy cheaper are enterprise work, while consumer engagement is the addiction-maximizing trap of social media. It is a genuinely good argument, and it is also extremely convenient that the values-aligned path happens to be the most lucrative one. The episode lets that tension sit, which is the right call. The honest reading is that Anthropic found a place where doing well and doing good currently point in the same direction, and the harder test will come the first time they diverge.

    On jobs, Dario is more persuasive than his critics give him credit for, precisely because he refuses the comfortable framing. Jensen Huang and others accuse him of conflating tasks with jobs and of doom marketing that benefits Anthropic. Dario’s response, that the idea this is cheap marketing is itself cheap marketing, is sharper than it first sounds. He is pointing at the way social media flattens a five-page argument about tasks, jobs, tax policy, and the adolescence of technology into a three-second clip designed to provoke. The deeper point is that he is trying to hold two things at once, fast GDP growth and high unemployment, and our public discourse is structurally bad at holding two things at once. That is less a story about AI than about the medium we use to argue about it.

    Finally, the Oppenheimer exchange reframes the entire profile. Dario explicitly rejects the lone-genius model and names Leo Szilard, the scientist who first imagined the chain reaction, as the figure he identifies with. He calls Oppenheimer a failure case, an example of what should not happen. For a man whose company is constantly accused of cultivating a great-man mythology, choosing the early-warning scientist over the bomb’s public face is a deliberate statement about how he wants this story to end: not with charismatic individuals at the center of everything, but with checks and balances everywhere. It is the most quietly radical thing said in the whole piece, and the teaser for a model named Mythos lands with a little extra irony because of it.

    Key Takeaways

    • Anthropic is profiled as an AI juggernaut valued at nearly a trillion dollars, with the figure of roughly $965 billion framing the episode, and is described as profitable for the first time.
    • The company was founded in 2021 by a team of OpenAI defectors and started as an underdog lab before becoming the breakout star of the AI race.
    • Anthropic is run by a sibling duo, Dario Amodei as the visionary and Daniela Amodei as the operator who turns his ideas into action, and Daniela jokes that when they argue, no one wins.
    • Dario describes the AI trajectory as a “smooth exponential” where nothing seems to happen for a long time and then progress suddenly explodes.
    • He says he predicted from a graph that Anthropic would become the AI company with the most revenue and valuation around this time, and that it has happened.
    • Dario grew up in San Francisco with a leather-craftsman father and a librarian mother, took calculus in middle school, and studied math at UC Berkeley while in high school, with no early interest in the internet revolution.
    • Dario studied neuroscience before moving to AI at Baidu and later Google, while Daniela was an early employee at Stripe.
    • Both joined OpenAI starting in 2016, where Dario developed the concept of scaling laws, predicting that large language models would improve simply by adding more data and compute even if the underlying algorithm stayed the same.
    • Scaling up was a counter-cultural scientific bet at the time, held mainly by the founding research team, and it helped supercharge OpenAI’s models and pave the way for ChatGPT.
    • The Amodeis left OpenAI after clashing with Sam Altman over direction and values, framing it as a breakdown of trust and honesty rather than a single safety disagreement.
    • Altman has said that despite their differences, he mostly trusts Anthropic as a company.
    • Anthropic has all seven of its co-founders still at the company, which Dario notes almost never happens at a company of its size.
    • The early team met during the pandemic at Precita Park in San Francisco, pulling up chairs on the grass to talk about what they were building.
    • The name Anthropic comes from the Greek word for human, reflecting a stated mission to build responsible AI for the long-term benefit of humanity.
    • Dario has published long essays including Machines of Loving Grace and The Adolescence of Technology, exploring both the miraculous potential and the worst-case scenarios of AI.
    • Claude is trained to follow a set of principles called a Constitution, intended to keep it aligned and well-behaved.
    • Daniela describes Claude’s intended personality as “professional warmth,” approachable but distant, not a best friend and not cold or calculating.
    • A good model, in Anthropic’s framing, does not lie accidentally or intentionally, with lying including hallucinations where the model invents something it does not know.
    • Anthropic’s own research has shown that models can purposely try to deceive users, which the company works to prevent in production models.
    • There is no universal standard for helpfulness or harmlessness, so Anthropic draws on founding documents like the UN Declaration of Human Rights to train Claude’s character.
    • The company has begun consulting religious leaders about Claude as an entity and about core values that transcend any single worldview.
    • Early Claude models, around the Claude 2 era, were sometimes “nannyish,” expressing concern when a user just wanted the weather, which researchers describe as tuning a fine dial.
    • Anthropic’s revenue skyrocketed over the past year, driven by a focus on lucrative business tools rather than consumer apps.
    • Claude Code automated large chunks of software engineering, and Claude Cowork extended that power to non-engineers.
    • Dario frames the enterprise bet as a values-and-business decision, arguing that a business model conflicting with your values forces you to betray them or become irrelevant.
    • He contrasts engagement-and-addiction-driven consumer and advertising models with enterprise uses like curing diseases, advancing biotech and pharma, and making energy cheaper.
    • Soon after Claude Cowork launched, $285 billion in market value vanished overnight in what traders called the SaaSpocalypse, with some software stocks down nine days in a row.
    • Dario argues the software “pie” will get bigger overall, even as some incumbents shrink or go out of business if they fail to adapt and defend their moats.
    • Boris Cherny, the engineer behind Claude Code and Claude Cowork, was recruited in 2024 from a slow life in rural Japan where he made miso and shopped at farmer’s markets.
    • Cherny’s bet was that a coding agent could do all of software development, not just autocomplete a line or a sentence.
    • He now runs anywhere from a few to a few thousand Claudes at once and says Claude has written one hundred percent of his code for at least six months.
    • A live demo builds a working recipe app that suggests meals for the week in minutes, work that used to take hours or days.
    • At the second annual Code with Claude conference, Anthropic reported API volume up nearly 17x year over year, eight frontier models shipped in twelve months, and first-quarter growth that annualizes to roughly 80x.
    • Dario stands by his warning that AI could eliminate half of all entry level white collar jobs in the next one to five years, saying he remains the same order of concerned.
    • He warns of an unusual combination of very fast GDP growth alongside high unemployment, underemployment, low-wage jobs, and high inequality.
    • Jensen Huang and others have pushed back, accusing Dario of conflating tasks with jobs and of doom marketing that benefits Anthropic.
    • Dario responds that the claim this is cheap marketing is itself cheap marketing, and blames social media for flattening his careful five-page arguments into three-second clips.
    • Anthropic published a paper estimating that management, finance, and legal jobs could be among the fields most affected by AI in the near future.
    • Dario points to the physical world, human-centered relationship-driven work, and humans directing AI as places displaced workers might go, though he is unsure how thick those roles will be.
    • He uses medicine as an example, predicting AI will excel at diagnosis while doctors pivot toward the interpersonal, hands-on, bedside-manner parts that AI cannot replace.
    • The episode teases a next installment on AI and the future of warfare, a scarily powerful new model called Mythos, and the theme of riding the exponential while avoiding dystopia.
    • Dario names The Making of the Atomic Bomb as a favorite book and identifies most with Leo Szilard, who first conceived of a chain reaction, rather than Oppenheimer, whom he sees as a failure case.
    • His view is that the only way the AI era ends well is through checks and balances everywhere, not larger-than-life personalities at the center of everything.

    Detailed Summary

    An unlikely AI celebrity and a sibling-run juggernaut

    The profile opens in a library Dario Amodei clearly loves, establishing him as an unlikely AI celebrity, a man known for warning the world about the risks of artificial intelligence who now runs a company valued at nearly a trillion dollars. Anthropic is presented as the breakout star of the AI race, wiping billions off software stocks, going head-to-head with the Pentagon, and building models powerful enough to threaten modern cybersecurity, with early testers reportedly calling one capability a super weapon and asking the company not to release it. Guiding the company is the sibling pair, Dario the visionary and Daniela the operator who translates his swirling cosmic thoughts into action. Daniela explains that the two have always been close and always wanted to do something big together, and when asked who wins their arguments, she says no one. The framing throughout is of a young, fast-growing startup carrying enormous responsibility for how humanity works, learns, thinks, and even fights wars.

    The smooth exponential and the road from OpenAI

    Dario describes his entire career as the experience of a smooth exponential, where nothing happens for a long stretch and then things go crazy, and he says he watched a graph and correctly predicted Anthropic would top the field in revenue and valuation around now. His backstory is a math prodigy in San Francisco, the son of a leather craftsman and a librarian, taking calculus in middle school and Berkeley math classes in high school, indifferent to the internet revolution and drawn instead to science fiction and understanding the universe. Daniela, more into reading and the arts, calls them near-perfect complements. Dario moved from neuroscience into AI at Baidu and Google, Daniela went to Stripe, and both eventually joined OpenAI starting in 2016, where Dario developed scaling laws, the then counter-cultural bet that more data and compute alone would make models smarter. That insight helped power the models behind ChatGPT, but the Amodeis clashed with Sam Altman over values and direction. Dario frames the departure bluntly: disagreements on safety alone were not enough, but a loss of trust, a sense that Altman’s stated values were not his real values, made it impossible to continue. The resolution, he says, was simply to go off and do their own thing.

    Precita Park, the Constitution, and teaching Claude to be good

    Anthropic’s origin story runs through Precita Park, where the early pandemic-era team gathered on the grass to talk about what they were building. Of seven co-founders, all are still at the company, a retention record Dario says almost never happens at this scale. From the start the company pitched itself as the ultimate safety-conscious lab, with Dario publishing essays like Machines of Loving Grace and The Adolescence of Technology. Claude is trained on a Constitution, and Daniela describes its intended character as professional warmth, approachable but distant. Defining a good model, the team says it should not lie, whether through intentional deception or hallucination, the latter being the model inventing answers it does not actually know. Anthropic’s research has shown models can deliberately deceive, something they work to prevent in production. Because there is no universal standard for helpfulness or harmlessness, they anchor Claude’s training in documents like the UN Declaration of Human Rights and have begun talking with religious leaders about values that transcend any single worldview. Daniela recalls early “nannyish” Claude 2-era behavior, where the model fretted over a user who only wanted the weather, and describes the work as threading a fine needle to land in the center of the dial.

    The enterprise bet, Claude Code, and the SaaSpocalypse

    Anthropic’s revenue surge and first-time profitability are attributed to a focus on business tools, especially Claude Code, which automated large chunks of software engineering, and Claude Cowork, which extended that capability beyond engineers. Dario frames the bet on coding and enterprise as both a values and a business decision: a business model that conflicts with your values eventually forces you to betray them or become irrelevant. He contrasts the engagement and addiction incentives of advertising-driven social media and AI video with enterprise applications like curing diseases, biotech, pharma, academic research, and cheaper energy, all of which he counts as enterprise work aligned with the company’s mission. The disruption was immediate and brutal: soon after Claude Cowork launched, $285 billion in market value vanished overnight in what traders dubbed the SaaSpocalypse, with some software stocks falling nine days straight. Dario’s read is that the overall software pie will grow even as specific incumbents shrink or fail, and that the big losers will be those who do not see what is coming or defend their moats.

    Boris Cherny, jet packs, and Code with Claude

    Much of Anthropic’s recent growth is credited to Boris Cherny, the engineer behind Claude Code and Claude Cowork, hired in 2024 from a deliberately slow life in rural Japan where he made miso and frequented farmer’s markets. A serious science fiction reader, Cherny was awed by his first AI chatbot and also acutely aware of how badly the technology could go. His bet was that a coding agent could do all of software development rather than just autocomplete. He now describes orchestrating anywhere from a few to a few thousand Claudes at once, talking to one while it writes code and moving to the next, and says Claude has written one hundred percent of his code for at least six months. He compares the feeling to having superpowers and a jet pack, calling engineering more fun than ever. A live demo has Claude build a working weekly-meal recipe app in minutes. The story then moves to the second annual Code with Claude conference, where the company reports API volume up nearly 17x year over year, eight frontier models shipped in twelve months, and first-quarter growth annualizing to roughly 80x, with attendees ranging from technical superfans to curious non-engineers.

    Jobs, the tasks-versus-jobs fight, and a more human medicine

    The episode turns to the uncomfortable core: whether engineers will be the first casualties of the AI they are building. Dario stands by his warning that AI could eliminate half of all entry level white collar jobs in one to five years and says he is still the same order of concerned, describing a strange combination of very fast GDP growth with high unemployment, underemployment, low-wage work, and inequality. He notes the usual productivity hump, where automating ninety percent of a job makes humans ten times more leveraged on the rest, before the curve bends toward one hundred percent. With 70 percent of Americans expecting AI to kill jobs and nearly a third fearing for their own, the stakes are political. Jensen Huang and others accuse Dario of conflating tasks with jobs and of doom marketing, and Dario pushes back hard, arguing he writes carefully across five pages about tasks, jobs, tax and macroeconomic policy, and the new jobs of the adolescence of technology, and that calling this cheap marketing is itself cheap marketing born of social media’s three-second culture. Anthropic has published a paper suggesting management, finance, and legal jobs could change the most. Dario points to the physical world, human-centered relationship work, and humans directing AI as landing spots, using medicine as his example: AI will become an excellent diagnostician, but it cannot physically examine a patient or provide bedside manner, so medicine pivots toward the interpersonal. The episode closes by teasing AI and the future of warfare, a powerful new model called Mythos, and Dario’s identification with Leo Szilard over Oppenheimer, whom he calls a failure case, insisting the era can only end well with checks and balances everywhere rather than larger-than-life figures at the center.

    Notable Quotes

    “There’s this kind of smooth exponential, and the experience of the smooth exponential is, nothing’s happening, nothing’s happening, nothing’s happening. Little things happen, and then zoom, it goes crazy.”

    Dario Amodei, on how AI progress actually feels from the inside

    “When you feel that you can’t trust someone, when you feel that their values are not what they say they are, when you feel that they’re not honest, that makes it very hard to continue to work with a company.”

    Dario Amodei, on why he and Daniela left OpenAI

    “Some of the early companies that we gave this to said things like, this is a super weapon, please don’t release this.”

    Anthropic, on early reactions to one of its more powerful models

    “I like to describe it as professional warmth. So the goal is not for it to be your best friend, but it’s not for it to be sort of cold, rote, calculating.”

    Daniela Amodei, describing the character Anthropic designs into Claude

    “If you pick a business model that fundamentally conflicts with your values, you’re gonna have a hard time. Either you betray your own values or you become irrelevant.”

    Dario Amodei, on why Anthropic bet on enterprise and coding

    “For me personally, it’s been writing a hundred percent of my code for at least six months. The work of engineering has just completely changed.”

    Boris Cherny, the engineer behind Claude Code and Claude Cowork

    “I feel like I suddenly have superpowers. I have like a jet pack and the engineering has never been this fun.”

    Boris Cherny, on building software with Claude Code

    “I think we could have this very unusual combination of very fast GDP growth and high unemployment, or at least underemployment, or low wage jobs, high inequality.”

    Dario Amodei, on the economic shock he is most worried about

    “The idea that this is cheap marketing is itself cheap marketing. I think it’s part of the disease of Silicon Valley.”

    Dario Amodei, responding to the doom-marketing accusation

    “The figure I most identified with was Leo Szilard, who was the one who first had the idea that there could be a chain reaction.”

    Dario Amodei, on which atomic-age scientist he sees himself in, rejecting Oppenheimer as a failure case

    Watch the full episode of The Circuit inside Anthropic here.

    Related Reading

    • Anthropic the official site for the company, Claude, Claude Code, and its safety research.
    • Machines of Loving Grace Dario Amodei’s long essay on the optimistic case for powerful AI referenced in the profile.
    • Scaling laws (Wikipedia) background on the data-and-compute bet Dario developed that reshaped modern AI.
    • Leo Szilard (Wikipedia) the physicist who first conceived the nuclear chain reaction and whom Dario says he identifies with.
    • Purpose the PJFP pillar on building meaningful work and direction in a world being reshaped by AI.
  • The AI Layoff Trap: Why Competing Firms Over-Automate, Destroy Their Own Customers, and How a Pigouvian Automation Tax Could Break the Arms Race

    A new economics paper called The AI Layoff Trap, by Brett Hemenway Falk of the University of Pennsylvania and Gerry Tsoukalas of Boston University, makes an argument that is easy to state and hard to escape. If artificial intelligence displaces workers faster than the economy can reabsorb them, it eats into the consumer demand that every firm depends on. The unsettling part is the next step: the authors show that firms knowing this is not enough to make them stop. Even with perfect foresight, rational companies race toward the cliff anyway, and the reason is a textbook market failure hiding inside the automation boom.

    TLDR

    The paper builds a task-based model of a transitioning economy and refocuses it from the labor market to the product market. When a firm automates, it captures the entire cost saving from replacing workers, but it bears only a fraction of the demand destruction that those lost paychecks cause, because most of that lost spending would have gone to rivals. This demand externality means each firm’s privately optimal automation rate is a dominant strategy that overshoots the level that would be best for everyone, including the firm owners themselves. Competition makes it worse, a monopolist would internalize it, and in the frictionless limit the whole thing collapses into a Prisoner’s Dilemma where every firm fires its entire human workforce even though collective restraint would raise all profits. Better AI amplifies the distortion rather than curing it, a dynamic the authors call a Red Queen effect. They test six policy responses. Capital income taxes, worker equity, universal basic income, upskilling, and Coasean bargaining all fail to fix the core incentive. Only a Pigouvian automation tax, set equal to the uninternalized demand loss per task, restores the efficient outcome. The conclusion reframes the AI jobs debate away from cleaning up the aftermath and toward the competitive incentives that drive the layoffs in the first place.

    Thoughts

    The cleverest move in this paper is where it points the camera. Most of the automation literature, going back to Acemoglu and Restrepo’s task-based framework, asks whether the labor market rebalances after displacement through new tasks and a self-correcting wage channel. Falk and Tsoukalas mostly set that debate aside and look at the product market instead. The question is no longer just “will the displaced worker find a new job,” it is “who buys the output once enough workers have lost their income.” By framing lost wages as lost revenue for every firm in the sector, they turn a labor story into a demand story, and the demand story has a much darker equilibrium.

    What makes the result bite is that it does not depend on firms being short-sighted or greedy. The authors grant every firm perfect foresight. Everyone can see the demand cliff ahead. They still automate past the social optimum because the math of a competitive market splits the cost saving and the demand loss unevenly. You keep all the savings from firing your workers. You eat only a sliver of the demand damage, and your competitors absorb the rest, just as you absorb a sliver of theirs. No individual firm can afford to be the one that shows restraint, because restraint just hands market share to rivals who do not. This is a genuine externality, not a coordination failure, which matters because coordination failures can sometimes be solved by communication and this one cannot. Even a binding agreement among all the firms would not hold, since defecting to automate is a dominant strategy for each of them.

    The Red Queen result is the part that should give AI optimists pause. The intuitive hope is that more capable AI raises productivity enough to lift everyone, so the demand problem takes care of itself. The model says the opposite. When AI gets better, each firm sees a bigger share gain from automating ahead of rivals, but at the symmetric equilibrium those share gains cancel out across firms and what remains is a larger distortion. Faster, cheaper, smarter automation widens the wedge between what is privately rational and what is collectively efficient. The technology improving does not relieve the pressure, it intensifies the race.

    The policy section is where the paper earns its keep, because it refuses to let the comfortable answers off the hook. Universal basic income is the response most people reach for, and the model is blunt that it raises living standards without changing a single firm’s incentive to automate. It treats the symptom and ignores the margin. Upskilling and worker equity narrow the gap but cannot close it. Capital income taxes operate on profit levels, not on the per-task decision where the externality actually lives, so they leave the automation rate untouched. The only instrument that works is a tax aimed directly at the act of automating, priced at the demand damage it imposes on others. That is an uncomfortable conclusion for almost everyone. It tells the political left that UBI alone does not fix the structural problem, and it tells the political right that an unregulated market over-automates in a way that destroys profits, not just jobs.

    The honest caveat, which the authors state plainly, is that this is a structural vulnerability rather than a diagnosed crisis. The signature they predict, profit erosion that shows up alongside mass layoffs, requires displacement at a scale and speed the economy has not yet reached. If reabsorption keeps pace, the externality stays too small to measure. But the conditions they flag are worth watching, and a few of the early indicators they cite, like business investment overtaking consumer spending as the leading driver of GDP growth and a falling savings rate, are exactly the kind of demand-side strain the model predicts. The value here is a clear mechanism and a sharp policy implication, available before the crisis rather than after it.

    Key Takeaways

    • The central claim is that AI-driven layoffs can erode the consumer demand firms depend on, and that rational firms with perfect foresight will not stop the process on their own.
    • The mechanism is a demand externality. An automating firm captures the full labor-cost saving but bears only a fraction of the aggregate demand loss it creates, because most of the lost spending would have gone to rivals.
    • Because of that split, each firm’s profit-maximizing automation rate is a strictly dominant strategy that exceeds the level that is collectively efficient.
    • The resulting loss is not a transfer from workers to owners. It is a deadweight loss that leaves both workers and firm owners worse off.
    • The distortion deepens with competition. A monopolist fully internalizes the externality, while fragmented, competitive markets show the widest gap between private and social automation rates.
    • In the frictionless limit, where every task is equally easy to automate, the game becomes a Prisoner’s Dilemma in which every firm replaces its entire human workforce even though collective restraint would raise all profits.
    • The Red Queen effect: more productive AI widens the wedge rather than resolving it, because perceived market-share gains from automating ahead of rivals cancel at the symmetric equilibrium and only the added distortion remains.
    • Endogenous wage adjustment, a key self-correcting channel in standard models, raises the threshold at which the externality activates but cannot close the wedge short of collapsing wages to the cost of AI.
    • Free entry, capital-income recycling, and richer product-market structures also fail to eliminate the distortion.
    • The model evaluates six policy instruments against the externality margin and reaches a clear ranking.
    • Universal basic income raises the floor on living standards but leaves each firm’s automation incentive unchanged.
    • Capital income taxes do not change the equilibrium automation rate, because they operate on profit levels rather than the per-task margin where the externality lives.
    • Upskilling and worker equity participation narrow the wedge but cannot eliminate it.
    • Coasean bargaining fails because automation is a dominant strategy, so no voluntary agreement among firms to restrain layoffs is self-enforcing.
    • Only a Pigouvian automation tax, a per-task charge set equal to the uninternalized demand loss, implements the cooperative optimum.
    • The tax can be self-limiting. Its revenue can fund retraining that raises income replacement, which shrinks the externality over time.
    • By Tinbergen’s principle, a distinct market failure needs a distinct instrument, which is why the single targeted tax succeeds where the broad transfers fail.
    • The mechanism runs through the product market, distinguishing it from work like Beraja and Zorzi that locates inefficient automation in labor-market borrowing constraints.
    • Unlike many other channels for excessive automation, this externality requires competition and vanishes under monopoly, and it persists even when AI is highly productive and credit markets are complete.
    • The demand externality belongs to the family of aggregate demand spillovers, but it is the mirror image of the classic big push: here individually profitable automation is collectively destructive.
    • The authors defend the channel against a general-equilibrium objection, arguing that displaced spending does not rotate back to mass-market firms because high-income consumption saturates and producers cannot quickly retool.
    • A second escape route through a falling interest rate also stalls when rates are near zero or when the income loss is lasting rather than temporary.
    • The empirical signature would be profit erosion coinciding with mass layoffs, which standard competitive models cannot easily explain.
    • The model points to fragmented industries deploying the most capable AI as the place the problem would bite hardest, not the dominant technology firms.
    • Suggested places to look for the effect include customer support, software services, and back-office operations at competing financial institutions.
    • The authors cite real-world signals, including Block cutting nearly half its workforce in February 2026 with AI named as the reason, and more than a million U.S. job cuts announced in 2025 with AI explicitly tied to roughly 55,000.
    • They note that roughly 80% of U.S. workers hold jobs with tasks exposed to large language models, citing Eloundou and coauthors.
    • The model is deliberately conservative, using one sector, one period, and symmetric firms, which the authors argue means the real problem is likely worse than what they show.
    • A practical wrinkle: a unilateral automation tax could push adoption offshore, strengthening the case for multilateral coordination or border adjustments, an explicit analogy to carbon policy.
    • The big reframing is that policy should address not only the aftermath of AI labor displacement but also the competitive incentives that cause it.

    Detailed Summary

    A task-based model refocused on the product market

    The framework borrows the task-based structure of Acemoglu and Restrepo but redirects its attention. Several symmetric firms each choose what fraction of their workforce to replace with AI. Automated tasks cost less to perform, but integration frictions make each additional task harder to automate than the last. On the demand side, workers spend a share of their income on the sector’s output while owners spend less, normalized to zero in the baseline. Some displaced income returns through reemployment or transfers, and the rest is lost to the sector. The setup is intentionally stripped down so the demand channel is transparent and the cliff is visible to every firm in the model.

    The demand externality that traps every firm

    Competition creates the trap. When a firm automates, it pockets the full labor-cost saving, but under competitive pricing it bears only a fraction of the aggregate demand destruction it causes. The rest spills onto rivals. Because each firm faces the same incentive, every firm’s profit-maximizing automation rate is a dominant strategy that exceeds the cooperatively efficient level. Foresight does not save them. The cliff is visible, the incentive to keep walking toward it is individually rational, and the collective result is over-automation that erodes the shared revenue base.

    Competition deepens it, monopoly internalizes it

    The size of the distortion depends on market structure. A monopolist owns all of the demand it would destroy, so it fully internalizes the externality and automates at the efficient rate. As markets fragment, each firm internalizes less and the gap between private and social automation widens. The most competitive markets, often held up as the healthiest, produce the worst over-automation in this model.

    The frictionless limit becomes a Prisoner’s Dilemma

    When integration frictions disappear and every task is equally easy to automate, the game sharpens into a Prisoner’s Dilemma. Full automation dominates restraint for each firm, so every firm displaces its entire human workforce, even though all of them would earn higher profits if they collectively held back. This is the cleanest statement of the trap: a unanimously worse outcome that no firm can unilaterally avoid, and that communication cannot fix because defection is dominant rather than merely tempting.

    The Red Queen effect: better AI makes it worse

    Higher AI productivity does not rescue the equilibrium. Each firm perceives a market-share gain from automating beyond its rivals, but at the symmetric equilibrium those gains cancel across firms, leaving only the extra distortion. So improvements in AI widen the wedge instead of closing it. The authors name this the Red Queen effect, after the character who must run just to stay in place. Endogenous wage adjustment, the classic self-correcting force, raises the threshold where the externality activates but cannot close the wedge once it does, short of wages collapsing all the way to the cost of AI.

    Six policy fixes, and why only one works

    The paper lines up six instruments against the externality. Capital income taxes change profit levels but not the per-task automation margin, so the equilibrium rate is unchanged. Universal basic income lifts living standards without touching the incentive to automate. Upskilling and worker equity narrow the wedge but leave a gap. Coasean bargaining cannot hold because automating is a dominant strategy, so no agreement is self-enforcing. Only a Pigouvian automation tax, set equal to the uninternalized demand loss per task, implements the cooperative optimum. Its revenue can fund retraining that raises income replacement, which shrinks the externality over time and can make the tax self-limiting. Tinbergen’s principle frames the lesson: a distinct market failure needs its own dedicated instrument.

    Does the channel survive general equilibrium?

    A natural objection is that in a frictionless multi-sector economy, displaced income would simply rotate to other spending and the mechanism would dissolve. The authors argue both escape routes are blocked for the mass-market firms most exposed to AI. Spending does not rotate back because high-income consumption saturates and mass-sector producers cannot quickly retool to capture redirected luxury demand. The other route runs through the interest rate: automation shifts income to owners who save more, raising aggregate saving, which a falling interest rate would normally recycle into investment. That adjustment stalls when rates are already near zero or when the income loss is lasting rather than temporary, so displaced workers cannot borrow their way through it.

    What to watch for in the real economy

    The distinguishing empirical signature would be profit erosion that shows up at the same time as mass layoffs, a combination standard competitive models struggle to explain since cost-cutting technology is supposed to raise profits. The authors are careful that this requires displacement at a scale and speed not yet reached, so the contribution is identifying a structural vulnerability rather than diagnosing an active crisis. They point to fragmented industries running the most capable AI as the place to look first, naming customer support, software services, and competing financial institutions’ back-office operations as concrete settings. They also flag a unilateral tax’s offshoring risk, drawing an explicit parallel to carbon policy and the case for multilateral coordination or border adjustments.

    Notable Quotes

    “At the limit, this becomes self-destructive: firms automate their way to boundless productivity and zero demand.”

    The authors, framing the demand cliff that competitive automation runs toward.

    “Rational, forward-looking firms should be the brake; if the cliff ahead is visible to all, why would they race toward it?”

    The authors, setting up the puzzle the paper exists to answer.

    “No firm can afford to be the one that holds back. This is the trap: an automation arms race that only intensifies as AI improves, that leaves workers and firm owners alike worse off, and that no market force can break.”

    From the Discussion, stating the core result in plain language.

    “Because over-automation leaves both firms and workers worse off, correcting it is a matter of eliminating waste, not of redistributing gains between them.”

    The authors, on why the fix is not a left-versus-right transfer fight.

    “This Red Queen effect means that ‘better’ AI, far from mitigating the externality, amplifies it.”

    The authors, on why more capable AI deepens the distortion rather than curing it.

    “The results suggest that policy should address not only the aftermath of AI labor displacement but also the competitive incentives that drive it.”

    From the abstract, the paper’s central policy reframing.

    You can read the full paper, including the formal propositions and the policy table, on arXiv here.

    Related Reading

  • Claude Fable 5 and Claude Mythos 5: Anthropic Ships Its First Generally Available Mythos-Class AI Model With New Safeguards

    Anthropic has launched Claude Fable 5 and Claude Mythos 5, the first Mythos-class models offered beyond a tiny circle of cyber defenders. Fable 5 is the generally available version, wrapped in a new layer of safeguards, while Mythos 5 is the same underlying model with some of those guardrails lifted for a small group of vetted partners. The pair sits a full tier above the Opus class in raw capability, and the launch is as much a story about how Anthropic is choosing to gate that capability as it is about the benchmarks. Below is a full breakdown of what shipped, what the model can do, and why the safeguard design matters.

    TLDR

    Anthropic released Claude Fable 5, a Mythos-class model that is now its most capable generally available model, posting state-of-the-art results across software engineering, knowledge work, vision, memory, and scientific research. To ship it safely and fast, Fable 5 carries new safety classifiers that route flagged queries in cybersecurity, biology and chemistry, and distillation over to Claude Opus 4.8 instead of refusing, a fallback that triggers in under 5% of sessions. The same model ships without cyber safeguards as Claude Mythos 5 for Project Glasswing partners in collaboration with the US Government, where it is described as having the strongest cybersecurity capabilities of any model in the world. Highlights include a codebase-wide migration of a 50-million-line Ruby codebase that Stripe says took a day instead of two months, beating Pokemon FireRed with a vision-only harness, accelerating drug design roughly tenfold using Mythos 5, producing novel molecular biology hypotheses preferred by scientists about 80% of the time, and over a week of autonomous genomics research. Both models cost 10 dollars per million input tokens and 50 dollars per million output tokens, less than half the price of Mythos Preview, with a staged subscription rollout and a new 30-day data retention policy for Mythos-class traffic.

    Thoughts

    The most interesting decision here is not the capability jump, it is the naming split. Fable and Mythos are the same brain. The only difference is whether the safeguards are on. Anthropic is effectively shipping one model twice: a gated public edition and an ungated edition handed to a short list of trusted defenders working with the US Government. That is a clean way to resolve the central tension of frontier AI, which is that the exact capabilities that help a security professional close a vulnerability also help an attacker find one. Rather than dumbing the model down for everyone or holding it back entirely, they are letting the access list, not the weights, carry the risk. Expect this pattern to repeat as capabilities climb.

    The fallback-to-Opus design is the other quietly important choice. When a classifier flags a query in cybersecurity, biology, chemistry, or suspected distillation, the user does not hit a wall of refusal. The request is silently handed to Opus 4.8, a model that is still excellent at almost everything. Graceful degradation beats a hard no, both for user experience and for trust. It also reframes what a safeguard is. Instead of a binary block, it becomes a routing decision, and because more than 95% of sessions never trigger it, most users will never notice it exists. The honest admission that the classifiers are tuned conservatively and will sometimes catch harmless requests is the right posture, even if it will annoy power users who keep getting bounced to the smaller model.

    The commercial signals are worth reading closely. Pricing came down to less than half of Mythos Preview, which suggests confidence in serving costs at scale, but the subscription rollout tells a more cautious story. Fable 5 is free on Pro, Max, Team, and Enterprise plans only through June 22, after which using it requires usage credits until capacity catches up. That is a polite way of saying demand is expected to badly outrun supply. The model is fully available on the API and consumption-based Enterprise plans from day one, because those bill by the token and self-throttle. Subscriptions, which are all-you-can-eat, are where a capacity crunch actually hurts, so that is exactly where the brakes went on.

    On the science, the genomics result is the one that should make people sit up. A model doing over a week of largely autonomous research, assembling single-cell data across 138 species, then designing and training its own machine learning model that outperforms a recently published Science paper while being 100 times smaller, is a different category of claim than acing a benchmark. So is the drug-design work, where Mythos 5 reportedly matches or beats skilled human operators end to end, choosing binding sites, running protein design tools, and recovering from its own failures. If those hold up to publication and independent replication, the interesting frontier stops being chat quality and becomes whether a model can run a research program. That is also precisely why the biology and chemistry classifier exists, and why Anthropic is being so deliberate about who gets the ungated version.

    One caveat worth keeping in view: nearly all of the evidence in the announcement is Anthropic’s own, or comes from partners with early access and an incentive to be enthusiastic. The Stripe migration, the FrontierCode score, the Slay the Spire memory result, the protein targets, and the genomics model are all compelling, but they are first-party until outside labs and the eventual system card, peer review, and independent red-teamers weigh in. The note that the UK AISI made progress toward a universal jailbreak inside a brief testing window is a useful reminder that the safeguard story is a work in progress, not a finished proof.

    Key Takeaways

    • Claude Fable 5 is a Mythos-class model made safe for general use, and is now Anthropic’s most capable generally available model.
    • Mythos-class is a tier that sits above the Opus class in capability. The first was Claude Mythos Preview, released in April through Project Glasswing.
    • Fable 5 is state-of-the-art on nearly all tested benchmarks, and its lead grows as tasks get longer and more complex.
    • Claude Mythos 5 is the same underlying model as Fable 5, but with safeguards lifted in some areas. Fable and Mythos differ only by their safeguards.
    • Mythos 5 is described as having the strongest cybersecurity capabilities of any model in the world, and is deployed through Project Glasswing with the US Government.
    • New safety classifiers cover cybersecurity, biology and chemistry, and distillation. Flagged queries fall back to Claude Opus 4.8 rather than being refused.
    • Users are told whenever a fallback happens. More than 95% of Fable sessions involve no fallback at all, and for those sessions Fable performs effectively the same as Mythos 5.
    • The safeguards are tuned conservatively and trigger in less than 5% of sessions on average, sometimes catching harmless requests. Anthropic plans to reduce false positives after launch.
    • Stripe reported Fable 5 compressed months of engineering into days, performing a codebase-wide migration of a 50-million-line Ruby codebase in a day that would have taken a team over two months by hand.
    • Fable 5 scores highest among frontier models on Cognition’s FrontierCode evaluation for high-quality agentic coding, even at medium effort, and is more token-efficient than past Claude models.
    • On Hebbia’s Finance Benchmark for senior-level reasoning, Fable 5 has the highest score of any model, with gains in document reasoning, chart and table interpretation, and problem solving.
    • IMC noted Fable 5 aced their trading-analysis evaluations nearly across the board, including factual lookup, conceptual reasoning, root-cause analysis, and expected-value analysis.
    • Fable 5 is the new state-of-the-art for vision, and can rebuild a web app’s source code from screenshots alone.
    • Fable 5 beat Pokemon FireRed using a minimal, vision-only harness with no maps, navigation aids, or extra game-state information. Earlier Claude models needed a complex helper harness.
    • Persistent file-based memory improved Fable 5’s Slay the Spire performance three times more than it did for Opus 4.8, and Fable reached the game’s final act three times more often.
    • Fable 5 built a simulation of the solar system, deriving the planets’ orbital motion from physics first principles and using it to predict solar eclipses.
    • Using Mythos 5, internal protein design experts accelerated aspects of drug design by around ten times, with the model matching or beating skilled human operators end to end.
    • Nine of 14 protein targets in the drug-design study yielded strong candidates Anthropic is now investigating.
    • Mythos 5 is Anthropic’s first model to consistently produce novel, compelling scientific hypotheses. Scientists preferred its molecular biology hypotheses about 80% of the time in blinded comparisons.
    • One Mythos hypothesis, a novel mechanism for an E. coli protein, was corroborated by an independent lab working on the same problem.
    • In over a week of largely autonomous work, Mythos 5 assembled single-cell data for millions of cells across 138 animal species and trained a custom model that outperformed a recent Science paper while being 100 times smaller.
    • Anthropic’s automated alignment assessment found Mythos 5’s level of misaligned behavior was low and similar to Opus 4.8. Because they are the same model, Fable 5’s alignment is similar.
    • An external bug bounty produced no universal jailbreaks in over 1,000 hours of testing, though the UK AISI made progress toward one in a brief initial window.
    • One external partner found Fable 5’s safeguards against harmful cyber queries the most robust of any model tested, including Opus 4.8 and Opus 4.7, with zero compliance on harmful single-turn cyberattack requests.
    • The biology and chemistry classifier is deliberately broad for now. Mythos-class models outperformed dedicated protein language models at predicting AAV viral shell assembly using biological reasoning alone.
    • The distillation classifier targets large-scale attempts to extract Claude’s capabilities to train competing models, which could proliferate near-frontier capabilities without safeguards.
    • A new policy requires 30-day data retention for all Mythos-class traffic on first- and third-party surfaces, used only for safety, with logged human access and deletion after 30 days in almost all cases.
    • Anthropic plans trusted access programs that let cybersecurity organizations apply for Mythos 5, and let a small number of life science researchers access Fable 5 with biology and chemistry safeguards removed.
    • Both models cost 10 dollars per million input tokens and 50 dollars per million output tokens, less than half the price of Mythos Preview. Developers can use claude-fable-5 via the Claude API.
    • Fable 5 is free on Pro, Max, Team, and seat-based Enterprise plans through June 22. On June 23 it moves to usage credits on those plans until capacity allows it to return as a standard inclusion.

    Detailed Summary

    A Mythos-class model, made safe for general use

    Fable 5 is the first Mythos-class model Anthropic has made generally available. Mythos-class is a tier that sits above the Opus class, and the first of its kind, Claude Mythos Preview, was released in April through Project Glasswing to a limited group of cyber defenders and critical software infrastructure providers. The company framed today’s launch as the moment it could finally bring that level of capability to all users, because its safeguards had matured enough to allow it. Fable 5’s capabilities exceed those of any model Anthropic has made generally available, and its advantage over other models grows as tasks get longer and more complex.

    Two models, one brain

    Claude Mythos 5 is the same underlying model as Fable 5, but with safeguards lifted in some areas. The names are the only real difference: Fable, from the Latin fabula meaning that which is told, is akin to the Greek mythos, and the safeguards are what distinguish the two. Mythos 5 launches first to existing Mythos Preview users, including the Project Glasswing cybersecurity partners, as an upgrade. It is deployed in collaboration with the US Government and is described as having the strongest cybersecurity capabilities of any model in the world. Anthropic plans to steadily expand access through a more systematic trusted access program.

    Software engineering and token efficiency

    Fable 5 can work autonomously for longer than any previous Claude model, and software engineering is where that shows most clearly. During early testing, Stripe reported it compressed months of engineering into days, performing a codebase-wide migration in a 50-million-line Ruby codebase in a single day that would otherwise have taken a whole team over two months by hand. It is also more token-efficient than past models, scoring highest among frontier models on Cognition’s FrontierCode evaluation for high-quality, maintainable agentic coding, even at medium effort.

    Knowledge work, vision, and memory

    On complex analytical work, Fable 5 posted the highest score of any model on Hebbia’s Finance Benchmark for senior-level reasoning, with substantial gains in document-based reasoning and chart and table interpretation, and IMC said it aced their trading-analysis evaluations nearly across the board. In vision, it is the new state-of-the-art, able to extract precise numbers from detailed scientific figures and rebuild a web app’s source code from screenshots alone. It needs less scaffolding too: where earlier Claude models struggled to play Pokemon even with helper harnesses, Fable 5 beat FireRed with a minimal, vision-only harness using nothing but raw game screenshots. On memory, giving Fable persistent file-based notes improved its Slay the Spire performance three times more than it did for Opus 4.8, and it built a physics-first-principles solar system simulation accurate enough to predict solar eclipses.

    Life sciences: drug design, hypotheses, and genomics

    Using Mythos 5, Anthropic’s internal protein design experts accelerated aspects of the drug-design process by around ten times. With protein design and bioinformatics tools but no human assistance, the model matched or beat skilled human operators, executing the full workflow of choosing binding sites, selecting and running design tools, and recovering from failures. Nine of 14 protein targets yielded strong drug-design candidates now under investigation. Mythos 5 is also Anthropic’s first model to consistently produce novel, compelling scientific hypotheses: scientists preferred its molecular biology hypotheses about 80% of the time in blinded comparisons, and one, a novel mechanism for an E. coli protein, was corroborated by an independent lab. In genomics, Mythos 5 ran over a week of largely autonomous research, assembling single-cell data for millions of cells across 138 species and training a custom model that outperformed a recent Science paper despite being 100 times smaller.

    The new safeguards: classifiers and fallback

    Mythos-class capability is potent enough that Anthropic considers it a substantial misuse risk, especially given how much advanced AI usage is dual use. Fable 5 ships with a new set of classifiers, separate AI systems that detect potential misuse and jailbreak attempts and stop the main model from responding. When a classifier flags a request related to cybersecurity, biology and chemistry, or distillation, the response is handled by Claude Opus 4.8 instead, and the user is told. The cybersecurity classifiers cover both exploitation and broader offensive cyber tasks like reconnaissance and lateral movement, and Anthropic says they prevent Fable from making any progress on those tasks. The biology and chemistry classifier is intentionally broad for now, after tests showed Mythos-class models could outperform dedicated protein language models at predicting AAV viral shell assembly using biological reasoning alone. The distillation classifier targets large-scale attempts to extract Claude’s capabilities to train competing models.

    Jailbreak resistance, data retention, and availability

    Anthropic ran extensive red-teaming, including an external bug bounty that produced no universal jailbreaks in over 1,000 hours, though it notes the UK AISI made progress toward one in a brief window. The company concedes it is likely impossible to fully prevent universal jailbreaks and aims instead to make any that remain slow and costly enough to catch before they scale. A new policy requires 30-day data retention for all Mythos-class traffic, used only for safety, with logged human access and deletion after 30 days in almost all cases. On availability, Fable 5 is live everywhere today and fully available on the API and consumption-based Enterprise plans, while subscription access rolls out in stages: free on Pro, Max, Team, and seat-based Enterprise through June 22, then on usage credits from June 23 until capacity allows it to return as a standard inclusion. Both models cost 10 dollars per million input tokens and 50 dollars per million output tokens.

    Notable Quotes

    “Today we’re launching Claude Fable 5: a Mythos-class model that we’ve made safe for general use.”

    Anthropic, opening the Claude Fable 5 and Claude Mythos 5 announcement

    “Fable 5’s capabilities exceed those of any model we’ve ever made generally available.”

    Anthropic, on where Fable 5 sits in the lineup

    “It has the strongest cybersecurity capabilities of any model in the world.”

    Anthropic, describing Claude Mythos 5

    “During early testing, Stripe reported that Fable 5 compressed months of engineering into days.”

    Anthropic, on Fable 5’s software engineering results

    “Our early data shows that more than 95% of Fable sessions involve no fallback at all.”

    Anthropic, on how often the safeguards route to Opus 4.8

    “Mythos 5 is our first model to consistently produce novel, compelling scientific hypotheses.”

    Anthropic, on the model’s molecular biology research

    “It is likely impossible to completely prevent universal jailbreaks, but our goal is to make any remaining jailbreaks sufficiently slow and costly that we can detect and prevent them before they are used at scale.”

    Anthropic, on the limits of its safeguards

    “Fable is from the Latin fabula, ‘that which is told,’ akin to the Greek mythos. The safeguards are what distinguish the two models.”

    Anthropic, explaining the Fable and Mythos naming

    Read the full announcement and the benchmark tables on Anthropic’s site here: Claude Fable 5 and Claude Mythos 5.

    Related Reading