PJFP.com

Pursuit of Joy, Fulfillment, and Purpose

Tag: AI memory

Thomas Laffont of Coatue on the $4 Trillion AI IPO Wave: SpaceX, Anthropic, OpenAI, and Why the New Unicorn Economy Is Healthier
Thomas Laffont, co-founder of the $55 billion hedge fund Coatue Management, made his All-In Podcast premiere with a data-dense walk through what he calls a once-in-a-generation moment for the unicorn economy. In front of Chamath Palihapitiya, Jason Calacanis, David Sacks, and David Friedberg, he argued that a roughly $4 trillion wave of private value is about to hit the public markets, led by SpaceX, Anthropic, and OpenAI, and that the new AI-driven unicorn economy is actually healthier than the one that came before it. You can watch the full presentation and Q&A on YouTube.

TLDW

Laffont presents Coatue’s slide deck on the state of the unicorn economy and argues it has rebalanced after the excesses of 2021. The average unicorn is up about 70 percent since September 2024, AI keeps taking a bigger share of all fundraising, and the model has shifted from many small unicorns to fewer companies each raising far more, with funding per unicorn up roughly 5x since 2021. He introduces a “Magnificent 8” private index (SpaceX, Stripe, Anthropic, Databricks, Revolut, ByteDance, Anduril, and more) worth nearly $4 trillion that has crushed the public Mag 7, then shows that exits are finally thawing as SpaceX heads to an IPO in weeks and Anthropic confidentially files its S1. He lays out Coatue’s “CODE” framework for why SpaceX gets more valuable the more it launches, a counterintuitive finding that the odds of a 10x actually rise as companies get bigger (31 percent for $100 billion-plus centicorns), the explosive revenue ramp of OpenAI and Anthropic past Workday, ServiceNow, Adobe, Salesforce, and now the hyperscalers, a three-pillar map of where AI revenue comes from (consumer, ads, enterprise), and the AI memory thesis. The Q&A with Chamath and Calacanis digs into the power law, K-shaped outcomes, whether these valuations are disconnected from reality, the public market as the great antiseptic, and what happens when trillions in private value finally recycles back through GPs and LPs.

Thoughts

The most useful idea in the talk is not the $4 trillion headline, it is the cohort-health chart. Laffont splits unicorns into eras and shows that the pre-2021 cohort was healthy, roughly 80 percent had raised again or exited 20 quarters after minting, while the giant 2021 ZIRP cohort of 479 companies is stuck with under 20 percent doing either. That single comparison reframes the whole AI boom. The bullish read is that the 2024 AI cohort is small, concentrated, and cash-generative, so it looks more like the healthy pre-ZIRP group than the 2021 hangover. The bearish read is that we are watching the same movie with bigger numbers, and the test only comes when these companies face public markets. Laffont is honest that we do not yet know which cohort the AI class resembles, and that intellectual humility is what makes the deck credible rather than promotional.

The SpaceX “CODE” framework is the sharpest analytical move of the presentation. Most people would assume a launch business gets cheaper per launch as it scales. Laffont shows the opposite, the market pays more per launch as cadence rises, and explains it as a phase change in business quality: from one-time government launch revenue, to a single recurring-revenue constellation, to multiple constellations, to a platform with optional upside in space data centers, the moon, and Mars. It is a clean way to think about any company that climbs from a project business to a platform business, and it applies far beyond rockets. The lesson for investors is that valuation can rationally expand even as unit economics look like they should compress, because the nature of the revenue underneath is changing.

The counterintuitive 10x odds finding deserves more attention than it got in the room. Conventional wisdom says the bigger you are, the harder it is to grow, so a $100 billion company should be less likely to 10x than a $10 billion one. Coatue’s data says the reverse: centicorns have a 31 percent shot at a 10x, far higher than the 8 percent a unicorn has at becoming a decacorn. Laffont’s explanation is a filtering mechanism, every step up validates a compounding advantage and durability of earnings, so survivors are increasingly the kind of business that keeps compounding. This is essentially a quantitative restatement of quality investing, and it is the intellectual backbone of the LP strategy the besties tease out, just buy whoever reaches $100 billion and hold.

Where the argument gets genuinely contested is valuation, and the panel does not let it slide. The pushback that “these are not fake companies” is true and important, OpenAI and Anthropic are growing faster than any software company in history, and Anthropic reportedly had a profitable month. But growth and reality do not settle the question of price when you are paying 50 to 100 times revenue for trillion-dollar private companies, as Bill Ackman pointed out earlier in the day. Laffont’s answer is the most grounded thing he says all session: the public market is the great antiseptic, it will not care about anyone’s slide deck, and he wants to see these names withstand short sellers and skeptics. That is the right posture. The deck is a thesis, not a verdict, and the verdict arrives roughly six months and one day after the IPOs, once passive flows and supply have washed through.

The closing thread, that almost every sector is being transformed at once and we still do not have superintelligence, is the part worth sitting with. The risk in a presentation this bullish is treating the trend as destiny. The value is in the framing tools Laffont hands you, cohort health, phase-change business quality, the filtering odds, the three revenue pillars, and the antiseptic of public scrutiny. Use those to interrogate each name rather than to buy the index on faith, and the talk earns its premiere billing.

Key Takeaways
- Coatue Management is one of the most successful hedge funds of the last two decades with about $55 billion under management, and is raising roughly another billion dollars specifically to invest in AI.
- The unicorn economy is up about 70 percent on average since September 2024, and the public market has made a similar move up over the same period.
- The unicorn economy’s share of the NASDAQ rose significantly after 2015 but has plateaued in recent years, reflecting strong performance from public companies.
- AI keeps increasing its wallet share of all venture fundraising, multiple years in a row now.
- The composition of funding has changed. The unicorn “factory” peaked in the ZIRP era of 2021 and has normalized at a much lower level since.
- Funding per unicorn has increased roughly 5x since 2021. There are fewer unicorns, and each one is raising more.
- Cohort health, pre-ZIRP group: of about 73 unicorns, 20 quarters after minting roughly 80 percent had either raised a new round or exited, which is healthy.
- Cohort health, 2021 group: of about 479 unicorns, 20 quarters in, fewer than 20 percent had exited or raised again. Far larger cohort, far worse outcomes.
- The open question is which cohort the new 2024 AI cohort will resemble.
- Funding is concentrating: the top 10 companies capture a large share, and it is a small number of AI companies, not all of them, with Anthropic and OpenAI raising massive rounds.
- Laffont proposes a “Magnificent 8” private index: SpaceX, Stripe, Anthropic, Databricks, Revolut, ByteDance, Anduril, and more, spanning internet, AI, fintech, and space tech.
- That private index represents almost $4 trillion of value and has crushed the traditional public Mag 7, with almost every name outperforming.
- Exits are thawing. 2026 is on a good trend for cash returned versus consumed, not quite 2021 levels, with half a year still to go.
- That trend does not yet include three imminent liquidity events: SpaceX (IPO expected in weeks) and Anthropic (confidentially filed its S1), whose combined value could exceed the prior decade of exits combined.
- The ecosystem is far more balanced than when Laffont first presented at the 2024 All-In Summit, when it was consuming much more cash than it returned.
- OpenAI and Anthropic revenue growth is unlike anything previously seen. Starting from January 2025, they passed Workday, then ServiceNow, then Adobe, then Salesforce, and are now bigger than Google Cloud and Azure.
- On current forecasts, that revenue could pass AWS by the end of the year and exceed all of Microsoft by 2028.
- Hyperscalers are not sitting still. The largest companies in the world are funding the disruption, investing unprecedented sums to enable the ChatGPT moment.
- The SpaceX “CODE” framework: the number one driver correlated to SpaceX’s valuation is cadence of launches, and valuation per launch rises as launches increase.
- Why per-launch value rises: business quality improves through phases, pre-constellation (one-time government revenue), initial ramp (one recurring-revenue constellation), scale (multiple constellations), and platform (space data centers, moon and Mars optionality).
- Anthropic in particular is scaling like no company seen across the PC, internet, or mobile eras.
- Counterintuitive 10x odds: a unicorn has about an 8 percent chance of becoming a decacorn, a decacorn has 8 to 13 percent odds of reaching $100 billion, but a centicorn ($100 billion-plus) has a 31 percent chance of a 10x.
- Value creation has accelerated. It typically takes years to go from $500 billion to $1 trillion in market cap, yet recently three companies did it in one year and two did it in a matter of weeks.
- Cerebras is the counterexample of slow success: years of dark periods and no new capital developing its technology, then a massive OpenAI contract that quintupled the company’s value ahead of its IPO.
- Semiconductors are on a generational run, with the sector dramatically outperforming the index since the 2024 All-In Summit.
- AI memory thesis: the more an AI system knows about you, the more useful it is, so memory per user could quintuple, which helps explain recent moves in memory companies.
- Where the revenue is: the AI ecosystem is roughly $140 billion today, about $300 billion this year, and is expected to double in 2027.
- Three revenue pillars: consumer (subscribers times ARPU), ads (about a quarter of Meta and Google ads are AI-enabled today, heading toward 100 percent and roughly $150 billion), and enterprise (tools like Claude Code and Codex inside businesses).
- Disruption is hitting every sector: software, telco (Starlink-powered global phone calls), semis, energy (data centers reshaping Pennsylvania’s grid), auto (Ferrari’s electric and autonomous stumble), and consumer (GLP-1s reshaping food, alcohol, and wellness).
- Final takeaways: the new unicorn economy is healthier thanks to AI, winners are compounding faster so the cost of not owning a winner is higher than ever, disruption is everywhere, and we do not even have superintelligence yet.
- In the Q&A, both Anthropic and OpenAI publicly say they want to be public, and big outcomes now look likely to become liquid within roughly a 12-month window.
- The valuation pushback: these are not fake companies, they generate substantial revenue at scale and grow faster than anything before, and Anthropic reportedly even had a profitable month.
- The public market is framed as the great equalizer and antiseptic, but with passive buying the true price discovery may not land on day one, more like six months and a day after listing.
- A floated LP strategy: wait for whoever reaches $100 billion and concentrate capital there as the least brittle, quickest-return bet, tempered by the warning that valuations are disconnecting from any historical metric (50x to 100x revenue).
- An open risk: with so much capital, OpenAI and Anthropic could rationally start a price war, the way ride-sharing and food-delivery players once did, though heavy infrastructure spend complicates it.
Detailed Summary

The unicorn economy has rebalanced after 2021

Laffont opens by reframing a market many assume is frothy. The average unicorn is up about 70 percent since September 2024, and the public market has tracked a similar climb, so private and public value are moving together rather than diverging. The unicorn economy’s share of the NASDAQ rose sharply after 2015 and then plateaued, which he reads as a sign of how strong public companies have become. Underneath the headline, the structure of funding has changed. The 2021 ZIRP era was a unicorn factory that minted enormous numbers of companies, and that machine has since normalized to a much lower level. The result is a barbell: fewer new unicorns, but each raising far more, with funding per unicorn up roughly 5x since 2021. AI sits at the center of this, taking a steadily larger share of all venture dollars for several years running.

Cohort health is the real story

The deck’s most important slide measures the health of the ecosystem by cohort. The pre-ZIRP cohort, about 73 unicorns, looks healthy: 20 quarters after becoming unicorns, roughly 80 percent had either raised a new round or exited. The 2021 cohort tells the opposite story. It is enormous, about 479 unicorns, and 20 quarters in, fewer than 20 percent had raised again or exited. That contrast sets up the central question of the talk. A new 2024 cohort of AI companies is forming, and no one yet knows whether it will resemble the healthy pre-ZIRP group or the bloated, stuck 2021 group. Laffont’s framing leans optimistic because the AI cohort is small and concentrated, but he is careful not to declare the answer.

The Magnificent 8 and a $4 trillion private index

Funding is not just flowing to AI, it is flowing to a handful of AI names, with the top 10 capturing a large share and Anthropic and OpenAI raising the biggest rounds. From this concentration Laffont builds a private index he half-jokingly calls the Magnificent 8, a number he expects to shrink as companies go public. The members span sectors: SpaceX, Stripe, Anthropic, Databricks, Revolut, ByteDance, and Anduril, covering internet, AI, fintech, and space tech. He says he would be comfortable owning that index for the next decade-plus. Collectively it represents almost $4 trillion of value and has outperformed the public Mag 7, with nearly every constituent beating that benchmark.

Exits are thawing and a wall of liquidity is coming

One of Laffont’s recurring concerns at past summits has been balance: the unicorn economy is great at consuming cash, but a healthy ecosystem must also return it. On that score 2026 is trending well, not quite 2021, but solid with half a year left. Crucially, that figure does not yet include three imminent events. SpaceX is expected to go public within weeks, and Anthropic confidentially filed its S1 the day of the talk. Adding those up, just a few companies could deliver more liquidity than the prior ten years combined. The takeaway is that the ecosystem that was dangerously out of balance in 2024 is now meaningfully more balanced, and improving.

The revenue ramp past the hyperscalers

The growth rates of OpenAI and Anthropic, Laffont argues, are unlike anything previously seen. Charting from January 2025, the leading AI labs passed Workday, then ServiceNow, then Adobe by year end, then Salesforce by January, and are now bigger than Google Cloud and Azure. On forecast, that revenue could surpass AWS by the end of the year and exceed all of Microsoft by 2028. He stresses that the hyperscalers are not passive bystanders, they are actively funding the disruption, pouring unprecedented capital into enabling the change that began with the ChatGPT moment.

The SpaceX CODE framework

Laffont devotes real time to how Coatue thinks about SpaceX. The single factor most correlated with SpaceX’s valuation is cadence of launches, which is intuitive for a launch business. The surprise is that valuation per launch has risen rather than fallen as cadence climbed. His explanation, the CODE framework, is that the quality of the business model improves the more SpaceX launches. In phase one, pre-constellation, you are simply proving rockets, with a few government customers and lumpy, unpredictable one-time revenue. In the initial ramp you stand up a constellation, which is an end market and a recurring-revenue business that grows with every satellite and subscriber. At scale you operate multiple constellations, and Laffont expects companies, governments, and militaries to want to own their own. Ultimately it becomes a platform, with new businesses layered on top, from space data centers to the optionality of the moon and Mars.

Counterintuitive odds and the speed of value creation

Coatue bucketed companies and asked the odds of a 10x within each. A unicorn has roughly an 8 percent chance of becoming a decacorn. A decacorn has 8 to 13 percent odds of reaching $100 billion. But a centicorn, $100 billion or more, has a 31 percent chance of a 10x, counting both public and private companies. The bigger you are, the better your odds, which inverts intuition. Laffont pairs this with the sheer speed of recent value creation. Going from $500 billion to $1 trillion in market cap normally takes years, yet three companies did it in a single year and two did it in a matter of weeks. He also offers Cerebras as the patient counterexample, a chip company that endured years of dark periods and no new capital before a massive OpenAI contract quintupled its value ahead of IPO, part of a broader generational run for semiconductors.

AI memory and where the revenue actually comes from

A throughline from the day’s other speakers is that the more an AI knows about you, the more useful it is, from your restaurant preferences to your work context. Laffont turns that into a thesis: memory per user could quintuple based on what these systems require, which helps explain recent moves in memory companies. He then tackles the most contested question, where is the revenue. He sizes the AI ecosystem at about $140 billion today, roughly $300 billion this year, and doubling in 2027, built on three pillars. Consumer is subscribers times ARPU. Ads are the pillar people forget, with about a quarter of Meta and Google ads already AI-enabled and penetration heading toward 100 percent, a roughly $150 billion opportunity. Enterprise is the breakthrough category, exemplified by tools like Claude Code and Codex operating inside businesses.

Every sector is being transformed at once

What makes this era different, Laffont says, is that nearly every sector is being transformed simultaneously. Software is obvious, but look at telco, where he believes Starlink will soon power a device that lets you make a phone call anywhere on earth, attacking the global telco and broadband profit pool with a better product. Compute is driving massive change in semis, data centers are reshaping the energy equation in places like Pennsylvania, and the auto business is being upended, as Ferrari’s stumble introducing electric and autonomous technology showed. In consumer, GLP-1 drugs are profoundly changing consumption of food and alcohol and the broader focus on wellness. His takeaways close the loop: the new unicorn economy is healthier thanks to AI, winners are compounding faster so the cost of missing them is higher than ever, disruption is everywhere, and superintelligence has not even arrived yet.

The Q&A: power law, valuation, and the public market test

Chamath and Jason Calacanis press Laffont on what this means for allocators. The recurring theme is the power law and K-shaped outcomes, with gains consolidating into a small number of companies. The positive side, Laffont notes, is that outcomes are enormous and increasingly liquid within a 12-month window, and both Anthropic and OpenAI say they want to be public. The hard part is valuation. The besties cite Bill Ackman’s framing that investors are making venture bets on trillion-dollar companies at 50 to 100 times revenue. Laffont’s pushback is that these are not fake companies, they generate substantial revenue at scale and grow faster than anything before, and Anthropic reportedly had a profitable month. But he embraces the discipline ahead: the public market is the great antiseptic and will not care about anyone’s presentation, though with heavy passive buying, true price discovery may take roughly six months and a day rather than landing on day one. Asked whether the compounding is a market inefficiency or survivor bias, he declines to over-read a small sample, noting that Anthropic before Claude Code was a completely different company than after. The conversation closes on what happens when trillions recycle from GPs to LPs, the case for simply owning whoever crosses $100 billion, the risk of everyone crowding into three names, and the possibility of an eventual OpenAI versus Anthropic price war.

Notable Quotes

“So we have fewer unicorns that are each raising more.”
Thomas Laffont, summarizing how funding per unicorn has risen roughly 5x since 2021

“The reason is that the quality of SpaceX’s business model increases the more you launch.”
Thomas Laffont, explaining the CODE framework and why valuation per launch rises with cadence

“The winners are compounding faster than ever, which means the costs of not being in a winner are higher than ever.”
Thomas Laffont, on the central risk of a power-law market

“And by the way, we don’t even have super intelligence yet.”
Thomas Laffont, closing his takeaways on how early the transformation still is

“These are companies generating substantial revenue at scale that are growing faster than anything we’ve ever seen.”
Thomas Laffont, pushing back on the idea that AI valuations rest on fake companies

“It will be the great antiseptic. It will not care about my presentation.”
Thomas Laffont, on the public market as the ultimate test for SpaceX, OpenAI, and Anthropic

“Anthropic pre-cloud code was a completely different company than post cloud code.”
Thomas Laffont, on why he won’t over-read a small sample of hyper-compounders

“The power law rules our lives. All the great gains are being consolidated into small numbers of companies.”
An All-In host, framing the Q&A on concentration in private markets

This is a curated set of highlights. To hear the full presentation, the slide walkthrough, and the complete Q&A with Chamath and Jason Calacanis, watch the full conversation here.

Related Reading
- Coatue Management. Primary source for Thomas Laffont’s firm and the technology investing strategy behind the deck.
- The All-In Podcast. The show and summit where Laffont made this premiere presentation.
- Power law (Wikipedia). Background on the distribution Laffont and the hosts say governs venture and public-market returns.
- The Magnificent Seven (Wikipedia). The public-market benchmark Laffont’s private “Magnificent 8” index is measured against.
- Cerebras Systems. The AI chipmaker Laffont cites as the slow-grind IPO that was eventually transformed by a major OpenAI contract.
June 4, 2026
Tim Ferriss, Chris Williamson, and George Mack Go Down the Rabbit Hole: Japanese Immersion, Memory and Forgetting, Brain Stimulation, AI Interfaces, and the Search for Meaning
This is the third installment of the freewheeling “Rabbit Hole” roundtable from Chris Williamson’s Modern Wisdom, and the cast is stacked: Tim Ferriss, writer George Mack, and the founder behind the ambient-AI app Sky (who posts as @signull). It is a sprawling, two-and-a-half-hour conversation that jumps from why Americans never adopted WhatsApp to whether Tim dreams in Japanese, then keeps tunneling into deeper ground: how language shapes thought, why forgetting is a feature, the frontier of brain stimulation, what the next computing interface looks like, and the search for meaning in a world where AI keeps removing scarcity. You can watch the full conversation on YouTube here.

TLDW

The group opens on language: the etymology of “soon,” Malay and Indonesian reduplication, the Sapir-Whorf idea that language shapes thought, and Tim Ferriss recounting how a year of total immersion in a Japanese high school at fifteen made him fluent, with a detour into why adults can learn languages faster than the myth suggests. From there they move into the mind itself, aphantasia versus hyperphantasia, eidetic memory, and the underrated advantages of forgetting, which loops into AI memory, hallucination as a form of confabulation, and the unreliability of eyewitness testimony. A long middle section, anchored by Packy McCormick’s essay “Riding the Leopard,” wrestles with meaning in a post-scarcity world, drawing on Viktor Frankl, Joseph Campbell, Nick Bostrom, and the Dawkins versus Hirsi Ali debate about whether comforting beliefs are rational if they work. Tim then walks through the most concrete material in the episode: his use of accelerated TMS, the one-day protocol, the stellate ganglion block, and why the chemical-imbalance theory of depression is largely debunked. They close on the next interface (ambient AI, camera-equipped AirPods, the post-app phone, Apple’s wait-and-win strategy), a riff on Britain versus America, and the rise of AI-assisted looks-maxing. The throughline, stated and restated, is that friction and scarcity are where meaning and value actually come from.

Thoughts

For a conversation that looks like pure chaos, one idea holds it together: friction is where meaning lives, and modern technology is a machine for removing friction. They route the point through Nick Bostrom (the traits we admire in people exist because we have to negotiate a scarce, resistant world), through dating apps and DoorDash (frictionless access cheapens the thing you get), and through chess (still meaningful precisely because there is an opponent pushing back, even though engines crush every human). It reframes the AI-and-meaning panic in a useful way. The danger is not that AI deletes meaning, it is that it makes meaning harder to reach, the same way a calorie-dense food environment does not outlaw health but quietly makes it the harder path. If that is right, the work ahead is less about stopping the technology and more about deliberately reintroducing resistance.

The most original riff is the treatment of forgetting as a feature rather than a defect, and then turning that lens on AI. Humans prune memory by salience, holding onto the vivid and the painful and letting the middle fade. Current AI memory systems do not prune, so when you stuff a model’s context full of stored “facts” you get noise and forced, spurious connections. The group notes that AI hallucination is really just machine confabulation, and that humans confabulate constantly, the Grenfell Tower “baby caught from the tower” false memory and the general unreliability of eyewitness testimony being the proof. The practical takeaway for anyone building AI products is counterintuitive and correct: the hard problem is not storage, it is principled forgetting.

Tim Ferriss’s neuromodulation segment is the most concrete and quietly radical part of the episode. The claim worth sitting with is that the chemical-imbalance theory of depression is largely debunked, and the frontier has moved to circuit-level intervention: accelerated TMS, a neuroplasticity agent like d-cycloserine taken beforehand, and a “one-day protocol” that took him from an eight or nine on anxiety and rumination down to a one, with lifelong insomnia resolved. Two honest caveats keep it credible rather than salesy. It does not always work (he is candid that several rounds failed), and the side effects are real (rebound symptoms, temporary anhedonia). The economics are a clean illustration of a pattern that recurs through the whole conversation: roughly thirty thousand dollars out of pocket today is how the unit cost eventually falls to something insurers and ordinary patients can afford, the same arc that electric cars and the first copy-and-paste-less iPhones traveled.

The meaning-and-religion exchange is where the conversation is most alive, and most revealing about where this cohort has landed. The Dawkins versus Ayaan Hirsi Ali anecdote crystallizes it: a man “optimizing for rationality while ignoring effectiveness,” pressing someone on whether the stone literally moved on the third day, when that someone’s life was demonstrably saved by the belief. Their tentative conclusion, that comforting delusions may be permissible when the measurable outcomes (health, community, longevity, a sense of meaning) are real, would have been near-heresy in the New Atheist moment of fifteen years ago and is now close to consensus among exactly these kinds of people. Whether you buy it or not, it is a sharp barometer of how far the cultural wind has shifted, and it pairs neatly with George Mack’s point that you cannot invalidate a whole framework with a single counterexample the way you can in mathematics.

Key Takeaways
- Americans never adopted WhatsApp largely because the US had free SMS early, while Brits paid per text, which is also why a generation grew up compressing messages into 160 characters.
- The word “soon” was the Anglo-Saxon word for “now.” Because people kept saying “soon” and not acting, the language invented “now” to replace it, and “now” is already drifting the same way (“now now” in South Africa, similar constructions in Latin America).
- Malay and Indonesian use reduplication instead of plurals (table-table, orang-orang meaning men, the root of orangutan, “man of the forest”), a small example of how different languages carve up the world differently.
- The Sapir-Whorf hypothesis and Wittgenstein’s line, “the limits of my language are the limits of my world,” frame a recurring theme: we assume we shape language, but language also shapes us, including, some speakers report, having a different personality in a different language.
- Tim Ferriss became fluent in Japanese through total immersion as a fifteen-year-old exchange student, taking physics and world history in Japanese, helped by the fact that it was pre-smartphone so there was no English escape hatch.
- Adults can often learn languages faster than children, not slower. Children seem faster mainly because they have no choice and are forced into immersion. Adults already have the conceptual scaffolding (grammar, abstraction, the subjunctive) that a three-year-old lacks.
- Density of practice beats frequency. Learning a language one hour a week is like trying to learn tennis once a month. The Michel Thomas method and Nassim Taleb’s joke (“the best way to learn Russian is to go into a Russian jail”) both point at intensity and stakes.
- People differ radically in how they think. Aphantasia is the inability to visualize (some people only think in words), while others cannot think in words at all and only in images. The “imagine an apple” test reveals where you sit on that spectrum.
- An overdeveloped memory can be counter-evolutionary past a point. Hyperthymesia makes it hard to let go of grievances and slights, and there are real, underrated advantages to forgetting.
- Forgetting is the hard, missing piece in AI memory. Systems store facts but have no pruning of salience, so loading lots of “memories” into context produces noise and spurious connections rather than wisdom.
- AI hallucination is best understood as machine confabulation, and humans confabulate constantly. The Grenfell Tower “baby dropped and caught” story spread through multiple eyewitnesses and turned out to be a collective false memory once physicists questioned it.
- Memory is bound to place. One participant had to move neighborhoods after a breakup because every coffee shop and corner replayed the relationship, echoing an Alain de Botton observation that a beautiful memory becomes the sharpest source of pain if the relationship ends.
- Phantom phone vibrations are real and documented. Years of notifications Pavlovian-condition your body to feel buzzes that are not there, evidence of how deeply the device has wired itself into your nervous system.
- You can train visual memory. Tools include “Drawing on the Right Side of the Brain,” gesture drawing with short timed poses, and learning to see specifics (the six local tree species) instead of the generic label “tree.” Attention and labels, not just raw acuity, drive perception.
- The smartphone is described as a “black mirror.” There is data suggesting people with fewer mirrors at home self-report as happier, and “Zoom face” drove a surge in cosmetic surgery during the pandemic as people watched themselves on camera all day.
- Packy McCormick’s essay “Riding the Leopard” anchors the meaning discussion. A reader who analyzed more than 200 sci-fi novels found that the most common unsolved problem in post-scarcity worlds is meaning (59% of books), with identity next at 17%.
- Viktor Frankl’s framing recurs: “as the struggle for survival has subsided, the question has emerged, survival for what?” Ever more people have the means to live but no meaning to live for.
- Nick Bostrom’s point (from his “solved world” work) is that almost everything we value in other people, discipline, prudence, good judgment, honesty, exists because we must negotiate a scarce world. Remove the scarcity and those values risk a strange “weightlessness.”
- The precautionary principle cuts both ways: humans are very good at forecasting problems and very bad at forecasting the solutions that billions of people will eventually invent for those problems.
- Chess is the optimistic counterexample to “AI removes all purpose.” Engines beat every human, yet people, including Magnus Carlsen, still love playing, because meaning needs resistance, not victory.
- There is a real resurgence in religion, including the ascendant Latin Mass, conducted in a language the congregation does not speak. The group debates whether “comforting delusions” are actually rational if religious people are measurably happier, healthier, and longer-lived.
- The Dawkins versus Ayaan Hirsi Ali exchange is held up as someone “optimizing for rationality while ignoring effectiveness,” and you cannot disprove a whole framework with a single counterexample the way you can in math.
- Tim Ferriss is now far more focused on neuromodulation than psychedelics. Accelerated TMS, paired with a plasticity agent and refined into a “one-day protocol,” took him from an eight or nine on anxiety and rumination to a one, and resolved decades of insomnia.
- The chemical-imbalance theory of depression and anxiety is, by his account, thoroughly debunked. You are not depressed simply because of low serotonin, which is part of why SSRIs come with off-target side effects and poor off-ramping plans.
- The stellate ganglion block (SGB) acts like a hard reset of the nervous system. Tim measured a roughly 30% jump in HRV on his Whoop that held for months. It is used aggressively for PTSD in soldiers.
- Psychedelics reopen critical-period plasticity windows (research associated with Gul Dolen) for two to three weeks afterward, which is powerful for relearning but also means whatever habits you instill in that window can stick hard. The brain is “Play-Doh warmed in the microwave.”
- Most consumer vagus-nerve stimulators are “bunk” because they do not hit the nerve correctly (the target near the ear is the cymba concha). Kevin Tracey’s book “The Great Nerve” is cited as the credible source, and devices like gammaCore are FDA-cleared for migraine.
- Hard safety warning: do not DIY brain stimulation. Hit the wrong target and you can make symptoms much worse. Use a reputable clinic.
- Sequencing is everything, in TMS, in language learning, and in habit change. Most mistakes are sequencing mistakes. Pick the right domino to tip first and everything downstream gets easier.
- The next interface is unsettled. Candidates include camera-equipped AirPods, a “Her”-style earpiece, a glanceable agentic home screen (the Sky app), and OpenAI’s Jony Ive collaboration. Elon Musk’s bet is that apps disappear and the phone generates whatever you need on demand.
- Apple’s strategy is to never be first but to be best, letting other companies fund the R&D and split-test the market (MP3 players before iPod, smartphones before iPhone, wireless earbuds before AirPods), backed by a war chest and roughly 20 billion dollars a year from Google.
- Both smartphone hardware and AI models feel like they are hitting diminishing returns in noticeable user experience, after a long stretch (iPhone 5 to 12) of obvious leaps.
- If the UK were a US state it would rank first in many quality-of-life metrics (life expectancy, low homicide, healthcare coverage, paid leave) and 51st in GDP per capita. Scott Galloway’s line: America is the best place to earn money, Europe the best place to spend it.
- A fast, real-world AI win: uploading photos of a years-long skin condition to Gemini, which correctly identified it as fungal and recommended ketoconazole shampoo after doctors had failed. Photo-based self-diagnosis is becoming a major consumer use case, as is AI-assisted “looks-maxing” and Facetune-style editing.
- Tim’s recent long-form essay, “The Self-Help Trap: What I Learned After 20 Years of Improving Myself,” is on tim.blog, and George Mack’s book recommendations live at highagency.com/books.
Detailed Summary

Does Tim Ferriss dream in Japanese? Immersion and learning as an adult

The episode’s title question gets a real answer. Tim Ferriss says he runs on an English interface but became genuinely fluent in Japanese as a fifteen-year-old exchange student, after misunderstanding that “Japanese lessons” meant all his lessons (physics, world history) would be taught in Japanese. Total immersion plus a pre-smartphone world with no way to retreat into English did the work, and when he came home it took about a month to switch back, waking up and speaking Japanese to his mother. The group challenges the myth that children learn languages faster than adults: kids appear faster only because they are forced into immersion and have no mortgage and no job to distract them. Adults arrive with conceptual scaffolding, grammar, abstraction, the ability to grasp a counterfactual subjunctive, that a three-year-old simply does not have. The real variable is density of practice, which is why a six-week immersion can beat a year of weekly classes, and why the Michel Thomas method and Nassim Taleb’s “learn Russian in a Russian jail” both lean on intensity.

Language shapes thought: etymology and Sapir-Whorf

The opening stretch is a love letter to etymology. “Soon” was once the Anglo-Saxon word for “now,” and degraded over generations as people said it without acting, forcing the invention of “now,” which is itself now drifting. Malay and Indonesian double nouns rather than pluralize them (table-table, and orang-orang, men, giving us orangutan, “man of the forest”). These are small doors into the Sapir-Whorf hypothesis and Wittgenstein’s claim that the limits of your language are the limits of your world. The group treats the idea that language shapes us, not only the reverse, as easy to dismiss and probably true, citing friends who feel they have a different personality or can access different thoughts in Italian or Swedish.

Two ways of thinking, and the praise of forgetting

From language they move to cognition. People differ dramatically: some have aphantasia and cannot picture an apple at all, thinking only in words, while others cannot think in words and only in images, one friend reportedly visualizing a staircase to count. Tim places himself far toward hyper-visual memory, able to recall the floor plan of nearly every restaurant he has been in. But the group keeps returning to the underrated value of forgetting. An overdeveloped memory, hyperthymesia, makes it hard to release grievances and slights, which may be counter-evolutionary past a point. The athletic version is the “yips,” where you have to learn to process a mistake on film and then discard it rather than ruminate.

When memory becomes a feature: AI, hallucination, and false memory

The forgetting thread maps directly onto AI. The founder building the Sky app notes that it is now trivial to have AI extract and store a fact, but there is no pruning of salience, no built-in sense that something is no longer relevant, so passing many stored memories into context produces noise and forced connections. AI hallucination, the group argues, is just machine confabulation, and humans confabulate all the time. The vivid example is the Grenfell Tower fire, where multiple eyewitnesses “remembered” a baby being dropped from the tower and caught, a story that fell apart once physicists ran the numbers, an illustration that eyewitness testimony and human memory are themselves hallucinated reconstructions.

Attention, phones, and the black mirror

Phones get treated as both nervous-system extension and liability. Phantom vibrations are real and documented, a Pavlovian artifact of years of haptic notifications. The smartphone is a “black mirror,” and the group cites data suggesting fewer mirrors at home correlate with higher self-reported happiness, plus the pandemic “Zoom face” surge in cosmetic surgery. Tim describes running no social media, no vibrate, and no ringer on his phone with no felt loss of being informed, and a wider complaint that screens are now so ambient (five screens on a treadmill, a video wall, subtitles everywhere) that going screen-free requires active effort.

Riding the leopard: meaning in a post-scarcity world

Tim reads from Packy McCormick’s essay “Riding the Leopard,” which opens with a parade of AI funding announcements and the deflating question, “who gives a damn, why do we care?” before pivoting to a reader, in remission from stage-four cancer, who analyzed more than 200 sci-fi novels and found that the dominant unsolved problem in post-scarcity worlds is meaning. The piece quotes Viktor Frankl on survival giving way to “survival for what,” and takes its title from Joseph Campbell’s image of Dionysus riding the leopard without being torn apart, living with composure atop overwhelming energy. The group widens it with Nick Bostrom’s argument that the human traits we prize exist only because we negotiate a scarce world, so removing scarcity creates a values “weightlessness,” and David Deutsch’s counter that problems are infinite and soluble.

Friction, resistance, and the cocktail-party question

The most coherent conclusion is that meaning requires friction. Chess stays meaningful despite unbeatable engines because there is still resistance. Capitalism’s genius and its cost is removing friction, dating apps turning people into a swipeable catalog, DoorDash delivering a bathing suit in thirty minutes, and that frictionlessness tends to cheapen the thing delivered. The “what do you do?” cocktail-party question gets dissected as a very Western tic that ties identity to craft and productivity. Winston Churchill becomes the case study: a man who nearly died countless times, believed he was preserved for a purpose, fought his “black dog” depression, and laid 200 bricks a day just to stay occupied.

Religion, rationality, and comforting delusions

The meaning question leads into the religion revival, including the surging Latin Mass conducted in a language nobody in the pews speaks. They revisit the Jordan Peterson and Sam Harris debates about whether a secular population can build a durable moral code from first principles, and the Dawkins versus Ayaan Hirsi Ali exchange, where Dawkins challenged the literal resurrection while Hirsi Ali described religion saving her from a suicidal low. The verdict offered is that Dawkins was “optimizing for rationality while ignoring effectiveness,” and that if comforting beliefs reliably produce better health, community, and meaning, calling them irrational starts to look like the irrational move. George Mack adds the logical point that you cannot void an entire framework with a single counterexample the way you can in mathematics.

Rewiring the brain: TMS, the one-day protocol, and neuromodulation

Tim delivers the episode’s most concrete material. He describes years of generalized anxiety, OCD, and rumination he now traces partly to Lyme disease and chronic neuroinflammation, and his use of accelerated TMS (intermittent theta-burst stimulation) targeting specific circuits identified via fMRI. Paired with a neuroplasticity agent, the antibiotic d-cycloserine, dissolved in the mouth beforehand, the treatment evolved into a “one-day protocol” that took him from an eight or nine to a one and ended decades of insomnia. He is careful to caveat: he is not a doctor, it has not worked every time (five or six attempts), and side effects include rebound symptoms, occasional insomnia, and temporary anhedonia. The broader claim is that the chemical-imbalance theory of depression is largely debunked, and that real innovation here, as with electric cars and early iPhones, starts with wealthy early adopters overpaying (around 30 thousand dollars out of pocket) until cost and throughput improve. He names Jonathan Downar as a leading researcher and is involved with a device company, Ampa, built around the one-day protocol.

Psychedelics, plasticity windows, and the stellate ganglion block

Adjacent to TMS, Tim explains that psychedelics (and MDMA) appear to reopen critical-period plasticity for two to three weeks afterward, work associated with researcher Gul Dolen, which is promising for stroke recovery or relearning but dangerous if you instill bad habits while the brain is malleable. He recounts a two-sided stellate ganglion block (SGB) with Matt Cook, essentially a hard reset of the nervous system that produced a roughly 30% increase in HRV on his Whoop that held for months, and is used aggressively for PTSD in soldiers. After years funding psychedelic science, he says he has done almost none in the last three years because neuromodulation has been that compelling, while warning that psychedelics are “nuclear power for the psyche,” not suitable for everyone.

The vagus nerve, real and fake

On vagus-nerve stimulation, Tim’s verdict is that most consumer devices are bunk because they do not hit the nerve in the right place (the ear target is the cymba concha, and many heavily funded products miss it). He points to Kevin Tracey, author of “The Great Nerve,” as the credible scientist, explains the “inflammatory reflex” and its relevance to rheumatoid arthritis and autoimmune conditions, and notes that gammaCore (the prescription version of Truvaga) is FDA-cleared for migraine, with SetPoint Medical’s implant another route. A migraine-with-aura sufferer in the group provides the real-world test case.

The next interface and Apple’s wait-and-win game

The future-of-computing thread argues the real AI device has not been invented yet. Candidates include camera-equipped AirPods, a glanceable agentic home screen (the Sky app’s pitch is surfacing what you need so you doom-scroll less), a “Her”-style always-on earpiece, subvocalization sensors that read intended speech, and OpenAI’s secretive hardware with Jony Ive. Elon Musk’s bet is that apps vanish and the phone simply generates what you need on demand, which is plausible now that people use ChatGPT or Claude for tasks that used to need dedicated apps. Apple’s counter-move is its classic one: never first, always best, letting rivals fund the R&D (MP3 players, smartphones, wireless earbuds all predate Apple’s versions), backed by a war chest and roughly 20 billion dollars a year from Google. Both phone hardware and AI models, the group feels, are now delivering diminishing perceptible gains.

Britain, America, and the image economy

The closing tangents include George Mack’s viral chart showing that if the UK were a US state it would rank first in many quality-of-life measures and 51st in GDP per capita, with Scott Galloway’s summary that America is the best place to earn money and Europe the best place to spend it. They land on AI as an everyday tool: uploading photos of a stubborn skin condition to Gemini, which diagnosed it as fungal and recommended ketoconazole shampoo where doctors had failed, and the booming use of AI for “looks-maxing,” facial analysis, and Facetune-style editing, with writer Freya India’s reporting that young women now compete to be the one holding the phone so they control the edit. Tim signs off pointing to his “Self-Help Trap” essay on tim.blog, George to highagency.com/books, and the Sky founder to the app’s growing wait list.

Notable Quotes

“The reason that people mistakenly believe that kids learn faster is because the kids have no choice. The kids have no mortgage. The kids have no job.”
On why adults can actually learn languages faster than children

“It’s the Wittgenstein quote of, the limits of my world are the limits of my language. And we think that we shape language, but language shapes us.”
George Mack, introducing the Sapir-Whorf thread

“There are some tremendous advantages to forgetting.”
Tim Ferriss, on why an overdeveloped memory can be counter-evolutionary

“As the struggle for survival has subsided, the question has emerged, survival for what? Ever more people today have the means to live but no meaning to live for.”
Viktor Frankl, quoted by Tim Ferriss reading from Packy McCormick’s essay “Riding the Leopard”

“Everything that we value in other humans can be refined down to the fact that you need to negotiate with a world that is scarce.”
Summarizing Nick Bostrom’s argument about values in a solved world

“What you see is a guy who is playing a game of optimizing for rationality whilst ignoring effectiveness.”
On Richard Dawkins challenging Ayaan Hirsi Ali’s faith despite the outcomes it produced

“There’s very few things that I can think of that are meaningful that are also totally frictionless or just there is no challenge in it.”
On why meaning depends on resistance, from the chess and dating-app discussion

“The general chemical imbalance theory of depression or anxiety is pretty much thoroughly debunked at this point. You’re not depressed because you have low serotonin levels by and large.”
Tim Ferriss, on the shift from serotonin models to circuit-level neuromodulation

“A lot of innovation starts with people with money spending way too much money. That’s true with electric cars, it’s true with Uber, it’s true with the early generation iPhones.”
Tim Ferriss, on how expensive early treatments like accelerated TMS eventually scale

These are short, curated pulls from a long conversation, not a transcript. For the full context, including the brain-stimulation walkthrough and the meaning debate, watch the full episode on YouTube here.

Related Reading
- Tim Ferriss (tim.blog) primary source for his newsletter, his “Self-Help Trap” essay, and his writing on neuromodulation and TMS.
- Man’s Search for Meaning by Viktor Frankl the source of the “survival for what” idea that anchors the post-scarcity discussion.
- The Sapir-Whorf hypothesis (Wikipedia) background on the claim that language shapes thought, which opens the conversation.
- Packy McCormick’s Not Boring home of “Riding the Leopard,” the essay Tim Ferriss reads from on meaning and post-scarcity.
- The vagus nerve (Wikipedia) context for the neuromodulation thread and Kevin Tracey’s work on the inflammatory reflex in “The Great Nerve.”
June 1, 2026
Krishna Rao on Anthropic Going From 9 Billion to 30 Billion ARR in One Quarter and the Compute Strategy Powering Claude
Krishna Rao, Chief Financial Officer of Anthropic, sat down with Patrick O’Shaughnessy on Invest Like the Best for one of the most detailed public looks yet at the operating engine behind Claude. He covers how Anthropic compounded from $9 billion of run rate revenue at the start of the year to north of $30 billion by the end of Q1, why he spends 30 to 40 percent of his time on compute, the playbook for buying gigawatts of AI infrastructure across Trainium, TPU, and GPU platforms, how Anthropic prices its models, why returns to frontier intelligence keep climbing, and what the Mythos release tells us about the cyber capabilities of the next generation of Claude.

TLDW

Anthropic is running the most compute fungible frontier lab in the world, with active deployments across AWS Trainium, Google TPU, and Nvidia GPU, and an internal orchestration layer that lets a chip serve inference in the morning and run reinforcement learning the same evening. Krishna Rao explains the cone of uncertainty that governs gigawatt scale compute procurement, the floor Anthropic refuses to drop below on model development compute, the Jevons paradox unlock from cutting Opus pricing, the 500 percent annualized net dollar retention from enterprise customers, the layer cake of long term deals with Google, Broadcom, Amazon, and the recent xAI Colossus tie up in Memphis, the phased release of the Mythos model in response to spiking cyber capabilities, the internal use of Claude Code to produce statutory financial statements and run a Monthly Financial Review skill, and why the team believes scaling laws are alive and well. The interview also covers fundraising history through Series D and Series E, the $75 billion already raised plus another $50 billion coming, talent density beating talent mass during the Meta poaching wave, and Rao’s belief that biotech and drug discovery represent the most exciting frontier for AI.

Key Takeaways
- Anthropic entered the year with about $9 billion of run rate revenue and ended the first quarter with north of $30 billion of run rate revenue, a more than 3x leap driven by model intelligence gains and the products built around them.
- Compute is described as the lifeblood of the company, the canvas everything else is built on, and the most consequential class of decisions Rao makes. Buy too much and you go bankrupt. Buy too little and you cannot serve customers or stay at the frontier.
- Rao spends 30 to 40 percent of his time on compute, even today, and the leadership team meets repeatedly on both procurement and ongoing compute allocation.
- Anthropic is the only frontier language lab actively using all three major chip platforms in production: AWS Trainium, Google TPU, and Nvidia GPU. It is also the only major model available on all three clouds.
- Flexibility is the central design principle. Anthropic builds flexibility into the deals themselves, into the orchestration layer that maps workloads to chips, and into compilers built from the chip level up.
- The cone of uncertainty frames procurement. Small differences in weekly or monthly growth compound into wildly different two year outcomes, so the team plans across a range of scenarios rather than a single point estimate, and ranges toward the upper end while protecting downside.
- Compute allocation across the company sits in three buckets: model development and research, internal employee acceleration, and external customer serving. A non negotiable floor protects model development even when customer demand is tight.
- Anthropic estimates that if it cut off internal employee use of its own models, the freed compute could serve billions of dollars of additional revenue. It chooses not to, because internal use compounds into better future models.
- Intelligence is multi dimensional, not a single IQ score. Anthropic measures real world capability through customer feedback, long horizon task performance, tool use, computer use, and speed at agentic tasks, not just leaderboard benchmarks that have largely saturated.
- Each Opus generation, 4 to 4.5 to 4.6 to 4.7, delivers both capability improvements and an efficiency multiplier on token processing. New models often serve customers at a fraction of the prior cost while doing more.
- Reinforcement learning is described as inference inside a sandbox with a reward function, so model efficiency gains directly improve internal RL throughput. The flywheel is tightly coupled.
- Over 90 percent of code at Anthropic is now written by Claude Code, and a large share of Claude Code itself is written by Claude Code.
- Anthropic shipped roughly 30 distinct product and feature releases in January and the pace has accelerated since.
- Scaling laws, in Anthropic’s internal data, are alive and well. The team holds itself to a skeptical scientific standard and still does not see them slowing down.
- Anthropic recently signed a 5 gigawatt deal with Google and Broadcom for TPUs starting in 2027, plus an Amazon Trainium agreement for up to 5 gigawatts, totaling more than $100 billion in commitments. A significant portion lands this year and next year.
- A new partnership for capacity at the xAI Colossus facility in Memphis was announced just before the interview, aimed at expanding consumer and prosumer capacity.
- Pricing has been remarkably stable across Haiku, Sonnet, and Opus. The biggest deliberate change was lowering Opus pricing, which produced a textbook Jevons paradox: consumption rose far faster than the price drop, and the new Opus 4.6 and 4.7 slot in at the same price point.
- Mythos is the first model Anthropic chose to release in a phased way because of a sharp spike in cyber capability. In an open source codebase where a prior model found 22 security vulnerabilities, Mythos found roughly 250.
- The Mythos release framework focuses on defensive use first, expands access over time, and is presented as a template for future capability spikes.
- Anthropic now sells to 9 of the Fortune 10 and reports net dollar retention above 500 percent on an annualized basis. These are not pilots. Rao describes signing two double digit million dollar commitments during a 20 minute Uber ride to the studio.
- The platform strategy is mostly horizontal. Anthropic will go vertical with offerings like Claude for Financial Services, Claude for Life Sciences, and Claude Security where it can demonstrate the model’s capabilities, but expects most application value to accrue to customers building on top.
- Investors raised over $75 billion in equity since Rao joined, with another $50 billion in commitments tied to the Amazon and Google deals. Capital intensity is real, but the raises fund the upper end of the cone of uncertainty more than they fund current losses.
- The Series E close coincided with the day the DeepSeek news broke, forcing investors to reassess their AI thesis in real time. Anthropic closed the round anyway.
- Inside finance, Claude now produces statutory financial statements for every Anthropic legal entity, with a human checker. A library of more than 70 finance specific skills underpins workflows.
- A custom Monthly Financial Review skill produces a 90 to 95 percent ready monthly close report, so leadership discussion shifts from reconciling numbers to debating implications.
- An internal real time analytics platform called Anthrop Stats compresses weekly insight cycles from hours to about 30 minutes.
- The biggest token user inside Anthropic’s finance team is the head of tax, focused on tax policy engines and workflow automation. The most senior people, not the youngest, are leading internal adoption.
- Talent density beats talent mass. When Meta and others ran aggressive offer waves, Anthropic lost two people while peer labs lost dozens.
- All seven Anthropic co founders remain at the company, as does most of the first 20 to 30 employees, which Rao credits to a collaborative, transparent, debate friendly culture and a real culture interview that can veto otherwise top tier candidates.
- Dario Amodei holds an open all hands every two weeks, writes a short prepared document, and takes unscripted questions from anyone at the company.
- AI safety investments in interpretability and alignment have a commercial side effect. Looking inside the model helps Anthropic build better models, and enterprises selling sensitive workloads want to trust the lab they hand customer data to.
- Anthropic explicitly identifies as America first in its approach to model development, and engages closely with the US administration on capability releases such as Mythos.
- The longer term product vision is the virtual collaborator: an agent with organizational context, access to the company’s tools, persistent memory, and the ability to work on ideas, not just tasks, over long horizons.
- CoWork, Anthropic’s extension of the Claude Code paradigm into general knowledge work, is being adopted faster than Claude Code itself when indexed to the same point in its launch curve.
- Anthropic’s product teams ship daily, with a fleet of agents working across the company on specific tasks. Everyone effectively becomes a manager of agents.
- The dominant downside risks to Anthropic’s high end forecast are slower customer diffusion of model capability into real workflows, scaling laws flattening unexpectedly, and Anthropic losing its position at the frontier.
- Rao is most excited about biotech and healthcare outcomes, especially the prospect that AI could push drug discovery and lab throughput up 10x or 100x, turning currently incurable diagnoses into treatable ones within a patient’s lifetime.
Detailed Summary

Compute as Lifeblood and the Cone of Uncertainty

Rao opens with the claim that compute is the most important resource at Anthropic, and the most consequential decision class in the company. You cannot buy a gigawatt of compute next week. You have to anticipate demand a year or two in advance, and the cost of being wrong in either direction is high. Buy too much and the unit economics collapse. Buy too little and you cannot serve customers or stay at the frontier, which are described as the same failure mode. To navigate this, the team uses a cone of uncertainty rather than point estimates. Small differences in weekly growth compound into vastly different two year outcomes, and Anthropic tries to position itself toward the upper end of that cone while preserving optionality. Rao notes he has had to consciously break a lifetime of linear thinking and force himself into exponential models.

Three Chip Platforms, One Orchestration Layer

Anthropic uses Amazon’s Trainium, Google’s TPUs, and Nvidia’s GPUs fungibly. That was not free. Adopting TPUs at scale started around the third TPU generation, when outside observers thought it was a strange choice. Anthropic invested years into compilers and orchestration so workloads can flow across chips by generation and by job type. The team works deeply with Annapurna Labs at AWS to influence Trainium roadmaps because Anthropic stresses these chips harder than almost anyone. The result is what Rao believes is the most efficient utilization of compute across any frontier lab, with a dollar of compute going further inside Anthropic than anywhere else.

Three Buckets and the Model Development Floor

Compute gets allocated across model development, internal acceleration of employees, and customer serving. The conversations are collaborative rather than zero sum, but there is a hard floor on model development that the company refuses to cross even if it makes customer demand harder to serve in the short term. The thesis is simple. The returns to frontier intelligence are extremely high, especially in enterprise, so cutting model investment to chase near term revenue is a bad trade. Internal employee use is also explicitly protected. Rao notes that diverting that internal usage to external customers would unlock billions of additional revenue today, but the compounding benefit of accelerating researchers and engineers outweighs that.

Intelligence Is Multi Dimensional

Rao pushes back hard on the IQ framing of model progress. Benchmarks saturate quickly, and the real signal comes from how customers actually use the models. Anthropic looks at long horizon task completion, tool use, computer use, and time to result on agentic tasks. Two equally capable agents who differ only in speed produce dramatically different value, because the faster one compounds into more attempts and more outcomes. Frontier model leaps are also fuel efficient. The sedan to sports car analogy breaks down because each Opus generation, 4 to 4.5 to 4.6 to 4.7, delivers a step up in capability and a multiplier on per token efficiency.

From 9 Billion to 30 Billion ARR in One Quarter

The headline number for the quarter is a leap from about $9 billion of run rate revenue to over $30 billion, accomplished without onboarding a corresponding step up in compute, because new compute lands on ramps locked in 12 months prior. Rao attributes the leap to model capability gains, products that surface that intelligence in usable form factors, and an enterprise customer base that pulls more workloads onto Claude as each generation unlocks new use cases. Coding started the wave with Sonnet 3.5 and 3.6, and the same pattern is now playing out elsewhere in the economy.

Recursive Self Improvement and Talent Density

Over 90 percent of Anthropic’s code is now written by Claude Code, including most of Claude Code itself. Rao describes this as a structural reason to keep allocating internal compute to employees even when external demand is hungry. Recursive self improvement is not happening through models that need no humans. It is happening through researchers who set direction and use frontier models to compress months of work into days. Talent density beats talent mass. When Meta and other labs went after Anthropic researchers with very large packages, Anthropic lost two people while peer labs lost dozens.

Procurement Strategy and the Layer Cake

Compute lands as a layer cake. Last month Anthropic signed a 5 gigawatt TPU deal with Google and Broadcom starting in 2027, alongside an Amazon Trainium agreement for up to 5 gigawatts. The total is north of $100 billion in commitments. A new tie up with xAI’s Colossus facility in Memphis was announced just before the interview, intended for nearer term capacity to support consumer and prosumer growth. Anthropic evaluates near term and long term compute deals against the same set of variables: price, duration, location, chip type, and how efficiently the team can run it. The relationships are deeper than procurement. The hyperscalers are also distribution channels for the model.

Platform First, Selective Vertical Bets

Rao describes Anthropic as a platform first business, with most expected value accruing to customers building on the platform. The team will only go vertical when it can either demonstrate capabilities that are skating to where the puck is going, like Claude Code did before the models could fully support it, or when it wants to set a template for an industry vertical, as with Claude for Financial Services, Claude for Life Sciences, and Claude Security. He acknowledges that surprise capability jumps make customers anxious about the platform competing with them, and frames Anthropic’s mitigation as deeper partnerships, early access programs, and an emphasis on accelerating customer building rather than disintermediating it.

Pricing, Jevons Paradox, and Return on Compute

Pricing across Haiku, Sonnet, and Opus has been stable. The notable exception is Opus, which Anthropic deliberately repriced lower when launching Opus 4.5 because Opus class problems were being squeezed into Sonnet workloads. Efficiency gains made it possible to serve Opus profitably at the new level. The consumption response was a classic Jevons paradox, with usage rising far more than the price reduction would have predicted, and Opus 4.6 then slotted in at the same price with a capability bump. Margins are not framed as a per token markup. Compute is fungible across model development, internal acceleration, and customer serving, so Anthropic measures return on the entire compute envelope rather than software style variable cost per call.

Fundraising, DeepSeek, and Capital Intensity

Rao joined while Anthropic was closing its Series D, mid frontier model launch and during the FTX share liquidation. Investors initially questioned whether Anthropic needed a frontier model, whether AI safety and a real business could coexist, and why the sales team was so small. The Series E closed the same day the DeepSeek news broke, with markets violently re pricing AI in real time. Since Rao joined, Anthropic has raised over $75 billion, with another $50 billion tied to the Amazon and Google compute deals. The reason for the size of the raises is the cone of uncertainty, not current losses. Returns on compute today are described as robust.

Mythos, Cyber Capability, and Phased Releases

The Mythos release marks the first time Anthropic shipped a model under a deliberately phased rollout because of a specific capability spike. Cyber is the dimension that spiked. Where a prior model found 22 vulnerabilities in an open source codebase, Mythos found roughly 250. The defensive applications, automatically patching massive codebases, are genuinely valuable, but the offensive risk is real enough that Anthropic chose to release to a smaller group first and expand access over time. Rao positions this as a template for future capability spikes, not a permanent restriction. He also describes the relationship with the US administration as cooperative, including the Department of War interaction, with Anthropic supporting a regulatory framework that does not strangle innovation but takes responsibility seriously.

Claude Inside Finance

Anthropic’s finance team is one of the strongest internal case studies. Statutory financial statements for every legal entity are produced by Claude, with a human reviewer. A skill library of more than 70 finance specific skills underpins a Monthly Financial Review skill that drafts the monthly close at 90 to 95 percent ready, so leadership meetings shift from explaining the numbers to discussing what to do about them. An internal analytics platform called Anthrop Stats compresses weekly insight cycles from hours to 30 minutes. The biggest internal token user in finance is the head of tax, building policy engines, which Rao highlights as evidence that adoption is driven by the most senior people, not just younger engineers.

Culture, Co Founders, and the Race to the Top

Seven co founders should not, on paper, work as a leadership group. Rao argues it works because the culture was set early around collaboration, intellectual honesty, transparency, and humility. The culture interview is a real veto, not a checkbox. Dario Amodei runs an all hands every two weeks with a short written piece followed by unscripted questions, and decisions, once made, get clean alignment rather than residual politics. Anthropic frames its approach as a race to the top, where being a model for how to build the technology responsibly is itself a recruiting and retention advantage.

The Virtual Collaborator and the Frontier Ahead

The product vision Rao describes is the virtual collaborator. Not just a smarter chatbot, but an agent with organizational context, access to the company’s tools, memory, and the ability to work on ideas over long horizons. Coding was the first domain to feel this, but CoWork, Anthropic’s extension of the Claude Code pattern into general knowledge work, is being adopted faster than Claude Code was at the same age. Product development inside Anthropic already looks different. Teams ship daily, with fleets of agents working across the company, and individual humans increasingly act as managers of those fleets.

Downside Risks and What Excites Him Most

The three risks Rao names if asked to do a premortem on a softer year are slower customer diffusion of model capability into real workflows, scaling laws unexpectedly flattening, and Anthropic losing its frontier position to competitors. None of these are observed today, but he is unwilling to claim them with certainty. On the upside, he is most excited about biotech and healthcare. Lab throughput rising 10x or 100x, paired with AI assisted clinical workflows, could turn currently incurable diagnoses into treatable ones within a patient’s lifetime. That is the outcome he wants the technology to chase.

Thoughts

The most consequential structural point in this interview is the framing of compute as a single fungible resource pool measured by return on the entire envelope, not as a variable cost per inference call. That accounting shift, if you accept it, breaks most of the bear cases about AI lab unit economics. The bear argument almost always assumes that a token served to a customer is the only thing the chip did that day. Rao’s version is that the same fleet trains models in the morning, runs reinforcement learning at lunch, serves customers in the afternoon, and accelerates internal engineers in the evening. If even half of that is real, the right comparison is total compute spend versus total enterprise value created by the platform, and on that ratio Anthropic looks structurally strong rather than weak.

The Jevons paradox on Opus pricing is the most actionable insight for anyone running an AI product. Most teams default to either chasing premium pricing on the newest model or undercutting to chase volume. Anthropic did something more disciplined: it left Sonnet and Haiku alone, dropped Opus when efficiency gains made it serveable, and watched aggregate usage rise faster than the price cut. The lesson is that frontier model pricing is not really a price problem. It is a capability access problem, and elasticity around the right tier is much higher than the standard SaaS playbook implies.

The Mythos cyber jump deserves more attention than it has gotten. Going from 22 to 250 vulnerabilities found in the same codebase is the kind of capability discontinuity that genuinely changes the regulatory calculus. Anthropic is signaling that it can identify these discontinuities ahead of release and choose a deployment shape that respects them. Whether peer labs adopt similar discipline is the open question. Anthropic’s race to the top framing assumes they will be forced to. The competitive market may say otherwise.

The hiring data point is the most underrated investor signal. Two departures while peer labs lost dozens, during the most aggressive talent war in tech history, is not a culture poster. It is a structural advantage that compounds every time another lab tries to buy its way to the frontier. Money can be matched. Conviction in the mission, transparent leadership, and a culture interview that can veto otherwise stellar candidates cannot. If you believe scaling laws hold, talent retention at this density is one of the few moats that actually scales with capital.

Finally, the most interesting personal admission is that Krishna Rao, a finance leader trained at Blackstone and Cedar, is openly telling investors that linear thinking is the failure mode he had to break out of. The companies that pattern match this moment to prior technology waves are mispricing it, in both directions. The cone of uncertainty Anthropic uses internally is the right metaphor for everyone else too. If you are forecasting AI as if it is cloud in 2010, you are almost certainly wrong, and the magnitude of the error is much larger than it would be in any prior era.

Watch the full conversation with Krishna Rao on Invest Like the Best here.
May 13, 2026