PJFP.com

Pursuit of Joy, Fulfillment, and Purpose

Tag: anthropic ipo

Tim Ferriss and Kevin Rose Random Show: Mortality and Grief, Zen Insights, Rock Climbing at 50, LSD for Anxiety (MM120), AI Smart Homes, and Why You Should Buy the Company Instead of the Product
Tim Ferriss and Kevin Rose reunite over tequila for another Random Show, and this one swings from the heaviest material they have covered in years (the death of their friend Om Malik, aging parents, dementia, and what grief actually is) to Zen retreat breakthroughs, rock climbing as a post-50 obsession, a phase 3 LSD trial for anxiety, AI-powered smart homes, the coming wave of AI IPOs, and the single investing lesson both keep relearning: let your winners run, and when you love a product, buy the company.

TLDW

Kevin reframes the loss of Om Malik and his father through a simple equation: grief is love with nowhere to go, and the sorrow is proof of how lucky you were. Tim adds Tim Urban’s “The Tail End” math (you have spent roughly 95% of your lifetime hours with your parents by high school graduation) and Sam Harris’s “The Last Time” meditation. Kevin recounts a micro-awakening at a five-day silent Zen retreat (“nothing lacking”), both plug their meditation app The Way with Henry Shukman, and Tim declares multi-pitch climbing in Yosemite his next deliberate-practice obsession, complete with hangboard protocols and grip-training gear. The health segment covers A2 whey, venison organ-meat sticks as a multivitamin, the 1,3-butanediol ketone controversy, ketones temporarily unlocking speech in relatives with dementia, terminal lucidity, a JAMA phase 3 trial of MM120 (lysergide) showing 12 weeks of anxiety relief from a single dose, and the Norwegian 4×4 protocol whose hippocampal benefits may persist for five years. The AI segment runs from Kevin’s Claude-coded camera system that opens his gate via license plate recognition, to Tim’s 20-year angel investing retrospective built with Claude Code and the Gmail API, to their handicapping of Google versus Anthropic versus OpenAI, China’s open-source push, local inference boxes, and why buying at IPO and holding may match venture returns.

Thoughts

The emotional spine of this episode is the best thing in it. Kevin’s formulation, that the gap left by a death “is just love at the end of the day,” is not new philosophy, but it lands differently coming from someone actively managing a dying dog, a mother with dementia, and a friend’s fresh death, all in the same month. The practical corollary the two keep circling is time-boxing: Tim Urban’s Tail End math and Sam Harris’s “last time” framing both convert vague mortality awareness into a scheduling problem. Tim credits one short blog post with causing years of family trips that his emotionally reserved family would never have taken otherwise. That is about as strong an endorsement as content can get: it changed the calendar, not just the mood.

The health middle of the show is classic Random Show in that the interesting part is the epistemology, not the products. Tim flags that the loudest critics of 1,3-butanediol ketones sell competing ketone salts, applies a shelf-life heuristic to processed meat instead of memorizing ingredient lists, and treats organ-meat sticks as a dosed multivitamin rather than a diet. The MM120 discussion is the meatiest science: a five-arm randomized trial where a single 100 microgram dose of lysergide produced roughly twelve weeks of relief in generalized anxiety disorder, which Tim, who has been diagnosed with GAD and OCD, reads as a plausible future where anxiety treatment is episodic rather than daily. The unresolved tension they name honestly: the promising dementia signals (ketones, psilocybin case reports, microdosing) all crash into the consent problem. A person who cannot consent cannot sign up for a hallucinogen, and “it might give you half a day of real conversation back” is both a miracle and an ethical minefield.

The AI section quietly contains one of the more useful predictions frameworks going: Kevin’s argument that Google’s confusing high-bandwidth TPU architecture only makes sense as a bet on continuous learning, where models stop shipping as discrete releases and start improving around the clock like a child. If self-improving models are really 12 to 18 months out, the “model drop” news cycle this episode itself participates in (new Sonnet today, Mythos tomorrow) is a temporary artifact. Tim’s counterweight is human-scale and more sobering: an AI trained on your own writing produces in 30 seconds what takes you 30 hours, and he compares the demoralization to Lee Sedol retiring after AlphaGo. His book sales chart, stable for a decade and then compounding downward every year since ChatGPT launched, is the receipts. The tension between “AI made my 20-year retrospective possible” and “AI is draining my motivation to write” is the honest version of the AI discourse most podcasts flatten into one direction.

The investing segment is the most immediately actionable. Three ideas stack neatly: let winners run (Tim has lost more money selling early than he made buying), the venture-returns myth (a famous firm’s own analysis found that buying at IPO and holding a decade roughly matched their gains from early rounds through lockup), and buy-what-you-use (the friend who spent $100k on a top-of-the-line Tesla instead of Tesla stock forfeited roughly $15 million; teenage Tim bought Pixar after seeing Toy Story). None of this is sophisticated, which is the point both make explicitly: with Anthropic and OpenAI racing to IPO, ordinary people who use these tools daily will get a shot the private markets never gave them, and the discipline that matters is holding, not access.

Key Takeaways
- Kevin and Tim lost their friend and colleague Om Malik of True Ventures within the past week; Kevin found out mid-way through a five-day silent meditation retreat.
- Kevin’s reframe on grief: the severe sense of loss is “just love at the end of the day.” The gaping hole his father’s death left is love manifested through sorrow, and recognizing that converts anguish into gratitude for having crossed paths at all.
- Tim credits Matt Mullenweg twice: for organizing the Antarctica trip where he got days of uninterrupted time with Om (including a visit to an emperor penguin colony), and for sending him Tim Urban’s blog post “The Tail End.”
- The Tail End’s core math: by high school graduation you have used up roughly 90 to 95% of the total in-person hours you will ever spend with your parents. Reading it drove Tim to organize regular family trips, awkwardness be damned, before his father’s mobility declined.
- Sam Harris’s short meditation “The Last Time” pairs with it: for many activities you will do a last time without knowing it was the last time.
- Kevin’s 15-year-old dog Toaster had a violent shaking episode (a stress syndrome after standing six hours at a vet visit, not a terminal event), and Kevin’s takeaway from being covered in the aftermath was that when you love an animal that much, none of it matters.
- At a traditional Zen sesshin with Henry Shukman and his visiting Japanese teacher Yamada Roshi, Kevin had a two-second micro-insight while working his koan: a felt sense of “nothing lacking,” where nothing could be added or taken away because everything was already fully present. Not an emotion, a steady state.
- Both are investors in The Way, Henry Shukman’s single-path guided meditation app, which they frame as an ideological investment like their funding of the dog aging study on rapamycin. Tim’s favorite sessions: “Whole Earth is Medicine” and “This Too is Me.”
- Tim’s practical meditation pitch: you do not need a retreat; 10 minutes twice daily works, and there seems to be real alchemy in the twice-a-day rhythm. Kevin, once the guy who quit everything in two weeks, is coming up on five years of consistent practice.
- A physiology aside: Henry’s instruction to drop the jaw slightly mirrors how Tim’s mandibular snoring device works. Dropping the jaw an eighth of an inch down and forward opens the airway. The ancients found it by trial and error.
- Kevin, approaching 50, wants to stop saying “one day” about his bookmarked obsessions (Japanese woodworking, ships in bottles) and actually commit to things in the next two decades.
- Tim’s next deep dive is rock climbing: his surgically repaired right elbow finally allows it, and his stretch goal is multi-pitch climbing in Yosemite despite being, in his words, deadly terrified of heights, sweaty palms included.
- Tim’s philosophy of training: “training to not die sooner than is necessary” is not a sufficient goal. He needs a concrete deadline event, the way the Lancaster Classic structured his archery, to make deliberate practice worth it.
- What sold Tim on climbing longevity: the 60-to-almost-80-year-olds at Salt Lake City gyms climbing 5.11+ on weekday mornings, out-performing what he could imagine doing, plus women who cannot do five pull-ups climbing 5.13 and 5.14 on pure technique.
- Climbing is also social in a way archery never was: bouldering routes are literally called “problems,” and strangers trade beta. After decades of solitary repetition, Tim has hit his quota.
- Training tools discussed: Michael Eckert’s finger-strength course (the multiple-time pull-up world champion Kevin just bought into), the Nug (a pocket-size wooden grip trainer Tim travels with), and Abrahangs, Emil Abrahamsson’s protocol of moderate partial-bodyweight hangs, 10 seconds on and 50 seconds off for 10 minutes twice a day, which produces outsized forearm and finger gains.
- Tim’s fantasy recommendation: The Blade Itself, whose treatment of the randomness of death (a friend of Tim’s just died in a plane crash) doubles as a gratitude practice. The audiobooks are exceptional.
- Protein talk: Kevin likes Pioneer Pastures A2 whey (30 grams a shake, lactose removed, no investor relationship); Tim gets roughly 40% of his protein from Maui Nui wild-harvested axis deer venison and treats the liver-and-heart pepper sticks as a two-or-three-a-week multivitamin.
- On processed meat and nitrates, Tim’s heuristic is shelf life: if an ultraprocessed meat lasts three years on a shelf, raise an eyebrow. Minimally processed meat almost definitionally does not keep.
- Exogenous ketones containing 1,3-butanediol may carry liver toxicity risk, though Tim notes many people pushing that claim sell competing ketone salts. His personal policy: use them intermittently, not daily.
- The startling ketone anecdote: given to relatives with dementia, sentence length roughly 5xed within 20 minutes. Caveats: it tastes like gasoline, and 1,3-butanediol can affect balance, a serious concern when a broken hip is often the beginning of the end for older adults.
- Kevin moved his mother, who has non-Alzheimer’s (likely vascular) dementia, into a new home equipped with an AI radar orb that detects falls instantly. She cannot recall breakfast but knows who he is, which he will take all day long.
- The exercise-for-brain-health protocol Tim assembled with neuroscientist Dr. Tommy Wood: Norwegian 4×4 VO2 max intervals (4 minutes on, ~3 minutes off, 4 rounds) three times weekly for five to six months produces volumetric changes in the hippocampus that appear to last up to five years.
- The only bike Tim can tolerate for it is the Kaiser M3i indoor bike, because the handlebars raise enough to spare his lower back. Kevin’s sustainable alternative: incline treadmill walking while playing Duolingo chess until 40 minutes disappear. Tim’s version of don’t-let-perfect-be-the-enemy-of-good: a 5-minute, three-set gym session still counts.
- The JAMA study that grabbed Tim: a phase 3, five-arm randomized trial of MM120 (lysergide, essentially LSD, from the company formerly known as MindMed) for generalized anxiety disorder. Effects were dose-dependent, with 100 micrograms (a standard full trip) as the apparent minimum effective dose, and relief persisting through 12 weeks after a single treatment.
- Mid-conversation they discover the trial ran at Neuroscape at UCSF, their friend Adam Gazzaley’s lab, which Kevin helped fund. Tim, clinically diagnosed with GAD and OCD, finds 12 weeks of relief from one dose remarkable.
- Related dementia signals: a case report of an elderly Japanese woman with dementia who took a five-gram “heroic dose” of psilocybin mushrooms, slept 19 hours, and woke temporarily capable of full expositional conversation instead of monosyllables; Tim has also seen an unpublished case report of LSD microdosing producing similar verbal fluidity.
- Both note the hard ethics: hallucinogens for someone who cannot consent, the devastation of a bad trip you inflicted, versus the possibility of half a day of real connection or slowed decline.
- Terminal lucidity, the well-documented phenomenon of vegetative or unresponsive patients becoming fully lucid in their final days, leaves both baffled: if cognition is fully localized in a structurally deteriorated brain, where is the lucidity coming from? Kevin’s analogy: we assume nothing is backed up to the cloud.
- Tim’s caffeine pacing hack: Nutonic nootropic toothpicks (a gift from Chris Williamson), roughly 20 to 25 milligrams of caffeine each, a hard ceiling per toothpick that prevents his chain-refill coffee problem.
- Kevin’s AI smart home: his Ubiquiti camera system has a full API, so with Claude writing the glue code, the cameras now recognize individual people (and Toaster, who gets a dog emblem), play deterrent audio at loiterers in his alleyway, and open his gate automatically when they read his license plate. The camera costs about $200; anyone can do this now.
- Tim’s flagship AI project: a 20-year retrospective of his angel investing, built with Claude Code and the Gmail API, testing his own stories about his batting average against hard data. Doing it manually would have taken a year of full-time work by multiple people.
- The humbling adjacent stat from Kevin: friends with always-on AI wearables report that about 70% of what we confidently remember is what actually happened. Startup genesis stories are the same phenomenon, a five-minute bit polished until the teller believes it.
- Tim’s most valuable everyday AI use: holistic health cross-checking (contraindications between medications and supplements, could A explain D), hallucination-limited by fact-checking across multiple LLMs.
- Tim’s contrarian AI take: for most people the honest impact is small “because most shit isn’t worth doing in the first place.” Doing something well does not make it worth doing, and AI is skyrocketing the volume of efficiently produced BS.
- The demoralization is real, though: an AI trained on your writing produces in 30 seconds what takes 30 hours. Tim compares it to the top Go player who lost the joy of the game after AlphaGo, and his all-format book sales have compounded downward every year since ChatGPT launched (roughly -5%, then -28%, then -49%, tracking toward -67%).
- The prompt experiment both loved: with cross-conversation memory enabled, ask your model “What are three to five rewarding paths I might explore in the next five years?” Tim sent the answers to close friends who called them outstanding, including a non-book business idea Kevin urged him to build. Ask AI open-ended questions the way you would ask a close friend, not robot questions.
- Kevin is prototyping “Bond,” an app built from scanned values-card decks: swipe to surface your core values, form explicit agreements with partners and friends that both sides “shake” on, weight the damage of a broken bond, and accumulate a trust ledger. He calls the underlying idea dark information: real relational data (trust, reliability, empathy) that exists everywhere but has never been given physical form.
- Tim’s writing unlock for the blank page: dictate a rambling brain dump into Wispr Flow while walking, drop it into Claude to clean up, and uncomfortable procrastinated emails come together in minutes. Gear notes: Shokz OpenMeet bone-conduction headset (open ears for traffic, recommended by Exploding Kittens co-founder Elan Lee) and a Sennheiser lav mic plus the Ferrite app as a pocket recording studio that beats studio mics in echoey hotel rooms.
- State of AI, per both: the big three are Google, Anthropic, and OpenAI, with X/Grok never count-out-able (though Anthropic and Google buying excess Colossus capacity suggests weak Grok demand; Kevin still values Grok’s X-API grounding and uses it heavily for Digg). Meta has phenomenal assets but, Kevin thinks, not the talent to keep pace. Apple is quietly a couple of years out.
- Kevin’s Google thesis: they own the full stack (TPUs, data centers, models, Android’s install base), and their confusingly high-bandwidth chip architecture is a bet that the future is continuous learning, models improving 24/7 like a child rather than shipping as discrete releases. Consensus estimates put self-improving models 12 to 18 months out.
- Kevin’s insider color: touring Google X with Sergey Brin and Bill Maris a decade-plus ago, he saw Waymos years before the public knew. Google is sitting on roughly five years of undisclosed deck and holds back frontier models partly for cost and partly to avoid government intervention. In 12 months we will know where Google really stands.
- Counterweights: ChatGPT owns consumer mindshare and OpenAI must crack advertising, which is very hard; Anthropic is reportedly the fastest-scaling enterprise business ever but keeps taking hits from the administration; no frontier lab will remain unconstrained by government; and China is releasing open-source models on par with the frontier (“doing it the American way”), while AMD’s ~$4,000 local inference box can run massive models at home, eight months behind the frontier, which for many users is fine.
- The investing lessons: let winners run (Tim: “I’ve lost more money by selling stocks early than I’ve ever probably made buying the original stock”); a famous venture firm’s internal analysis found buying at IPO and holding roughly 10 years matched their gains from early-stage investing through post-lockup; and buy the company, not just the product. Kevin’s friend David Prager spent $100k on a maxed-out Tesla instead of Tesla stock, forgoing roughly $15 million. Tim’s first stock, at about 15 years old, was Pixar, bought because Toy Story convinced him animation was the future.
- Kevin relaunched Digg: from 20,000 weekly users to nearly 500,000 and millions of monthly page views, pulling the zeitgeist from X and other feeds with heavy AI curation rather than trying to build another social network.
Detailed Summary

Grief, the Tail End, and the Last Time

The show opens with banter about alcohol taxes and ketamine before turning serious: Toaster, Kevin’s 15-year-old dog, just had a terrifying (ultimately survivable) collapse, and the pair lost their friend Om Malik of True Ventures within the week. Kevin, who got the news at a silent retreat, offers the episode’s emotional thesis: the loss and sorrow are the shape love takes when the person is gone, and he would not trade the chaos of caring for people and animals for a calmer, emptier life. Tim thanks Matt Mullenweg for the Antarctica trip that gave him days of psychologically naked time with Om, and for sending him Tim Urban’s “The Tail End,” the post whose parents-time math pushed Tim into years of deliberate family trips before his father needed a wheelchair. Sam Harris’s meditation “The Last Time” extends the theme: you rarely know a last time is the last time. Kevin’s response is to do the thing one more time anyway, bouncy-house backflips at 49 included.

Zen, Nothing Lacking, and The Way

Kevin describes his five-day traditional Zen sesshin with Henry Shukman and Yamada Roshi: wall-gazing with eyes open, koan practice on the out-breath, and private interviews with the Roshi. His micro-insight, about two seconds long, was a non-emotional steady state of “nothing lacking,” everything fully present with nothing to add or subtract, what Zen calls the removal of the veil. Tim relays his favorite sessions from The Way (the app both back as a philosophical investment, like the rapamycin dog aging study): “This Too is Me,” which dissolves the burden of a squirrel-chasing mind by including everything experience serves up as you, and Henry’s small physical instructions, like dropping the jaw, which Tim connects to his mandibular snoring device: an eighth of an inch down and forward opens the airway. His bottom line: 10 minutes twice a day captures most of the benefit, and watching the formerly two-weeks-and-out Kevin sustain five years of practice has been deeply reassuring.

Rock Climbing as the Next Decade’s Project

Kevin, marching toward 50, wants to stop bookmarking dreams (Japanese woodworking, ships in bottles) and start doing them. Tim’s answer is rock climbing: his repaired right elbow finally allows it, and his stretch goal is multi-pitch in Yosemite despite sweating through his palms at the mere thought of heights. What converted him was the Salt Lake City gym crowd at 11 a.m.: retirees in their 60s and 70s climbing 5.11+, inverted on overhangs, evidence that this sport rewards technique and consistency over youth (women who cannot do five pull-ups climb 5.13). After archery, which he loved but found definitionally solitary, climbing’s social “beta”-trading culture is the draw. The training stack: Michael Eckert’s finger-strength course, the Nug pocket grip trainer, and Abrahangs (Emil Abrahamsson’s 10-seconds-on, 50-off, 10-minute, twice-daily hang protocol). A darker aside grounds the ambition: a friend of Tim’s just died in a plane crash, and The Blade Itself keeps teaching him that life-or-death is often dumb luck.

Protein, Ketones, and the Dementia Frontier

The supplements run: Kevin’s new favorite is Pioneer Pastures A2 whey (30 grams, lactose removed, gut-friendly); Tim, disclosure-forward as always, travels with Maui Nui venison and treats the liver-and-heart sticks as a twice-weekly multivitamin. On processed meat, Tim’s heuristic is shelf life over ingredient forensics. The exogenous ketone conversation is more fraught: 1,3-butanediol may stress the liver (though the claim’s loudest advocates sell competing ketone salts), so Tim doses intermittently. The astonishing part: given to relatives with dementia, ketones 5xed sentence length within 20 minutes, going from non-answer answers to full paragraphs, “offline to online.” Balance risks make it dicey in exactly the population that needs it. Kevin’s mother’s new care home uses an AI radar orb for instant fall detection. For prevention, Tim’s protocol from conversations with Dr. Tommy Wood: Norwegian 4×4 VO2 max intervals three times a week for five to six months, whose hippocampal volumetric changes appear to persist up to five years, done on the one bike (Kaiser M3i) that does not wreck his back. Kevin’s sustainable version: incline treadmill plus Duolingo chess.

MM120, Psilocybin Case Reports, and Terminal Lucidity

Tim walks through the JAMA-published phase 3 trial of MM120 (lysergide, effectively LSD) for generalized anxiety disorder: five arms (placebo, 25, 50, 100, 200 micrograms), dose-dependent response, with 100 micrograms reading as the minimum effective dose and relief lasting through the 12-week measurement window from a single supervised treatment. Kevin clicks through mid-show and discovers it ran at Neuroscape at UCSF, their friend Adam Gazzaley’s lab, which Kevin helped fund. For Tim, clinically diagnosed with GAD and OCD, episodic rather than daily treatment is the headline. The dementia thread continues: a case report of an elderly Japanese woman who took five grams of psilocybin mushrooms, slept 19 hours, and woke into temporary full conversation; an unpublished LSD microdosing report with similar verbal fluidity. Both wrestle with consent ethics. And then terminal lucidity, the documented phenomenon of unresponsive patients becoming fully lucid days before death, which neither can explain: as Kevin puts it, if it is all localized in a deteriorated brain, where is that coming from?

AI at Home and AI on Yourself

Kevin’s Ubiquiti camera system, glued together with Claude-written code against its API, now recognizes faces (and Toaster), scolds loiterers through a speaker, and opens his gate when it reads his license plate, all on a $200 camera. Tim’s project is introspective: a Claude Code plus Gmail API retrospective of 20 years of angel investing, checking who made which introductions, what he passed on, and whether his stories about his batting average survive contact with data (they mostly did; he missed fewer explicit opportunities than he feared). Kevin cites friends with always-on AI wearables: about 70% of what we confidently remember is accurate. Tim’s daily-driver use is health: cross-referencing medications, supplements, and symptoms across multiple LLMs. His caution: most tasks AI accelerates were not worth doing, and the volume of efficient BS is skyrocketing. His countervailing enthusiasm: the “what should I do in the next five years” prompt with cross-conversation memory produced ideas good enough to deeply inform his next chapter. Ask it questions like a close friend. Kevin’s next experiment is “Bond,” a values-and-trust app for making implicit relational agreements (what he calls dark information) explicit, trackable, and reflective. Tim’s practical writing unlock: Wispr Flow voice dumps cleaned up by Claude, especially for procrastinated uncomfortable emails, recorded on a Shokz OpenMeet bone-conduction headset.

The AI Landscape and Where the Money Goes

Recorded the day a new Anthropic Sonnet launched, with Mythos due the next day, the forecasting segment lands on a big three of Google, Anthropic, and OpenAI. Kevin’s Google case: full-stack ownership (TPUs whose high-bandwidth architecture only makes sense as a bet on continuous, 24/7 self-improving learning, expected within 12 to 18 months), Android distribution, data center expertise, billion-dollar engineer retention, and a five-year hidden deck he glimpsed touring Google X with Sergey Brin and Bill Maris before Waymo was public. Google holds frontier models back for cost and regulatory reasons; within a year we will know what they have. OpenAI owns consumer mindshare but must solve ads; Anthropic is crushing enterprise ARR while absorbing slaps from the administration; no lab escapes government constraint; China’s open-source frontier-parity models and AMD’s ~$4,000 local inference box threaten the subscription model from below. The investing translation: these companies are going public, and ordinary users will finally get access. The lessons both preach: let winners run, remember that buying at IPO and holding a decade roughly matched one famous firm’s venture returns, and buy the company behind the product you love, the lesson of Prager’s $15 million Tesla and teenage Tim’s Pixar shares. Kevin closes with Digg’s relaunch (20,000 to nearly 500,000 weekly users) and Tim with the sobering chart of his AI-era book sales, compounding downward since ChatGPT.

Notable Quotes

“I realized that that gap is just love at the end of the day because I wouldn’t have it unless I loved this man so much. I cared for this person so much. How lucky am I to have crossed paths with this person to get to know them?”
Kevin Rose, on losing Om Malik

“When I lost my dad, like that is just a gaping hole of love manifested through sorrow and sadness.”
Kevin Rose, on grief as a consequence of deep love

“I had a sense of nothing lacking. Nothing needed to be added and nothing even possibly could be added and nothing possibly could be taken away because everything at that moment was full in the way that it should be.”
Kevin Rose, describing his micro-insight at the Zen retreat

“Training to not die sooner than is necessary is not sufficient for me.”
Tim Ferriss, on why he needs concrete physical goals like multi-pitch climbing in Yosemite

“Doing something well does not make it important or worth doing in the first place.”
Tim Ferriss, on AI’s honest impact when most tasks were never worth doing

“I can still write, but what they can do in 30 seconds is what would take me 30 hours. And I’m just like, it really drains the motivation for me to put in those 30 hours.”
Tim Ferriss, on AIs trained on his own writing

“It’s not about those new models dropping. It’s about just like a child learning. Tomorrow it’ll be better than today for forever.”
Kevin Rose, on Google’s bet that continuous learning replaces the model-release cycle

“You got to let your winners run as long as possible. I’ve lost more money by selling stocks early than I’ve ever probably made buying the original stock.”
Tim Ferriss, the takeaway from his 20-year angel investing retrospective

“You find something that you love and you buy said object when you should actually buy the company.”
Kevin Rose, on the $100k Tesla that should have been $15 million of Tesla stock

Watch the full conversation between Tim Ferriss and Kevin Rose here on YouTube.

Related Reading
- The Tail End (Wait But Why) the Tim Urban post that quantifies how little time you have left with the people you love.
- The Way Henry Shukman’s single-path guided meditation app that both Ferriss and Rose back and use daily.
- Terminal lucidity (Wikipedia) background on the end-of-life phenomenon neither host can explain.
- LSD (Wikipedia) context for MM120/lysergide and the history behind the generalized anxiety disorder trial.
- The Botany of Desire by Michael Pollan, the book Tim cites on how dogs (and plants) co-domesticated us as much as we domesticated them.
July 16, 2026
OpenAI’s Leaked 2025 Financials: $34 Billion in Spending, a $38.5 Billion Net Loss, and a $17 Billion Microsoft Bill Ahead of Its IPO
OpenAI’s audited 2025 financials leaked this week, and they are the clearest picture yet of what it actually costs to run the company behind ChatGPT. Independent journalist Ed Zitron first published the documents, and the Financial Times independently confirmed them. The headline: OpenAI spent $34 billion last year, booked $13.07 billion in revenue, and reported a net loss attributable to the company of $38.5 billion. The disclosure lands just days after OpenAI confidentially filed for an IPO that could value it north of $1 trillion.

TLDR

OpenAI’s audited 2025 numbers, leaked by Ed Zitron and confirmed by the Financial Times, show revenue tripling to $13.07 billion while total costs reached $34 billion, producing a $20.92 billion operating loss and a $38.53 billion net loss attributable to the company. The much larger net loss is inflated by a one-time $41.55 billion non-cash charge tied to OpenAI’s October 2025 conversion from a nonprofit to a public benefit corporation; strip the non-cash items and the loss is closer to $8 billion. R&D alone was $19.18 billion, cost of revenue (inference) was $7.5 billion, and sales and marketing ballooned to $5.73 billion. OpenAI paid Microsoft $17.2 billion in 2025 while Microsoft paid OpenAI only $303 million, exposing a deep Azure dependency. The company burned $1.60 for every dollar of revenue, down from $2.37 in 2024, and gross margin slipped from roughly 40% to 33% as more capable models consumed more compute per query. The leak arrives as OpenAI files a confidential S-1, targets a listing as early as September 2026 at up to a $1 trillion valuation, and races rival Anthropic, which is more valuable on paper and claims it is already turning an operating profit.

Thoughts

The most important thing to understand about these numbers is that there are two loss figures and the press will conflate them. The $38.53 billion net loss is the scary headline, but $41.55 billion of it is a non-cash accounting charge from converting investor convertible interests into equity during the for-profit restructuring. That charge is real on the audited statement and it will show up in the eventual S-1, but it is a one-time artifact of OpenAI’s unusual corporate history, not money that left the building. The number that describes the actual business is the $20.92 billion operating loss. That is the one to watch, and it is still enormous.

The genuinely encouraging line in the whole release is the loss-per-dollar ratio. In 2024 OpenAI spent $2.37 to generate a dollar of revenue. In 2025 that fell to $1.60. A company that is still losing $1.60 on every dollar is not a healthy business, but a company whose efficiency improved by a third in a single year while tripling its top line is at least pointed in a defensible direction. The bull case for OpenAI lives entirely in the slope of that line. If it keeps improving at that rate, the math eventually crosses over. If it stalls, the valuation is a fantasy.

The Microsoft relationship is the single most revealing disclosure, and it is wildly asymmetric. OpenAI paid Microsoft $17.2 billion in 2025. Microsoft paid OpenAI $303 million. That is a 56-to-1 ratio, and it reframes the partnership: Microsoft is not really a peer or even just an investor, it is OpenAI’s landlord and primary supplier, collecting rent on every model trained and every query answered. The April 2026 renegotiation that capped revenue-share payments at $38 billion through 2030, down from a projected $135 billion, suddenly looks less like a favor and more like OpenAI desperately trying to lower its single largest cost. The dependency cuts both ways, but right now Microsoft holds the better hand.

The structural problem hiding inside the cost of revenue line is inference. Training a model is a fixed, one-time cost. Serving it is a recurring cost that scales with every one of ChatGPT’s roughly 800 million weekly users. OpenAI spent $5.02 billion on Azure inference in the first half of 2025 alone, and the more capable its reasoning models get, the more compute each answer burns. That is why gross margin went down even as revenue went up. It is the opposite of how software is supposed to work, where the marginal cost of one more user trends toward zero. OpenAI’s marginal cost is real, large, and growing. The counterargument is that per-token inference costs have been falling roughly tenfold a year, so the unit economics could still flip. That is the entire wager.

Finally, the timing matters more than the numbers. OpenAI’s confidential S-1 means these audited figures were going to become public regardless, since the SEC requires the full prospectus at least 15 days before a roadshow. What the leak changes is who gets to study them first. Prospective IPO buyers, enterprise customers signing multi-year API contracts, and competitors now have the audited books weeks or months early, and they are reading them against Anthropic, which filed at a higher valuation and claims an operating profit. For a company asking the public markets to underwrite a $1 trillion bet on a monopoly outcome that does not yet exist, losing control of the narrative this early is not a small thing.

Key Takeaways
- OpenAI’s audited 2025 financials were first published by independent journalist Ed Zitron and independently confirmed by the Financial Times, the first verified look at the company’s books before its planned IPO.
- Revenue grew from $3.7 billion in 2024 to $13.07 billion in 2025, more than tripling year over year, making OpenAI one of the fastest-growing businesses in history.
- By the end of 2025 OpenAI was generating roughly $2 billion in monthly revenue, up from about $1 billion a quarter at the end of 2024.
- Total costs and expenses hit $34 billion in 2025, up from $12.48 billion in 2024.
- Research and development was the single largest expense at $19.18 billion, up from $7.81 billion, and exceeded total revenue on its own.
- Of that R&D spend, $10.59 billion went to Microsoft, almost certainly the GPU compute cost of training frontier models on Azure.
- Cost of revenue, the expense of serving ChatGPT responses (inference), rose from $2.65 billion to $7.5 billion.
- Sales and marketing jumped from $1.11 billion to $5.73 billion, a 418% increase.
- General and administrative costs rose from $907 million to $1.57 billion.
- The operating loss, the truest measure of day-to-day economics, grew from $8.78 billion to $20.92 billion.
- The net loss attributable to OpenAI was $38.53 billion, up nearly eightfold from $5.09 billion in 2024.
- The bulk of that jump was a one-time, non-cash $41.55 billion charge from OpenAI’s October 28, 2025 conversion to a public benefit corporation, reflecting the changing fair value of convertible interests and warrant liabilities.
- Stripping out the restructuring charge and other non-cash items such as stock-based compensation and Microsoft computing credits, the underlying loss was about $8 billion.
- Including all factors, gross net loss reached $60.35 billion, lowered to the $38.53 billion attributable figure by removing $21.82 billion attributed to noncontrolling and redeemable noncontrolling interests.
- OpenAI burned $1.60 for every $1 of revenue in 2025, an improvement from $2.37 in 2024, the clearest data point in the bull case.
- Measured as a percentage of revenue, the operating loss improved from 237% in 2024 to 160% in 2025.
- In total, OpenAI paid Microsoft $17.2 billion in 2025: $10.59 billion in R&D fees, $6.047 billion in cost of revenue, $527 million in sales and marketing, and $42 million in G&A.
- Microsoft paid OpenAI just $303 million in the same year, a 56-to-1 imbalance underscoring OpenAI’s Azure dependency.
- SoftBank paid OpenAI $867 million in 2025.
- At year-end OpenAI carried $3.64 billion in outstanding payables to Microsoft, plus tens of millions more in accrued and non-current liabilities.
- OpenAI spent $5.02 billion on Azure inference in just the first half of 2025; Azure inference from 2024 through Q3 2025 totaled $12.43 billion.
- ChatGPT serves roughly 800 million weekly users, meaning billions of queries a week, each one burning GPU time at Azure’s pricing of about $6.98 per H100 GPU-hour.
- Gross margin fell from roughly 40% in 2024 to 33% in 2025, because more capable reasoning models consume more compute per query.
- Research firm Sacra estimates OpenAI’s inference costs reached $8.4 billion in 2025 and will rise to $14.1 billion in 2026, a 68% increase.
- At year-end OpenAI held just over $50 billion in assets, with almost half in cash.
- The April 2026 Microsoft renegotiation ended exclusivity and capped revenue-share payments at $38 billion through 2030, down from a projected $135 billion, potentially saving OpenAI up to $97 billion over five years.
- OpenAI filed a confidential draft S-1 with the SEC around May 22, 2026 and confirmed it publicly on June 8, naming Goldman Sachs and Morgan Stanley as underwriters.
- The company is targeting a listing as early as September 2026 at a valuation that could exceed $1 trillion, though Sam Altman has said a public offering “may be a while.”
- OpenAI raised $122 billion earlier in 2026 at a $730 billion pre-money valuation, putting its post-money value around $852 billion.
- At an $852 billion valuation, OpenAI trades at roughly 65 times its 2025 revenue.
- Rival Anthropic also filed IPO paperwork this month after raising $65 billion at a $900-$965 billion valuation, making it more valuable on paper than OpenAI, and says it expects to report an operating profit of $559 million in the June quarter.
- HSBC analysts estimate OpenAI may need more than $207 billion in additional capital through 2030 even under optimistic projections.
- OpenAI projects profitability by 2029 or 2030; independent analysts put the more likely date at 2031 or later.
- Bridgewater partner Greg Jensen reportedly told clients the implied revenue multiples price OpenAI for “a monopoly outcome that does not yet exist.”
- Zitron separately reported OpenAI had a negative 122% non-GAAP operating margin in Q1 2026 and that ChatGPT growth has stalled, with the company projecting paid ChatGPT Plus subscriptions to fall from 44 million in 2025 toward cheaper tiers in 2026.
Detailed Summary

How the leak happened and why it matters now

The audited documents were obtained and first published by Ed Zitron on his newsletter Where’s Your Ed At, then independently verified by the Financial Times, which reviewed the same materials. That dual sourcing matters: this is not a rumor or a model, it is OpenAI’s actual audited financial statement. The timing is the story. OpenAI filed a confidential draft S-1 with the SEC around May 22, 2026 and confirmed it publicly on June 8. Under SEC rules the full prospectus must be released at least 15 days before an investor roadshow, so the 2025 numbers were going to be public soon regardless. The leak simply moved that disclosure forward, handing prospective investors, enterprise customers, and competitors an early look at the books.

Revenue tripled, costs grew faster

OpenAI’s revenue rose from $3.7 billion in 2024 to $13.07 billion in 2025, and monthly revenue reached nearly $2 billion by year-end. By almost any normal standard that is spectacular growth. The problem is that costs grew faster, reaching $34 billion against $12.48 billion the year before. The gap between what OpenAI earns and what it spends has widened every year since its founding, and 2025 is the starkest example yet. Revenue alone was outpaced by research and development as a single line item in both of the last two years.

Two loss numbers, and why both matter

There are two figures that get cited interchangeably and should not be. The operating loss of $20.92 billion is what the business spent beyond what it earned from operations: training models, serving ChatGPT, paying engineers, running marketing. The net loss attributable to OpenAI of $38.53 billion is far larger because 2025 was the year OpenAI completed its conversion from a nonprofit to a for-profit public benefit corporation, finalized on October 28, 2025. That restructuring triggered a $41.55 billion non-cash charge reflecting the changing fair value of convertible equity interests and warrant liabilities. Before the conversion, investors held convertible interest rights treated as liabilities under US accounting rules and revalued upward as OpenAI’s valuation climbed, creating the charge. It is not expected to recur. Including all minor items, gross net loss reached $60.35 billion, reduced to the $38.53 billion attributable figure after removing $21.82 billion tied to noncontrolling and redeemable noncontrolling interests, primarily the OpenAI Foundation’s stake. Strip the non-cash noise and the underlying loss was about $8 billion.

Where the $34 billion went

The spending breaks into four lines. Research and development was $19.18 billion, the largest category, with $10.59 billion of it flowing to Microsoft for training compute. Cost of revenue, the expense of serving responses to users, was $7.5 billion and captures inference, the compute consumed every time someone prompts ChatGPT or calls the API. Sales and marketing reached $5.73 billion, up 418% year over year, a striking jump for a product that grew largely by word of mouth. General and administrative costs added $1.57 billion. The shape of the spending tells you OpenAI is simultaneously racing to build better models, serve a massive and growing user base, and aggressively defend market share through marketing.

The Microsoft dependency

The most striking single disclosure is the scale of the Microsoft relationship. OpenAI paid Microsoft $17.2 billion in 2025: $10.59 billion in R&D fees for model training, $6.047 billion in cost-of-revenue for inference serving, $527 million in sales and marketing, and $42 million in G&A. Microsoft paid OpenAI just $303 million the same year. SoftBank paid OpenAI $867 million. The 56-to-1 ratio between what OpenAI pays Microsoft and what Microsoft pays back makes the structural reality plain: Microsoft is OpenAI’s largest landlord. The dynamic began shifting in April 2026, when the two renegotiated, ending Microsoft’s exclusivity and capping revenue-share payments at $38 billion through 2030, down from a projected $135 billion. That could save OpenAI up to $97 billion over five years, though Microsoft keeps its IP license through 2032 and remains the primary cloud partner.

Why inference is the core problem

Training happens once. Serving happens billions of times a day. When OpenAI releases a model it spends months and billions on training compute, a fixed cost that falls away when training ends. Inference is the opposite: every ChatGPT message runs through the model on Azure GPU hardware, consuming electricity and compute to generate a response. With roughly 800 million weekly users, that is billions of queries a week, each burning GPU time at roughly $6.98 per H100 GPU-hour on demand. OpenAI spent $5.02 billion on Azure inference in the first six months of 2025 alone. Sacra estimates full-year inference costs of $8.4 billion in 2025, rising to $14.1 billion in 2026. This is why gross margin fell from about 40% to 33% even as revenue tripled: more capable reasoning models consume far more compute per query, and revenue has not kept pace with the cost growth that capability generates.

What it means for the IPO and the race with Anthropic

OpenAI was last valued around $852 billion post-money after raising $122 billion in early 2026, which puts it at roughly 65 times 2025 revenue. It has named Goldman Sachs and Morgan Stanley as underwriters and is targeting a listing as early as September 2026 at up to a $1 trillion valuation, though Altman has hedged that it “may be a while” and that staying private might be the better course. HSBC estimates the company may need more than $207 billion in additional capital through 2030. The race is with Anthropic, which filed paperwork the same month after raising $65 billion at a $900-$965 billion valuation, making it more valuable on paper, and which says it expects a $559 million operating profit in the June quarter. The contrast is sharp: the two leading AI labs heading toward public markets at the same time, one bleeding cash at scale, the other claiming profitability, both asking investors to bet on a future that has not arrived.

Notable Quotes

“The financial condition of OpenAI is deeply concerning. $38.53 billion in losses are astronomical, and far higher than most believed it would be. Losses also appear to be mounting year-over-year at a dramatic rate, and I’m not sure how this company finds a way toward any kind of sustainability or profitability.”
Ed Zitron, the independent journalist who published the leaked audited financials

“It’s unclear what this means, nor how OpenAI reconciled the removal of $3.74 billion in costs. I will not speculate further.”
Ed Zitron, on a discrepancy he found in the restated 2024 figures

“OpenAI’s two biggest expenses are R&D and marketing. Budget cuts there, coupled with an ability to raise prices or win new sources of revenue, could see the company move into the black over time. Cutting R&D would be the most difficult part of that, given that AI companies can only hold onto their customers by generating the best-performing models.”
Jim Edwards, Fortune, on whether OpenAI has a realistic path to profitability

“What the audited documents make impossible to argue is that the path to profitability is short, clear, or cheap.”
TechTimes analysis of the leaked OpenAI financials

The implied revenue multiples price OpenAI for “a monopoly outcome that does not yet exist.”
Bridgewater partner Greg Jensen, reportedly telling clients how to read OpenAI’s valuation

“OpenAI spent $34bn last year as the ChatGPT maker poured money into a race to dominate the fast-growing AI market ahead of a planned stock market listing.”
George Hammond and Bryce Elder, Financial Times, framing the audited 2025 spend

Read Ed Zitron’s original reporting with the full breakdown here, and the Financial Times confirmation here.

Related Reading
- Ed Zitron, Where’s Your Ed At the primary source that broke the audited 2025 financials with the full line-by-line breakdown.
- OpenAI (Wikipedia) background on the company’s history, structure, and the nonprofit-to-for-profit conversion that drives the non-cash charge.
- Inference (Wikipedia) on the recurring compute cost that explains why OpenAI’s gross margin shrinks as usage grows.
- Anthropic the rival lab that filed IPO paperwork the same month at a higher valuation and claims it is already operating at a profit.
- SEC on confidential filings context for why OpenAI’s audited numbers were headed for public disclosure regardless of the leak.
June 17, 2026
Thomas Laffont of Coatue on the $4 Trillion AI IPO Wave: SpaceX, Anthropic, OpenAI, and Why the New Unicorn Economy Is Healthier
Thomas Laffont, co-founder of the $55 billion hedge fund Coatue Management, made his All-In Podcast premiere with a data-dense walk through what he calls a once-in-a-generation moment for the unicorn economy. In front of Chamath Palihapitiya, Jason Calacanis, David Sacks, and David Friedberg, he argued that a roughly $4 trillion wave of private value is about to hit the public markets, led by SpaceX, Anthropic, and OpenAI, and that the new AI-driven unicorn economy is actually healthier than the one that came before it. You can watch the full presentation and Q&A on YouTube.

TLDW

Laffont presents Coatue’s slide deck on the state of the unicorn economy and argues it has rebalanced after the excesses of 2021. The average unicorn is up about 70 percent since September 2024, AI keeps taking a bigger share of all fundraising, and the model has shifted from many small unicorns to fewer companies each raising far more, with funding per unicorn up roughly 5x since 2021. He introduces a “Magnificent 8” private index (SpaceX, Stripe, Anthropic, Databricks, Revolut, ByteDance, Anduril, and more) worth nearly $4 trillion that has crushed the public Mag 7, then shows that exits are finally thawing as SpaceX heads to an IPO in weeks and Anthropic confidentially files its S1. He lays out Coatue’s “CODE” framework for why SpaceX gets more valuable the more it launches, a counterintuitive finding that the odds of a 10x actually rise as companies get bigger (31 percent for $100 billion-plus centicorns), the explosive revenue ramp of OpenAI and Anthropic past Workday, ServiceNow, Adobe, Salesforce, and now the hyperscalers, a three-pillar map of where AI revenue comes from (consumer, ads, enterprise), and the AI memory thesis. The Q&A with Chamath and Calacanis digs into the power law, K-shaped outcomes, whether these valuations are disconnected from reality, the public market as the great antiseptic, and what happens when trillions in private value finally recycles back through GPs and LPs.

Thoughts

The most useful idea in the talk is not the $4 trillion headline, it is the cohort-health chart. Laffont splits unicorns into eras and shows that the pre-2021 cohort was healthy, roughly 80 percent had raised again or exited 20 quarters after minting, while the giant 2021 ZIRP cohort of 479 companies is stuck with under 20 percent doing either. That single comparison reframes the whole AI boom. The bullish read is that the 2024 AI cohort is small, concentrated, and cash-generative, so it looks more like the healthy pre-ZIRP group than the 2021 hangover. The bearish read is that we are watching the same movie with bigger numbers, and the test only comes when these companies face public markets. Laffont is honest that we do not yet know which cohort the AI class resembles, and that intellectual humility is what makes the deck credible rather than promotional.

The SpaceX “CODE” framework is the sharpest analytical move of the presentation. Most people would assume a launch business gets cheaper per launch as it scales. Laffont shows the opposite, the market pays more per launch as cadence rises, and explains it as a phase change in business quality: from one-time government launch revenue, to a single recurring-revenue constellation, to multiple constellations, to a platform with optional upside in space data centers, the moon, and Mars. It is a clean way to think about any company that climbs from a project business to a platform business, and it applies far beyond rockets. The lesson for investors is that valuation can rationally expand even as unit economics look like they should compress, because the nature of the revenue underneath is changing.

The counterintuitive 10x odds finding deserves more attention than it got in the room. Conventional wisdom says the bigger you are, the harder it is to grow, so a $100 billion company should be less likely to 10x than a $10 billion one. Coatue’s data says the reverse: centicorns have a 31 percent shot at a 10x, far higher than the 8 percent a unicorn has at becoming a decacorn. Laffont’s explanation is a filtering mechanism, every step up validates a compounding advantage and durability of earnings, so survivors are increasingly the kind of business that keeps compounding. This is essentially a quantitative restatement of quality investing, and it is the intellectual backbone of the LP strategy the besties tease out, just buy whoever reaches $100 billion and hold.

Where the argument gets genuinely contested is valuation, and the panel does not let it slide. The pushback that “these are not fake companies” is true and important, OpenAI and Anthropic are growing faster than any software company in history, and Anthropic reportedly had a profitable month. But growth and reality do not settle the question of price when you are paying 50 to 100 times revenue for trillion-dollar private companies, as Bill Ackman pointed out earlier in the day. Laffont’s answer is the most grounded thing he says all session: the public market is the great antiseptic, it will not care about anyone’s slide deck, and he wants to see these names withstand short sellers and skeptics. That is the right posture. The deck is a thesis, not a verdict, and the verdict arrives roughly six months and one day after the IPOs, once passive flows and supply have washed through.

The closing thread, that almost every sector is being transformed at once and we still do not have superintelligence, is the part worth sitting with. The risk in a presentation this bullish is treating the trend as destiny. The value is in the framing tools Laffont hands you, cohort health, phase-change business quality, the filtering odds, the three revenue pillars, and the antiseptic of public scrutiny. Use those to interrogate each name rather than to buy the index on faith, and the talk earns its premiere billing.

Key Takeaways
- Coatue Management is one of the most successful hedge funds of the last two decades with about $55 billion under management, and is raising roughly another billion dollars specifically to invest in AI.
- The unicorn economy is up about 70 percent on average since September 2024, and the public market has made a similar move up over the same period.
- The unicorn economy’s share of the NASDAQ rose significantly after 2015 but has plateaued in recent years, reflecting strong performance from public companies.
- AI keeps increasing its wallet share of all venture fundraising, multiple years in a row now.
- The composition of funding has changed. The unicorn “factory” peaked in the ZIRP era of 2021 and has normalized at a much lower level since.
- Funding per unicorn has increased roughly 5x since 2021. There are fewer unicorns, and each one is raising more.
- Cohort health, pre-ZIRP group: of about 73 unicorns, 20 quarters after minting roughly 80 percent had either raised a new round or exited, which is healthy.
- Cohort health, 2021 group: of about 479 unicorns, 20 quarters in, fewer than 20 percent had exited or raised again. Far larger cohort, far worse outcomes.
- The open question is which cohort the new 2024 AI cohort will resemble.
- Funding is concentrating: the top 10 companies capture a large share, and it is a small number of AI companies, not all of them, with Anthropic and OpenAI raising massive rounds.
- Laffont proposes a “Magnificent 8” private index: SpaceX, Stripe, Anthropic, Databricks, Revolut, ByteDance, Anduril, and more, spanning internet, AI, fintech, and space tech.
- That private index represents almost $4 trillion of value and has crushed the traditional public Mag 7, with almost every name outperforming.
- Exits are thawing. 2026 is on a good trend for cash returned versus consumed, not quite 2021 levels, with half a year still to go.
- That trend does not yet include three imminent liquidity events: SpaceX (IPO expected in weeks) and Anthropic (confidentially filed its S1), whose combined value could exceed the prior decade of exits combined.
- The ecosystem is far more balanced than when Laffont first presented at the 2024 All-In Summit, when it was consuming much more cash than it returned.
- OpenAI and Anthropic revenue growth is unlike anything previously seen. Starting from January 2025, they passed Workday, then ServiceNow, then Adobe, then Salesforce, and are now bigger than Google Cloud and Azure.
- On current forecasts, that revenue could pass AWS by the end of the year and exceed all of Microsoft by 2028.
- Hyperscalers are not sitting still. The largest companies in the world are funding the disruption, investing unprecedented sums to enable the ChatGPT moment.
- The SpaceX “CODE” framework: the number one driver correlated to SpaceX’s valuation is cadence of launches, and valuation per launch rises as launches increase.
- Why per-launch value rises: business quality improves through phases, pre-constellation (one-time government revenue), initial ramp (one recurring-revenue constellation), scale (multiple constellations), and platform (space data centers, moon and Mars optionality).
- Anthropic in particular is scaling like no company seen across the PC, internet, or mobile eras.
- Counterintuitive 10x odds: a unicorn has about an 8 percent chance of becoming a decacorn, a decacorn has 8 to 13 percent odds of reaching $100 billion, but a centicorn ($100 billion-plus) has a 31 percent chance of a 10x.
- Value creation has accelerated. It typically takes years to go from $500 billion to $1 trillion in market cap, yet recently three companies did it in one year and two did it in a matter of weeks.
- Cerebras is the counterexample of slow success: years of dark periods and no new capital developing its technology, then a massive OpenAI contract that quintupled the company’s value ahead of its IPO.
- Semiconductors are on a generational run, with the sector dramatically outperforming the index since the 2024 All-In Summit.
- AI memory thesis: the more an AI system knows about you, the more useful it is, so memory per user could quintuple, which helps explain recent moves in memory companies.
- Where the revenue is: the AI ecosystem is roughly $140 billion today, about $300 billion this year, and is expected to double in 2027.
- Three revenue pillars: consumer (subscribers times ARPU), ads (about a quarter of Meta and Google ads are AI-enabled today, heading toward 100 percent and roughly $150 billion), and enterprise (tools like Claude Code and Codex inside businesses).
- Disruption is hitting every sector: software, telco (Starlink-powered global phone calls), semis, energy (data centers reshaping Pennsylvania’s grid), auto (Ferrari’s electric and autonomous stumble), and consumer (GLP-1s reshaping food, alcohol, and wellness).
- Final takeaways: the new unicorn economy is healthier thanks to AI, winners are compounding faster so the cost of not owning a winner is higher than ever, disruption is everywhere, and we do not even have superintelligence yet.
- In the Q&A, both Anthropic and OpenAI publicly say they want to be public, and big outcomes now look likely to become liquid within roughly a 12-month window.
- The valuation pushback: these are not fake companies, they generate substantial revenue at scale and grow faster than anything before, and Anthropic reportedly even had a profitable month.
- The public market is framed as the great equalizer and antiseptic, but with passive buying the true price discovery may not land on day one, more like six months and a day after listing.
- A floated LP strategy: wait for whoever reaches $100 billion and concentrate capital there as the least brittle, quickest-return bet, tempered by the warning that valuations are disconnecting from any historical metric (50x to 100x revenue).
- An open risk: with so much capital, OpenAI and Anthropic could rationally start a price war, the way ride-sharing and food-delivery players once did, though heavy infrastructure spend complicates it.
Detailed Summary

The unicorn economy has rebalanced after 2021

Laffont opens by reframing a market many assume is frothy. The average unicorn is up about 70 percent since September 2024, and the public market has tracked a similar climb, so private and public value are moving together rather than diverging. The unicorn economy’s share of the NASDAQ rose sharply after 2015 and then plateaued, which he reads as a sign of how strong public companies have become. Underneath the headline, the structure of funding has changed. The 2021 ZIRP era was a unicorn factory that minted enormous numbers of companies, and that machine has since normalized to a much lower level. The result is a barbell: fewer new unicorns, but each raising far more, with funding per unicorn up roughly 5x since 2021. AI sits at the center of this, taking a steadily larger share of all venture dollars for several years running.

Cohort health is the real story

The deck’s most important slide measures the health of the ecosystem by cohort. The pre-ZIRP cohort, about 73 unicorns, looks healthy: 20 quarters after becoming unicorns, roughly 80 percent had either raised a new round or exited. The 2021 cohort tells the opposite story. It is enormous, about 479 unicorns, and 20 quarters in, fewer than 20 percent had raised again or exited. That contrast sets up the central question of the talk. A new 2024 cohort of AI companies is forming, and no one yet knows whether it will resemble the healthy pre-ZIRP group or the bloated, stuck 2021 group. Laffont’s framing leans optimistic because the AI cohort is small and concentrated, but he is careful not to declare the answer.

The Magnificent 8 and a $4 trillion private index

Funding is not just flowing to AI, it is flowing to a handful of AI names, with the top 10 capturing a large share and Anthropic and OpenAI raising the biggest rounds. From this concentration Laffont builds a private index he half-jokingly calls the Magnificent 8, a number he expects to shrink as companies go public. The members span sectors: SpaceX, Stripe, Anthropic, Databricks, Revolut, ByteDance, and Anduril, covering internet, AI, fintech, and space tech. He says he would be comfortable owning that index for the next decade-plus. Collectively it represents almost $4 trillion of value and has outperformed the public Mag 7, with nearly every constituent beating that benchmark.

Exits are thawing and a wall of liquidity is coming

One of Laffont’s recurring concerns at past summits has been balance: the unicorn economy is great at consuming cash, but a healthy ecosystem must also return it. On that score 2026 is trending well, not quite 2021, but solid with half a year left. Crucially, that figure does not yet include three imminent events. SpaceX is expected to go public within weeks, and Anthropic confidentially filed its S1 the day of the talk. Adding those up, just a few companies could deliver more liquidity than the prior ten years combined. The takeaway is that the ecosystem that was dangerously out of balance in 2024 is now meaningfully more balanced, and improving.

The revenue ramp past the hyperscalers

The growth rates of OpenAI and Anthropic, Laffont argues, are unlike anything previously seen. Charting from January 2025, the leading AI labs passed Workday, then ServiceNow, then Adobe by year end, then Salesforce by January, and are now bigger than Google Cloud and Azure. On forecast, that revenue could surpass AWS by the end of the year and exceed all of Microsoft by 2028. He stresses that the hyperscalers are not passive bystanders, they are actively funding the disruption, pouring unprecedented capital into enabling the change that began with the ChatGPT moment.

The SpaceX CODE framework

Laffont devotes real time to how Coatue thinks about SpaceX. The single factor most correlated with SpaceX’s valuation is cadence of launches, which is intuitive for a launch business. The surprise is that valuation per launch has risen rather than fallen as cadence climbed. His explanation, the CODE framework, is that the quality of the business model improves the more SpaceX launches. In phase one, pre-constellation, you are simply proving rockets, with a few government customers and lumpy, unpredictable one-time revenue. In the initial ramp you stand up a constellation, which is an end market and a recurring-revenue business that grows with every satellite and subscriber. At scale you operate multiple constellations, and Laffont expects companies, governments, and militaries to want to own their own. Ultimately it becomes a platform, with new businesses layered on top, from space data centers to the optionality of the moon and Mars.

Counterintuitive odds and the speed of value creation

Coatue bucketed companies and asked the odds of a 10x within each. A unicorn has roughly an 8 percent chance of becoming a decacorn. A decacorn has 8 to 13 percent odds of reaching $100 billion. But a centicorn, $100 billion or more, has a 31 percent chance of a 10x, counting both public and private companies. The bigger you are, the better your odds, which inverts intuition. Laffont pairs this with the sheer speed of recent value creation. Going from $500 billion to $1 trillion in market cap normally takes years, yet three companies did it in a single year and two did it in a matter of weeks. He also offers Cerebras as the patient counterexample, a chip company that endured years of dark periods and no new capital before a massive OpenAI contract quintupled its value ahead of IPO, part of a broader generational run for semiconductors.

AI memory and where the revenue actually comes from

A throughline from the day’s other speakers is that the more an AI knows about you, the more useful it is, from your restaurant preferences to your work context. Laffont turns that into a thesis: memory per user could quintuple based on what these systems require, which helps explain recent moves in memory companies. He then tackles the most contested question, where is the revenue. He sizes the AI ecosystem at about $140 billion today, roughly $300 billion this year, and doubling in 2027, built on three pillars. Consumer is subscribers times ARPU. Ads are the pillar people forget, with about a quarter of Meta and Google ads already AI-enabled and penetration heading toward 100 percent, a roughly $150 billion opportunity. Enterprise is the breakthrough category, exemplified by tools like Claude Code and Codex operating inside businesses.

Every sector is being transformed at once

What makes this era different, Laffont says, is that nearly every sector is being transformed simultaneously. Software is obvious, but look at telco, where he believes Starlink will soon power a device that lets you make a phone call anywhere on earth, attacking the global telco and broadband profit pool with a better product. Compute is driving massive change in semis, data centers are reshaping the energy equation in places like Pennsylvania, and the auto business is being upended, as Ferrari’s stumble introducing electric and autonomous technology showed. In consumer, GLP-1 drugs are profoundly changing consumption of food and alcohol and the broader focus on wellness. His takeaways close the loop: the new unicorn economy is healthier thanks to AI, winners are compounding faster so the cost of missing them is higher than ever, disruption is everywhere, and superintelligence has not even arrived yet.

The Q&A: power law, valuation, and the public market test

Chamath and Jason Calacanis press Laffont on what this means for allocators. The recurring theme is the power law and K-shaped outcomes, with gains consolidating into a small number of companies. The positive side, Laffont notes, is that outcomes are enormous and increasingly liquid within a 12-month window, and both Anthropic and OpenAI say they want to be public. The hard part is valuation. The besties cite Bill Ackman’s framing that investors are making venture bets on trillion-dollar companies at 50 to 100 times revenue. Laffont’s pushback is that these are not fake companies, they generate substantial revenue at scale and grow faster than anything before, and Anthropic reportedly had a profitable month. But he embraces the discipline ahead: the public market is the great antiseptic and will not care about anyone’s presentation, though with heavy passive buying, true price discovery may take roughly six months and a day rather than landing on day one. Asked whether the compounding is a market inefficiency or survivor bias, he declines to over-read a small sample, noting that Anthropic before Claude Code was a completely different company than after. The conversation closes on what happens when trillions recycle from GPs to LPs, the case for simply owning whoever crosses $100 billion, the risk of everyone crowding into three names, and the possibility of an eventual OpenAI versus Anthropic price war.

Notable Quotes

“So we have fewer unicorns that are each raising more.”
Thomas Laffont, summarizing how funding per unicorn has risen roughly 5x since 2021

“The reason is that the quality of SpaceX’s business model increases the more you launch.”
Thomas Laffont, explaining the CODE framework and why valuation per launch rises with cadence

“The winners are compounding faster than ever, which means the costs of not being in a winner are higher than ever.”
Thomas Laffont, on the central risk of a power-law market

“And by the way, we don’t even have super intelligence yet.”
Thomas Laffont, closing his takeaways on how early the transformation still is

“These are companies generating substantial revenue at scale that are growing faster than anything we’ve ever seen.”
Thomas Laffont, pushing back on the idea that AI valuations rest on fake companies

“It will be the great antiseptic. It will not care about my presentation.”
Thomas Laffont, on the public market as the ultimate test for SpaceX, OpenAI, and Anthropic

“Anthropic pre-cloud code was a completely different company than post cloud code.”
Thomas Laffont, on why he won’t over-read a small sample of hyper-compounders

“The power law rules our lives. All the great gains are being consolidated into small numbers of companies.”
An All-In host, framing the Q&A on concentration in private markets

This is a curated set of highlights. To hear the full presentation, the slide walkthrough, and the complete Q&A with Chamath and Jason Calacanis, watch the full conversation here.

Related Reading
- Coatue Management. Primary source for Thomas Laffont’s firm and the technology investing strategy behind the deck.
- The All-In Podcast. The show and summit where Laffont made this premiere presentation.
- Power law (Wikipedia). Background on the distribution Laffont and the hosts say governs venture and public-market returns.
- The Magnificent Seven (Wikipedia). The public-market benchmark Laffont’s private “Magnificent 8” index is measured against.
- Cerebras Systems. The AI chipmaker Laffont cites as the slow-grind IPO that was eventually transformed by a major OpenAI contract.
June 4, 2026
Gavin Baker on Orbital Compute, TSMC, Frontier AI Models, Anthropic’s Vertical Take Off, and the Coming Wafer Shortage
Gavin Baker, founder and CIO of Atreides Management, returns to Patrick O’Shaughnessy’s Invest Like the Best for his sixth appearance. He calls the current AI moment the most extraordinary moment in the history of capitalism, walks through what Anthropic’s vertical takeoff in revenue actually means, lays out why orbital compute is closer than skeptics believe, dissects the TSMC bottleneck that may be the only thing standing between today’s market and a full-on AI bubble, and rates every hyperscaler on how they have positioned for a world where frontier model providers may stop selling API access altogether.

TLDW

Anthropic added eleven billion dollars of ARR in a single month, which is roughly the combined business of Palantir, Snowflake, and Databricks built over a decade. That is the setup. From there Gavin Baker covers the March and April selloff, the contrarian read that a closed Strait of Hormuz was actually bullish for American manufacturing competitiveness, why Anthropic and OpenAI multiples may be misleadingly cheap on an unconstrained run rate basis, why Elon Musk’s discipline on SpaceX valuation created a superpower of permanent access to capital, the practical engineering case for orbital compute as racks in space rather than Pentagon sized space stations, why TSMC’s capacity discipline is the single most important variable in whether the AI cycle becomes a bubble, what Terafab in Texas changes, why the Pareto frontier of AI models has flipped from Google dominance to Anthropic and OpenAI dominance in nine months, the shift from all you can eat AI subscriptions to usage based pricing and what that means for revenue scaling, Richard Sutton’s bitter lesson as the largest risk to the AI trade, why frontier tokens still capture an overwhelming share of economic value, the role of continual learning as the third great open question, why most new chip startups should not try to build a better GPU, why Cerebras did something different and hard, why disaggregated inference may extend GPU useful lives to ten or fifteen years and rescue the private credit industry, why being in the token path is the new venture filter, the new prisoner’s dilemma around releasing frontier models via API, an honest rating of Google, Meta, Amazon, and Microsoft, why personal safety is becoming a real AI era risk, and why he remains an AI optimist maximalist who believes this could be the next Pax Americana.

Key Takeaways
- Anthropic added eleven billion dollars of ARR in one month, more than the combined businesses of Palantir, Snowflake, and Databricks built across a decade. There is no precedent for this in the history of capitalism.
- The SaaS and cloud revolution created between five and ten trillion dollars of value over twenty years. AI is replaying that compression on a timeline measured in months.
- The March selloff was a drawdown driven by disagreement with price action, not invalidated thesis. That is the kind of drawdown an investor can lean into.
- Deep Seek Monday in January 2025 was a similar setup. By the day of the selloff, AWS Asia GPU prices had already doubled, GPU availability had fallen, and it was obvious reasoning models would be vastly more compute hungry at inference. The market priced the opposite.
- The Strait of Hormuz closing was actually positive for America. US natural gas (the primary input into US electricity, which feeds AI) fell twenty percent on Bloomberg while Asian and European natural gas doubled or tripled. American manufacturing competitiveness improved overnight.
- The US is now the world’s largest producer and exporter of oil and gas. The economy is dramatically less energy intensive than in the 1970s. The shortage trauma comparison does not hold.
- Tech as a sector traded as cheaply versus the rest of the market in early April as at any point in the last ten years, into the single most bullish moment for AI fundamentals on record.
- Anthropic is dramatically more capital efficient than OpenAI, having burned roughly eighty percent less to reach a similar revenue scale. They have very different structural returns on invested capital.
- Anthropic at roughly nine hundred billion for fifty billion of ARR (growing a thousand percent) is striking. Adjusted for compute constraint, the unconstrained run rate could be one hundred fifty to two hundred billion, putting the implied multiple closer to five times.
- Claude Opus generates roughly seventy percent fewer tokens for the same question than previously, with token quantity tied to answer quality. Subscribers on flat-fee plans are getting a lobotomized model.
- Elon Musk’s superpower is twenty years of making investors money. He never pushes valuation. SpaceX compounded low thirty percent per year for a decade because Musk treats fair pricing as a sacred covenant.
- Capitalism will solve the watts shortage. The current bottleneck has shifted from chips and energy to zoning and political approval. Many capex decisions are paused until after the US midterms.
- The watts shortage probably begins to alleviate in 2027 and 2028. Orbital compute solves it longer term.
- Orbital compute is not Pentagon sized data centers in space. It is racks in space. A Blackwell rack is three thousand pounds, eight feet tall, four feet deep, three feet wide. SpaceX has shown a satellite roughly that size.
- The satellites operate in sun synchronous orbit so solar wings (around five hundred feet per side) always face the sun and the radiator on the dark side always points to deep space.
- Starlink V3 satellites already run at around twenty kilowatts. A Blackwell rack runs at one hundred kilowatts. SpaceX engineers express genuine confidence they have already solved cooling and radiator design at these scales.
- Racks in space are connected with lasers traveling through vacuum, the same lasers already on every Starlink. SpaceX operates the world’s largest satellite fleet and, via xAI Colossus, the world’s largest data center on Earth.
- Inference will move to orbit. Training will stay on Earth for a long time. Terrestrial data centers remain valuable for the rest of an investor’s career.
- The wafer bottleneck is structural and political. TSMC is essentially Taiwan’s GDP, water, and electricity. The leaders see themselves as inheritors of Morris Chang’s sacred legacy and they do not behave like a Western public company.
- Jensen Huang has never had a contract with TSMC. The relationship is run on handshakes and the assumption that things will be fair over time.
- If TSMC did everything Jensen wanted, Nvidia could be selling two to three trillion dollars of GPUs in 2026 and 2027. TSMC’s discipline is the single largest factor preventing a true AI bubble.
- Historically, foundational technologies always get a bubble. Railroads, canals, the internet. The current AI buildout is overwhelmingly funded out of operating cash flow, GPUs are running at one hundred percent utilization, and that is fundamentally different from the year 2000 fiber overbuild.
- If one of Intel or Samsung Foundry catches up at the leading node, the other will follow, and TSMC’s discipline collapses. Watch TSMC capacity decisions to predict a bubble.
- Terafab, the SpaceX and Tesla joint venture to build the world’s largest fab in America, has a partnership with Intel that grants access to fifty years of institutional foundry knowledge. The A teams at ASML, KLA, Lam Research, and Applied Materials will follow Elon’s reputation in hardware engineering.
- The hiring playbook for Terafab includes building Taiwan Town, Japan Town, and Korea Town next to the fab. Recruit the engineers and import their families, their restaurants, and their staff.
- Frontier tokens still capture an overwhelming share of all economic value created at the model layer. This is surprising and is one of the three big open questions for AI investing.
- The Pareto frontier of intelligence versus cost has flipped. Nine months ago Google’s TPU dominated every point on the frontier. Today Anthropic and OpenAI dominate, with Grok 4.3 on the frontier and Gemini 3.1 hanging on.
- Google’s conservative TPU V8 design (partly an attempt to reduce dependence on Broadcom and Nvidia) is the leading explanation for the loss of per token cost leadership.
- AI pricing is shifting from all you can eat to usage based, mirroring the cellular and long distance industries. Cellular stopped being a great growth industry when it went all you can eat. AI just made the opposite move.
- OpenAI and Anthropic together could exceed two hundred billion in ARR this year if compute keeps coming online and frontier token pricing holds.
- The two hundred fifty dollar a month consumer AI plan is no longer enough to evaluate frontier capability. Enterprise plans with usage based billing are required because rate limits are now severe.
- The three biggest open questions for AI investors are: violation of the bitter lesson via ASI or human ingenuity, whether frontier tokens keep commanding their premium, and when continual learning arrives.
- Today’s continual learning is crude reinforcement learning during mid training on verifiable tasks. True continual learning means weights updating dynamically, like a human who learns the first time they touch fire.
- Trying to build a better GPU is a losing strategy. Jensen will copy any one to three percent share design. Startups should target one percent share, do something different, and make it hard enough that Nvidia cannot fast follow.
- Disaggregated inference (separating prefill and decode) opens new design canvases. Prefill is memory capacity bound. Decode is memory bandwidth bound. Each can be optimized independently.
- Cerebras did something different and hard with wafer scale computing. Three generations of chips and real grit to get there.
- Disaggregation of inference may stretch GPU useful lives to ten or fifteen years, dropping financing costs from low sevens to five or six percent, mathematically lowering the cost of the AI buildout and likely saving the private credit industry from its SaaS loan exposure.
- Sellers of shortage outperform buyers of shortage. But owning the largest installed base of what is currently in shortage (hyperscaler CPU fleets, for example) is also a strong position.
- Most of the economic value at the application layer of AI has been destroyed, not created. The exceptions are companies in the token path or in niches small enough that frontier labs ignore them.
- Coding may be the shortest path to ASI. If you can write code, you can write code that does anything. Cursor, Cognition, and Anthropic correctly focused on it.
- Jensen could probably get close to the frontier with his own Nemotron family of models whenever he wants. The fact that he chooses not to is a strategic decision about not commoditizing his customers.
- The new prisoner’s dilemma in AI is whether frontier labs release their best model via API. If everyone agrees not to, Chinese open source falls behind. If anyone defects, the defector pulls ahead on revenue and resources, forcing everyone else to defect.
- Google still owns the largest compute installed base. Without TPU’s prior cost advantage, this matters more. YouTube data has real value in a world of robotics. GCP is going crazy.
- Meta deserves credit for becoming AI first internally faster than any other internet giant. Musa, their first MSL model, is impressively close to the Pareto frontier.
- Amazon is strong because of Trainium and robotics driven retail P&L efficiency. Nova is better than it gets credit for.
- Microsoft flinched on capex in early 2025 and lost position. Satya Nadella’s current decision to use Microsoft compute for Microsoft products rather than reselling to OpenAI is a courageous and probably correct call, even at the cost of an eight hundred dollar stock price.
- The hyperscalers most engaged with startups are Amazon and Nvidia by a mile, followed by Google. Broadcom is the favorite ASIC partner. AMD, Microsoft, and Meta have minimal startup engagement and that will cost them as the best teams are now at startups.
- Personal safety in an AI era requires a family or company safe word that cannot be socially engineered. Deepfake voice and video extortion at the speed of FaceTime is already feasible.
- Ukraine is winning largely on the back of having the best battlefield AI outside America and Israel. Adversaries are starting to internalize what AI dominance means geopolitically.
- An optimistic read is that this becomes a new Pax Americana, the way the post 1945 American nuclear monopoly was used to rebuild Germany and Japan rather than dominate.
- AI cured a friend’s daughter’s rare disease by spinning up a research effort that identified a market drug capable of impacting her condition. That is the upside that keeps Gavin an AI optimist maximalist.
Detailed Summary

The most extraordinary moment in the history of capitalism

Gavin’s framing of the current moment is unusually direct. Anthropic added eleven billion dollars of annual recurring revenue in a single month. The three highest profile SaaS companies of the last decade plus, Palantir, Snowflake, and Databricks, took a decade and tens of thousands of employees collectively to build the combined business that Anthropic added in thirty days. He has been investing through every major tech cycle and says there is no historical analog. Not the dotcom era, not the cloud transition, not mobile. This is its own thing.

The market response, then, was peculiar. The NASDAQ sold off into the single most bullish moment for AI fundamentals on record. Tech traded at roughly its widest discount versus the rest of the market in a decade. Investors who said they wished they had bought into AI during 2022, during COVID, or during Deep Seek Monday got the same valuation setup again in early April, this time with an even clearer inflection.

Why the Strait of Hormuz closing was secretly bullish for America

One reason the macro fear in March may have been mispriced is that the same geopolitical event that drove the selloff was, in practice, a relative benefit to the United States. American natural gas, the input into American electricity, which is the input into American AI training and inference, fell roughly twenty percent. Asian and European natural gas prices doubled or tripled. The US emerged with sharply improved relative manufacturing competitiveness, which is exactly what the current administration cares about.

The 1970s comparison does not hold. The US economy is dramatically less energy intensive, it is now the world’s largest producer and largest exporter of oil and gas, and there are no shortages, only price moves. That backdrop made it easier for disciplined investors to stay focused on AI fundamentals through the volatility.

Anthropic and OpenAI valuations on an unconstrained run rate

Anthropic at roughly nine hundred billion for fifty billion of ARR sounds rich until you adjust for the fact that the company is severely compute constrained. Gavin estimates that, unconstrained, Anthropic might be at one hundred fifty to two hundred billion in run rate revenue, putting the implied multiple closer to five times. He also points out that Claude Opus now generates roughly seventy percent fewer tokens for the same question than it used to. Token quantity correlates with answer quality, and Anthropic is rate limiting and shrinking outputs to ration capacity across its user base.

Anthropic and OpenAI are also structurally very different. Anthropic has burned around eighty percent less cash than OpenAI to reach a comparable revenue scale. That implies very different long term returns on invested capital, though OpenAI has done a better job locking in compute and Sarah Friar is one of the most exceptional CFOs Gavin has worked with.

Why neither lab is raising at a three trillion dollar valuation

The answer Gavin gives is that both labs are deliberately leaving valuation on the table the way Elon has done for two decades. SpaceX compounded at low thirty percent annually for a decade because Elon never pushed price. The result is a permanent superpower of access to capital. Investors trust him because they have made money with him for twenty years. That is a moat that compounds with every round.

Anthropic could probably raise at a one hundred percent premium to its rumored latest mark. They are choosing not to. In an uncertain world (Ukraine, Russia, Iran, Taiwan), preserving the ability to raise more capital later at fair prices is more valuable than maximizing this round.

Watts and wafers, the two real constraints

Capitalism is solving the watts problem. The leading PE infrastructure investors now say zoning and political approval, not chips or energy, are the gating factors. Companies are deferring big capex announcements until after the US midterms. Turbine capacity is being doubled at the manufacturers. Companies like Boom Aerospace are repurposing jet engines for grid use. Watts probably ease meaningfully in 2027 and 2028 and then orbital compute does the rest.

Wafers are the harder problem because they live in Taiwan, run on handshakes, and depend on a corporate culture that does not respond to public market incentives. TSMC is essentially the GDP, water consumption, and electricity consumption of Taiwan. Its leadership treats the company as the legacy of Morris Chang. The Silicon Shield doctrine is real and internal.

Orbital compute as racks in space

The biggest mental update Gavin asks listeners to make is to stop picturing data centers in space as Pentagon sized space stations. A Blackwell rack is three thousand pounds and roughly the size of a refrigerator. SpaceX has shown a concept satellite of about that size. Solar wings extend five hundred feet to each side and the radiator extends hundreds of feet behind, both possible because the orbit is sun synchronous and the orientation is fixed relative to the sun.

SpaceX engineers Gavin has spoken to at Starbase express genuine confidence that they have solved cooling at these power levels. They have. Starlink V3 satellites already operate at twenty kilowatts. A Blackwell rack is one hundred kilowatts. The same company operates the world’s largest satellite fleet and the world’s largest data center on Earth via xAI Colossus. The racks are connected to each other with lasers traveling through vacuum, technology already deployed in every Starlink. The naysayers, Gavin observes, are armchair skeptics and Larry Ellison’s response (he is out there landing rockets, no one else is) is the right frame.

Terafab in Texas and the threat to TSMC’s discipline

Terafab, the SpaceX and Tesla joint venture, intends to be the largest fab in the world. The partnership with Intel grants access to fifty years of foundry institutional knowledge, allowing Terafab to start three to five quarters behind the leading node rather than fifteen years behind. The A teams at the semicap equipment companies (ASML, KLA, Lam Research, Applied Materials) will follow Elon’s reputation in hardware engineering the same way they followed TSMC twenty years ago when Intel stumbled.

The talent strategy is the part most observers underestimate. Recruit the best engineers globally, then import their families, their restaurants, their staff. Build Taiwan Town, Japan Town, and Korea Town next to the fab. Optimize the human experience for the people whose work matters. Intel and Samsung do not think that way.

Bubble watch and the year 2000 comparison

Every foundational technology in modern history has had a bubble. Railroads, canals, the internet. Carlota Perez documented why. Markets correctly identify the importance, diversity of opinion collapses, supply gets ahead of demand, the bubble crashes. The current cycle has two important differences. The buildout is overwhelmingly funded out of operating cash flow, not debt. Every GPU is running at one hundred percent utilization, while at the peak of the fiber bubble ninety nine percent of fiber was unused.

TSMC discipline is the single largest reason a bubble has not formed. If Jensen could buy everything TSMC could theoretically make, Nvidia could sell two to three trillion dollars of GPUs in 2026 and 2027. At some point that becomes more than the market can absorb. If Intel or Samsung Foundry catches up at the leading node, the other will too. TSMC’s pricing discipline collapses and the bubble starts.

The Pareto frontier and the loss of Google’s cost advantage

The most important chart in AI is the Pareto frontier of model intelligence versus per token cost. Nine months ago, Google’s TPU based models dominated every point on it. OpenAI, Anthropic, and xAI sat inside the frontier. Today the frontier is dominated by Anthropic and OpenAI, with Grok 4.3 on the frontier and Gemini 3.1 hanging on by subsidization more than economics. The most likely cause is Google’s conservative TPU V8 design, an attempt to reduce dependence on Broadcom and Nvidia that sacrificed per token economics.

The bitter lesson, frontier tokens, and continual learning

Three open questions dominate AI investing. The first is whether Richard Sutton’s bitter lesson (more compute beats human algorithmic cleverness) gets violated by ASI itself optimizing for efficiency. Closer observers of AI are more skeptical of a violation. Gavin thinks ASI’s first move will be to make itself more efficient and more resourced, which is technically a temporary violation.

The second is whether frontier tokens keep capturing the overwhelming share of economic value at the model layer. Today they do, surprisingly. Gemini 3.1 Pro was mindblowing nine months ago and is intolerable today. The third is when continual learning arrives. Today’s models need a million fire touches to learn what a human learns from one. True continual learning would mean dynamic weight updates in real time and would produce a fast takeoff.

From all you can eat to usage based AI pricing

AI is shifting from flat fee plans to usage based pricing. The historical analogy is cellular and long distance. Both stopped being great growth industries when they went all you can eat. AI just made the opposite move. The consequence is that flat fee subscribers, even on premium consumer plans, get a rate limited and token throttled version of the frontier model. Enterprise plans with usage based billing are now required to evaluate true capability. Gavin thinks the combination of new compute coming online and usage based pricing is what gets OpenAI and Anthropic past two hundred billion in combined ARR this year.

Chip startups, prefill decode disaggregation, and Cerebras

Trying to build a better GPU is the wrong move. The four scaled players (Nvidia, AMD, Trainium, TPU) have copy capability for any one to three percent share design that looks attractive. The good news for startups is that disaggregated inference (separating prefill and decode) opens a richer design canvas. Prefill is memory capacity bound. Decode is memory bandwidth bound. Each can be optimized independently. Andrew Fox’s analogy is a British naval ship of the eighteenth century. Prefill is loading the cannon. Decode is firing it.

Cerebras is the model. Wafer scale computing is genuinely different and genuinely hard. It took three generations of chips to get right. Andrew Feldman and his team had the grit to keep going through chip one being a failure. The design has a high ratio of on chip compute and memory relative to shoreline IO, which is why Cerebras is now experimenting with putting an optical wafer on top of the compute wafer to solve scale out.

GPU useful lives and the rescue of private credit

One of the strongest claims in the conversation is that disaggregated inference will stretch GPU useful lives to ten or fifteen years. The skeptical narrative (GPUs are obsolete in two years, companies are cooking their depreciation books) is wrong. You can put a Cerebras system or Groq LPU in front of older Hopper or Ampere parts, use them only for prefill, and run them until they physically melt. Private credit, which is in pain from SaaS loans and which underwrote GPU loans on three to four year lives, may be saved by this.

If GPU financing rates can come down from low sevens to five or six percent, the mathematics of the AI buildout improves materially. That is a structural tailwind that compounds for years.

The application layer, the token path, and a new prisoner’s dilemma

Trillions of dollars of value have been destroyed at the application layer, not created. Cursor and Cognition are the rare scaled exceptions, and they got there by focusing on coding very early. As Amjad Masad noted, coding is plausibly the shortest path to ASI because a coding agent can write itself into any new domain. Jamin Ball’s frame is that the new venture filter is whether the company is in the token path. Data Bricks is. Most application layer startups are not.

Jensen could probably get close to the frontier with Nemotron whenever he wants, and the strategic question of whether to do that is a new prisoner’s dilemma. If every frontier lab agrees not to release best models via API, Chinese open source falls steadily behind. If anyone defects, the defector gains revenue and resources, and everyone else has to defect. The same dynamic exists between TSMC, Intel, and Samsung. If Nvidia or AMD ever truly used an alternative foundry, that foundry would catch up rapidly.

Rating the hyperscalers

Google has the largest compute installed base, the YouTube data that matters in a robotics world, and a search business that prints. Their loss of TPU cost leadership is the surprise of the year. If Google IO in five days does not produce a leapfrog model, the Nvidia centric narrative gets even stronger.

Meta deserves real credit. Zuckerberg made Meta AI first internally faster than any other internet giant, paid up for the talent contracts when no one else would, and shipped Musa as a first model from MSL that is close to the Pareto frontier. Amazon is well positioned on Trainium, robotics in retail, and a Nova model line that is better than it gets credit for. Microsoft flinched on capex in early 2025 and lost position. Satya Nadella’s current decision to use Microsoft compute for Copilot rather than reselling to OpenAI is courageous and probably correct, even at the cost of stock price.

The most interesting cross hyperscaler metric is startup engagement. Nvidia and Amazon engage deeply with startups. Google is next. Broadcom is the favored ASIC partner. AMD, Microsoft, and Meta have minimal startup engagement, which Gavin believes will cost them as the best teams now sit at startups.

Personal safety, geopolitics, and the Pax Americana case

The closing section turns darker. Personal safety in an AI era requires a family or company safe word that cannot be socially engineered. Deepfake voice and video extortion via something that looks exactly like your child calling on FaceTime is already feasible. Political violence against AI leaders is a real concern. Geopolitically, Ukraine is winning largely because it has the best battlefield AI outside America and Israel. How adversaries respond to that asymmetry is the next great variable.

Gavin’s optimistic frame is the Pax Americana. After 1945 the US had a nuclear monopoly and could have controlled the world. Instead it rebuilt Germany and Japan, both of which became the most reliable American allies for the next eighty years. If AI dominance plays out similarly, this is a generationally positive story rather than a destabilizing one. The personal anecdote that closes the conversation is a friend whose daughter was diagnosed with a rare genetic condition. He spun up agents, identified a drug already on the market that addresses her mutation, and her life is immeasurably different because of AI. That is the upside.

Thoughts

The Anthropic eleven billion in a month framing is the kind of stat that resets priors. The right way to interpret it is not as a one off but as a measure of how fast value can compound when the underlying technology improves on a curve steeper than the ability of the rest of the economy to absorb it. The skeptical question is whether that ARR is durable or whether it is heavily tied to a customer base of other AI companies that are themselves on a single venture funded year of runway. The bullish answer is that frontier coding, frontier research, and frontier enterprise tasks are not going to stop being valuable, and Anthropic is the best at all three. Both can be true. The number is still extraordinary.

The argument that TSMC discipline is the only thing preventing a bubble is the analytically tightest part of the conversation. The implied trade is to watch TSMC capacity additions like a hawk and to be more, not less, cautious if Intel Foundry or Samsung Foundry ever announce real share at the leading node. The Terafab thesis is more speculative but more interesting. If Elon’s talent recruiting playbook works and the Intel partnership gives Terafab a real seat at the table within five years, the geometry of the global semiconductor industry shifts in a way that is bullish for American manufacturing, bullish for power and water infrastructure in Texas, and ambiguous for TSMC itself.

The Pareto frontier discussion deserves more attention than it usually gets. Pricing leadership in AI is not a vanity metric. It determines who can subsidize free tier usage, who can absorb compute shortages, who can ship cheaper enterprise plans, and ultimately whose model becomes the default for any given workload. Google losing per token leadership in nine months is one of the most under analyzed events in the sector and it explains a lot about why Anthropic and OpenAI are growing the way they are. If Google IO does not produce a leapfrog model, the implied verdict on TPU V8 design choices gets a lot harsher.

The application layer destruction point is worth sitting with. Founders building on top of frontier models are competing in a world where the model itself moves faster than any moat they can build, where the model lab can absorb their niche if it gets interesting, and where the only protection is either deep token path integration or a niche so small the lab does not bother. That is a much harsher venture environment than the early SaaS era. The compensating opportunity is that one human can now run a hundred agents, so the ceiling on what a small team can build is correspondingly higher. The bet is that productivity per founder rises faster than competitive pressure from the labs. We will find out.

The orbital compute pitch is the section that will polarize listeners. The naive read is that this is science fiction. The closer read is that every component (sun synchronous orbit, laser interconnect, twenty kilowatt satellite buses, ten thousand satellite manufacturing cadence, full rocket reusability) already exists. The remaining engineering problems are repair, maintenance, and radiator scale, all of which are real but tractable on a five to ten year horizon. The strategic implication is that the political and zoning ceiling on terrestrial data centers becomes less binding if orbital compute is a credible alternative for inference workloads. The investor implication is that being short the watts and cooling complex on a five year horizon is a real trade, not a meme.

Watch the full conversation here.
May 20, 2026