PJFP.com

Pursuit of Joy, Fulfillment, and Purpose

Tag: AI chips

  • Whale Rock Capital Founder Alex Sacerdote on S-Curve Investing, Why Anthropic Is His Highest Conviction Bet, and the Decommoditization of AI Hardware

    Alex Sacerdote built Whale Rock Capital into one of the most respected technology hedge funds in the world by treating markets through a single disciplined lens: the technology adoption S-curve. In this long conversation on Invest Like the Best with Patrick O’Shaughnessy, he lays out the full framework that has carried him through internet 1.0, mobile, cloud, e-commerce, and now AI, and he explains why Anthropic became his highest conviction position, why his fund went net short application software, and why the least glamorous corner of the market, the hardware and chips that build out data centers, may be one of the best ways to play artificial intelligence right now. What follows is the working theory of a money manager who has spent twenty years trying to think exponentially while the rest of the market thinks one quarter at a time.

    TLDW

    Sacerdote walks through Whale Rock’s three-part investment framework: find the right part of an S-curve, identify the company with a durable competitive advantage, and buy when long-term earnings power is underappreciated. He tells the story of investing in Anthropic at a 180 billion dollar valuation in August 2025 after Claude Code made coding the true unlock of AI, and frames the foundational model market as a three-horse race between Anthropic, OpenAI, and Google that resolved from sixty startups into an oligopoly. He argues enterprise AI is less than 1 percent penetrated, calls the adoption shape an L curve rather than an S-curve, and warns there is not enough compute in the world. He explains why he sold almost all of his application software and went net short, why he loves the decommoditization of AI hardware (Celestica, Corning, Elite Materials, Delta, Advanced Energy, high bandwidth memory, 40-layer PCBs), introduces a modified rule of 40 for chip investing, surveys the moats that let leaders win (network effects, industry standard, scale, critical IP, brand, recursive self-improvement), discusses moving from public markets into private deals like Stripe and Anthropic, lays out Whale Rock’s fund products including the new Mega Cap Tech Fund, defends old-fashioned scuttlebutt research in an AI age, and closes on the kindest thing anyone ever did for him, his father joining the firm after 41 years at Goldman Sachs.

    Thoughts

    The most useful idea in this conversation is not the bullishness on AI, which is everywhere now, but the discipline underneath it. Sacerdote’s framework forces a separation that most investors collapse. A great market is not a great investment. A great company is not a great investment. You need a tall S-curve, a company with a moat that survives the curve, and a price that does not yet reflect the earnings power. He says the quiet part out loud: he has repeatedly bought the best companies in the world at four or five times earnings precisely because the market refuses to extrapolate exponential growth. Nvidia at four times earnings in 2023, Tesla at five times in 2019, Amazon where AWS came free. The edge is not information, it is the willingness to underwrite two to four years out when the consensus cannot see past the next quarter.

    The Anthropic story is the framework applied in real time, and it is worth noting how late and how cautious he was. Whale Rock passed on the 60 billion dollar round because gross margins were negative and coding had not yet exploded. They only got conviction once Claude Code flipped from autocomplete to agentic work, once they heard Anthropic engineers were burning 100 dollars a day in tokens, and once the math on twenty million coders implied a half trillion dollar market from coding alone. The lesson he repeats throughout, that it is okay to be late, that you can miss the first 100 percent if the curve is tall enough, is a direct rebuke to the fear of missing out that drives most AI investing. He waited for the moat to be visible before he paid up.

    His most contrarian and most actionable call is on hardware. The consensus reflex is that chips and components are commodities that get competed to zero. Sacerdote argues the opposite is happening: AI workloads growing 10x a year are pushing every layer of the server to its physical limits, and that pressure is decommoditizing the entire stack. A liquid-cooled AI server is a 300,000 dollar piece of critical infrastructure, not a 5,000 dollar throwaway box, which means the supplier becomes a permanent fixture like a parts vendor on a plane. The Celestica example is the template: a contract manufacturer left for dead since 1999 that turned out to be the sole supplier of Google’s TPU server and a leader in liquid cooling and Ethernet switching, trading at eight times earnings. If he is right that we are 30 percent short on DRAM, NAND, and PCBs, the picks-and-shovels trade has years left to run regardless of which model company wins.

    The software bear case deserves the most scrutiny because it is the most consequential and the least certain. Going from 40 to 50 percent of the portfolio in software to net short is a violent reallocation, and his reasons are layered: AI products that nobody will pay for, CIO budgets being raided to fund Anthropic tokens, pricing power evaporating, and the long-term threat that AI-native startups rebuild incumbents from scratch. But he is honest that the bull case is real too, that old technology is sticky, that companies prefer to buy rather than build, and that AI might actually make platforms like Slack or CRM more important if agents end up operating inside them. This is the genuine uncertainty in the whole AI trade. The bottom of Jensen’s cake, chips and models, is where the value has accrued so far, but historically the application layer captured most of the market cap. Sacerdote is betting that this time the infrastructure and model layers hold the value longer, and he admits the application ecosystem is still unclear and a little bit dangerous. That admission is more valuable than any of his confident calls.

    Finally, the section on research in an AI age is a quiet refutation of the idea that this work automates away. Sacerdote runs a Philip Fisher scuttlebutt operation, 2,500 to 3,000 face-to-face management meetings a year, two decades of compounding relationships, the tripod of conviction where he, his analyst, and a respected outsider all independently like an idea. AI writes better notes now, but the paragraph on top, the wisdom about what it means and how it fits the thesis, is still human. The durable moat in his own business is the same one he looks for in the companies he buys: an accumulated advantage that newcomers cannot replicate quickly. That consistency between how he invests and how he operates is the most credible thing in the interview.

    Key Takeaways

    • Whale Rock’s framework has three legs: identify the right part of a technology S-curve, find the company with a powerful competitive advantage, and invest when long-term earnings power is underappreciated.
    • The core insight is exponential, not linear. Strong tech business models grow earnings exponentially, and because the market refuses to extrapolate, you can buy elite companies at very low multiples.
    • Concrete examples of buying exponential growth cheaply: Nvidia at four times earnings in 2023, Tesla at five times in 2019, Apple at four times, and Amazon where AWS was effectively free.
    • When ChatGPT launched in November 2022, Whale Rock did a firm-wide deep dive and chose to invest in chips and infrastructure first, because demand arrives there first and the winners are knowable regardless of who wins the model layer.
    • The foundational model market went from roughly 60 startups to a three-horse race: Anthropic, OpenAI, and Google. Most startups died, Amazon never showed up, and Meta faltered and had to reboot.
    • Anthropic was the dark horse that focused purely on enterprise while OpenAI won consumer. Whale Rock made it their highest conviction position.
    • Coding is the true unlock of AI. The progression went from Microsoft Copilot at 20 dollars a month (fixing grammar, finding a bug) to Claude running agentically and writing most of the code.
    • The market math: Anthropic engineers were reportedly spending 100 dollars a day on tokens, roughly 20 to 30 thousand dollars a year, and with about 20 million coders in the world that implies a half trillion dollar market from coding alone.
    • Whale Rock invested in Anthropic at the 180 billion dollar valuation in August 2025, when the company hoped to reach 9 billion in revenue and nobody yet knew what 2026 could be.
    • Andrej Karpathy and Linus Torvalds both flipped on AI coding. Karpathy went from 80 percent handwritten code to writing almost no code except in English.
    • Models are not pure commodities. There is real differentiation: Anthropic is strong for private equity and finance, Google is strong at ingesting PDFs, and routers that switch between models mask but do not erase that differentiation.
    • Anthropic is building an ecosystem around the API (SDK, orchestration, the harness, tools), echoing how AWS built lock-in with products around commodity servers starting in 2013.
    • The 800 million people using AI are mostly using AI 1.0, a search engine on steroids. Sundar Pichai estimated only about 10 basis points of knowledge workers are truly using AI’s new capabilities.
    • Enterprise AI is less than 1 percent penetrated. Whale Rock calls the adoption shape an L curve or backwards L curve because it goes straight up, unlike the slower 30 to 50 percent growth of cloud and SaaS.
    • There is not enough compute in the world. Anthropic reportedly has half of what it needs, and Marc Andreessen said the one thing he is sure of is that there will not be enough compute for the next four years.
    • The infrastructure S-curve is only about 10 percent penetrated and remains one of the best ways to play AI.
    • Getting into private deals requires a double opt-in. Whale Rock did a 90-page deck (built with Claude Code) on the coding market to win their Anthropic allocation, and their first private was Stripe in 2020 at a 35 billion dollar valuation.
    • The unicorn private market is now bigger than most European stock markets, larger than Germany or the UK individually. Whale Rock does 2,500 to 3,000 management meetings a year, 10 to 15 percent with privates.
    • S-curves come in two sizes: mega S-curves (internet, mobile, cloud, e-commerce, AI) and sub S-curves within them. AI is the biggest of all and each curve builds on the last.
    • Adoption inflects when barriers fall. Steve Jobs cut the smartphone price to 200 dollars on a 3G touchscreen, Elon cut the EV price to 40,000 with 300-mile range and a working supply chain. Remove the barriers and you get the tornado of demand.
    • Knowing how tall the curve is tells you when to sell. Growth stops being exponential around 30 to 40 percent penetration, when the sell side catches up and big beats end. EVs hit a wall at 10 to 15 percent instead of the expected 40 to 50 percent.
    • Selling Apple in 2012 at roughly 50 percent US smartphone penetration was a mistake, because the moat let it keep compounding around 20 percent even after the explosive phase ended.
    • At strategic inflection points you cannot trust the data (Andy Grove). The signal is intuition and anecdote: a 12-year-old in China on a giant phone playing a real game, or standing-room-only sessions at the Gartner IT Symposium for AWS, VMware, and Splunk.
    • Adoption slope varies. The radio curve hit near-full penetration in about 7 years, while B2B and infrastructure (the dishwasher that has to be plugged in) take far longer. AI is fast because you just open a browser.
    • The moats that let leaders win: network effects, becoming an industry standard, rapid scale, critical intellectual property, brand, and platform lock-in. Anthropic appears to have critical IP, enterprise brand, escape velocity, and recursive self-improvement from using its own code on its own models.
    • On the internet, the leader usually goes bigger, faster, and wins, and compounds on itself (Amazon, Shopify). Exceptions come at paradigm shifts, like AOL failing to make the dialup-to-broadband transition.
    • Whale Rock went from 40 to 50 percent in software five years ago to net short entering this year, which helped performance in the first quarter. AI products were not good enough to charge for and were not moving the needle.
    • Software faces a stack of headaches: falling priority on CIO to-do lists, budget pressure from token spend, lost pricing power, hiring freezes that hurt seat-based models, and the long-term threat of AI-native replacements.
    • The classic rule of 40 is growth rate plus operating margin. Whale Rock’s modified rule of 40 for chip investing is percent of sales that are AI plus market share in that category. Software AI exposure is still only 1 to 2 percent.
    • AI may make some platforms more important. The first thing you do with Claude is plug it into Slack, which could make Slack a permanent repository, and agents may end up operating inside incumbent tools like CRM, solidifying rather than killing them.
    • The data center stood still for 40 years on Intel x86, with every component commoditized. AI changed that. Workloads growing 10x a year are driving the decommoditization of the hardware industry.
    • Celestica is the template: a contract manufacturer left for dead since 1999, sole supplier of the Google TPU server, strong in liquid cooling and Ethernet white-box switching, with 50 to 60 percent share of the cloud Ethernet switch market, once trading at eight times earnings.
    • The whole supply chain is rerating: high bandwidth memory stacked 10 chips high, 40-layer PCBs (versus 10 for a normal server), Elite Materials copper clad laminate, Corning fiber (enough to circle the world four and a half times in one Microsoft data center), and Delta and Advanced Energy power supplies seeing ASPs rise 40 percent a year.
    • Networking has three layers: scale out (racks together), scale across (data centers together), and scale up (every GPU in a rack, currently copper, eventually fiber). The copper-to-fiber shift could two-to-three-x Corning’s opportunity.
    • Whale Rock estimates the market is roughly 30 percent short on DRAM, NAND, and PCBs even at today’s 10 basis points of real AI usage.
    • Rate of change matters more than absolute level. When Claude plotted market share data it missed the rate of change, the thing that drives accelerating growth and margins as a company moves from 10 to 30 percent share.
    • Key risks: public and government negativity toward AI (Maine reportedly banned data centers, only 20 percent of people are optimistic), models hitting a wall and letting open source catch up into a race to the bottom, and a major player faltering and stranding compute.
    • Chip companies do not care who wins the token war, which makes them a relatively safe way to play AI. Jensen Huang actively wants open source to take off.
    • Research is still human work. Whale Rock runs a Philip Fisher scuttlebutt process, the tripod of conviction (Alex, the analyst, and a respected outsider), and 20 years of compounding knowledge. AI writes better notes but cannot supply the wisdom paragraph on top or pick stocks.
    • The firm’s product evolution: 15 years as a long short fund, a long only fund in 2020 that is now larger than the long short, opt-in privates formalized around 2015 and activated in 2020, an 80 percent privates hybrid fund in 2021, and the new Whale Rock Mega Cap Tech Fund.
    • The Mega Cap Tech Fund thesis: endowments are structurally underweight the largest tech companies because they believe there is no alpha in large cap. Whale Rock takes the top 30 global market caps and picks the best 12 or 13, arguing it takes 100 diversified PMs to realize Google is a winner.
    • The kindest thing anyone ever did for Sacerdote: his father, after 41 years at Goldman Sachs, joined Whale Rock as chairman and the gray hair for six years until he passed away in 2011.

    Detailed Summary

    The Anthropic Investment and the Three-Horse Race

    When ChatGPT launched in November 2022, Whale Rock immediately took its 10-person team and ran a firm-wide deep dive. Sacerdote’s first principle is that every new compute paradigm creates a new stack with new winners and losers, and in this stack the layers run from power and chips at the bottom, to the clouds, to the foundational models, to the applications on top. In early 2023 the firm deliberately positioned in chips and infrastructure first, reasoning that demand arrives there first and the winners are knowable no matter who wins above. At an April 2023 webinar they framed the model layer as a coin flip between winner-take-all, total commodity, a race to zero, or an oligopoly of three or four. Over the next three years the answer became clear: of roughly 60 startups, almost all died, Amazon never really showed up, Meta came in strong then faltered and rebooted, and Anthropic emerged as the dark horse focused purely on enterprise while OpenAI won consumer and Google remained a perennial threat. The result looked like the cloud market, where three companies underpin the entire SaaS world with excellent businesses.

    The decisive factor was code. Sacerdote says the firm was initially skeptical AI could replace labor, given the negative corporate feedback on early models. That changed in 2025 when Claude Code and the agentic coding tools exploded. The progression ran from Microsoft Copilot at 20 dollars a month, which could improve coding grammar or find a bug, to Claude running agentically and doing far more. The token economics were staggering: Anthropic engineers reportedly spending 100 dollars a day, which annualizes to 20 to 30 thousand dollars, and with 20 million coders worldwide that implied a half trillion dollar market from coding alone, on technology that was only 7 to 9 months old. Whale Rock made the investment at the 180 billion dollar valuation in August 2025, writing in their letter that the company hoped to reach 9 billion in revenue, with growth like nothing they had ever seen, 100 million to a billion on the way to 9 billion, and no one yet knowing what 2026 could bring.

    Why the Models Are Not Commodities

    Everyone expected the foundational models to be pure commodities, but Sacerdote argues there is tremendous differentiation within them. Different training methods produce different skills: Anthropic excels at anything touching private equity and finance, Google is strong at ingesting PDFs. Routers that switch between models make them look like commodities but mask genuine, critical IP. Beyond the model itself, Anthropic is building a whole ecosystem around the API: the SDK, the orchestration layer, the tools, and the harness, the software wrapped around the API that gets the most out of the model. He compares this directly to AWS in 2013, when people dismissed cloud as commodity servers in a warehouse and missed that Amazon was inventing products that slowly built lock-in. The open-source risk from China is real, but Sacerdote got comfortable that leading-edge token quality is superior, because going from 80 to 85 percent of benchmark performance is a huge unlock and the open-source players lack the compute to leapfrog the frontier.

    The S-Curve Framework in Full

    Whale Rock’s whole edge is thinking exponentially when the world thinks linearly. Sacerdote argues very few people believe you can accurately predict two, three, or four years out, but if you understand the S-curve, the moats, and how to model, you can. Every technology follows the same pattern: it exists hidden for years (smartphones 10 years before the iPhone, the internet 20 years before Netscape, EVs 15 years before Tesla went vertical in 2019) until the barriers to adoption fall and demand inflects into a tornado. Knowing how tall the curve is tells you when to sell, because exponential growth stops around 30 to 40 percent penetration when the sell side catches up. Curves can also be dynamic: AWS turned out to address a far larger TAM than expected once it became clear cloud was not actually deflationary. There are mega S-curves (internet, mobile, cloud, e-commerce, AI) and sub S-curves within them. AI is the biggest. And slope varies enormously by the nature of the technology, the radio curve hitting full penetration in 7 years, B2B and infrastructure taking decades because, like a dishwasher, they have to be plugged into existing systems.

    On timing, Sacerdote is relaxed about being late. Citing Peter Lynch, who mentored him at Fidelity and told him to white out the chart because it is all about the future, he argues it is fine to miss the first one, two, or three years and even the first 100 percent if the top of the curve is half a trillion. At strategic inflection points, per Andy Grove, you cannot trust the data, so the firm relies on intuition and anecdote: a 12-year-old in China playing a real video game on a huge phone, or the AWS session at the Gartner IT Symposium that was standing-room-only at 9, 10, and 11 in the morning. Spotting the leader pulling away matters because, on the internet, the leader usually goes bigger, faster, and wins, compounding on itself, with exceptions only at paradigm shifts like AOL missing the move from dialup to broadband.

    The Software Bear Case

    Five years ago Whale Rock had 40 to 50 percent of its portfolio in software. Their April 2023 thesis was that incumbents with huge sales forces and proprietary data would take the AI APIs and build great products. Instead, the AI products were not good enough to charge for and did not move the needle, so the firm sold almost all of its application software and entered this year net short, which helped in the first quarter. The bear case is layered: software has fallen down the CIO priority list, budgets are being raided to fund Anthropic tokens with faster ROI, annual price increases look risky, and hiring freezes hurt seat-based models. The deeper threat is that AI-native startups could rebuild any incumbent from scratch, obviating the data advantage. The bull case is genuine too: old tech is sticky (mobile games did not kill consoles, tablets did not kill the PC), companies prefer to buy rather than build, and an ERP is hard to replace. Sacerdote also floats an optimistic twist, that AI could make platforms like Slack more important as agent repositories, and that agents operating inside CRM could solidify rather than destroy it, even as the bear case is that CRM goes headless and gets relegated to a database.

    The Decommoditization of AI Hardware

    This is Sacerdote’s most differentiated call. For 40 years nothing changed in the data center; Intel x86 became the standard, compute grew 25 to 40 percent a year in line with Moore’s law, and every component, from the printed circuit board to memory to enclosures to networking, commoditized. AI broke that. Workloads now grow 10x a year and push every aspect of the hardware to its physical limits, creating both tremendous unit growth and what Whale Rock calls the decommoditization of the hardware industry. He cites Sean Maguire wishing he could run a hardware hedge fund because all the companies are public with powerful IP, and compares it to Sequoia’s best early hardware investments in Apple and Cisco. The economics flip because an AI server is a liquid-cooled, 200 to 300 thousand dollar piece of critical infrastructure where a single failure brings the whole thing down, so suppliers become permanent like a critical part on a plane.

    Celestica is the marquee example: a contract manufacturer that had been a disaster industry since 1999 and went offshore to China, but kept its IBM supercomputing heritage and talent, became the sole supplier of the Google TPU server, and was trading at eight times earnings three years ago. It turned out to be excellent at liquid cooling where others failed, holds 50 to 60 percent share of the crucial cloud Ethernet switch market, and its engineers helped write the open-source SONiC software, working closely with Broadcom. The same dynamic runs up and down the chain: high bandwidth memory stacked 10 chips high that took Samsung years to master, 40-layer PCBs versus 10 for a normal server with very few suppliers able to make them, Elite Materials supplying the copper clad laminate, and Corning’s fiber, thinner and more bendable, with enough in a single Microsoft data center to circle the world four and a half times. Networking splits into scale out, scale across, and scale up, with the eventual copper-to-fiber shift in scale up potentially two-to-three-x-ing Corning’s opportunity. Power supplies from Delta and Advanced Energy are seeing ASPs rise 40 percent a year at higher margins because each Nvidia rack uses 50 to 125 percent more power. Visibility has gone from we’ll call you next week to design this roadmap with us for four years, turning 5 percent low-margin businesses into 35 to 50 percent topline growers with rising margins, and the whole market is roughly 30 percent short on DRAM, NAND, and PCBs.

    Private Markets, Risks, and the Research Machine

    Moving from public markets into privates meant adapting to a double opt-in, where the company has to choose to let you in. Whale Rock won its Anthropic allocation partly by building a 90-page deck with Claude Code scouring the internet for feedback on the coding market. Their first private was Stripe in April 2020 at a 35 billion dollar valuation, which they could only underwrite because they knew the public comp Adyen cold, and they upsized to a 100 million dollar block. The unicorn market is now bigger than most European stock markets combined. On risk, Sacerdote worries about public and government negativity (Maine reportedly banning data centers, only 20 percent of people optimistic), the possibility that models hit a wall and open source catches up into a race to the bottom, and a major player faltering and stranding compute, though he notes someone else (like Meta stepping into a cancelled Oracle deal) would likely absorb it, and that chip companies benefit regardless of who wins the token war. He explains his caution on the application layer by noting it always comes later, the iPhone took years to spawn its app economy, and the ecosystem is still unclear and a little dangerous, while pointing to Brett Taylor’s Sierra as the kind of company that could prove it out.

    On the research itself, Sacerdote insists AI has not supplanted the analyst. Whale Rock runs the scuttlebutt approach straight out of Philip Fisher’s Common Stocks and Uncommon Profits, doing 2,500 to 3,000 face-to-face management meetings a year and talking to suppliers, customers, and competitors. AI now writes much better notes and gets the team up to speed quickly on complex areas like ABF substrates, but there must be a wisdom paragraph on top, and it cannot pick stocks or replicate the work two analysts did building conviction in AppLovin and a relationship with Adam Foroughi. He calls the firm the Whale Rock learning machine, a group of 10 highly experienced people compounding knowledge for 20 years, with the tripod of conviction (himself, his analyst, and a respected outside investor all liking an idea) as the test. The firm’s products evolved from a 15-year long short fund to a 2020 long only fund now larger than the original, opt-in privates, an 80 percent privates hybrid in 2021, and the new Mega Cap Tech Fund built on the thesis that endowments are structurally underweight the largest tech companies because they wrongly believe large cap has no alpha. He closes on his father, who left Goldman after 41 years to join Whale Rock as chairman and the gray hair until his death in 2011, a mentor remembered by countless people for his humility and grace.

    Notable Quotes

    “When you get the right part of the S-curve, you get exponential unit growth. If you have a very strong business model, your earnings don’t grow linearly, they grow exponentially.”

    Alex Sacerdote, stating the core of the Whale Rock investment framework

    “The world doesn’t think exponentially. Very few people believe you can accurately predict two, three, four years out. But if you follow and understand the S-curve and you know the moats and you know how to model, you really can predict these great things.”

    Alex Sacerdote, on why the market consistently underprices long-term earnings power

    “The enterprise AI or enterprise application AI market is less than 1 percent penetrated, and we’ve never seen, you know, we talk about S-curves, we call this an L curve, just straight up.”

    Alex Sacerdote, on why AI adoption looks different from every prior technology curve

    “We’re at 10 basis points of people really using AI and we’re already sold out. There’s not enough compute in the world. So Anthropic has half of what they need right now, and that’s before this huge takeup.”

    Alex Sacerdote, on the scale of the compute shortage relative to actual adoption

    “It’s okay to be late. It’s okay to miss the first one, two, three years in a lot of cases, because if the top of the S-curve is half a trillion, the growth can go on for a long time. It’s okay to miss the first 100 percent.”

    Alex Sacerdote, on why fear of missing out is the wrong instinct in a tall S-curve

    “The old way of software is like using a pen and paper or a horse and buggy. The new way of software is like a jet engine or frankly like the transporter from Star Trek. It’s so revolutionary it feels like it has to be disruptive.”

    Alex Sacerdote, explaining why Whale Rock went net short application software

    “You become like critical infrastructure, like selling a critical part on a plane. You’ll never get swapped out.”

    Alex Sacerdote, on how liquid-cooled AI servers turned commodity hardware suppliers into permanent fixtures

    “Why do you tell everyone your secret? It’s like why does the casino teach people how to play blackjack? It’s harder. It’s really hard to do.”

    Alex Sacerdote, quoting his mother on why a public framework does not erase the edge

    “He said, you know, I’ve been at Goldman for 41 years. How about I come and join you? I’ll be the gray hair. I’ll be the oversight. I’ll be the chairman. You do what you do.”

    Alex Sacerdote, recalling his father joining Whale Rock, the kindest thing anyone ever did for him

    Watch the full conversation here: Whale Rock Capital Founder on Investing in the Age of Exponential AI.

    Related Reading

  • Elon Musk Announces SpaceX AI Satellites, Starship Mass to Orbit, and a Moon Mass Driver to Climb the Kardashev Scale

    Elon Musk sat down with the SpaceX Starlink team for a wide ranging update that connects every recent SpaceX move into one thesis: harness far more of the sun’s energy by putting AI compute in orbit. In this SpaceX conversation, the group walks from galaxy sized framing (the Kardashev scale) all the way down to the engineering specifics of a new AI satellite, the manufacturing buildout in Bastrop, Texas, and a long term plan that ends with a mass driver on the moon. The pitch is that none of it requires magic, just scaling technology SpaceX already flies.

    TLDW

    Musk frames civilizational progress with the Kardashev scale, a measure of how much power a species harnesses, and points out that humanity uses less than a trillionth of the sun’s output, barely registering even on the Type 1 (planet) level. Because most of Earth is water and the usable sunlit land is limited, the only way to capture a meaningful fraction of the sun’s energy is to go to space, where cooling is also easier since heat radiates straight into the vacuum. Three limiting factors must be solved: mass to orbit (handled by fully and rapidly reusable Starship, which already beats the Saturn V on thrust and aims for millions of tons to orbit per year), solar power plus radiators, and AI chips. SpaceX unveils its first AI satellite design, AI1, a roughly 70 meter wingspan craft at 150 kW peak and 120 kW sustained power that matches an Nvidia GB300 rack, reuses Starlink V3 solar technology, links by laser, and runs at only a few milliseconds of latency from low orbit. Chips start as off the shelf Nvidia GB300 and Rubin parts plus a TPU reference design, then scale through a planned 100 million square foot “Terafab” toward a terawatt per year of compute, about twice current US electricity use. The endgame pushes another 1,000x by manufacturing on the moon and using a lunar mass driver to fling satellites into deep space without rockets.

    Thoughts

    The most important reframe in this conversation is that Starlink, Starship, the xAI acquisition, and a new chip factory are not separate bets. They are one bet expressed as a single number: the percentage of the sun’s energy that civilization can capture and put to work. By anchoring everything to the Kardashev scale, Musk turns “build more satellites” into a measurable physics goal rather than a product roadmap. It is a rhetorically powerful move because it makes today’s hyperscale AI buildout, which already strains terrestrial grids, look like the obvious forcing function for going to space. If you accept that compute demand keeps compounding, then the constraint stops being chips and becomes power and cooling, and space genuinely is better at both.

    The cleverest engineering insight is almost understated: an AI satellite is simpler than a Starlink satellite, not harder. A Starlink craft carries complex phased array and parabolic antennas to talk to millions of dispersed users. An orbital data center mostly needs solar cells, radiators, some laser links, and the chips. SpaceX has already industrialized the hard parts (mass produced solar arrays, constellation flight operations at 10,000 satellites, laser mesh networking), so the new product is closer to a remix of proven subsystems than a clean sheet program. That is the real argument for why SpaceX, specifically, can do this when “data center in space” has sounded like science fiction for a decade.

    The numbers are where skepticism should live, and to his credit Musk says to take the timeline with a grain of salt. An annualized gigawatt of space compute by the end of next year, scaling roughly 10x per year toward a terawatt, is an extraordinary ramp. A terawatt is about twice the entire electricity consumption of the United States, delivered as orbiting hardware. Getting there leans on Starship hitting rapid reusability and on a 100 million square foot chip fab that is ten times Gigafactory Texas. Each of those is itself a moonshot, and stacking them multiplies the risk. The honest read is that the architecture is coherent even if the schedule is aspirational.

    The moon segment is where the talk turns from aggressive to genuinely speculative, and it is the part worth watching. A lunar mass driver, essentially a long linear motor that accelerates payloads to escape velocity, only makes sense once you are already moving enormous mass and want to escape Earth’s gravity well and atmosphere entirely. It is a classic Musk pattern: solve the near term problem (mass to orbit with Starship) in a way that creates the precondition for the next, larger problem (local production on the moon). Whether or not the dates hold, the dependency chain is logical, and it explains why SpaceX keeps investing in capabilities that look excessive for today’s market.

    One underrated takeaway for readers outside aerospace: this is as much a manufacturing story as a space story. The bottleneck is not whether a single AI satellite works, it is whether you can stamp out thousands to a million of them, plus the solar, plus the chips, at volume and low cost. That is why so much of the conversation is about Bastrop production lines, a solar manufacturing facility already under construction, and the Terafab. The space hardware is the visible part; the factories are the actual product.

    Key Takeaways

    • The whole strategy is framed around the Kardashev scale, a measure of how much power a civilization harnesses, named for Russian physicist Nikolai Kardashev.
    • Type 1 harnesses a planet’s available power, Type 2 a star’s full output, and Type 3 a galaxy’s; humanity sits at the very bottom of even Type 1.
    • We currently use much less than a trillionth of the sun’s power output, and a trillion is a million times a million.
    • The sun is about 99.86% of all mass in the solar system; most of the remaining 0.14% is Jupiter, and Earth is a tiny dust mote by comparison.
    • Incident solar energy on Earth’s cross section is roughly a half billionth of the sun’s total power output.
    • Most of that sunlight is unusable because about 70% of Earth is water and much of the land is at the poles or far north where solar is weak.
    • Reaching one millionth of the sun’s output, a “micro” on the Kardashev 2 scale, would be an epic achievement relative to today, and 1% would make a civilization vastly more powerful than ours.
    • Space avoids building massive ground power plants and makes cooling easier, because waste heat can radiate directly into the vacuum.
    • Three limiting factors must be solved to scale: mass to orbit, solar power plus radiators, and AI chips.
    • Starship provides the mass to orbit and is the first rocket designed for full and rapid reusability, the breakthrough behind both multiplanetary life and ascending the Kardashev scale.
    • SpaceX catches the booster with the launch tower instead of adding heavy landing legs, an extreme mass optimization measure.
    • Starship V3 already produces more than double the thrust of the Saturn V; V4 will be roughly three times, making it the largest, heaviest, most powerful moving object ever built.
    • Starship is targeted to eventually fly more than once per hour.
    • SpaceX already delivers roughly 85 to 90% of all Earth mass to orbit with Falcon 9 and Falcon Heavy.
    • The plan is to go from around 2,500 tons to orbit per year to millions of tons per year, reaching a million tons per year in about three years.
    • The AI satellite, called AI1, is actually simpler than a Starlink satellite because it lacks the complex phased array and parabolic antennas.
    • AI1 targets 150 kW peak power and 120 kW sustained power, roughly matching an Nvidia GB300 rack of 72 GPUs.
    • Design assumptions are about 250 watts per square meter for the solar array and about 1,400 watts per square meter for the double sided radiators, both expected to improve over time.
    • Radiators are oriented knife edge to the sun and radiate from both sides; each satellite has roughly a 70 meter wingspan.
    • Each satellite carries on the order of a terabit of laser link connectivity.
    • Satellites connect to each other or to the Starlink constellation by laser, and Starlink relays data to the ground over existing Ka and Ku antennas plus laser to ground links.
    • At 600 to 800 km altitude latency is only around 3 milliseconds, since light travels about 300 km per millisecond.
    • SpaceX has about 10,000 Starlinks in orbit and is the only operator with experience flying constellations at that scale.
    • The constellation could eventually grow to thousands or even up to a million satellites; space is big enough to pack and fly them safely.
    • The satellites and solar will be built in Bastrop, Texas, where a solar manufacturing facility is already under construction.
    • The AI satellite production building and solar production are expected to be operating at reasonable volume by the end of next year.
    • SpaceX keeps making Starlink user terminals in Bastrop and is turning on new, higher volume production lines, with possibly a few hundred million terminals eventually, plus a direct to cell constellation that connects straight to phones.
    • Initial chips are off the shelf: the reference design targets Nvidia GB300 or Rubin chips, with a TPU reference design as well, and essentially any existing chip can be put into orbit.
    • The chip industry looks set to reach maybe 100 gigawatts a year of AI compute, far short of the terawatt SpaceX wants.
    • To close that gap, SpaceX plans a “Terafab,” a chip factory around 100 million square feet, roughly 10 times the size of Tesla Gigafactory Texas.
    • A terawatt of chip output per year is like a billion full reticle equivalent chips, each running about a kilowatt, plus a lot of memory.
    • The timeline targets an annualized rate of a gigawatt per year of space compute by the end of next year, scaling roughly 10x per year: 10 GW in about 2.5 years, 100 GW in about 3.5 years, then a terawatt per year, which is 1,000 GW and about twice current US electricity consumption.
    • Beyond a terawatt, the only path to another 1,000x is the moon, using local production of photovoltaics, solar, and radiators so most mass does not have to be shipped from Earth.
    • A lunar mass driver (a linear electric motor or rail gun) could accelerate AI satellites into deep space without rockets, thanks to the moon’s lack of atmosphere and one sixth gravity.
    • Bringing that much mass to the moon would also make it possible for anyone who wants to go to the moon to go, and even live there.
    • Musk stresses none of this requires magic; the AI satellite reuses Starlink V3 solar technology, and he frames the timelines as a best guess rather than a promise.
    • SpaceX has acquired xAI, now referred to as SpaceX AI, folding its AI ambitions directly into the space company.

    Detailed Summary

    The Kardashev Scale and Why Earth Barely Registers

    Musk opens with the question of how you objectively measure a civilization’s progress, the metric an alien species would use to calibrate us. The answer he reaches for is the Kardashev scale, named for the Russian physicist who proposed it, which ranks civilizations by the power they harness: a planet’s worth (Type 1), a star’s worth (Type 2), or a galaxy’s worth (Type 3). Humanity is extremely low even on Type 1. To dramatize the scale of the sun, he notes it is about 99.86% of all the mass in the solar system, with most of the rest being Jupiter and Earth a tiny dust mote in the miscellaneous category. The incident solar energy hitting Earth’s cross section is only about a half billionth of the sun’s total output, and we capture a vanishingly small slice of even that.

    Why Energy at Scale Means Going to Space

    Because roughly 70% of Earth is water and much of the remaining land sits at the poles or in far northern regions where solar is weak and few people live, the usable area for ground solar is small. To reach any meaningful percentage of the sun’s energy, you have to go to space. Musk sets the aspiration at a millionth of the sun’s output as a first “micro” milestone, noting that even 1% would make a civilization vastly more powerful than today’s. Orbit also solves two practical problems at once: you avoid building enormous terrestrial power plants, and cooling becomes easier because waste heat can be radiated straight into the vacuum rather than fought against in an atmosphere.

    The Three Limiting Factors

    Scaling to space based compute comes down to three things: a large mass to orbit capability, a lot of solar power and radiators, and a lot of AI chips. To put a hundred gigawatts and ultimately a terawatt into space, you need a terawatt of solar generation, the radiators to reject the heat, and a terawatt of AI chips. The rest of the conversation works through each limiting factor in turn, starting with the one SpaceX has spent two decades on.

    Starship and the Reusability Breakthrough

    Starship supplies the mass to orbit. Musk argues that full and rapid reusability is the fundamental breakthrough required for both multiplanetary life and climbing the Kardashev scale, since expendable rockets are simply too expensive and you cannot build enough of them. Every other mode of transport, from cars to planes to bicycles, is reusable; rockets are uniquely hard because Earth has a deep gravity well and thick atmosphere, which is why many prior reusable rocket attempts were abandoned. SpaceX pushes mass optimization to the extreme, even catching the booster with the launch tower instead of carrying heavy landing legs. The goal beyond catching the rocket is reflying it with no refurbishment, like an aircraft. Starship V3 already more than doubles the Saturn V’s thrust, V4 will be roughly triple, and the vehicle is the largest and most powerful moving object ever made, targeted to fly more than once per hour. SpaceX already lifts an estimated 85 to 90% of all Earth mass to orbit, and plans to scale from about 2,500 tons per year to millions of tons per year, reaching a million tons per year in roughly three years.

    Inside the AI Satellite (AI1)

    The team explains that a data center in space is not a building with engines bolted on; it reduces to chips plus the power and cooling to run them. The AI satellite, dubbed AI1, is actually simpler than a Starlink satellite because it skips the complex phased array and parabolic antennas, leaving mostly solar cells, a radiator, and some laser links. The draft version targets 150 kW peak power and 120 kW sustained, matching roughly what an Nvidia GB300 rack of 72 GPUs draws. Design assumptions are about 250 watts per square meter of solar array and about 1,400 watts per square meter for double sided radiators oriented knife edge to the sun, both numbers expected to improve. The result is a craft with around a 70 meter wingspan and roughly a terabit of laser connectivity. Compute racks link to each other or to the Starlink constellation by laser, and data reaches the ground via existing Ka and Ku antennas or laser to ground links. From 600 to 800 km up, latency is only about 3 milliseconds, since light travels 300 km per millisecond, so the common worry about high latency does not apply.

    Operating a Constellation of a Million Satellites

    The satellites are large, but space is enormous, so even thousands or up to a million of them would not crowd orbit; viewed against the Earth they are nearly invisible. SpaceX leans on hard won operational experience, with about 10,000 Starlinks already flying and a unique track record of operating constellations at that scale safely. Knowing how tightly satellites can be packed and flown without collisions is treated as the number one constraint when designing the constellation.

    Manufacturing in Bastrop, Texas

    The satellites and solar will be built in Bastrop, Texas, in a facility the hosts describe as already massive and about to be dwarfed by what comes next. A solar manufacturing facility is already under construction, and the AI satellite production building will follow, with both expected to operate at reasonable volume by the end of next year. The same site keeps producing Starlink user terminals and is spinning up new, higher volume lines. Musk projects there could eventually be a few hundred million Starlink terminals, alongside a direct to cell constellation that connects straight from a phone to space for high bandwidth communication.

    Chips, the Terafab, and the Road to a Terawatt

    In the near term, SpaceX simply launches chips that already exist. The current reference design targets Nvidia GB300 or Rubin chips, with a TPU reference design as well, and essentially any existing chip can be flown. The problem is that the chip industry as a whole may only reach about 100 gigawatts a year of AI compute, which does not answer how you get to a terawatt. The answer is a gigantic chip factory, a “Terafab” around 100 million square feet, roughly ten times the size of Tesla Gigafactory Texas, big enough that Musk jokes about needing Starship point to point to cross it. Even with no new fundamental breakthroughs, scaling existing chip technology to a terawatt of output per year is, from a logic die standpoint, like a billion full reticle equivalent chips each running a kilowatt, plus a lot of memory. The stated timeline is an annualized gigawatt per year of space compute by the end of next year, then scaling roughly an order of magnitude per year: about 10 GW in 2.5 years, 100 GW in 3.5 years, and eventually a terawatt per year, which is 1,000 GW, about twice the current electricity consumption of the United States. Musk repeatedly flags these as best guesses, not promises.

    The Moon, a Mass Driver, and the Next 1,000x

    Asked why stop at a terawatt, Musk says a terawatt is actually very small. Getting another three orders of magnitude, a 1,000x jump, points to the moon. The plan is local lunar production of photovoltaics, solar, and radiators, so that most of the mass does not have to be transported from Earth, with chips either shipped up or eventually made on the moon. Because the moon has no atmosphere and only one sixth of Earth’s gravity, you can accelerate AI satellites into deep space without a rocket, using an electromagnetic mass driver, essentially a rail gun or linear electric motor. A side benefit of moving that much mass to the moon is that anyone who wants to go to the moon would be able to, and could even live there. The team closes on the excitement of building a whole new kind of satellite and the sci fi prospect of a mass driver on the moon.

    Notable Quotes

    “We currently use much less than a trillionth of the power output of the sun. And a trillion is a million times a million.”

    Elon Musk, on how far humanity sits from harnessing the sun’s energy

    “The sun is about 99.86% of all mass in the solar system.”

    Elon Musk, dramatizing the scale of the star we orbit

    “You’re an extremely kick-ass civilization if you get to 1% of the sun’s energy.”

    Elon Musk, on what a meaningful Kardashev milestone would look like

    “Reusability is the fundamental breakthrough that is necessary to make life multiplanetary, as well as to ascend the Kardashev scale.”

    Elon Musk, on why Starship matters

    “An AI satellite is essentially a lot of solar cells, a radiator, and you still need some laser links, but you don’t have all of the super complex antennas that you have on a Starlink satellite.”

    Elon Musk, on why the orbital data center is simpler than Starlink

    “There’s not some magic that’s necessary that doesn’t exist for the AI satellites.”

    Elon Musk, on reusing existing Starlink technology

    “We expect that the Terafab is going to be around 100 million square feet, which is 10 times the size of the Tesla Gigafactory Texas.”

    Elon Musk, on the chip factory needed to reach a terawatt

    “The only way that we can really see that you can achieve that is on the moon with a mass driver.”

    Elon Musk, on scaling another 1,000x beyond a terawatt

    Watch the full conversation here: Elon Musk and the SpaceX team on AI satellites and climbing the Kardashev scale.

    Related Reading

    • Kardashev scale (Wikipedia), background on the Type 1, 2, and 3 framework that anchors the entire conversation.
    • Starship (SpaceX), the official page for the fully reusable vehicle behind the mass to orbit numbers.
    • Starlink, the constellation whose solar arrays, laser links, and operations the AI satellites are built on.
    • Mass driver (Wikipedia), the electromagnetic launch concept proposed for flinging satellites off the moon.
    • Nvidia GB300 (Nvidia), the GPU rack whose power profile defines the first AI satellite’s compute target.
  • Jensen Huang on Nvidia’s Supply Chain Moat, TPU Competition, China Export Controls, and Why Nvidia Will Not Become a Cloud (Dwarkesh Podcast Summary)

    TLDW (Too Long, Didn’t Watch)

    Jensen Huang sat down with Dwarkesh Patel for over 90 minutes covering Nvidia’s supply chain dominance, the TPU threat, why Nvidia will not become a hyperscaler, whether the US should sell AI chips to China, and why Nvidia does not pursue multiple chip architectures at once. Jensen framed Nvidia’s entire business as transforming “electrons into tokens” and argued that Nvidia’s real moat is not any single technology but the full stack ecosystem it has built over two decades. He was blunt about his regret over not investing in Anthropic and OpenAI earlier, passionate about keeping the American tech stack dominant worldwide, and dismissive of the idea that China’s chip industry can be meaningfully contained through export controls.

    Key Takeaways

    1. Nvidia’s moat is the ecosystem, not the chip. Jensen repeatedly emphasized that Nvidia’s competitive advantage comes from CUDA, its massive installed base, its deep partnerships across the entire supply chain, and the fact that it operates in every cloud. The moat is not a single product but an interlocking system that took 20+ years to build.

    2. Supply chain bottlenecks are temporary, energy bottlenecks are not. Jensen argued that CoWoS packaging, HBM memory, EUV capacity, and logic fabrication bottlenecks can all be resolved in two to three years with the right demand signal. The real constraint on AI scaling is energy policy, which takes far longer to fix.

    3. TPUs and ASICs are not an existential threat to Nvidia. Jensen was emphatic that no competitor has demonstrated better price-performance or performance-per-watt than Nvidia, and challenged TPU and Trainium to prove otherwise on public benchmarks like InferenceMAX and MLPerf. He described Anthropic as a “unique instance, not a trend” for TPU adoption.

    4. Jensen regrets not investing in Anthropic and OpenAI earlier. He admitted he did not deeply internalize how much capital AI labs needed and that traditional VC funding was not sufficient for companies at that scale. He described this as a clear miss, though he said Nvidia was not in a position to make multi-billion dollar investments at the time.

    5. Nvidia will not become a hyperscaler. Jensen’s philosophy is “do as much as needed, as little as possible.” Building cloud infrastructure is something other companies can do, so Nvidia supports neoclouds like CoreWeave, Nebius, and Nscale instead of competing with them. Nvidia invests in ecosystem partners rather than vertically integrating into cloud services.

    6. Jensen is strongly against US chip export controls on China. This was the longest and most heated segment of the interview. Jensen argued that China already has abundant compute, energy, and AI researchers, and that export controls have accelerated China’s domestic chip industry while causing the US to concede the world’s second-largest technology market. He compared the situation to how US telecom policy allowed Huawei to dominate global telecommunications.

    7. AI will cause software tool usage to skyrocket, not collapse. Jensen pushed back on the narrative that AI will commoditize software companies. He argued that agents will use existing tools at massive scale, causing the number of instances of products like Excel, Synopsys Design Compiler, and other enterprise tools to grow exponentially.

    8. Nvidia does not pick winners among AI labs. Jensen explained that Nvidia invests across multiple foundation model companies simultaneously and refuses to favor any single one. He cited his own company’s unlikely survival story as the reason for this humility: Nvidia’s original graphics architecture was “precisely wrong” and would have been counted out by anyone picking winners.

    9. Nvidia added Groq for premium token economics. Nvidia recently acquired Groq and is folding it into the CUDA ecosystem because the market is now segmenting into different token tiers. Some customers will pay premium prices for faster response times even at lower throughput, creating a new segment of the inference market.

    10. Without AI, Nvidia would still be very large. Jensen was clear that accelerated computing, not AI specifically, is the foundational mission of the company. Molecular dynamics, quantum chemistry, computational lithography, data processing, and physics simulation all benefit from GPU acceleration regardless of deep learning.

    Detailed Summary

    Nvidia’s Real Business: Electrons to Tokens

    Jensen opened the conversation by reframing Nvidia’s entire value proposition. When Dwarkesh suggested that Nvidia is fundamentally a software company that sends a GDS2 file to TSMC for manufacturing, Jensen pushed back hard. He described Nvidia’s job as transforming electrons into tokens, with everything in between representing an “incredible journey” of artistry, engineering, science, and invention. He said the transformation is far from deeply understood and the journey is far from over, making commoditization unlikely.

    Jensen described Nvidia as operating a philosophy of doing “as much as necessary and as little as possible.” Whatever Nvidia does not need to do itself, it partners with someone else and makes it part of the broader ecosystem. This is why Nvidia has what Jensen called probably the largest ecosystem of partners in the industry, spanning the full supply chain upstream and downstream, application developers, model makers, and all five layers of the AI stack.

    On the question of whether AI will commoditize software companies, Jensen offered a contrarian take. He argued that agents are going to use software tools at unprecedented scale, meaning the number of instances of products like Excel, Cadence design tools, and Synopsys compilers will skyrocket. Today the bottleneck is the number of human engineers. Tomorrow, those engineers will be supported by swarms of agents exploring design spaces and using the same tools humans use today. Jensen said the reason this has not happened yet is simply that the agents are not good enough at using tools. That will change.

    The Supply Chain Moat

    Dwarkesh pressed Jensen on Nvidia’s reported $100 billion (and potentially $250 billion) in purchase commitments with foundries, memory manufacturers, and packaging companies. The question was whether Nvidia’s real moat for the next few years is simply locking up scarce upstream components so that no competitor can get the memory and logic they need to build alternative accelerators.

    Jensen confirmed this is a significant advantage but framed it differently. He said Nvidia has made enormous explicit and implicit commitments upstream. The implicit commitments matter just as much: Jensen personally meets with CEOs across the supply chain to explain the scale of the coming AI industry, convince them to invest in capacity, and assure them that Nvidia’s downstream demand is large enough to justify that investment. Nvidia’s GTC conference serves this purpose too, bringing the entire ecosystem together so upstream suppliers can see downstream demand and vice versa.

    Jensen described a process of systematically “prefetching bottlenecks” years in advance. CoWoS advanced packaging was a major bottleneck two years ago, but Nvidia swarmed it with repeated doubling of capacity until TSMC recognized it as mainstream computing technology rather than a specialty product. More recently, Nvidia has invested in the silicon photonics ecosystem through partnerships with Lumentum and Coherent, invented new packaging technologies, licensed patents to keep the supply chain open, and even invested in new testing equipment like double-sided probing.

    When Dwarkesh asked about the ultimate physical bottlenecks, Jensen surprised him. The hardest bottleneck to solve is not CoWoS or HBM or EUV machines. It is plumbers and electricians needed to build data centers. Jensen used this as a launching point to criticize “doomers” who discourage people from pursuing careers in software engineering or radiology, arguing that scaring people out of these professions creates the real bottlenecks.

    On EUV and logic scaling specifically, Jensen was optimistic. He said no supply chain bottleneck lasts longer than two to three years. Once you can build one of something, you can build ten, and once you can build ten, you can build a million. The key is a clear demand signal. If TSMC is convinced of the demand, ASML will produce enough EUV machines. Meanwhile, Nvidia continues to improve computing efficiency by 10x to 50x per generation through architecture, algorithms, and system design.

    The TPU Question

    Dwarkesh pushed hard on whether Google’s TPUs represent a real threat, noting that two of the top three AI models (Claude and Gemini) were trained on TPUs. Jensen drew a sharp distinction between what Nvidia builds and what a TPU is. Nvidia builds accelerated computing, which serves molecular dynamics, quantum chromodynamics, data processing, fluid dynamics, particle physics, and AI. A TPU is a tensor processing unit optimized for matrix multiplies. Nvidia’s market reach is far greater than any TPU or ASIC can possibly have.

    Jensen emphasized programmability as Nvidia’s core architectural advantage. If you want to invent a new attention mechanism, build a hybrid SSM model, fuse diffusion and autoregressive techniques, or disaggregate computation in a novel way, you need a generally programmable architecture. The only way to achieve 10x or 100x performance leaps (versus the roughly 25% per year from Moore’s Law) is to fundamentally change the algorithm, and that requires the flexibility CUDA provides.

    On the specific question of whether hyperscalers with huge engineering teams can simply write their own kernels and bypass CUDA, Jensen acknowledged they do write custom kernels but argued that Nvidia’s engineers still routinely deliver 2x to 3x speedups when they optimize a partner’s stack. He described Nvidia’s GPUs as “F1 racers” that anyone can drive at 100 mph, but extracting peak performance requires deep architectural expertise. Nvidia uses AI itself to generate many of its optimized kernels.

    Jensen was particularly blunt about public benchmarks. He pointed to Dylan Patel’s InferenceMAX benchmark and said neither TPU nor Trainium has been willing to demonstrate their claimed performance advantages on it. He said Nvidia’s performance-per-TCO is the best in the world, “bar none,” and challenged anyone to prove otherwise.

    Regarding Anthropic’s multi-gigawatt deal with Broadcom and Google for TPUs, Jensen called it “a unique instance, not a trend.” He said without Anthropic, there would be essentially no TPU growth and no Trainium growth. He traced this back to his own mistake: when Anthropic and OpenAI needed multi-billion dollar investments from their compute suppliers to get off the ground, Nvidia was not in a position to provide that capital. Google and AWS were, and in return, Anthropic committed to using their compute.

    Nvidia’s Investment Strategy and Regrets

    Jensen was unusually candid about his regret over not investing in foundation model companies earlier. He said he did not deeply internalize how different AI labs were from typical startups. A traditional VC would never put $5 to $10 billion into a single AI lab, but that was exactly what companies like OpenAI and Anthropic needed. By the time Jensen understood this, Nvidia was not in a financial or cultural position to make those kinds of investments.

    Now, Nvidia has invested approximately $30 billion in OpenAI and $10 billion in Anthropic. Jensen said he is delighted to support both and considers their existence essential for the world. But he acknowledged that these investments came at much higher valuations than would have been possible years earlier.

    Jensen explained Nvidia’s broader investment philosophy: support everyone, do not pick winners. He invests in one foundation model company, he invests in all of them. This comes from hard-won humility. When Nvidia started, there were 60 3D graphics companies. Nvidia’s original architecture was “precisely wrong” and the company would have been at the top of most lists to fail. Jensen said he has enough humility from that experience to know that you cannot predict which AI company will ultimately succeed.

    Why Nvidia Will Not Become a Hyperscaler

    Dwarkesh pointed out that Nvidia has the cash to build and operate its own cloud infrastructure, bypassing the middleman ecosystem that converts CapEx into OpEx for AI labs. Jensen rejected this path based on his core operating philosophy.

    If Nvidia did not build its computing platform, NVLink, and the CUDA ecosystem, nobody else would have done it. He is “completely certain” of that. These are things Nvidia must do. But the world has lots of clouds. If Nvidia did not build a cloud, someone else would show up. So the answer is to support the ecosystem instead: invest in CoreWeave, Nscale, Nebius, and others to help them exist and scale, rather than competing with them.

    Jensen was clear that Nvidia is not trying to be in the financing business either. When OpenAI needed a $30 billion investment before its IPO, Nvidia stepped up because OpenAI needed it and Nvidia deeply believed in the company. But these are targeted ecosystem investments, not a strategic pivot into cloud services.

    On GPU allocation during shortages, Jensen pushed back on the narrative that Nvidia strategically “fractures” the market by giving allocations to smaller neoclouds. He said the process is straightforward: you forecast demand, you place a purchase order, and it is first in, first out. Nvidia never changes prices based on demand. Jensen said he prefers to be dependable and serve as the foundation of the industry rather than extracting maximum short-term value.

    The China Debate

    The longest and most heated section of the interview was Jensen’s case against US chip export controls on China. This was a genuine debate, with Dwarkesh pushing the national security argument and Jensen pushing back forcefully.

    Jensen’s core argument rested on several pillars. First, China already has abundant compute. They manufacture 60% or more of the world’s mainstream chips, have massive energy infrastructure (including empty data centers with full power), and employ roughly 50% of the world’s AI researchers. The threshold of compute needed to build models like Anthropic’s Mythos has already been reached and exceeded by China’s existing infrastructure.

    Second, export controls have backfired. They accelerated China’s domestic chip industry, forced their AI ecosystem to optimize for internal architectures instead of the American tech stack, and caused the United States to concede the second-largest technology market in the world. Jensen compared this directly to how US telecom policy allowed Huawei to dominate global telecommunications infrastructure.

    Third, Jensen argued that AI is a five-layer stack (energy, chips, computing platform, models, applications) and the US needs to win at every layer. Fixating on one layer (models) at the expense of another layer (chips) is counterproductive. If Chinese open source AI models end up optimized for non-American hardware and that stack gets exported to the global south, the Middle East, Africa, and Southeast Asia, the US will have lost something far more valuable than whatever marginal compute advantage the export controls provided.

    Dwarkesh countered with the Mythos example: Anthropic’s new model found thousands of high-severity zero-day vulnerabilities across every major operating system and browser, including one that had existed in OpenBSD for 27 years. If China had enough compute to train and deploy a model like Mythos at scale before the US could prepare, the cyber-offensive capabilities would be devastating.

    Jensen’s response was direct. Mythos was trained on “fairly mundane capacity” that is already abundantly available in China. The amount of compute is not the bottleneck for that kind of breakthrough. Great computer science is, and China has no shortage of brilliant AI researchers. He pointed to DeepSeek as evidence: most advances in AI come from algorithmic innovation, not raw hardware. If China’s researchers can achieve breakthroughs like DeepSeek with limited hardware, imagine what they could do with more.

    Jensen also argued for dialogue over confrontation. He said it is essential that American and Chinese AI researchers are talking to each other, and that both countries agree on what AI should not be used for. The idea that you can prevent AI risks by cutting off chip sales, when the real advances come from algorithms and computer science, reflects a fundamental misunderstanding of how AI progress works.

    The debate ended without resolution, but Jensen’s final point was sharp: “I’m not talking to somebody who woke up a loser. That loser attitude, that loser premise, makes no sense to me.”

    Why Not Multiple Chip Architectures?

    Near the end of the interview, Dwarkesh asked why Nvidia does not run multiple parallel chip projects with different architectures, like a Cerebras-style wafer-scale design or a Dojo-style huge package, or even one without CUDA.

    Jensen’s answer was simple: “We don’t have a better idea.” Nvidia simulates all of these alternative approaches in its internal simulators and they are provably worse. The company works on exactly the projects it wants to work on. If the workload were to change dramatically (not just the algorithms, but the actual market shape), Nvidia might add other accelerators.

    In fact, Nvidia recently did exactly this by acquiring Groq. The inference market is now segmenting into different tiers. Some customers will pay premium prices for extremely fast response times even if throughput is lower. This creates a new “high ASP token” segment that justifies a different point on the performance curve. But Jensen was clear: if he had more money, he would put it all behind Nvidia’s existing architecture, not diversify into alternatives.

    Nvidia Without AI

    Jensen closed by saying that even if the deep learning revolution had never happened, Nvidia would be “very, very large.” The premise of the company has always been that general-purpose computing cannot scale indefinitely and that domain-specific acceleration is the way forward. Molecular dynamics, seismic processing, image processing, computational lithography, quantum chemistry, and data processing all benefit from GPU acceleration regardless of AI. Jensen said the fundamental promise of accelerated computing has not changed “not even a little bit.”

    Thoughts

    This interview is one of the most revealing Jensen Huang conversations in years, partly because Dwarkesh actually pushes back instead of lobbing softballs. A few things stand out.

    The Anthropic regret is real and significant. Jensen is essentially admitting that Nvidia’s biggest strategic miss of the AI era was not understanding that foundation model companies needed supplier-level capital commitments, not VC funding. The fact that Google and AWS used compute investments to lock in Anthropic’s architecture choices has had downstream consequences that Nvidia is still working to unwind. When Jensen says Anthropic is “a unique instance, not a trend” for TPU adoption, he is simultaneously downplaying the threat and revealing exactly how seriously he takes it.

    The China debate is the highlight. Jensen’s argument is more nuanced than it first appears. He is not saying “sell China everything.” He is saying the current binary approach of near-total restriction has backfired by accelerating China’s domestic chip industry and pushing the Chinese AI ecosystem away from the American tech stack. His comparison to the US telecom industry losing global market share to Huawei is pointed and historically grounded. Whether you agree with his conclusion or not, the framing of AI as a five-layer stack where the US needs to compete at every layer is a useful mental model.

    The “electrons to tokens” framing is Jensen at his best. It is a simple metaphor that captures something genuinely complex about where value is created in the AI supply chain. And his insistence that the transformation is “far from deeply understood” is a subtle way of arguing that Nvidia’s competitive position will be durable because the problem space is not close to being solved.

    The Groq acquisition reveal is interesting for what it signals about the inference market. If Nvidia is creating a separate product tier for premium-priced, low-latency tokens, it suggests the company sees inference economics fragmenting significantly. This aligns with the broader trend of AI becoming an enterprise product where different customers have wildly different willingness to pay based on how they use tokens.

    Finally, Jensen’s refusal to diversify chip architectures is a bold bet. “We simulate it all in our simulator, provably worse” is an incredibly confident statement. History is full of companies that were right until they were not. But Nvidia’s track record of 50x generation-over-generation improvements through co-design across processors, fabric, libraries, and algorithms is hard to argue with. The question is whether the current paradigm of transformer-based models on GPU clusters represents a local or global optimum for AI compute.

  • All-In Podcast Recap: Epstein Files, Tether’s Billions, Nvidia Accounting & Poker Psychology

    Live from The Venetian: The Besties break down the Epstein file release, the massive margins of Tether, the Michael Burry vs. Nvidia debate, and a masterclass in risk with Alan Keating.

    In this special live episode recorded during the F1 weekend in Las Vegas, the “Besties” (Chamath Palihapitiya, Jason Calacanis, David Sacks, and David Friedberg) reunite in person. The agenda is packed: political intrigue surrounding Jeffrey Epstein, the financial dominance of stablecoins, technical debates on AI chip accounting, and high-stakes poker strategy.

    TL;DR: Executive Summary

    The US government has voted nearly unanimously to release the Epstein files, leading the hosts to speculate that the lack of leaks points to intelligence agency involvement rather than political dirt on Donald Trump. Chamath details a meeting with Tether CEO Paolo Ardoino, revealing a business holding over $100 billion in US Treasuries with profit margins potentially exceeding 95%. The group then debates Michael Burry’s short position on Nvidia, with Friedberg defending the “useful life” of AI chips under GAAP accounting. Finally, poker legend Alan Keating joins to discuss “soul reading” opponents and mastering fear in high-stakes games.


    Key Takeaways

    • The Epstein Intelligence Theory: The hosts argue that if the files contained damaging information on Donald Trump, it would have been leaked during the Biden administration. The prevailing theory discussed is that Epstein may have been an intelligence asset (CIA/Mossad/Russia), explaining the long-standing secrecy.
    • Tether is a Financial Juggernaut: Tether holds approximately $135 billion in US Treasuries and operates with roughly 100 employees. Chamath estimates the business runs at 95%+ margins, effectively exporting US dollar stability to developing nations while capturing massive interest yields.
    • Nvidia vs. Michael Burry: “The Big Short” investor Michael Burry is shorting the sector, arguing tech companies are “cooking the books” by depreciating AI chips over 6 years when they become obsolete in 3. Friedberg counters that chips retain a “useful life” for inference and background tasks long after they are no longer top-of-the-line.
    • Google Gemini 3: Google has regained the lead on LLM benchmarks with Gemini 3. The conversation highlights a shift toward proprietary silicon (TPUs) and a fragmented chip market, posing a potential long-term risk to Nvidia’s dominance.
    • The “Oppenheimer” Moment: David Friedberg reveals he decided to return as CEO of Oho after watching the movie Oppenheimer, realizing he needed to be an active operator rather than a passive board member.

    Detailed Episode Breakdown

    1. The Epstein Files Release

    In a stunning bipartisan move, the House and Senate voted nearly unanimously to release the Epstein files. The Besties analyzed why this is happening now. Sacks and Chamath suggested that because Epstein was the “most investigated human on earth,” any compromising information regarding Trump would likely have been weaponized politically by now.

    The discussion pivoted to the source of Epstein’s wealth. Chamath noted Epstein managed money for billionaires and charged inexplicable fees for “tax advice”—such as a documented $168 million payment from Apollo’s Leon Black. The hosts speculated that Epstein likely functioned as a spy or asset for intelligence agencies, which would explain the protective layer surrounding the files for so long.

    2. Tether and the Stablecoin Boom

    Chamath shared insights from a dinner with Tether CEO Paolo Ardoino. Tether’s financials are staggering: approximately $135 billion in US Treasuries and billions more in Bitcoin and gold.

    The hosts discussed the utility of stablecoins in high-inflation economies, where locals use USDT to preserve purchasing power. Because Tether earns the interest on the backing treasuries (rather than passing it to the coin holder), and operates with a lean team, the company generates billions in pure profit. Sacks noted that future US regulations might eventually force stablecoin issuers to share that yield with users, but for now, it remains one of the most profitable business models in the world.

    3. Accounting Corner: Is Nvidia Overvalued?

    Michael Burry is shorting the semiconductor sector, claiming companies are inflating earnings by depreciating Nvidia chips over 6 years despite rapid technological obsolescence.

    Friedberg launched a segment dubbed “Accounting Corner” to rebut this. He explained that under GAAP standards, an asset’s useful life is determined by its ability to generate revenue, not just its technological superiority. Even if an H100 chip isn’t the fastest on the market in year 4, it can still run inference models or handle lower-priority compute tasks, justifying the longer depreciation schedule. Chamath added that tech giants monitor “output tokens” closely; if a chip wasn’t profitable, they would simply turn it off.

    4. Poker Strategy with Alan Keating

    The episode concluded with Alan Keating, a high-stakes poker player famous for his loose, aggressive style. Keating explained his philosophy, which relies less on “solvers” (GTO strategy) and more on “soul reading”—navigating the fear and psychology of the table.

    He broke down a famous hand where he beat Doug Polk with a 4-2 offsuit, explaining that he sensed fear in Polk’s betting patterns on the turn. Keating described his approach as finding “beauty in the chaos” and dragging opponents into “deep water” where they are uncomfortable and prone to errors.


    Editorial Thoughts

    This episode marked a distinct shift in the podcast’s tone regarding crypto, moving from general skepticism to a recognition of the sheer scale and utility of stablecoins like Tether. The “Accounting Corner” segment, while technical, provided critical context for investors trying to value the AI stack—suggesting the AI boom has more fundamental accounting support than bears like Burry believe. Finally, the live format from Las Vegas brought a looser, more energetic dynamic to the conversation, highlighting the chemistry that makes the show work.