PJFP.com

Pursuit of Joy, Fulfillment, and Purpose

Tag: Jensen Huang

  • Dan Loeb on Building Third Point’s $25 Billion Investment Empire: AI, Activism, Credit, and the FTX Mistake

    Dan Loeb has spent three decades turning a $3 million fund into Third Point, a roughly $25 billion collection of hedge fund, credit, insurance, and venture businesses. In this Invest Like the Best conversation with Patrick O’Shaughnessy, Loeb walks through how he reinvented his strategy from deep value and event-driven trades into quality and thematic investing, why he now believes every serious investor has to be a technology investor, how he reads the AI cycle and the semiconductor melt-up, where activism and corporate governance still pay, and the single mistake that taught him the most. It is a rare, unhurried look at how a famously sharp-elbowed activist actually thinks about markets, businesses, and people.

    TLDW

    Loeb covers an enormous amount of ground: his daily process for staying ahead of the information firehose, Jensen Huang’s AI stack as a mental model, and why Nvidia, Anthropic, and Elon Musk’s companies are the three most consequential firms he tracks. He traces Third Point’s roots in credit and event-driven investing at Jefferies, the influence of Joel Greenblatt’s “You Can Be a Stock Market Genius,” and his later pivot to quality investing shaped by “The Outsiders” and Lawrence Cunningham’s “Quality Investing.” He argues the AI rally is not a dot-com-style valuation bubble because the leaders generate enormous cash, explains why human judgment and structural market quirks still create alpha, and makes the case that AI will never fully run a capital system. He digs into corporate governance and his father’s influence, the Sotheby’s and Sony activism campaigns, the hard reality of activism in Japan, and what investing in Danaher’s operating system taught him. He names FTX as his hardest lesson, breaks down Third Point’s evolution into a 60-percent-credit platform spanning CLOs, structured credit, reinsurance and annuities, describes how he is pushing his analysts to use AI and Claude daily, and closes on kindness and the friend who let him sleep on a couch before he made it.

    Thoughts

    The most striking thing about Loeb is that he treats his own strategy as a thing to be disrupted rather than defended. He built his reputation on Greenblatt-style special situations, spin-offs, demutualizations, and post-reorg equities bought cheap because of forced selling and sandbagged guidance. Most investors who win that way spend the rest of their careers protecting the formula. Loeb instead watched the people who stayed rigid about deep value and low multiples underperform or disappear, and deliberately retrained himself and his team around business quality and thematic conviction. The willingness to abandon a winning identity is the actual edge here, more than any single trade. It is the rare investor who can say his current strategy would not fit cleanly on a PowerPoint deck and treat that as a feature.

    His AI framing deserves attention because it is unfashionably calm. The bear case on AI is usually about valuation, and Loeb dismantles it on the leaders’ own numbers: these are companies investing off their balance sheets, generating enormous cash, trading at multiples that do not resemble 1999. He was short the dot-com bubble, so he is not a permabull cheering from the sidelines. His real point is subtler, that the danger is expectations, not valuations. The semiconductor index ran up 40 percent on genuinely strong fundamentals, but Micron and Nvidia both put up monster quarters and saw their stocks fall because expectations had simply outrun even great results. That gap between fundamentals and price is where he thinks the human investor still earns a living, precisely because quant strategies, CTAs, and risk-managed pods are forced to sell into weakness rather than buy it.

    The governance material is the most quietly radical part of the conversation. Loeb defends shareholder primacy against the Business Roundtable’s softer stakeholder language, but his argument is not the cartoon version where shareholder value means strip-mining a company. It is that boards have one job, accountability for capital allocation and management, and that vague multi-stakeholder mandates become an excuse for directors to avoid the hard work. His read on bad governance is almost always relational: directors who let loyalty to an underperforming CEO override their duty, or who sit on boards for status and income. The Sotheby’s story is the clean illustration, a centuries-old, high-status business run unprofitably because nobody treated it like a business. Loeb’s pattern is to find the gap between claimed status and actual performance and to raise the social cost of coasting.

    What is genuinely new in Loeb’s posture is how he talks about AI inside his own firm. He is not pitching it as a moat or a headcount-reduction story. He frames Claude and AI tools as a way to make each person a more autonomous self-improver, something that gives back whatever you put into it, with some analysts running agents overnight and burning tokens while he personally uses it more for queries. Coming from a 30-year fundamental investor, the absence of defensiveness is the signal. He pairs it with Brad Gerstner’s nod to “Essentialism”: the firehose is now infinite, so the scarce skill is deciding what is actually relevant. That is a more honest answer to the AI question than either doom or hype.

    Finally, the FTX confession is worth sitting with because of how he frames it. He does not retreat into cynicism about venture or crypto. He notes that Sam Bankman-Fried, fraud aside, had a real nose for value, with stakes in Anthropic, Cursor, and Solana that would have made him a top venture investor of the era. The lesson Loeb extracts is procedural, not philosophical: their due diligence now includes checking bank balances, the most basic verification that would have surfaced the problem. It is a useful reminder that even sophisticated capital can skip boring fundamentals when a company is growing fast and the cap table looks good. The discipline is not in having a grand theory of fraud, it is in never skipping the unglamorous checks.

    Key Takeaways

    • Loeb’s macro focus right now collapses to two variables: where oil goes, dictated by war and geopolitics, and what AI does on the spending and infrastructure front and its impact on society and the economy.
    • He argues you can no longer punt on technology and focus on industrials or consumer; tech is a big, growing, compounding part of the economy that affects everything else, so every investor has to become a tech investor.
    • He uses Jensen Huang’s AI stack as a mental model: power and energy at the bottom, then chips and infrastructure, up through large language models, software, and applications.
    • The three most consequential companies he tracks are Nvidia, Anthropic, and Elon Musk’s companies collectively.
    • Third Point’s roots are in credit and event-driven investing, shaped by his time at Jefferies watching investors like David Tepper before he founded Appaloosa, Eric Mindich at Goldman, and firms like Angelo Gordon and Farallon.
    • Joel Greenblatt’s “You Can Be a Stock Market Genius” was his foundational framework: spin-offs, demutualizations, privatizations, and post-reorg equities where a new, illiquid security gets dumped by holders who will not do the work.
    • Spin-off managers often sandbag guidance because their incentive packages get set at the time of the spin-off, creating a predictable gap between conservative numbers and real value.
    • From 1995 to roughly 2013-2015, event-driven special situations were Third Point’s bread and butter; those opportunities still exist, but the real edge now is overlaying them with a business-quality lens.
    • The pivot to quality and thematic investing was influenced most by “The Outsiders” (capital allocation plus great operations) and Lawrence Cunningham’s “Quality Investing” (high-moat, high-return-on-capital businesses to own for years).
    • AI disruption made last year one of the worst for many apparently high-quality companies, as businesses that looked durable rapidly became less so.
    • Loeb sees the AI rally as fundamentally different from the dot-com bubble: the leaders invest off their balance sheets, generate enormous cash, and do not carry the valuation excess of 1999.
    • The danger in semis is expectations, not valuation: Nvidia and Micron posted spectacular quarters yet saw stocks fall because expectations had outrun even great numbers.
    • Structural forces still create alpha for fundamental investors: quants, CTAs, and multi-strategy pods have risk metrics that force selling on the way down, the opposite of what is rational for long-term holders.
    • He believes AI will not fully run a capital system; private equity, restructurings, creditor committees, and high-touch negotiation will always need humans.
    • His interest in governance came from his father, a securities lawyer and corporate governance expert who sat on the boards of Mattel and Williams-Sonoma and pushed ethical sourcing ahead of his time.
    • Loeb defends shareholder primacy, citing Milton Friedman and Warren Buffett, and criticizes the Business Roundtable’s move away from shareholder value as a distraction from the board’s real duty.
    • Bad governance usually comes from directors letting loyalty to a weak CEO override fiduciary duty, lacking the knowledge to do the job, or serving for status and income.
    • Writing is a core activism lever: great writing is clear thinking, and social pressure through writing and PR is one of the most effective ways to move a board, alongside financial and legal levers.
    • The Sotheby’s campaign targeted a high-status, centuries-old business run unprofitably; Third Point bought 9.9 percent, eventually brought in Tad Smith from MSG, who cleaned up operations and technology before the company sold.
    • Third Point increasingly prefers to back great companies with excellent management and cheer them on rather than hunt for mismanaged businesses, because bad management tends to cluster into a morass.
    • Third Point is a collection of businesses; the flagship hedge fund grew from $3 million to about $9 billion and is roughly 30 percent credit, with the broader firm closer to 60 percent credit.
    • The firm spans a roughly $7 billion CLO business, structured and corporate credit, an insurance company, asbestos liabilities, a small private credit unit, and a venture capital arm.
    • The unifying thread is valuing enterprises across early, mid, and mature stages and investing in whichever fulcrum security offers the best risk-reward, from equity to senior debt.
    • Loeb cites buying Twitter’s financing debt near 96-97 cents at a 12 percent yield when most credit investors were scared, and a difficult xAI debt financing, as examples of cross-discipline conviction.
    • He is the portfolio manager only of the hedge fund; the credit, CLO, structured credit, and high-yield businesses have their own PMs and investment committees he does not sit on.
    • The Sony campaign saw Third Point own up to 7 percent and push to separate the conglomerate; management resisted for years before spinning out the semiconductor and financial services businesses.
    • He learned that activism in Japan is hard, but the government often wants reform; he co-wrote a paper with Larry Lindsey and Niall Ferguson urging corporate governance and return on invested capital as a fourth arrow of Abenomics, picked up as a Wall Street Journal editorial.
    • Investing in Danaher was his most instructive experience, teaching him how the Danaher Business System drives continuous improvement (Kaizen) and how the company celebrates rather than shames underperformance because problems are fixable.
    • FTX was his hardest lesson; it looked great and was verifiable on the blockchain, but was not what it appeared, and now Third Point’s diligence includes checking bank balances.
    • He notes that, fraud aside, Sam Bankman-Fried had a strong nose for value with stakes in Anthropic, Cursor, and Solana.
    • Recent mistakes also include shorts where Third Point thought certain info-services businesses would resist AI disruption; he still expects a shakeout with some phoenixes rising from the ashes.
    • He is pushing his whole team to use AI daily, hiring native computer scientists and system integrators, and describes Claude as a tool that makes you autonomous and gives back whatever you put into it.
    • Third Point’s distinctive edge is optimism about AI creating net jobs and the ability to default into credit investing during stressed times, as it did with investment-grade credit in 2020.
    • Credit is hard to copy because it runs on relationships, not electronic trading; that is why Third Point built into CLOs and eyes the roughly $6 trillion structured credit market rather than treating it as tourism.
    • The great analyst has changed: 20 years ago it was someone who could model fast and crack a complex restructuring (Loeb made a career-defining bet on Drexel Burnham claims); today it is a Gavin Baker type who deeply understands an industry, like the analyst who flew to Texas and realized Casey’s General Stores was really a pizza chain.
    • Outside the US, Loeb is more bullish on Korea, Taiwan, and Japan as hunting grounds, finds Europe tough on regulation (though he owns Rolls-Royce and ASML), and finds the Middle East the most vibrant region.
    • What worries him most is not the business but running out of time for family, surfing, and reading; what excites him is incorporating everything relevant about the world and forming relationships with people building interesting things.
    • His closing reflection is on kindness as a top-tier value, and the friend, Carter, who let him sleep on a couch and seeded his early fund, echoing a Palmer Luckey line that money cannot buy friends who believed in you when you had nothing.

    Detailed Summary

    Staying ahead of the firehose and reading the macro

    Loeb opens by admitting he does not have a perfectly organized system for processing the modern flood of information. He checks the news for what is relevant to the economy and to Third Point’s positions, tries not to obsess over minute-to-minute moves, and leans more tactical than strategic. When people ask him about macro, he says the usual government-reported metrics (growth, unemployment, inflation, rates, currencies, gold, crypto) are trumped right now by two things: where oil goes, which depends on war and geopolitics, and what AI does on the spending and infrastructure side and its impact on society and the economy. To understand technology, he leans on Jensen Huang’s framing of the AI stack and talks to smart people regularly, and he watches three companies above all: Nvidia, Anthropic, and Elon Musk’s companies as a group.

    From event-driven roots to quality investing

    Third Point’s DNA comes from Loeb’s time as a credit investor at Jefferies, where he watched some of the best distressed, event-driven, and risk-arbitrage investors operate, from David Tepper to Eric Mindich to firms like Angelo Gordon and Farallon. His first lens was event-driven: spin-offs, demutualizations, privatizations, and post-reorg equities, where a newly created and illiquid security gets dumped by holders who will not do the work, and management sandbags guidance because incentive packages are set at the spin date. He barely thought about moats or returns on capital; he just wanted to buy something genuinely cheap with those characteristics. That was the firm’s bread and butter from 1995 until roughly 2013-2015. Those opportunities still exist, but Loeb describes deliberately evolving toward business quality and thematic investing, influenced by “The Outsiders” on capital allocation and Lawrence Cunningham’s “Quality Investing” on durable, high-return businesses. He organized the team around industry experts rather than generalists. The twist: AI disruption recently turned many apparently high-quality companies into much lower-quality ones, fast.

    The AI cycle, bubbles, and the human edge

    Loeb resists the bubble narrative. He was short the dot-com bubble and remembers the valuation excess; today’s AI leaders, by contrast, invest off their balance sheets and generate enormous cash, so unless you believe the capex yields no return, the earnings and multiples do not look like 1999. The real driver of volatility, he argues, is expectations: the semiconductor index ran up 40 percent on strong fundamentals, but Nvidia and Micron both delivered blowout quarters and still saw their stocks fall because expectations had run too high. That dynamic is exactly where a fundamental investor earns a living, because quants, CTAs, and risk-managed pods are structurally forced to sell into weakness. He also doubts AI will ever fully run a capital system, since private equity, restructurings, creditor committees, and high-touch credit always need humans. He cites “Reminiscences of a Stock Operator” and Ecclesiastes: there is nothing new under the sun, and human nature, with its bubbles, panics, and extremes, does not change.

    Governance, his father, and the duty of boards

    Loeb traces his governance interest to his father, a securities lawyer and corporate-governance expert who served on the boards of Mattel and Williams-Sonoma and championed ethical sourcing before it was common. He calls the American board system beautiful: directors are answerable to shareholders and accountable for strategy and key financial decisions. Governance breaks down when directors lose sight of their fiduciary duty, lack the knowledge or talent diversity to do the job, or prioritize things other than shareholders. He invokes Milton Friedman and Warren Buffett to argue that caring about communities, employees, and conduct is not inconsistent with shareholder value but part of it, and criticizes the Business Roundtable for muddying the board’s core duty. The most common failure he sees is directors letting loyalty to an underperforming CEO override their duty. Most of the time Third Point redirects existing boards without even taking a seat; the extreme proxy fights are the exception.

    Activism, writing, Sotheby’s, and Sony

    Great writing, Loeb says, is clear thinking and organizing your thoughts to get a desired outcome, and it is one of activism’s most effective levers alongside financial and legal pressure. Social pressure through writing and PR can move a board on its own. He sees a pattern in his campaigns: targets that hold themselves out as high status but are not living up to it. Sotheby’s is the clean example, a centuries-old, high-status business run unprofitably, where Third Point bought 9.9 percent, gave the existing CEO a year, then helped install Tad Smith from MSG, who modernized operations and technology before the company was sold. Sony was a two-act campaign in which Third Point owned up to 7 percent and pushed to break up the conglomerate; he recounts sharing the thesis with Andrew Ross Sorkin at the New York Times under embargo, the panic it caused, and how management resisted for years before spinning out the semiconductor and financial services units. The lesson: activism in Japan is genuinely hard, even though the government wanted reform. He co-authored a paper with Larry Lindsey and Niall Ferguson arguing corporate governance and return on invested capital should be a fourth arrow of Abenomics, which ran as a Wall Street Journal editorial.

    The Danaher operating system

    Loeb calls Danaher his most instructive investment. He and his partner persuaded the company to compress its five-day Danaher Business System training into a single day, and he came away with a deep appreciation for how a real operating system drives continuous improvement. The standout lesson was cultural: Danaher holds people individually accountable, but when it finds someone underperforming it celebrates rather than shames, because the problems are addressable and fixable, and it does this relentlessly across operations and working capital. He also points to the diaspora of Danaher executives, including Larry Culp and the leadership at Ingersoll Rand, as evidence of the system’s depth. The investment worked for about four years before COVID-era order surges and inventory swings turned tailwinds into headwinds; Third Point sold and has recently bought back in modestly.

    The structure of Third Point and the fulcrum security

    Third Point is not one fund but a collection of businesses. The flagship hedge fund grew from $3 million to about $9 billion and is roughly 30 percent credit, generically around 110 percent long and 30-40 percent short on the equity side. Across the firm the credit weight is closer to 60 percent, spanning a roughly $7 billion CLO business, several billion in structured and corporate credit, an insurance company, a couple billion in asbestos liabilities, a small new private credit unit, and a venture arm. The unifying thread is valuing enterprises at any stage and investing in whichever fulcrum security (the one with the best risk-reward) makes sense. Loeb illustrates with Credit Suisse’s takeover by UBS, where the holdco paper proved the fulcrum, and with buying Twitter’s resold financing debt near 96-97 cents at a 12 percent yield when other credit investors were scared, plus a difficult xAI debt financing that few credit people wanted. He pushes back on the idea that he sits atop everything: he is the PM only of the hedge fund, while the other businesses have their own PMs and committees he is not on.

    Insurance, the FTX lesson, and recent mistakes

    Loeb started a Bermuda reinsurance company in 2010, backed by himself, Kelso, and Pinebrook, on a barbell thesis of investing the float in Third Point and treasuries to defer taxes and lever capital. The reinsurance side soured, and about three years ago he concluded they had the right idea but the wrong vehicle, that plain-vanilla annuities (which can only invest in credit) would have fit better. Third Point merged the reinsurer into its UK closed-end fund, Third Point Offshore Investors, reincorporated from Guernsey to Cayman, and repurposed it into an insurance company managing private credit, structured credit, whole-loan mortgages, real estate lending, and investment-grade debt. His hardest lesson was FTX: it looked great, was verifiable on the blockchain, and had a strong cap table, but was not what it seemed; diligence now includes checking bank balances. He notes Sam Bankman-Fried, fraud aside, had a great nose for value (Anthropic, Cursor, Solana). Other recent mistakes were shorts where Third Point bet certain info-services businesses would resist AI disruption; he still expects a shakeout with some survivors rising from the ashes.

    AI inside the firm, the analyst of the future, and kindness

    Loeb is pushing his entire team to use AI daily, hiring native computer scientists and system integrators, and describes Claude as a tool that makes you an autonomous self-improver and gives back whatever you put into it, with some analysts running agents overnight while he uses it more for queries. He pairs this with Brad Gerstner’s recommendation of “Essentialism”: you cannot do it all, so you must decide what is most relevant. The great analyst has changed: 20 years ago it was someone who could model fast and crack a complex restructuring, as Loeb did with the Drexel Burnham bankruptcy claims early in his career; today it is a Gavin Baker type who deeply understands an industry and its technology, like the analyst who flew to Texas and realized Casey’s General Stores was really a pizza chain in disguise. On the rest of the world, he is more bullish on Korea, Taiwan, and Japan, finds Europe tough on regulation (while owning Rolls-Royce and ASML), and finds the Middle East the most vibrant region. He closes on what worries and excites him (time with family, surfing, and reading versus the joy of incorporating everything relevant about the world), and on kindness, crediting his friend Carter, who let him sleep on a couch and seeded his early fund, and echoing Palmer Luckey’s line that money cannot buy friends who believed in you when you had nothing.

    Notable Quotes

    “I think you have to be a tech person today. It’s a big and growing and compounding part of the economy. It affects everything else.”

    Dan Loeb, on why no serious investor can punt on technology anymore

    “Hold on to your seats because things are only going to accelerate from here.”

    Dan Loeb, recounting a 2013 Davos warning about technological change he now applies to AI

    “Maybe that’s where the human element comes in, to understand and to be able to make those tough trading decisions when fundamentals are going one way and stock prices are going the other way, and to be able to take the pain of losses in the short run.”

    Dan Loeb, on where a human investor still has an edge over machines

    “It’s very different from the dot-com bubble, which we were short going into. You don’t have the valuation bubble now on those companies that you had back in those days.”

    Dan Loeb, on why he does not see the AI rally as a 1999-style bubble

    “When they found someone that was underperforming, it was celebrated instead of shamed, because look at all these things you’re doing wrong, we can fix those. And they did.”

    Dan Loeb, on the accountability culture he learned from the Danaher Business System

    “I would have to say our investment in FTX. It looked great. The company was growing fast. We could verify it all on the blockchain.”

    Dan Loeb, naming his hardest investment lesson

    “Be kind to people you have no idea how it will ever benefit you. And sometimes it will and sometimes it won’t.”

    Dan Loeb, on elevating kindness in your hierarchy of values

    “The one thing money doesn’t buy you is friends that believed in you when you had nothing.”

    Dan Loeb, quoting Gavin Baker quoting Palmer Luckey, on the friend who seeded his early fund

    Watch the full conversation between Dan Loeb and Patrick O’Shaughnessy here.

    Related Reading

  • Gavin Baker on Orbital Compute, TSMC, Frontier AI Models, Anthropic’s Vertical Take Off, and the Coming Wafer Shortage

    Gavin Baker, founder and CIO of Atreides Management, returns to Patrick O’Shaughnessy’s Invest Like the Best for his sixth appearance. He calls the current AI moment the most extraordinary moment in the history of capitalism, walks through what Anthropic’s vertical takeoff in revenue actually means, lays out why orbital compute is closer than skeptics believe, dissects the TSMC bottleneck that may be the only thing standing between today’s market and a full-on AI bubble, and rates every hyperscaler on how they have positioned for a world where frontier model providers may stop selling API access altogether.

    TLDW

    Anthropic added eleven billion dollars of ARR in a single month, which is roughly the combined business of Palantir, Snowflake, and Databricks built over a decade. That is the setup. From there Gavin Baker covers the March and April selloff, the contrarian read that a closed Strait of Hormuz was actually bullish for American manufacturing competitiveness, why Anthropic and OpenAI multiples may be misleadingly cheap on an unconstrained run rate basis, why Elon Musk’s discipline on SpaceX valuation created a superpower of permanent access to capital, the practical engineering case for orbital compute as racks in space rather than Pentagon sized space stations, why TSMC’s capacity discipline is the single most important variable in whether the AI cycle becomes a bubble, what Terafab in Texas changes, why the Pareto frontier of AI models has flipped from Google dominance to Anthropic and OpenAI dominance in nine months, the shift from all you can eat AI subscriptions to usage based pricing and what that means for revenue scaling, Richard Sutton’s bitter lesson as the largest risk to the AI trade, why frontier tokens still capture an overwhelming share of economic value, the role of continual learning as the third great open question, why most new chip startups should not try to build a better GPU, why Cerebras did something different and hard, why disaggregated inference may extend GPU useful lives to ten or fifteen years and rescue the private credit industry, why being in the token path is the new venture filter, the new prisoner’s dilemma around releasing frontier models via API, an honest rating of Google, Meta, Amazon, and Microsoft, why personal safety is becoming a real AI era risk, and why he remains an AI optimist maximalist who believes this could be the next Pax Americana.

    Key Takeaways

    • Anthropic added eleven billion dollars of ARR in one month, more than the combined businesses of Palantir, Snowflake, and Databricks built across a decade. There is no precedent for this in the history of capitalism.
    • The SaaS and cloud revolution created between five and ten trillion dollars of value over twenty years. AI is replaying that compression on a timeline measured in months.
    • The March selloff was a drawdown driven by disagreement with price action, not invalidated thesis. That is the kind of drawdown an investor can lean into.
    • Deep Seek Monday in January 2025 was a similar setup. By the day of the selloff, AWS Asia GPU prices had already doubled, GPU availability had fallen, and it was obvious reasoning models would be vastly more compute hungry at inference. The market priced the opposite.
    • The Strait of Hormuz closing was actually positive for America. US natural gas (the primary input into US electricity, which feeds AI) fell twenty percent on Bloomberg while Asian and European natural gas doubled or tripled. American manufacturing competitiveness improved overnight.
    • The US is now the world’s largest producer and exporter of oil and gas. The economy is dramatically less energy intensive than in the 1970s. The shortage trauma comparison does not hold.
    • Tech as a sector traded as cheaply versus the rest of the market in early April as at any point in the last ten years, into the single most bullish moment for AI fundamentals on record.
    • Anthropic is dramatically more capital efficient than OpenAI, having burned roughly eighty percent less to reach a similar revenue scale. They have very different structural returns on invested capital.
    • Anthropic at roughly nine hundred billion for fifty billion of ARR (growing a thousand percent) is striking. Adjusted for compute constraint, the unconstrained run rate could be one hundred fifty to two hundred billion, putting the implied multiple closer to five times.
    • Claude Opus generates roughly seventy percent fewer tokens for the same question than previously, with token quantity tied to answer quality. Subscribers on flat-fee plans are getting a lobotomized model.
    • Elon Musk’s superpower is twenty years of making investors money. He never pushes valuation. SpaceX compounded low thirty percent per year for a decade because Musk treats fair pricing as a sacred covenant.
    • Capitalism will solve the watts shortage. The current bottleneck has shifted from chips and energy to zoning and political approval. Many capex decisions are paused until after the US midterms.
    • The watts shortage probably begins to alleviate in 2027 and 2028. Orbital compute solves it longer term.
    • Orbital compute is not Pentagon sized data centers in space. It is racks in space. A Blackwell rack is three thousand pounds, eight feet tall, four feet deep, three feet wide. SpaceX has shown a satellite roughly that size.
    • The satellites operate in sun synchronous orbit so solar wings (around five hundred feet per side) always face the sun and the radiator on the dark side always points to deep space.
    • Starlink V3 satellites already run at around twenty kilowatts. A Blackwell rack runs at one hundred kilowatts. SpaceX engineers express genuine confidence they have already solved cooling and radiator design at these scales.
    • Racks in space are connected with lasers traveling through vacuum, the same lasers already on every Starlink. SpaceX operates the world’s largest satellite fleet and, via xAI Colossus, the world’s largest data center on Earth.
    • Inference will move to orbit. Training will stay on Earth for a long time. Terrestrial data centers remain valuable for the rest of an investor’s career.
    • The wafer bottleneck is structural and political. TSMC is essentially Taiwan’s GDP, water, and electricity. The leaders see themselves as inheritors of Morris Chang’s sacred legacy and they do not behave like a Western public company.
    • Jensen Huang has never had a contract with TSMC. The relationship is run on handshakes and the assumption that things will be fair over time.
    • If TSMC did everything Jensen wanted, Nvidia could be selling two to three trillion dollars of GPUs in 2026 and 2027. TSMC’s discipline is the single largest factor preventing a true AI bubble.
    • Historically, foundational technologies always get a bubble. Railroads, canals, the internet. The current AI buildout is overwhelmingly funded out of operating cash flow, GPUs are running at one hundred percent utilization, and that is fundamentally different from the year 2000 fiber overbuild.
    • If one of Intel or Samsung Foundry catches up at the leading node, the other will follow, and TSMC’s discipline collapses. Watch TSMC capacity decisions to predict a bubble.
    • Terafab, the SpaceX and Tesla joint venture to build the world’s largest fab in America, has a partnership with Intel that grants access to fifty years of institutional foundry knowledge. The A teams at ASML, KLA, Lam Research, and Applied Materials will follow Elon’s reputation in hardware engineering.
    • The hiring playbook for Terafab includes building Taiwan Town, Japan Town, and Korea Town next to the fab. Recruit the engineers and import their families, their restaurants, and their staff.
    • Frontier tokens still capture an overwhelming share of all economic value created at the model layer. This is surprising and is one of the three big open questions for AI investing.
    • The Pareto frontier of intelligence versus cost has flipped. Nine months ago Google’s TPU dominated every point on the frontier. Today Anthropic and OpenAI dominate, with Grok 4.3 on the frontier and Gemini 3.1 hanging on.
    • Google’s conservative TPU V8 design (partly an attempt to reduce dependence on Broadcom and Nvidia) is the leading explanation for the loss of per token cost leadership.
    • AI pricing is shifting from all you can eat to usage based, mirroring the cellular and long distance industries. Cellular stopped being a great growth industry when it went all you can eat. AI just made the opposite move.
    • OpenAI and Anthropic together could exceed two hundred billion in ARR this year if compute keeps coming online and frontier token pricing holds.
    • The two hundred fifty dollar a month consumer AI plan is no longer enough to evaluate frontier capability. Enterprise plans with usage based billing are required because rate limits are now severe.
    • The three biggest open questions for AI investors are: violation of the bitter lesson via ASI or human ingenuity, whether frontier tokens keep commanding their premium, and when continual learning arrives.
    • Today’s continual learning is crude reinforcement learning during mid training on verifiable tasks. True continual learning means weights updating dynamically, like a human who learns the first time they touch fire.
    • Trying to build a better GPU is a losing strategy. Jensen will copy any one to three percent share design. Startups should target one percent share, do something different, and make it hard enough that Nvidia cannot fast follow.
    • Disaggregated inference (separating prefill and decode) opens new design canvases. Prefill is memory capacity bound. Decode is memory bandwidth bound. Each can be optimized independently.
    • Cerebras did something different and hard with wafer scale computing. Three generations of chips and real grit to get there.
    • Disaggregation of inference may stretch GPU useful lives to ten or fifteen years, dropping financing costs from low sevens to five or six percent, mathematically lowering the cost of the AI buildout and likely saving the private credit industry from its SaaS loan exposure.
    • Sellers of shortage outperform buyers of shortage. But owning the largest installed base of what is currently in shortage (hyperscaler CPU fleets, for example) is also a strong position.
    • Most of the economic value at the application layer of AI has been destroyed, not created. The exceptions are companies in the token path or in niches small enough that frontier labs ignore them.
    • Coding may be the shortest path to ASI. If you can write code, you can write code that does anything. Cursor, Cognition, and Anthropic correctly focused on it.
    • Jensen could probably get close to the frontier with his own Nemotron family of models whenever he wants. The fact that he chooses not to is a strategic decision about not commoditizing his customers.
    • The new prisoner’s dilemma in AI is whether frontier labs release their best model via API. If everyone agrees not to, Chinese open source falls behind. If anyone defects, the defector pulls ahead on revenue and resources, forcing everyone else to defect.
    • Google still owns the largest compute installed base. Without TPU’s prior cost advantage, this matters more. YouTube data has real value in a world of robotics. GCP is going crazy.
    • Meta deserves credit for becoming AI first internally faster than any other internet giant. Musa, their first MSL model, is impressively close to the Pareto frontier.
    • Amazon is strong because of Trainium and robotics driven retail P&L efficiency. Nova is better than it gets credit for.
    • Microsoft flinched on capex in early 2025 and lost position. Satya Nadella’s current decision to use Microsoft compute for Microsoft products rather than reselling to OpenAI is a courageous and probably correct call, even at the cost of an eight hundred dollar stock price.
    • The hyperscalers most engaged with startups are Amazon and Nvidia by a mile, followed by Google. Broadcom is the favorite ASIC partner. AMD, Microsoft, and Meta have minimal startup engagement and that will cost them as the best teams are now at startups.
    • Personal safety in an AI era requires a family or company safe word that cannot be socially engineered. Deepfake voice and video extortion at the speed of FaceTime is already feasible.
    • Ukraine is winning largely on the back of having the best battlefield AI outside America and Israel. Adversaries are starting to internalize what AI dominance means geopolitically.
    • An optimistic read is that this becomes a new Pax Americana, the way the post 1945 American nuclear monopoly was used to rebuild Germany and Japan rather than dominate.
    • AI cured a friend’s daughter’s rare disease by spinning up a research effort that identified a market drug capable of impacting her condition. That is the upside that keeps Gavin an AI optimist maximalist.

    Detailed Summary

    The most extraordinary moment in the history of capitalism

    Gavin’s framing of the current moment is unusually direct. Anthropic added eleven billion dollars of annual recurring revenue in a single month. The three highest profile SaaS companies of the last decade plus, Palantir, Snowflake, and Databricks, took a decade and tens of thousands of employees collectively to build the combined business that Anthropic added in thirty days. He has been investing through every major tech cycle and says there is no historical analog. Not the dotcom era, not the cloud transition, not mobile. This is its own thing.

    The market response, then, was peculiar. The NASDAQ sold off into the single most bullish moment for AI fundamentals on record. Tech traded at roughly its widest discount versus the rest of the market in a decade. Investors who said they wished they had bought into AI during 2022, during COVID, or during Deep Seek Monday got the same valuation setup again in early April, this time with an even clearer inflection.

    Why the Strait of Hormuz closing was secretly bullish for America

    One reason the macro fear in March may have been mispriced is that the same geopolitical event that drove the selloff was, in practice, a relative benefit to the United States. American natural gas, the input into American electricity, which is the input into American AI training and inference, fell roughly twenty percent. Asian and European natural gas prices doubled or tripled. The US emerged with sharply improved relative manufacturing competitiveness, which is exactly what the current administration cares about.

    The 1970s comparison does not hold. The US economy is dramatically less energy intensive, it is now the world’s largest producer and largest exporter of oil and gas, and there are no shortages, only price moves. That backdrop made it easier for disciplined investors to stay focused on AI fundamentals through the volatility.

    Anthropic and OpenAI valuations on an unconstrained run rate

    Anthropic at roughly nine hundred billion for fifty billion of ARR sounds rich until you adjust for the fact that the company is severely compute constrained. Gavin estimates that, unconstrained, Anthropic might be at one hundred fifty to two hundred billion in run rate revenue, putting the implied multiple closer to five times. He also points out that Claude Opus now generates roughly seventy percent fewer tokens for the same question than it used to. Token quantity correlates with answer quality, and Anthropic is rate limiting and shrinking outputs to ration capacity across its user base.

    Anthropic and OpenAI are also structurally very different. Anthropic has burned around eighty percent less cash than OpenAI to reach a comparable revenue scale. That implies very different long term returns on invested capital, though OpenAI has done a better job locking in compute and Sarah Friar is one of the most exceptional CFOs Gavin has worked with.

    Why neither lab is raising at a three trillion dollar valuation

    The answer Gavin gives is that both labs are deliberately leaving valuation on the table the way Elon has done for two decades. SpaceX compounded at low thirty percent annually for a decade because Elon never pushed price. The result is a permanent superpower of access to capital. Investors trust him because they have made money with him for twenty years. That is a moat that compounds with every round.

    Anthropic could probably raise at a one hundred percent premium to its rumored latest mark. They are choosing not to. In an uncertain world (Ukraine, Russia, Iran, Taiwan), preserving the ability to raise more capital later at fair prices is more valuable than maximizing this round.

    Watts and wafers, the two real constraints

    Capitalism is solving the watts problem. The leading PE infrastructure investors now say zoning and political approval, not chips or energy, are the gating factors. Companies are deferring big capex announcements until after the US midterms. Turbine capacity is being doubled at the manufacturers. Companies like Boom Aerospace are repurposing jet engines for grid use. Watts probably ease meaningfully in 2027 and 2028 and then orbital compute does the rest.

    Wafers are the harder problem because they live in Taiwan, run on handshakes, and depend on a corporate culture that does not respond to public market incentives. TSMC is essentially the GDP, water consumption, and electricity consumption of Taiwan. Its leadership treats the company as the legacy of Morris Chang. The Silicon Shield doctrine is real and internal.

    Orbital compute as racks in space

    The biggest mental update Gavin asks listeners to make is to stop picturing data centers in space as Pentagon sized space stations. A Blackwell rack is three thousand pounds and roughly the size of a refrigerator. SpaceX has shown a concept satellite of about that size. Solar wings extend five hundred feet to each side and the radiator extends hundreds of feet behind, both possible because the orbit is sun synchronous and the orientation is fixed relative to the sun.

    SpaceX engineers Gavin has spoken to at Starbase express genuine confidence that they have solved cooling at these power levels. They have. Starlink V3 satellites already operate at twenty kilowatts. A Blackwell rack is one hundred kilowatts. The same company operates the world’s largest satellite fleet and the world’s largest data center on Earth via xAI Colossus. The racks are connected to each other with lasers traveling through vacuum, technology already deployed in every Starlink. The naysayers, Gavin observes, are armchair skeptics and Larry Ellison’s response (he is out there landing rockets, no one else is) is the right frame.

    Terafab in Texas and the threat to TSMC’s discipline

    Terafab, the SpaceX and Tesla joint venture, intends to be the largest fab in the world. The partnership with Intel grants access to fifty years of foundry institutional knowledge, allowing Terafab to start three to five quarters behind the leading node rather than fifteen years behind. The A teams at the semicap equipment companies (ASML, KLA, Lam Research, Applied Materials) will follow Elon’s reputation in hardware engineering the same way they followed TSMC twenty years ago when Intel stumbled.

    The talent strategy is the part most observers underestimate. Recruit the best engineers globally, then import their families, their restaurants, their staff. Build Taiwan Town, Japan Town, and Korea Town next to the fab. Optimize the human experience for the people whose work matters. Intel and Samsung do not think that way.

    Bubble watch and the year 2000 comparison

    Every foundational technology in modern history has had a bubble. Railroads, canals, the internet. Carlota Perez documented why. Markets correctly identify the importance, diversity of opinion collapses, supply gets ahead of demand, the bubble crashes. The current cycle has two important differences. The buildout is overwhelmingly funded out of operating cash flow, not debt. Every GPU is running at one hundred percent utilization, while at the peak of the fiber bubble ninety nine percent of fiber was unused.

    TSMC discipline is the single largest reason a bubble has not formed. If Jensen could buy everything TSMC could theoretically make, Nvidia could sell two to three trillion dollars of GPUs in 2026 and 2027. At some point that becomes more than the market can absorb. If Intel or Samsung Foundry catches up at the leading node, the other will too. TSMC’s pricing discipline collapses and the bubble starts.

    The Pareto frontier and the loss of Google’s cost advantage

    The most important chart in AI is the Pareto frontier of model intelligence versus per token cost. Nine months ago, Google’s TPU based models dominated every point on it. OpenAI, Anthropic, and xAI sat inside the frontier. Today the frontier is dominated by Anthropic and OpenAI, with Grok 4.3 on the frontier and Gemini 3.1 hanging on by subsidization more than economics. The most likely cause is Google’s conservative TPU V8 design, an attempt to reduce dependence on Broadcom and Nvidia that sacrificed per token economics.

    The bitter lesson, frontier tokens, and continual learning

    Three open questions dominate AI investing. The first is whether Richard Sutton’s bitter lesson (more compute beats human algorithmic cleverness) gets violated by ASI itself optimizing for efficiency. Closer observers of AI are more skeptical of a violation. Gavin thinks ASI’s first move will be to make itself more efficient and more resourced, which is technically a temporary violation.

    The second is whether frontier tokens keep capturing the overwhelming share of economic value at the model layer. Today they do, surprisingly. Gemini 3.1 Pro was mindblowing nine months ago and is intolerable today. The third is when continual learning arrives. Today’s models need a million fire touches to learn what a human learns from one. True continual learning would mean dynamic weight updates in real time and would produce a fast takeoff.

    From all you can eat to usage based AI pricing

    AI is shifting from flat fee plans to usage based pricing. The historical analogy is cellular and long distance. Both stopped being great growth industries when they went all you can eat. AI just made the opposite move. The consequence is that flat fee subscribers, even on premium consumer plans, get a rate limited and token throttled version of the frontier model. Enterprise plans with usage based billing are now required to evaluate true capability. Gavin thinks the combination of new compute coming online and usage based pricing is what gets OpenAI and Anthropic past two hundred billion in combined ARR this year.

    Chip startups, prefill decode disaggregation, and Cerebras

    Trying to build a better GPU is the wrong move. The four scaled players (Nvidia, AMD, Trainium, TPU) have copy capability for any one to three percent share design that looks attractive. The good news for startups is that disaggregated inference (separating prefill and decode) opens a richer design canvas. Prefill is memory capacity bound. Decode is memory bandwidth bound. Each can be optimized independently. Andrew Fox’s analogy is a British naval ship of the eighteenth century. Prefill is loading the cannon. Decode is firing it.

    Cerebras is the model. Wafer scale computing is genuinely different and genuinely hard. It took three generations of chips to get right. Andrew Feldman and his team had the grit to keep going through chip one being a failure. The design has a high ratio of on chip compute and memory relative to shoreline IO, which is why Cerebras is now experimenting with putting an optical wafer on top of the compute wafer to solve scale out.

    GPU useful lives and the rescue of private credit

    One of the strongest claims in the conversation is that disaggregated inference will stretch GPU useful lives to ten or fifteen years. The skeptical narrative (GPUs are obsolete in two years, companies are cooking their depreciation books) is wrong. You can put a Cerebras system or Groq LPU in front of older Hopper or Ampere parts, use them only for prefill, and run them until they physically melt. Private credit, which is in pain from SaaS loans and which underwrote GPU loans on three to four year lives, may be saved by this.

    If GPU financing rates can come down from low sevens to five or six percent, the mathematics of the AI buildout improves materially. That is a structural tailwind that compounds for years.

    The application layer, the token path, and a new prisoner’s dilemma

    Trillions of dollars of value have been destroyed at the application layer, not created. Cursor and Cognition are the rare scaled exceptions, and they got there by focusing on coding very early. As Amjad Masad noted, coding is plausibly the shortest path to ASI because a coding agent can write itself into any new domain. Jamin Ball’s frame is that the new venture filter is whether the company is in the token path. Data Bricks is. Most application layer startups are not.

    Jensen could probably get close to the frontier with Nemotron whenever he wants, and the strategic question of whether to do that is a new prisoner’s dilemma. If every frontier lab agrees not to release best models via API, Chinese open source falls steadily behind. If anyone defects, the defector gains revenue and resources, and everyone else has to defect. The same dynamic exists between TSMC, Intel, and Samsung. If Nvidia or AMD ever truly used an alternative foundry, that foundry would catch up rapidly.

    Rating the hyperscalers

    Google has the largest compute installed base, the YouTube data that matters in a robotics world, and a search business that prints. Their loss of TPU cost leadership is the surprise of the year. If Google IO in five days does not produce a leapfrog model, the Nvidia centric narrative gets even stronger.

    Meta deserves real credit. Zuckerberg made Meta AI first internally faster than any other internet giant, paid up for the talent contracts when no one else would, and shipped Musa as a first model from MSL that is close to the Pareto frontier. Amazon is well positioned on Trainium, robotics in retail, and a Nova model line that is better than it gets credit for. Microsoft flinched on capex in early 2025 and lost position. Satya Nadella’s current decision to use Microsoft compute for Copilot rather than reselling to OpenAI is courageous and probably correct, even at the cost of stock price.

    The most interesting cross hyperscaler metric is startup engagement. Nvidia and Amazon engage deeply with startups. Google is next. Broadcom is the favored ASIC partner. AMD, Microsoft, and Meta have minimal startup engagement, which Gavin believes will cost them as the best teams now sit at startups.

    Personal safety, geopolitics, and the Pax Americana case

    The closing section turns darker. Personal safety in an AI era requires a family or company safe word that cannot be socially engineered. Deepfake voice and video extortion via something that looks exactly like your child calling on FaceTime is already feasible. Political violence against AI leaders is a real concern. Geopolitically, Ukraine is winning largely because it has the best battlefield AI outside America and Israel. How adversaries respond to that asymmetry is the next great variable.

    Gavin’s optimistic frame is the Pax Americana. After 1945 the US had a nuclear monopoly and could have controlled the world. Instead it rebuilt Germany and Japan, both of which became the most reliable American allies for the next eighty years. If AI dominance plays out similarly, this is a generationally positive story rather than a destabilizing one. The personal anecdote that closes the conversation is a friend whose daughter was diagnosed with a rare genetic condition. He spun up agents, identified a drug already on the market that addresses her mutation, and her life is immeasurably different because of AI. That is the upside.

    Thoughts

    The Anthropic eleven billion in a month framing is the kind of stat that resets priors. The right way to interpret it is not as a one off but as a measure of how fast value can compound when the underlying technology improves on a curve steeper than the ability of the rest of the economy to absorb it. The skeptical question is whether that ARR is durable or whether it is heavily tied to a customer base of other AI companies that are themselves on a single venture funded year of runway. The bullish answer is that frontier coding, frontier research, and frontier enterprise tasks are not going to stop being valuable, and Anthropic is the best at all three. Both can be true. The number is still extraordinary.

    The argument that TSMC discipline is the only thing preventing a bubble is the analytically tightest part of the conversation. The implied trade is to watch TSMC capacity additions like a hawk and to be more, not less, cautious if Intel Foundry or Samsung Foundry ever announce real share at the leading node. The Terafab thesis is more speculative but more interesting. If Elon’s talent recruiting playbook works and the Intel partnership gives Terafab a real seat at the table within five years, the geometry of the global semiconductor industry shifts in a way that is bullish for American manufacturing, bullish for power and water infrastructure in Texas, and ambiguous for TSMC itself.

    The Pareto frontier discussion deserves more attention than it usually gets. Pricing leadership in AI is not a vanity metric. It determines who can subsidize free tier usage, who can absorb compute shortages, who can ship cheaper enterprise plans, and ultimately whose model becomes the default for any given workload. Google losing per token leadership in nine months is one of the most under analyzed events in the sector and it explains a lot about why Anthropic and OpenAI are growing the way they are. If Google IO does not produce a leapfrog model, the implied verdict on TPU V8 design choices gets a lot harsher.

    The application layer destruction point is worth sitting with. Founders building on top of frontier models are competing in a world where the model itself moves faster than any moat they can build, where the model lab can absorb their niche if it gets interesting, and where the only protection is either deep token path integration or a niche so small the lab does not bother. That is a much harsher venture environment than the early SaaS era. The compensating opportunity is that one human can now run a hundred agents, so the ceiling on what a small team can build is correspondingly higher. The bet is that productivity per founder rises faster than competitive pressure from the labs. We will find out.

    The orbital compute pitch is the section that will polarize listeners. The naive read is that this is science fiction. The closer read is that every component (sun synchronous orbit, laser interconnect, twenty kilowatt satellite buses, ten thousand satellite manufacturing cadence, full rocket reusability) already exists. The remaining engineering problems are repair, maintenance, and radiator scale, all of which are real but tractable on a five to ten year horizon. The strategic implication is that the political and zoning ceiling on terrestrial data centers becomes less binding if orbital compute is a credible alternative for inference workloads. The investor implication is that being short the watts and cooling complex on a five year horizon is a real trade, not a meme.

    Watch the full conversation here.

  • Jensen Huang at Stanford CS153 Frontier Systems on Co-Design, Agentic Computing, Vera Rubin, Open Models, and the Million-X Decade That Reshaped AI Infrastructure

    https://www.youtube.com/watch?v=tsQB0n0YV3k

    NVIDIA CEO Jensen Huang returned to Stanford for the CS153 Frontier Systems class (the room nicknamed itself “AI Coachella”) to lay out, in raw form, how he thinks about the computer being reinvented for the first time in over sixty years. Across roughly seventy minutes of student questions he walks through the codesign philosophy that gave NVIDIA a million-x decade, the architectural through-line from Hopper to Grace Blackwell to Vera Rubin to Feynman, the case for open source foundation models, the realities of tokens per watt and MFU, energy demand running a thousand times higher, the China and export-control debate, and his own biggest strategic mistakes. Watch the full conversation on YouTube.

    TLDW

    Huang argues every layer of computing has changed: the programming model, the system architecture, the deployment pattern, the economics. Co-design across CPUs, GPUs, networking, storage, switches and compilers gave NVIDIA roughly a million-x speed-up over ten years versus the ten-x Moore’s Law era, and that headroom is what let researchers say “just train on the whole internet.” Hopper was built for pre-training, Grace Blackwell NVLink72 for inference and reasoning (50x over Hopper in two years), Vera Rubin is built for agents that load long memory, call tools and need a low-latency single-threaded CPU bolted directly to the GPU, and Feynman extends that to swarms of agents that spawn sub-agents. Open weights matter because safety, sovereignty (230-plus languages no one else will fund) and domain models for biology, autonomy, robotics and climate need a foundation that NVIDIA is willing to seed. Compute is not really the scarce resource (Huang says place the order and the chips ship), the broken thing is institutional budgeting that can’t put a billion dollars into a shared university supercomputer. Energy demand is heading a thousand times higher and this is finally the moment market forces alone will fund sustainable generation. On geopolitics he rejects the GPUs-as-atomic-bombs framing and warns America will end up like its telecom industry if it cedes two thirds of the world. On career he advises seeking suffering on purpose. On strategy he says observe, reason from first principles, build a mental model, work backwards, minimize opportunity cost, maximize optionality.

    Key Takeaways

    • The computing model has been substantially unchanged since the IBM System 360, sixty-plus years ago. Huang’s first computer architecture book was the System 360 manual. AI is the first true reinvention.
    • Old computing was pre-recorded retrieval. New computing is generated, contextually aware and continuous. Cloud was on-demand. Agentic systems run continuously.
    • Codesign is NVIDIA’s central thesis. Inherited from the Hennessy and Patterson RISC era at Stanford, extended across CPUs, GPUs, networking, switches, storage, compilers and frameworks all optimized together.
    • The result of full-stack codesign: roughly 1,000,000x faster compute over ten years, versus a generous 10x to 100x for Moore’s Law in the same period. Dennard scaling effectively ended a decade ago.
    • That million-x speed-up is what unlocked “train on all of the internet” as a realistic AI strategy.
    • After GPT, Huang says it was obvious thinking was next. Reasoning is just generating tokens consumed internally, then using tools is generating tokens consumed externally. Agentic systems followed predictably.
    • Education needs AI baked into the curriculum, not just taught as a subject. Pre-recorded textbooks cannot keep pace with knowledge being generated in real time.
    • Huang says he cannot learn anymore without AI. He has the AI read the paper, then read every related paper, then become a dedicated researcher he can interrogate.
    • Mead and Conway and the first-principles methodology of semiconductor design are still worth learning even though most of the scaling tricks have been exhausted.
    • NVIDIA itself is one of the largest consumers of Anthropic and OpenAI tokens in the world. One hundred percent of NVIDIA engineers are now agentically supported. Huang recommends Claude and similar tools by name and says open-source downloads will not match the integrated product harness.
    • NVIDIA still invests heavily in open foundation models because language and intelligence represent the codification of human knowledge. Five pillars: Nemotron (language), BioNeMo (biology), Alphamayo (autonomous vehicles), Groot (humanoid robotics) and a climate science model (mesoscale multiphysics).
    • Sovereign language models matter. Roughly 230 world languages will never be a top priority for a commercial frontier lab. Nemotron is near-frontier and fully fine-tunable so any country can adapt it.
    • Safety and security require open weights. You cannot defend against or audit a black box. Transparent systems let researchers interrogate models and let defenders deploy swarms.
    • The future of cyber defense is not bigger-model-versus-bigger-model. It is trillions of cheap fast small models like Nemotron Nano surrounding the threat.
    • Domain models fuse language priors with world models. Alphamayo learned to drive safely on a few million miles instead of billions because it can reason like a human about the road.
    • MFU (Model Flops Utilization) is a misleading metric. Huang says he wants low MFU, because that means he over-provisioned every resource and never gets pinned by Amdahl’s law during a spike.
    • The xAI Memphis cluster running at 11 percent MFU is not necessarily a failure mode. In disaggregated prefill plus decode inference you can deliver very high tokens per watt with very low MFU.
    • The right metric is performance, ultimately tokens per watt as a proxy for intelligence per watt, and even that needs adjustment because not all tokens are equal. Coding tokens are worth more than other tokens.
    • Hopper was designed for pre-training. NVIDIA chose to build multi-billion-dollar systems when the largest existing scientific supercomputer cost $350 million, with no proven customer base. It worked.
    • Grace Blackwell NVLink72 was designed for inference, especially the high-memory-bandwidth decode phase. It is the world’s first rack-scale computer and delivered a 50x speed-up over Hopper in two years, against an expected 2x from Moore’s Law.
    • Vera Rubin is designed for agents. Long-term memory wired into storage and into the GPU fabric, working memory, heavy tool use, and Vera, a CPU optimized for low-latency multi-core single-threaded code so a multi-billion-dollar GPU system does not stall waiting on a slow tool call.
    • Feynman is being shaped for swarms of agents with sub-agents and sub-sub-agents, a recursive software topology that demands a new compute pattern.
    • Tokens per watt improved 50x in one generation. Compounding energy efficiency is the lever NVIDIA controls directly.
    • Total compute energy demand is heading roughly a thousand times higher than today, possibly two orders of magnitude beyond that. Huang says he would not be surprised if the estimate is low.
    • For the first time in history, market forces alone are enough to fund solar, nuclear and grid upgrades. Government subsidies are no longer required to make sustainable energy investment rational.
    • Copper interconnect is becoming a bottleneck. Photonics is moving from optional to structural inside racks and across them.
    • Comparing NVIDIA GPUs to atomic bombs, Huang says, is a stupid analogy. A billion people use NVIDIA GPUs. He advocates them to his family. He does not advocate atomic bombs to anyone.
    • If the United States cedes two thirds of the global market to competitors on policy grounds, the American technology industry will end up like American telecommunications, which was policied out of existence.
    • Huang directly rejects AI doom-by-singularity narratives. It is not true that we have no idea how these systems work. It is not true that the technology becomes infinitely powerful in a nanosecond. He calls the rhetoric irresponsible and harmful to the field students are about to enter.
    • On Stanford specifically: if the university president places an order, NVIDIA will deliver the chips. The bottleneck is that no university department has a billion-dollar compute budget because budgeting is fragmented across grants. Stanford’s $40 billion endowment is more than enough to fix that.
    • “It’s Stanford’s fault” is meant as empowerment. If something is your fault, you can solve it.
    • Career advice: do not optimize purely for passion. Most people do not yet know what they love. Pick the job in front of you and do it as well as possible. Even as CEO, Huang says, 90 percent of the work is hard and he suffers through it.
    • Suffering on purpose builds the muscle of resilience. When the company, the team or the family needs you to be tough, that muscle has to already exist.
    • NVIDIA’s first generation of products was technically wrong in nearly every dimension: curved surfaces instead of triangles, no Z-buffer, forward instead of inverse texture mapping, no floating point. The strategic recovery, not the technology, taught Huang the lessons that have lasted decades.
    • The biggest clean strategic mistake Huang names is the move into mobile chips (Tegra). It grew to a billion dollars then went to zero when Qualcomm’s modem dominance shut NVIDIA out of the 3G to 4G transition. The recovery into automotive and robotics (the Thor chip is the great great great grandson of that mobile lineage) was real, but Huang refuses to rationalize the original choice.
    • Forecasting framework: observe, reason from first principles, ask “so what” and “what next” until you have a mental model of the future, place your company inside that model, then work backwards while minimizing opportunity cost and maximizing optionality.
    • Best part of the CEO job: living at the intersection of vision, strategy and execution surrounded by people capable enough to make ambitious visions real. Worst part: the responsibility for everyone who joined the spaceship, especially in the near-death moments NVIDIA had four or five times early on.
    • Underrated insider note: Huang’s first apple pie with cheese, first hot fudge sandwich and first milkshake all happened at Denny’s. The Superbird, the fried chicken and a custom Superbird-style ham and cheese with tomato and mustard are his order.

    Detailed Summary

    Computing reinvented from the ground up

    Huang frames the moment as the first true rewrite of the computer in sixty-plus years. From the IBM System 360 forward, the mental model of writing code, running code, taking a computer to market and reasoning about applications stayed roughly constant. AI changes the programming model itself. Software is no longer a compiled binary running deterministically on a CPU. It is a neural network running on a GPU producing generated, contextual, real-time output. That cascades into how companies are organized, what tools developers use, what the network and storage stack look like, and what an application is even allowed to do. Robo-taxis, he notes, are an application no one would have attempted before deep learning unlocked perception.

    Codesign and the million-x decade

    Codesign is the philosophical center of the talk. Huang traces it to the RISC work of John Hennessy at Stanford, where simpler instruction sets won by being co-designed with the compiler rather than maximally optimized in isolation. NVIDIA extends the principle across every layer simultaneously: GPU architecture, CPU architecture, NVLink and NVSwitch fabrics, photonic interconnects, networking silicon, storage paths, CUDA libraries, frameworks and ultimately the model design. The numbers Huang gives are arresting. Moore’s Law in its prime delivered roughly 100x per decade. By the time Dennard scaling broke, real-world gains had compressed to roughly 10x. NVIDIA’s codesigned stack delivered between 100,000x and 1,000,000x over the same ten-year window. That non-linear speed-up is, in Huang’s telling, the precondition for modern AI: it is what allowed researchers to stop curating training sets and just feed the entire internet to the model.

    Education has to fuse first principles with AI tools

    Asked how curriculum should evolve, Huang argues AI must be integrated into the learning process, not just taught about. He recalls Hennessy writing his textbook by hand a chapter a week while Huang was a student, and says pre-recorded textbooks cannot keep up with the rate at which AI generates new knowledge. He describes his own learning workflow: hand the paper to an AI, then have it read the entire surrounding literature, then treat the AI as a dedicated researcher who can be interrogated. At the same time he defends the classics. Mead and Conway are still the foundation. Most modern semiconductor scaling tricks have been exhausted, but knowing where the field came from sharpens judgment when designing what comes next.

    Open source and the five domain pillars

    Huang gives one of the most detailed public accounts of why NVIDIA invests so heavily in open foundation models even while being a top customer of closed labs. He recommends Claude and OpenAI by name for production coding work, and says 100 percent of NVIDIA engineers are now agentically supported. The open-weights case rests on three legs. First, language is the codification of intelligence, and there are at least 230 languages that no commercial lab will ever prioritize. Nemotron is built near frontier and released so any country or community can fine-tune it. Second, the same representation-learning approach has to be replicated in domains where the data is not internet text, so NVIDIA seeded BioNeMo for biology, Alphamayo for autonomy, Groot for humanoid robotics and a climate model for mesoscale multiphysics. The economics of those fields would never produce a foundation model on their own. Third, safety and security require transparency. A black box cannot be defended or audited, and the future of cyber defense is not bigger-model-versus-bigger-model but swarms of cheap fast small models like Nemotron Nano surrounding the threat.

    MFU is the wrong metric, tokens per watt is closer

    A student raises the leaked memo that the xAI Memphis cluster is running at 11 percent Model Flops Utilization. Huang flips the framing. He says he would rather be at low MFU all the time, because that means he over-provisioned flops, memory bandwidth, memory capacity and network capacity. Bottlenecks shift constantly, so over-provisioning across every dimension is what lets the system absorb a spike without getting pinned by Amdahl’s law. In disaggregated inference, where prefill and decode are physically separated and decode is bandwidth-bound rather than flop-bound, NVLink72 can deliver extremely high tokens per watt while reporting very low MFU. Huang argues the right framing is performance, and ultimately tokens per watt as a rough proxy for intelligence per watt, adjusted for the fact that not all tokens are equal. A coding token is worth more than a generic token.

    Hopper, Grace Blackwell NVLink72, Vera Rubin, Feynman

    Huang gives the clearest public framing of NVIDIA’s roadmap as a sequence of architectural answers to evolving compute patterns. Hopper was built for pre-training, at a moment when NVIDIA chose to build multi-billion-dollar machines while the largest scientific supercomputer in the world cost $350 million and the marketplace for such systems was, on paper, zero. Grace Blackwell NVLink72 was the answer to inference and reasoning: a rack-scale computer that ganged 72 GPUs together because decode needs aggregate memory bandwidth far beyond a single chip. The generation-over-generation speed-up was 50x in two years, twenty-five times what Moore’s Law would have delivered. Vera Rubin is being built explicitly for agents. Agents load long-term memory from storage that has to be wired directly into the GPU fabric, they use working memory, they call tools that run on a CPU, and they wait. So the CPU has to be Vera, optimized for low-latency single-threaded code, because the multi-billion-dollar GPU system cannot afford to idle waiting on a slow tool call. Feynman extends the pattern to swarms of agents with sub-agents and sub-sub-agents, a recursive software topology that will demand its own compute pattern.

    Energy demand and the grid

    Huang’s energy projection is one of the most aggressive numbers in the talk. NVIDIA can compound tokens per watt by 50x per generation through codesign, but the total compute demand is heading roughly a thousand times higher, and Huang says he would not be surprised if the real figure is one or two orders of magnitude beyond that. The reason is structural: future computing is generative and continuous, not pre-recorded and on-demand. The good news, he argues, is that this is the best moment in the history of humanity to invest in sustainable generation. Market forces alone are now sufficient to fund solar, nuclear and grid upgrades. Government subsidies are no longer required to make the math work.

    Adversarial countries, export controls and the telecom warning

    This is the segment where Huang is visibly fired up. He attacks the GPUs-as-atomic-bombs framing on its face. NVIDIA GPUs power medical imaging, video games and soy sauce delivery. A billion people use them. He advocates them to his family. The analogy collapses at the first comparison. He attacks the second framing, that American companies should not compete abroad because they will lose anyway, as a self-fulfilling defeat. Competition makes the company better. The third framing, that depriving the rest of the world of general-purpose computing benefits the United States, also fails on first principles: it benefits one or two American companies at the cost of an entire industry. The cautionary parallel is telecommunications. The United States once had a leading position in telecom fundamental technology and policied itself out of it. Huang’s worry, voiced explicitly to a room of CS students, is that they will graduate into a shell of a computer industry if the same path is repeated.

    AI doom and rational optimism

    In the same arc Huang rejects the science-fiction framing of AI as a singularity that arrives suddenly on a Wednesday at 7pm and ends civilization. He calls those claims irresponsible, says they are not true, and points out that the people advancing them are believed by audiences who then make policy on that basis. It is not true that no one understands how these systems work. It is not true that intelligence becomes infinitely powerful instantaneously. It is not true that there is no defense. His framing, which the host echoes as “rational optimism,” is that the goal is to create a future where people care about computers because the technology students are learning is worth mastering.

    Stanford’s compute problem is Stanford’s fault

    A student presses on the scarcity of compute for independent researchers, startups and universities inside the United States. Huang’s answer is sharp: there is no shortage. Place the order and the chips will arrive. The actual broken thing is institutional. University grants are fragmented across departments. No researcher can raise enough on a single grant to fund a billion-dollar shared cluster, and no one shares. He compares it to showing up at the grocery store demanding a billion dollars of tomatoes today. The solution is planning, aggregation and a campus-scale supercomputer, the way Stanford once built the linear accelerator. The endowment is $40 billion. Pulling a billion off it, contracting cloud capacity and giving every student and researcher AI supercomputer access is, in Huang’s view, obviously doable. When he says “it is Stanford’s fault” the host laughs, but Huang clarifies: if it is your fault you have the power to fix it.

    Career, suffering and resilience

    Asked how a CS student should spend the next few years, Huang pushes back on the standard “follow your passion” advice. Most people do not know what they love yet, because no one knows what they do not know. The bar of demanding joy from every working day is too high. Whatever the job is, do it as well as you can. Even as CEO of NVIDIA he says he genuinely loves about 10 percent of his work. The other 90 percent is hard and he suffers through it. He recommends suffering on purpose, because resilience is a muscle that only builds under load, and when the company, the team or the family needs that muscle, it has to already exist. Earlier in his life that meant cleaning toilets and busing tables at Denny’s. He does it today running a multi-trillion-dollar company.

    The biggest mistakes

    Huang separates technical mistakes from strategic mistakes. NVIDIA’s first generation of products was technically wrong in almost every way: curved surfaces instead of triangles, no Z-buffer, forward instead of inverse texture mapping, no floating point inside. The company wasted two and a half years. But the strategic genius of the recovery, the reading of the market, the conservation of resources and the reapplication of talent, is what taught him strategy. The clean strategic mistake he names is mobile. NVIDIA’s Tegra line grew to a billion dollars of revenue and then collapsed to zero when Qualcomm’s modem dominance locked NVIDIA out of the 3G to 4G transition. Huang explicitly refuses the comforting rationalization that the Tegra effort fed the Thor automotive chip (“Thor is the great great great grandson”). The original decision, he says, was a waste of time. The lesson is to think one or two clicks further about whether a market is structurally winnable before committing the company.

    Forecasting under fog of war

    The final substantive exchange is on forecasting. Huang’s method has four steps. Observe what is actually happening (AlexNet crushing two decades of computer vision research in one shot, GPT producing reasoning by token generation). Reason from first principles about why it works. Ask “so what” and “what next” recursively until a mental model of the future emerges. Place the company inside that future and work backwards. Crucially, expect to be partly wrong. Some outcomes will absolutely happen, some will likely happen, some might happen, and the strategy has to be robust across that distribution. The real cost of any strategic choice is the opportunity cost of the alternatives you did not take, so the discipline is to minimize that cost and maximize optionality while letting the journey itself pay for the journey.

    Thoughts

    The most useful thing in this conversation is the explicit architectural mapping of compute patterns to chip generations. Hopper for pre-training. Grace Blackwell NVLink72 for inference, because decode is bandwidth-bound and a single chip cannot supply it. Vera Rubin for agents, because tool calls stall multi-billion-dollar GPU systems and so the CPU has to be optimized for low-latency single-threaded code. Feynman for swarms. That sequence is not marketing. It is a falsifiable thesis about where the bottleneck moves next, and every other infrastructure company should be measuring themselves against it. If Huang is right that swarms of sub-agents are the next dominant pattern, then the design pressure shifts from raw flops to fabric topology, memory hierarchy and storage-to-GPU latency. That has implications for everyone downstream, including the hyperscalers building competing accelerators.

    The MFU section is the most intellectually generous moment in the talk. The instinct in the AI ops community has been to chase MFU as if it were a virtue. Huang argues, persuasively, that low MFU is consistent with high tokens per watt in a disaggregated inference setup, and that bottlenecks rotate fast enough that over-provisioning every resource is the rational design. That reframing matters because it changes what “scarce” means. Compute is not scarce in the way the discourse treats it. What is scarce is a coherent system designed end-to-end. The xAI 11 percent number, in that frame, is not embarrassing. It is the natural reading of a workload that is mostly decode.

    The Stanford segment is the part most likely to be quoted out of context. “It’s Stanford’s fault” is a deliberately provocative line, but the underlying claim is correct and load-bearing. Compute is not gated by NVIDIA refusing to ship chips. It is gated by the fact that fragmented grant funding cannot aggregate into the billion-dollar order that NVIDIA can fulfill. The implication is that universities and national labs need a structural change in how they pool capital for compute, and that the current model of every researcher buying a handful of cards is genuinely obsolete. Huang’s nudge about pulling a billion off the endowment is concrete enough to be acted on, and other major research universities should read this segment as a direct prompt.

    The geopolitical segment is the highest-stakes one. The telecommunications comparison is correct as a historical pattern, and Huang is one of the very few executives in a position to deliver that warning credibly. The unresolved tension is that the argument applies symmetrically. If American AI dominance is built by selling globally, that includes selling into adversarial states, and the policy question is where the line falls. Huang does not answer that question. He attacks the framing that lets the question be answered badly. That is a meaningful contribution to the discourse even if it does not resolve the underlying tradeoff.

    The career advice section is the part the social-media clips will mishandle. “Seek suffering” reads as macho when extracted. In context it is a specific operational claim about how resilience compounds, and it is paired with the Tegra story where Huang himself paid the price of not thinking one more click ahead. That kind of self-implication is rare in CEO talks, and it is the reason the talk is worth listening to in full rather than only reading the recap.

    Watch the full Stanford CS153 Frontier Systems conversation with Jensen Huang here.

  • Bubbles, Parabolas and Speed Crashes: How AI Agents Are Ending Human Market Structure and Why This Is Not the Dot-Com Bubble

    The host opens this Saturday morning macro and AI markets video with a direct challenge to anyone calling the current move a bubble. The argument is that the market structure itself has changed, that AI agents now dominate trading and capital allocation, and that Charles Kindleberger’s Manias, Panics, and Crashes describes a world that no longer exists. The full hour-long conversation walks through earnings, PEG ratios, capex, the benchmark arbitrage trapping passive investors, the inflation regime shift, and where money is rotating now. Watch the original video here.

    TLDW

    AI is not a bubble in the Kindleberger sense because the market is no longer dominated by emotional human professionals. AI agents, retail risk-takers, and passive flows are reshaping price discovery while the spend is being funded by free cash flow from the most cash-rich companies in history, not bond-issuance manias like telecoms or oil. Earnings growth is 27 percent, semiconductor sales grew 88 percent year over year in March, OpenAI and Anthropic revenue is on near-vertical curves, Nvidia’s PE is at decade lows even as Cisco’s was 130 at the dot-com peak, and the PEG ratio for the S&P sits at 1.03 with one third of the host’s thematic basket under 1.0 while Microsoft, Amazon, Meta, Apple, and Alphabet all carry richer PEGs. The new regime brings speed crashes instead of multi-year recessions, persistent bottlenecks in power, chips, transportation, and chemicals, inflation pressure that pushes three-month bills below CPI for the first time since the inflation era, and a benchmark arbitrage forcing passive money to chase AI exposure. The host is selling two thirds of his Micron, rotating into Nvidia, Vistra, silver, Bitcoin, and Ethereum, and warning that tokenization launches scheduled for July 26 will be the next major regime change.

    Key Takeaways

    • The word bubble is being misapplied because the same people calling AI a bubble called QE, tariffs, oil, Bitcoin, and passive investing bubbles for fifteen years and were wrong every time.
    • Kindleberger’s Manias, Panics, and Crashes described a slow, linear, human-emotion-driven world. AI agents have no emotion, no memory of Druckenmiller’s 2000 top, and one goal: make money.
    • The simplest test for anyone bearish on AI is to ask how much they use artificial intelligence. If they have not used a tool like OpenClaw or similar agentic systems, they are still operating in the old market regime.
    • This buildout is funded by free cash flow and bond issuance at yields better than US Treasuries from companies with stronger balance sheets than the federal government, unlike the dot-com telecoms or 1970s oil majors.
    • The S&P 500 is up only 7 percent year to date. The bubble framing is being applied to a handful of names, not to broad indices that remain reasonably valued.
    • The agentic stage of AI started in late November and accelerated when OpenClaw went viral at the end of January. Token consumption is set to grow 15 to 50 times from the IQ stage.
    • Anthropic revenue is stair-stepping from 5 to 7 to 9 to 14 to 19 to 24 to 30 billion in annualized run rate, on pace to surpass Alphabet in revenue by mid-2028.
    • OpenAI’s backlog hit 1.3 to 1.4 trillion in the most recent earnings cycle and the company still does not have enough compute.
    • Dario Amodei told the world Anthropic was planning for 10 times growth per year. In Q1 they saw 80 times annualized growth, which is why compute is bottlenecked and Anthropic is renting from Amazon, Google, and Colossus.
    • S&P 500 earnings growth is 27.1 percent year over year. The only quarters that match are those coming out of recessions, and this is not a reopening trade.
    • 320 of 500 S&P companies have reported and the average earnings surprise is 20 percent. Forward estimates are up 25 percent year over year as analysts revise upward against the historical pattern.
    • Total semiconductor sales grew 88 percent year over year in March. Semis have moved in proportion to earnings, not in excess of them.
    • Cisco’s PE was 130 at the dot-com peak. Nvidia’s PE today is the lowest of the last decade because professionals cannot run concentrated positions in single names.
    • The Edward Yardeni PEG ratio for the S&P is 1.03. The hyperscalers are not cheap on PEG: Microsoft 1.4, Amazon 1.66, Meta 1.96, Apple 3, Alphabet near 5. Thirty of ninety-five names in the host’s thematic portfolio carry PEGs under 1.0.
    • Passive investing creates a benchmark arbitrage. Everyone long the S&P 500 through index funds is structurally underweight Intel, Nvidia, Micron, and every name actually going up. Pension funds and mutual funds are forced to chase AI exposure to keep up.
    • BlackRock’s Tony Kim at the Milken conference: compute and model layers added 8 trillion in market cap year to date while the service apps that make up two thirds of GDP lost 1.2 trillion. The benchmark arbitrage is already running.
    • Larry Fink predicted a futures market for computing power. Power plus chips is the oil of the intelligence economy.
    • Jensen Huang called this a 90 trillion dollar AI physical upgrade cycle. The one big beautiful bill bonus depreciation provision was designed to incentivize this capex magic.
    • The host is selling two thirds of his Micron position. The reasoning is the memory market started moving in September of last year, the DRAM ETF is the ninth most traded ETF with billion dollar daily volumes, and exhaustion indicators are flashing red.
    • Money from Micron is rotating into Nvidia, Vistra, silver, Bitcoin, and Ethereum. The view is that the energy and power side of the AI stack is lagging the semis and will catch up next.
    • Silver versus gold has not moved while Micron has gone parabolic. LME metals are breaking out. China is increasing gold purchases significantly month over month.
    • The expected CPI print of 3.7 percent will put three-month Treasury bills below CPI for the first time since the post-pandemic inflation era. That is when Bitcoin started its last major run.
    • Logistics Managers Index hit 69.9 in March, the fastest expansion since March 2022. Transportation prices are surging because there is no capacity. This typically only happens during tax cuts or post-COVID reopenings.
    • Payroll job creation in information, professional services, and financial activities is negative. AI is already replacing knowledge work. Job creation has shifted to mining, manufacturing, construction, trade, transportation, and utilities, which is structurally inflationary.
    • Whirlpool says appliance demand is at great financial crisis lows. The consumer PC and laptop market collapse is worse than 2008. AI is pulling capital and pricing power away from legacy consumer categories.
    • Mike Wilson’s data shows reacceleration across sectors, not just large cap tech. Small caps and median stocks are showing earnings growth too, just at smaller market caps.
    • Chevron’s CEO says global oil shortages are starting. Jeff Currie warns US storage tanks will run empty. Ships are still not transiting the Strait of Hormuz. Countries that learned this lesson will restock to higher inventory levels permanently.
    • The Renmac Bubble Watch threshold was crossed on a technical basis. The host considers technical exhaustion a stronger signal than narrative-driven bubble calls.
    • Goldman Sachs power demand reports, Guggenheim warnings on the power crunch, and BlackRock’s compute intensity research all triangulate on the same conclusion: capex needs are larger than current forecasts.
    • The thematic portfolio is up roughly 30 percent from March lows. Power, optical fiber, advanced packaging, chemicals, and rack-level infrastructure baskets are leading.
    • Sterling Infrastructure (STRL), Fluence batteries, ABB electrification, Hon Hai (Foxconn), Vistra, Eaton, and Soitec are highlighted as names lagging the megacaps but inside the same AI infrastructure trade.
    • John Roque at 22V Research is releasing weekly frozen rope charts, long-base breakouts across power, copper, grid equipment, utilities, natural gas, transportation, capital goods, and agriculture. They all map to the same AI plus inflation regime.
    • Bitcoin ETF outstanding shares hit new highs. BlackRock, Morgan Stanley, and Goldman are all running competitive products. Boomer and wealth manager allocation is accelerating into year end.
    • Tokenization rolls out July 26. Wall Street clearing has enlisted 50 firms. A16Z published their case in December 2024. The host considers this underweighted by most investors and is speaking on the topic at the II event in Fort Lauderdale.
    • Raoul Pal and Yoni Assia on the end of human trading: AI agents and crypto collide by moving finance from human speed to machine speed. Agents will trade, allocate, hedge, and shift capital through wallets and exchanges. Tokenization means ownership becomes programmable.
    • The new regime is bubbles, parabolas, and speed crashes. Corrections compress from years into months. The right strategy is to never go to cash, only to rebalance and slow down within the portfolio.
    • For traders, exhaustion indicators using 5-day and 14-day RSI plus DeMark signals identify potential speed crash setups. Intel and Micron are flashing red on those screens right now.

    Detailed Summary

    Why this is not Kindleberger’s world anymore

    The framing argument of the video is that Manias, Panics, and Crashes described a market dominated by human professionals operating with limited information and lagged feedback loops. When supply and demand fell out of sync, prices collapsed because nobody could see what was happening in real time. That world is gone. AI agents now manage a majority of professional fund flows. Information moves instantaneously. Retail investors trade differently than institutional pros, and the capital structure of the entire market has changed. The host argues that since the Great Financial Crisis, the combination of QE and exponential corporate growth produced the only companies in history worth 25 trillion dollars combined with no net debt. Their AI capex is funded by free cash flow and high-grade bonds, not panicked bond issuance like the dot-com telecoms or oil majors of the 1970s.

    The Druckenmiller anchor and why FOMO is the wrong lens

    The video reads the Stanley Druckenmiller story of buying six billion in tech at the 2000 top and losing three billion in six weeks. Every professional carries that scar. It has shaped a generation of money managers into seeing parabolic moves and immediately calling bubble. The host’s counter is that recession calls from wealthy professionals are themselves a form of hope. Cash-rich investors root for crashes because crashes give them entry points. If the bubble never breaks the way it broke in 2000, those investors stay locked out, and that is precisely what the AI regime is doing.

    Earnings, revenue, and the reality test

    The video walks through current numbers in detail. S&P 500 earnings growth is running 27.1 percent year over year, which only happens coming out of recessions. 320 companies have reported with an average 20 percent earnings surprise. Forward estimates were revised up 25 percent year over year, well above the historical pattern of starting-year estimates getting cut. Total semiconductor sales were up 88 percent year over year in March. Anthropic’s revenue trajectory is stair-stepping from 5 to 30 billion in annualized run rate on the back of Claude Opus 4.5, putting it on track to surpass Alphabet by mid-2028. OpenAI is sitting on a 1.3 to 1.4 trillion backlog and still cannot get enough compute. Dario Amodei told the public Anthropic planned for 10 times growth per year and saw 80 times in Q1.

    PE, PEG, and the valuation argument

    Cisco’s PE at the dot-com peak was 130. Nvidia, the indisputable lead dog of the AI buildout, currently has a PE at the lowest of its last decade. The S&P 500’s PE is roughly where it has been since the post-COVID money printing era, far below the dot-com peak. Edward Yardeni’s PEG ratio for the index sits at 1.03. The host built a PEG screen for his ninety-five name thematic portfolio. Thirty of those names trade at a PEG under 1.0. The hyperscalers everyone holds passively are the expensive ones: Microsoft 1.4, Amazon 1.66, Meta 1.96, Apple 3, Alphabet near 5. The capacity for forward PE compression sits in the names retail and active rotational money are buying, not in the index core.

    The benchmark arbitrage trap

    Most money is now in passive investing. By construction, an S&P 500 or MSCI World allocation is underweight the names that are actually rising. Pension funds, mutual funds, and any active manager benchmarked to those indices is forced to add AI exposure to keep pace. BlackRock’s Tony Kim made this point at Milken: 8 trillion in market cap has accrued to compute and model layers year to date, while service apps representing two thirds of GDP lost 1.2 trillion. The host calls this benchmark arbitrage and considers it the single most underappreciated driver of the current move.

    The 90 trillion dollar physical upgrade cycle

    Jensen Huang’s framing of a 90 trillion dollar AI upgrade includes autos, phones, computers, humanoids, robotics, and the military stack. The host considers this a global race between the US and China. The one big beautiful bill included bonus depreciation specifically to incentivize the capex push. Greg Brockman’s interview with Sequoia made the point that demand for intelligence is effectively unlimited, and that every company outside the hyperscalers, Morgan Stanley, Goldman, Eli Lilly, Merck, United Healthcare, needs their own data center compute or their margins will not keep up with competitors. In a capitalist system, that forces broad enterprise AI spending.

    Speed crashes replace recessions

    The new regime has corrections but they are fast. Since 2020 we have had multiple 20 percent corrections compressed into weeks instead of years. The host expects this pattern to continue for the next decade. Bottlenecks in power, chips, transportation, chemicals, and skilled labor will produce inflation spikes that trigger speed crashes, not traditional credit-cycle recessions. The Logistics Managers Index reading of 69.9 in March, with capacity contraction near record lows, signals exactly this kind of bottleneck environment. The host’s strategy in this regime is to never go to cash, only to rebalance and slow down within the portfolio.

    The inflation regime shift and the rotation out of Micron

    The expected CPI print of 3.7 percent will put three-month Treasury bills below CPI for the first time since the post-pandemic inflation era, restoring negative real yields. That was the condition under which Bitcoin first launched its major bull moves. The host has sold two thirds of his Micron position despite continued bullish conviction on the name, because the memory market is the most stretched on exhaustion indicators and the DRAM ETF is trading at unprecedented volume. The capital is rotating into Nvidia, Vistra, silver, Bitcoin, and Ethereum. Silver versus gold has not moved while semis went parabolic. LME metals are breaking out. China is increasing gold purchases. The energy and power side of the stack is the next leg up.

    AI is breaking the consumer and the labor market

    Whirlpool reports appliance demand at financial crisis lows. PCs and laptops are collapsing worse than 2008. Phones, autos, housing, all the categories Kindleberger’s framework was built around are under pressure because AI is pulling capital and pricing power into compute, power, and chemicals. Payroll job creation in information, professional services, and financial activities is negative as AI takes knowledge work. Job creation is rotating into mining, construction, manufacturing, trade, transportation, and utilities, which is structurally inflationary because those sectors require physical capacity and wages. That combination, wage inflation plus commodity inflation, makes it very difficult for the Fed to ease, even with Kevin Warsh likely taking over.

    Crypto, tokenization, and AI agents at machine speed

    The final section pivots to crypto. Bitcoin ETF outstanding shares hit new highs, BlackRock’s product remains dominant, and Morgan Stanley and Goldman have launched competing vehicles. Wealth managers and boomers are allocating. The Raoul Pal and Yoni Assia conversation on the end of human trading is the host’s headline reference: AI agents will trade, allocate, hedge, and shift capital at machine speed through programmable wallets and exchanges. Tokenization, scheduled for a major launch on July 26 with 50 Wall Street clearing firms onboarded, makes ownership programmable. A16Z laid out the case in December 2024. The host is speaking on tokenization at the II event in Fort Lauderdale May 13 through 15 and considers it the next regime-defining shift after agentic AI.

    Thoughts

    The strongest argument in this video is structural, not narrative. The shift from human professionals with anchored memories to AI agents and benchmark-driven passive flows is a real change in who sets prices. Whether or not you accept the host’s portfolio calls, the framing should make any investor pause before defaulting to dot-com pattern recognition. Cisco’s PE was 130 with no business model. Nvidia’s PE is at a decade low with a near monopoly on the picks and shovels of the largest capex cycle in industrial history. Those facts cannot both be true and produce the same outcome.

    The PEG framework is the cleanest test in the video. If you believe Nvidia, Micron, Intel, and the second-tier AI infrastructure names are bubbles, you are implicitly betting that earnings growth collapses. That bet was viable in 2000 because the companies driving the move had no earnings. It is much harder to bet against earnings growth when 320 companies have just printed a 20 percent average earnings beat and analysts are revising forward estimates up by 25 percent. The host’s argument is not that the prices are reasonable in absolute terms. It is that the bear case requires growth to fall off a cliff, and nothing in the order books, the capex commitments, or the compute backlog suggests that is imminent.

    The benchmark arbitrage point deserves more attention than it gets. If the majority of professional money is locked in passive structures that are by definition underweight the leading names, and if those managers are evaluated quarter to quarter against the benchmark they cannot match, the pressure to chase will compound. This is the opposite of the dot-com setup, where active managers were forced to add overpriced tech to keep up with the index. Here, the index itself is structurally underweight the trade, and the active managers chasing it are doing so against names with rational PEG ratios.

    The rotation thesis from Micron into power, silver, and crypto is more debatable. The energy and bottleneck story is real, but the timing of when the power trade catches up with the semi trade is the hard part. The host’s discipline of never going to cash and rebalancing through the cycle is a sensible response to a regime that produces speed crashes rather than slow drawdowns. The investors most hurt by this regime will not be the ones who are long the wrong names. They will be the ones who sit out waiting for an entry point that never comes.

    Tokenization is the most underappreciated thread in the video. If the July 26 rollout brings 50 clearing firms and real ownership programmability online, the second half of the year could produce a regime shift on top of the AI regime shift. AI agents transacting on tokenized assets at machine speed is the logical endpoint of the trends the host has been tracking, and it is the part of his framework that current market consensus has not yet priced.

    Watch the full conversation here.

  • Jensen Huang on Nvidia’s Supply Chain Moat, TPU Competition, China Export Controls, and Why Nvidia Will Not Become a Cloud (Dwarkesh Podcast Summary)

    TLDW (Too Long, Didn’t Watch)

    Jensen Huang sat down with Dwarkesh Patel for over 90 minutes covering Nvidia’s supply chain dominance, the TPU threat, why Nvidia will not become a hyperscaler, whether the US should sell AI chips to China, and why Nvidia does not pursue multiple chip architectures at once. Jensen framed Nvidia’s entire business as transforming “electrons into tokens” and argued that Nvidia’s real moat is not any single technology but the full stack ecosystem it has built over two decades. He was blunt about his regret over not investing in Anthropic and OpenAI earlier, passionate about keeping the American tech stack dominant worldwide, and dismissive of the idea that China’s chip industry can be meaningfully contained through export controls.

    Key Takeaways

    1. Nvidia’s moat is the ecosystem, not the chip. Jensen repeatedly emphasized that Nvidia’s competitive advantage comes from CUDA, its massive installed base, its deep partnerships across the entire supply chain, and the fact that it operates in every cloud. The moat is not a single product but an interlocking system that took 20+ years to build.

    2. Supply chain bottlenecks are temporary, energy bottlenecks are not. Jensen argued that CoWoS packaging, HBM memory, EUV capacity, and logic fabrication bottlenecks can all be resolved in two to three years with the right demand signal. The real constraint on AI scaling is energy policy, which takes far longer to fix.

    3. TPUs and ASICs are not an existential threat to Nvidia. Jensen was emphatic that no competitor has demonstrated better price-performance or performance-per-watt than Nvidia, and challenged TPU and Trainium to prove otherwise on public benchmarks like InferenceMAX and MLPerf. He described Anthropic as a “unique instance, not a trend” for TPU adoption.

    4. Jensen regrets not investing in Anthropic and OpenAI earlier. He admitted he did not deeply internalize how much capital AI labs needed and that traditional VC funding was not sufficient for companies at that scale. He described this as a clear miss, though he said Nvidia was not in a position to make multi-billion dollar investments at the time.

    5. Nvidia will not become a hyperscaler. Jensen’s philosophy is “do as much as needed, as little as possible.” Building cloud infrastructure is something other companies can do, so Nvidia supports neoclouds like CoreWeave, Nebius, and Nscale instead of competing with them. Nvidia invests in ecosystem partners rather than vertically integrating into cloud services.

    6. Jensen is strongly against US chip export controls on China. This was the longest and most heated segment of the interview. Jensen argued that China already has abundant compute, energy, and AI researchers, and that export controls have accelerated China’s domestic chip industry while causing the US to concede the world’s second-largest technology market. He compared the situation to how US telecom policy allowed Huawei to dominate global telecommunications.

    7. AI will cause software tool usage to skyrocket, not collapse. Jensen pushed back on the narrative that AI will commoditize software companies. He argued that agents will use existing tools at massive scale, causing the number of instances of products like Excel, Synopsys Design Compiler, and other enterprise tools to grow exponentially.

    8. Nvidia does not pick winners among AI labs. Jensen explained that Nvidia invests across multiple foundation model companies simultaneously and refuses to favor any single one. He cited his own company’s unlikely survival story as the reason for this humility: Nvidia’s original graphics architecture was “precisely wrong” and would have been counted out by anyone picking winners.

    9. Nvidia added Groq for premium token economics. Nvidia recently acquired Groq and is folding it into the CUDA ecosystem because the market is now segmenting into different token tiers. Some customers will pay premium prices for faster response times even at lower throughput, creating a new segment of the inference market.

    10. Without AI, Nvidia would still be very large. Jensen was clear that accelerated computing, not AI specifically, is the foundational mission of the company. Molecular dynamics, quantum chemistry, computational lithography, data processing, and physics simulation all benefit from GPU acceleration regardless of deep learning.

    Detailed Summary

    Nvidia’s Real Business: Electrons to Tokens

    Jensen opened the conversation by reframing Nvidia’s entire value proposition. When Dwarkesh suggested that Nvidia is fundamentally a software company that sends a GDS2 file to TSMC for manufacturing, Jensen pushed back hard. He described Nvidia’s job as transforming electrons into tokens, with everything in between representing an “incredible journey” of artistry, engineering, science, and invention. He said the transformation is far from deeply understood and the journey is far from over, making commoditization unlikely.

    Jensen described Nvidia as operating a philosophy of doing “as much as necessary and as little as possible.” Whatever Nvidia does not need to do itself, it partners with someone else and makes it part of the broader ecosystem. This is why Nvidia has what Jensen called probably the largest ecosystem of partners in the industry, spanning the full supply chain upstream and downstream, application developers, model makers, and all five layers of the AI stack.

    On the question of whether AI will commoditize software companies, Jensen offered a contrarian take. He argued that agents are going to use software tools at unprecedented scale, meaning the number of instances of products like Excel, Cadence design tools, and Synopsys compilers will skyrocket. Today the bottleneck is the number of human engineers. Tomorrow, those engineers will be supported by swarms of agents exploring design spaces and using the same tools humans use today. Jensen said the reason this has not happened yet is simply that the agents are not good enough at using tools. That will change.

    The Supply Chain Moat

    Dwarkesh pressed Jensen on Nvidia’s reported $100 billion (and potentially $250 billion) in purchase commitments with foundries, memory manufacturers, and packaging companies. The question was whether Nvidia’s real moat for the next few years is simply locking up scarce upstream components so that no competitor can get the memory and logic they need to build alternative accelerators.

    Jensen confirmed this is a significant advantage but framed it differently. He said Nvidia has made enormous explicit and implicit commitments upstream. The implicit commitments matter just as much: Jensen personally meets with CEOs across the supply chain to explain the scale of the coming AI industry, convince them to invest in capacity, and assure them that Nvidia’s downstream demand is large enough to justify that investment. Nvidia’s GTC conference serves this purpose too, bringing the entire ecosystem together so upstream suppliers can see downstream demand and vice versa.

    Jensen described a process of systematically “prefetching bottlenecks” years in advance. CoWoS advanced packaging was a major bottleneck two years ago, but Nvidia swarmed it with repeated doubling of capacity until TSMC recognized it as mainstream computing technology rather than a specialty product. More recently, Nvidia has invested in the silicon photonics ecosystem through partnerships with Lumentum and Coherent, invented new packaging technologies, licensed patents to keep the supply chain open, and even invested in new testing equipment like double-sided probing.

    When Dwarkesh asked about the ultimate physical bottlenecks, Jensen surprised him. The hardest bottleneck to solve is not CoWoS or HBM or EUV machines. It is plumbers and electricians needed to build data centers. Jensen used this as a launching point to criticize “doomers” who discourage people from pursuing careers in software engineering or radiology, arguing that scaring people out of these professions creates the real bottlenecks.

    On EUV and logic scaling specifically, Jensen was optimistic. He said no supply chain bottleneck lasts longer than two to three years. Once you can build one of something, you can build ten, and once you can build ten, you can build a million. The key is a clear demand signal. If TSMC is convinced of the demand, ASML will produce enough EUV machines. Meanwhile, Nvidia continues to improve computing efficiency by 10x to 50x per generation through architecture, algorithms, and system design.

    The TPU Question

    Dwarkesh pushed hard on whether Google’s TPUs represent a real threat, noting that two of the top three AI models (Claude and Gemini) were trained on TPUs. Jensen drew a sharp distinction between what Nvidia builds and what a TPU is. Nvidia builds accelerated computing, which serves molecular dynamics, quantum chromodynamics, data processing, fluid dynamics, particle physics, and AI. A TPU is a tensor processing unit optimized for matrix multiplies. Nvidia’s market reach is far greater than any TPU or ASIC can possibly have.

    Jensen emphasized programmability as Nvidia’s core architectural advantage. If you want to invent a new attention mechanism, build a hybrid SSM model, fuse diffusion and autoregressive techniques, or disaggregate computation in a novel way, you need a generally programmable architecture. The only way to achieve 10x or 100x performance leaps (versus the roughly 25% per year from Moore’s Law) is to fundamentally change the algorithm, and that requires the flexibility CUDA provides.

    On the specific question of whether hyperscalers with huge engineering teams can simply write their own kernels and bypass CUDA, Jensen acknowledged they do write custom kernels but argued that Nvidia’s engineers still routinely deliver 2x to 3x speedups when they optimize a partner’s stack. He described Nvidia’s GPUs as “F1 racers” that anyone can drive at 100 mph, but extracting peak performance requires deep architectural expertise. Nvidia uses AI itself to generate many of its optimized kernels.

    Jensen was particularly blunt about public benchmarks. He pointed to Dylan Patel’s InferenceMAX benchmark and said neither TPU nor Trainium has been willing to demonstrate their claimed performance advantages on it. He said Nvidia’s performance-per-TCO is the best in the world, “bar none,” and challenged anyone to prove otherwise.

    Regarding Anthropic’s multi-gigawatt deal with Broadcom and Google for TPUs, Jensen called it “a unique instance, not a trend.” He said without Anthropic, there would be essentially no TPU growth and no Trainium growth. He traced this back to his own mistake: when Anthropic and OpenAI needed multi-billion dollar investments from their compute suppliers to get off the ground, Nvidia was not in a position to provide that capital. Google and AWS were, and in return, Anthropic committed to using their compute.

    Nvidia’s Investment Strategy and Regrets

    Jensen was unusually candid about his regret over not investing in foundation model companies earlier. He said he did not deeply internalize how different AI labs were from typical startups. A traditional VC would never put $5 to $10 billion into a single AI lab, but that was exactly what companies like OpenAI and Anthropic needed. By the time Jensen understood this, Nvidia was not in a financial or cultural position to make those kinds of investments.

    Now, Nvidia has invested approximately $30 billion in OpenAI and $10 billion in Anthropic. Jensen said he is delighted to support both and considers their existence essential for the world. But he acknowledged that these investments came at much higher valuations than would have been possible years earlier.

    Jensen explained Nvidia’s broader investment philosophy: support everyone, do not pick winners. He invests in one foundation model company, he invests in all of them. This comes from hard-won humility. When Nvidia started, there were 60 3D graphics companies. Nvidia’s original architecture was “precisely wrong” and the company would have been at the top of most lists to fail. Jensen said he has enough humility from that experience to know that you cannot predict which AI company will ultimately succeed.

    Why Nvidia Will Not Become a Hyperscaler

    Dwarkesh pointed out that Nvidia has the cash to build and operate its own cloud infrastructure, bypassing the middleman ecosystem that converts CapEx into OpEx for AI labs. Jensen rejected this path based on his core operating philosophy.

    If Nvidia did not build its computing platform, NVLink, and the CUDA ecosystem, nobody else would have done it. He is “completely certain” of that. These are things Nvidia must do. But the world has lots of clouds. If Nvidia did not build a cloud, someone else would show up. So the answer is to support the ecosystem instead: invest in CoreWeave, Nscale, Nebius, and others to help them exist and scale, rather than competing with them.

    Jensen was clear that Nvidia is not trying to be in the financing business either. When OpenAI needed a $30 billion investment before its IPO, Nvidia stepped up because OpenAI needed it and Nvidia deeply believed in the company. But these are targeted ecosystem investments, not a strategic pivot into cloud services.

    On GPU allocation during shortages, Jensen pushed back on the narrative that Nvidia strategically “fractures” the market by giving allocations to smaller neoclouds. He said the process is straightforward: you forecast demand, you place a purchase order, and it is first in, first out. Nvidia never changes prices based on demand. Jensen said he prefers to be dependable and serve as the foundation of the industry rather than extracting maximum short-term value.

    The China Debate

    The longest and most heated section of the interview was Jensen’s case against US chip export controls on China. This was a genuine debate, with Dwarkesh pushing the national security argument and Jensen pushing back forcefully.

    Jensen’s core argument rested on several pillars. First, China already has abundant compute. They manufacture 60% or more of the world’s mainstream chips, have massive energy infrastructure (including empty data centers with full power), and employ roughly 50% of the world’s AI researchers. The threshold of compute needed to build models like Anthropic’s Mythos has already been reached and exceeded by China’s existing infrastructure.

    Second, export controls have backfired. They accelerated China’s domestic chip industry, forced their AI ecosystem to optimize for internal architectures instead of the American tech stack, and caused the United States to concede the second-largest technology market in the world. Jensen compared this directly to how US telecom policy allowed Huawei to dominate global telecommunications infrastructure.

    Third, Jensen argued that AI is a five-layer stack (energy, chips, computing platform, models, applications) and the US needs to win at every layer. Fixating on one layer (models) at the expense of another layer (chips) is counterproductive. If Chinese open source AI models end up optimized for non-American hardware and that stack gets exported to the global south, the Middle East, Africa, and Southeast Asia, the US will have lost something far more valuable than whatever marginal compute advantage the export controls provided.

    Dwarkesh countered with the Mythos example: Anthropic’s new model found thousands of high-severity zero-day vulnerabilities across every major operating system and browser, including one that had existed in OpenBSD for 27 years. If China had enough compute to train and deploy a model like Mythos at scale before the US could prepare, the cyber-offensive capabilities would be devastating.

    Jensen’s response was direct. Mythos was trained on “fairly mundane capacity” that is already abundantly available in China. The amount of compute is not the bottleneck for that kind of breakthrough. Great computer science is, and China has no shortage of brilliant AI researchers. He pointed to DeepSeek as evidence: most advances in AI come from algorithmic innovation, not raw hardware. If China’s researchers can achieve breakthroughs like DeepSeek with limited hardware, imagine what they could do with more.

    Jensen also argued for dialogue over confrontation. He said it is essential that American and Chinese AI researchers are talking to each other, and that both countries agree on what AI should not be used for. The idea that you can prevent AI risks by cutting off chip sales, when the real advances come from algorithms and computer science, reflects a fundamental misunderstanding of how AI progress works.

    The debate ended without resolution, but Jensen’s final point was sharp: “I’m not talking to somebody who woke up a loser. That loser attitude, that loser premise, makes no sense to me.”

    Why Not Multiple Chip Architectures?

    Near the end of the interview, Dwarkesh asked why Nvidia does not run multiple parallel chip projects with different architectures, like a Cerebras-style wafer-scale design or a Dojo-style huge package, or even one without CUDA.

    Jensen’s answer was simple: “We don’t have a better idea.” Nvidia simulates all of these alternative approaches in its internal simulators and they are provably worse. The company works on exactly the projects it wants to work on. If the workload were to change dramatically (not just the algorithms, but the actual market shape), Nvidia might add other accelerators.

    In fact, Nvidia recently did exactly this by acquiring Groq. The inference market is now segmenting into different tiers. Some customers will pay premium prices for extremely fast response times even if throughput is lower. This creates a new “high ASP token” segment that justifies a different point on the performance curve. But Jensen was clear: if he had more money, he would put it all behind Nvidia’s existing architecture, not diversify into alternatives.

    Nvidia Without AI

    Jensen closed by saying that even if the deep learning revolution had never happened, Nvidia would be “very, very large.” The premise of the company has always been that general-purpose computing cannot scale indefinitely and that domain-specific acceleration is the way forward. Molecular dynamics, seismic processing, image processing, computational lithography, quantum chemistry, and data processing all benefit from GPU acceleration regardless of AI. Jensen said the fundamental promise of accelerated computing has not changed “not even a little bit.”

    Thoughts

    This interview is one of the most revealing Jensen Huang conversations in years, partly because Dwarkesh actually pushes back instead of lobbing softballs. A few things stand out.

    The Anthropic regret is real and significant. Jensen is essentially admitting that Nvidia’s biggest strategic miss of the AI era was not understanding that foundation model companies needed supplier-level capital commitments, not VC funding. The fact that Google and AWS used compute investments to lock in Anthropic’s architecture choices has had downstream consequences that Nvidia is still working to unwind. When Jensen says Anthropic is “a unique instance, not a trend” for TPU adoption, he is simultaneously downplaying the threat and revealing exactly how seriously he takes it.

    The China debate is the highlight. Jensen’s argument is more nuanced than it first appears. He is not saying “sell China everything.” He is saying the current binary approach of near-total restriction has backfired by accelerating China’s domestic chip industry and pushing the Chinese AI ecosystem away from the American tech stack. His comparison to the US telecom industry losing global market share to Huawei is pointed and historically grounded. Whether you agree with his conclusion or not, the framing of AI as a five-layer stack where the US needs to compete at every layer is a useful mental model.

    The “electrons to tokens” framing is Jensen at his best. It is a simple metaphor that captures something genuinely complex about where value is created in the AI supply chain. And his insistence that the transformation is “far from deeply understood” is a subtle way of arguing that Nvidia’s competitive position will be durable because the problem space is not close to being solved.

    The Groq acquisition reveal is interesting for what it signals about the inference market. If Nvidia is creating a separate product tier for premium-priced, low-latency tokens, it suggests the company sees inference economics fragmenting significantly. This aligns with the broader trend of AI becoming an enterprise product where different customers have wildly different willingness to pay based on how they use tokens.

    Finally, Jensen’s refusal to diversify chip architectures is a bold bet. “We simulate it all in our simulator, provably worse” is an incredibly confident statement. History is full of companies that were right until they were not. But Nvidia’s track record of 50x generation-over-generation improvements through co-design across processors, fabric, libraries, and algorithms is hard to argue with. The question is whether the current paradigm of transformer-based models on GPU clusters represents a local or global optimum for AI compute.

  • Jensen Huang on Lex Fridman: NVIDIA’s CEO Reveals His Vision for the AI Revolution, Scaling Laws, and Why Intelligence Is Now a Commodity

    A deep breakdown of Lex Fridman Podcast #494 featuring Jensen Huang, CEO of NVIDIA, covering extreme co-design, the four AI scaling laws, CUDA’s origin story, the future of programming, AGI timelines, and what it takes to lead the world’s most valuable company.

    TLDW (Too Long, Didn’t Watch)

    Jensen Huang sat down with Lex Fridman for a sprawling two-and-a-half-hour conversation covering the full arc of NVIDIA’s evolution from a GPU gaming company to the engine of the AI revolution. Jensen explains how NVIDIA now thinks in terms of rack-scale and pod-scale computing rather than individual chips, breaks down his four AI scaling laws (pre-training, post-training, test time, and agentic), and reveals the near-existential bet the company made putting CUDA on GeForce. He shares his views on China’s tech ecosystem, his deep respect for TSMC, why he turned down the chance to become TSMC’s CEO, how Elon Musk’s systems engineering approach built Colossus in record time, and why he believes AGI already exists. He also discusses why the future of programming is really about “specification,” why intelligence is being commoditized while humanity is the true superpower, and how he manages the enormous pressure of leading a company that nations and economies depend on. His core message: do not let the democratization of intelligence cause you anxiety. Instead, let it inspire you.

    Key Takeaways

    1. NVIDIA No Longer Thinks in Chips. It Thinks in AI Factories.

    Jensen’s mental model of what NVIDIA builds has fundamentally changed. He no longer picks up a chip to represent a new product generation. Instead, his mental model is a gigawatt-scale AI factory with power generation, cooling systems, and thousands of engineers bringing it online. The unit of computing at NVIDIA has evolved from GPU to computer to cluster to AI factory. His next mental “click” is planetary-scale computing.

    2. Extreme Co-Design Is NVIDIA’s Secret Weapon

    The reason NVIDIA dominates is not just better GPUs. It is the extreme co-design of the entire stack: GPU, CPU, memory, networking, switching, power, cooling, storage, software, algorithms, and applications. Jensen explains that when you distribute workloads across tens of thousands of computers and want them to go a million times faster (not just 10,000 times), every single component becomes a bottleneck. This is a restatement of Amdahl’s Law at scale. NVIDIA’s organizational structure directly reflects this co-design philosophy. Jensen has 60+ direct reports, holds no one-on-ones, and runs every meeting as a collective problem-solving session where specialists across all domains are present and contribute.

    3. The Four AI Scaling Laws Are a Flywheel

    Jensen outlined four distinct scaling laws that form a continuous loop:

    Pre-training scaling: Larger models plus more data equals smarter AI. The industry panicked when people said data was running out, but synthetic data generation has removed that ceiling. Data is now limited by compute, not by human generation.

    Post-training scaling: Fine-tuning, reinforcement learning from human feedback, and curated data continue to scale AI capabilities beyond what pre-training alone achieves.

    Test-time scaling: Inference is not “easy” as many predicted. It is thinking, reasoning, planning, and search. It is far more compute-intensive than memorization and pattern matching. This is why inference chips cannot be commoditized the way many predicted.

    Agentic scaling: A single AI agent can spawn sub-agents, creating teams. This is like scaling a company by hiring more employees rather than trying to make one person faster. The experiences generated by agents feed back into pre-training, creating a flywheel.

    4. The CUDA Bet Nearly Killed NVIDIA

    Putting CUDA on GeForce was one of the most consequential technology decisions in modern history. It increased GPU costs by roughly 50%, which crushed the company’s gross margins at a time when NVIDIA was a 35% gross margin business. The company’s market cap dropped from around $7-8 billion to approximately $1.5 billion. But Jensen understood that install base defines a computing architecture, not elegance. He pointed to x86 as proof: a less-than-elegant architecture that defeated beautifully designed RISC alternatives because of its massive install base. CUDA on GeForce put a supercomputer in the hands of every researcher, every scientist, every student. It took a decade to recover, but that install base became the foundation of the deep learning revolution.

    5. NVIDIA’s Moat Is Trust, Velocity, and Install Base

    Jensen was direct about NVIDIA’s competitive advantage. The CUDA install base is the number one asset. Developers target CUDA first because it reaches hundreds of millions of computers, is in every cloud, every OEM, every country, every industry. NVIDIA ships a new architecture roughly every year. No company in history has built systems of this complexity at this cadence. And the trust that NVIDIA will maintain, improve, and optimize CUDA indefinitely is something developers can count on. If someone created “GUDA” or “TUDA” tomorrow, it would not matter. The install base, velocity of execution, ecosystem breadth, and earned trust create a compounding advantage that is nearly impossible to replicate.

    6. Jensen Believes AGI Is Already Here

    When asked about AGI timelines, Jensen said he believes AGI has been achieved. His reasoning is practical: an agentic system today could plausibly create a web service, achieve virality, and generate a billion dollars in revenue, even if temporarily. This is not meaningfully different from many internet-era companies that did the same thing with technology no more sophisticated than what current AI agents can produce. He does not believe 100,000 agents could build another NVIDIA, but he believes a single agent-driven viral product is within reach right now.

    7. The Future of Programming Is Specification, Not Syntax

    Jensen believes the number of programmers in the world will increase dramatically, not decrease. His reasoning: the definition of coding is expanding to include specification and architectural description in natural language. This expands the population of “coders” from roughly 30 million professional developers to potentially a billion people. Every carpenter, plumber, accountant, and farmer who can describe what they want a computer to build is now a coder. The artistry of the future is knowing where on the spectrum of specification to operate, from highly prescriptive to exploratory and open-ended.

    8. China Is the Fastest Innovating Country in the World

    Jensen gave a nuanced and detailed explanation of why China’s tech ecosystem is so formidable. About 50% of the world’s AI researchers are Chinese. China’s tech industry emerged during the mobile cloud era, so it was built on modern software from the start. The country’s provincial competition creates an insane internal competitive environment. And the cultural norm of knowledge-sharing through school and family networks means China effectively operates as an open-source ecosystem at all times. This is why Chinese companies contribute disproportionately to open source. Their engineers’ brothers, friends, and schoolmates work at competing companies, and sharing knowledge is the cultural default.

    9. The Power Grid Has Enormous Waste That AI Can Exploit

    Jensen proposed a pragmatic solution to the energy problem for AI data centers. Power grids are designed for worst-case conditions with margin, but 99% of the time they run at around 60% of peak capacity. That idle capacity is simply wasted. Jensen wants data centers to negotiate flexible contracts where they absorb excess power most of the time and gracefully degrade during rare peak demand periods. This requires three things: customers accepting that “six nines” uptime may not always be necessary, data centers that can dynamically shift workloads, and utilities that offer tiered power delivery contracts instead of all-or-nothing commitments.

    10. Jensen Turned Down the CEO Role at TSMC

    In 2013, TSMC founder Morris Chang offered Jensen the chance to become CEO of TSMC. Jensen confirmed the story is true and said he was deeply honored. But he had already envisioned what NVIDIA could become and felt it was his sole responsibility to make that vision happen. He sees the relationship with TSMC as one built on three decades of trust, hundreds of billions of dollars in business, and zero formal contracts.

    11. Elon Musk’s Systems Engineering Approach Is Instructive

    Jensen praised Elon Musk’s approach to building the Colossus supercomputer in Memphis in just four months. He highlighted several principles: Elon questions everything relentlessly, strips every process down to the minimum necessary, is physically present at the point of action, and his personal urgency creates urgency in every supplier. Jensen drew a parallel to NVIDIA’s own “speed of light” methodology, where every process is benchmarked against the physical limits of what is possible, not against historical baselines.

    12. Intelligence Is a Commodity. Humanity Is Not.

    Perhaps the most philosophical takeaway from the conversation: Jensen argued that intelligence is a functional, measurable thing that is being commoditized. He surrounded himself with 60 direct reports who are all “superhuman” in their respective domains, more educated and deeper in their specialties than he is. Yet he sits in the middle orchestrating all of them. This proves that intelligence alone does not determine success. Character, compassion, grit, determination, tolerance for embarrassment, and the ability to endure suffering are the real differentiators. Jensen wants the audience to understand that the word we should elevate is not intelligence but humanity.

    Detailed Summary

    From GPU Maker to AI Infrastructure Company

    The conversation opened with Jensen explaining NVIDIA’s evolution from chip-scale to rack-scale to pod-scale design. The Vera Rubin pod, announced at GTC, contains seven chip types, five purpose-built rack types, 40 racks, 1.2 quadrillion transistors, nearly 20,000 NVIDIA dies, over 1,100 Rubin GPUs, 60 exaflops of compute, and 10 petabytes per second of scale bandwidth. And that is just one pod. NVIDIA plans to produce roughly 200 of these pods per week.

    Jensen explained that extreme co-design is necessary because the problems AI must solve no longer fit inside a single computer. When you distribute a workload across 10,000 computers but want a million-fold speedup, everything becomes a bottleneck: computation, networking, switching, memory, power, cooling. This is fundamentally an Amdahl’s Law problem at planetary scale. If computation represents only 50% of the workload, speeding it up infinitely only doubles total throughput. Every layer must be co-optimized simultaneously.

    NVIDIA’s organizational structure is a direct reflection of this co-design philosophy. Jensen has more than 60 direct reports, almost all with deep engineering expertise. He does not do one-on-ones. Every meeting is a collective problem-solving session where the memory expert, the networking expert, the cooling expert, and the power delivery expert are all in the room together, attacking the same problem.

    The Strategic History of CUDA

    Jensen walked through the step-by-step journey from graphics accelerator to computing platform. The company invented a programmable pixel shader, then added IEEE-compatible FP32 to its shaders, then put C on top of that (called Cg), and eventually arrived at CUDA. The critical strategic decision was putting CUDA on GeForce, a consumer product.

    This was nearly an existential move. It increased GPU costs by roughly 50% and consumed all of the company’s gross profit at a time when NVIDIA was a 35% gross margin business. The market cap cratered from around $7-8 billion to approximately $1.5 billion. But Jensen understood a principle that many technologists overlook: install base defines a computing architecture. x86 survived not because it was elegant but because it was everywhere. CUDA on GeForce put a supercomputing capability in the hands of every gamer, every student, every researcher who built their own PC. When the deep learning revolution arrived, CUDA was already the foundation.

    How Jensen Leads and Makes Decisions

    Jensen described a leadership philosophy built on continuous reasoning in public. He does not make announcements in the traditional sense. Instead, he shapes the belief systems of his employees, board, partners, and the broader industry over months and years by reasoning through decisions step by step, using every new piece of external information as a brick in the foundation. By the time he formally announces a strategic direction, the reaction is not surprise but rather, “What took you so long?”

    He applies this same approach to his supply chain. He personally visits CEOs of DRAM companies, packaging companies, and infrastructure providers. He explains the dynamics of the industry, shares his vision of future demand, and helps them reason through why they should make multi-billion-dollar capital investments. Three years ago, he convinced DRAM CEOs that HBM memory would become mainstream for data centers, which sounded ridiculous at the time. Those companies had record years as a result.

    Jensen’s “speed of light” methodology is his framework for decision-making. Every process, every design, every cost is benchmarked against the physical limits of what is theoretically possible. He prefers this to continuous improvement, which he views as incrementalism. He would rather strip a 74-day process back to zero and ask, “If we built this from scratch today, how long would it take?” Often the answer is six days, and the remaining 68 days are filled with accumulated compromises that can be challenged individually.

    AI Scaling Laws and the Future of Compute

    Jensen broke down the four scaling laws in detail. The pre-training scaling law, which depends on model size and data volume, was thought to be hitting a wall when the industry worried about running out of high-quality human-generated data. Jensen argued this concern is misplaced. Synthetic data generation has effectively removed the ceiling, and the constraint is now compute, not data.

    Post-training continues to scale through fine-tuning and reinforcement learning. Test-time scaling was the most counterintuitive for the industry. Many predicted that inference would be “easy” and that inference chips would be small, cheap, and commoditized. Jensen saw this as fundamentally wrong. Inference is thinking: reasoning, planning, search, decomposing novel problems into solvable pieces. Thinking is much harder than reading, and test-time compute is intensely resource-hungry.

    Agentic scaling is the newest frontier. A single AI agent can spawn sub-agents, effectively multiplying intelligence the way a company scales by hiring. The experiences and data generated by agentic systems feed back into pre-training, creating a continuous improvement loop. Jensen described this as the reason NVIDIA designed the Vera Rubin rack architecture differently from the Grace Blackwell architecture. Grace Blackwell was optimized for running large language models. Vera Rubin is designed for agents, which need to access files, use tools, do research, and spin off sub-agents. NVIDIA anticipated this architectural shift two and a half years before tools like OpenClaw arrived.

    China, TSMC, and the Global Supply Chain

    Jensen provided a thoughtful analysis of China’s tech ecosystem. He identified several structural advantages: 50% of the world’s AI researchers are Chinese, the tech industry was born during the mobile cloud era (making it natively modern), provincial competition creates internal Darwinian pressure, and the culture of knowledge-sharing through school and family networks makes China effectively open-source by default.

    On TSMC, Jensen emphasized that the deepest misunderstanding about the company is that its technology is its only advantage. Their manufacturing orchestration system, which dynamically manages the shifting demands of hundreds of companies, is “completely miraculous.” Their culture uniquely balances bleeding-edge technology excellence with world-class customer service. And the trust that Jensen places in TSMC is extraordinary: three decades of partnership, hundreds of billions of dollars in business, and no formal contract.

    Jensen also discussed the AI supply chain more broadly. NVIDIA has roughly 200 suppliers contributing technology to each rack. Jensen personally manages these relationships, flying to supplier sites, explaining industry dynamics, and helping CEOs reason through multi-billion-dollar investment decisions. When asked if supply chain bottlenecks keep him up at night, he said no, because he has already communicated what NVIDIA needs, his partners have told him what they will deliver, and he believes them.

    The Energy Challenge and Space Computing

    On the energy front, Jensen proposed a practical approach to the power problem. Rather than waiting for new power generation, he wants to capture the enormous waste already present in the grid. Power infrastructure is designed for worst-case peak demand, but 99% of the time it runs far below capacity. AI data centers could absorb this excess capacity with flexible contracts that allow graceful degradation during rare peak periods.

    On space computing, NVIDIA already has GPUs in orbit for satellite imaging. Jensen acknowledged the cooling challenge (no conduction or convection in space, only radiation) but sees it as a future frontier worth cultivating. In the meantime, he is focused on the lower-hanging fruit of eliminating waste in the terrestrial power grid.

    On AGI, Jobs, and the Human Future

    Jensen stated directly that he believes AGI has been achieved, at least by the practical definition of an AI system capable of creating a billion-dollar company. He sees it as plausible that an agent could build a viral web service that briefly generates enormous revenue, just as many internet-era companies did with technology no more sophisticated than what current AI agents produce.

    On jobs, Jensen was both compassionate and clear-eyed. He told the story of radiology: computer vision became superhuman around 2019-2020, and the prediction was that radiologists would disappear. Instead, the number of radiologists grew because AI allowed them to study more scans, diagnose better, and serve more patients. The purpose of the job (diagnosing disease) did not change, even though the tools changed completely.

    He applied this principle broadly: the number of software engineers at NVIDIA will grow, not decline, because their purpose is solving problems, not writing lines of code. The number of programmers globally will grow because the definition of coding is expanding to include natural language specification, opening it up to potentially a billion people.

    His advice to anyone worried about their job is straightforward: go use AI now. Become expert in it. Every profession, from carpenter to pharmacist to lawyer, will be elevated by AI tools. The people who learn to use AI will be the ones who get hired, promoted, and empowered.

    Mortality, Succession, and Legacy

    The conversation closed with deeply personal reflections. Jensen said he really does not want to die. He sees the current moment as a “once in a humanity experience.” He does not believe in traditional succession planning. Instead, he believes the best succession strategy is to pass on knowledge continuously, every single day, in every meeting, as fast as possible. His hope is to die on the job, instantaneously, with no long period of suffering.

    He described a vision for a kind of digital continuity: sending a humanoid robot into space, continuously improving it in flight, and eventually uploading the consciousness derived from a lifetime of communications, decisions, and reasoning to catch up with it at the speed of light.

    On the emotional experience of leading NVIDIA, Jensen was candid about hitting psychological low points regularly. His coping mechanism is decomposition: break the problem into pieces, reason about what you can control, tell someone who can help, share the burden, and then deliberately forget what is behind you. He compared this to the mental discipline of great athletes who focus only on the next point.

    His final message was about the relationship between intelligence and humanity. Intelligence, he argued, is functional. It is being commoditized. Humanity, character, compassion, grit, tolerance for embarrassment, and the capacity for suffering are the true superpowers. The word society should elevate is not intelligence but humanity.

    Thoughts

    This is one of the most substantive CEO interviews of 2026. What makes it remarkable is not just the breadth of topics but the depth of reasoning Jensen demonstrates in real time. You can actually watch him think through problems on the spot, which is rare for someone at his level.

    A few things stand out. First, the CUDA origin story is one of the great strategic narratives in tech history. The decision to absorb a 50% cost increase on a consumer product, watching your market cap collapse by 80%, and holding the course for a decade because you understood the power of install base is the kind of conviction that separates generational companies from everyone else.

    Second, Jensen’s framing of the four scaling laws as a flywheel is the clearest articulation anyone has given of why AI compute demand will continue to accelerate. Most people understand pre-training. Fewer understand test-time scaling. Almost nobody is thinking about agentic scaling as a compute multiplier. Jensen has been thinking about it for years and already designed hardware for it before the software ecosystem caught up.

    Third, the discussion on jobs deserves attention. The radiology example is powerful because it is a completed experiment, not a prediction. The profession that was supposed to be eliminated first by AI instead grew. The mechanism is straightforward: when you automate the task, you expand the capacity of the purpose, and demand for the purpose increases. This does not mean there will be no pain or dislocation. Jensen acknowledged that explicitly. But the historical pattern is clear.

    Finally, the philosophical distinction between intelligence and humanity is the kind of framing that could genuinely help people navigate the anxiety of this moment. If you define your value by your intelligence alone, AI commoditization is terrifying. If you define your value by your character, your compassion, your tolerance for suffering, and your willingness to keep going when everything goes wrong, then AI is just the most powerful set of tools you have ever been given.

    Jensen Huang is 62 years old, has been running NVIDIA for 34 years, and shows no signs of slowing down. If anything, his conviction about the future is accelerating alongside his company’s growth.

    Watch the full episode: Lex Fridman Podcast #494 with Jensen Huang

  • Jensen Huang on Nvidia’s Future: Physical AI, the Inference Explosion, Agentic Computing, and Why AI Doomers Are Wrong

    Jensen Huang sat down with the All-In Podcast crew at GTC 2026 for one of the most wide-ranging and candid conversations he’s had in years. From the Groq acquisition to $50 trillion physical AI markets, from defending Nvidia’s pricing to gently calling out Anthropic’s communications missteps, Huang covered everything. Here’s a complete breakdown of everything said — and what it means.


    ⚡ TL;DW

    • Nvidia has evolved from a GPU company into a full-stack AI factory company, and its TAM has expanded by 33–50% just from new rack configurations.
    • Inference demand is exploding — Huang says compute will scale 1 million times, and analysts who model 7–20% growth “don’t understand the scale and breadth of AI.”
    • The Groq acquisition positions Nvidia to run the right workload on the right chip — GPU, LPU, CPU, switch, all orchestrated under Dynamo, the AI factory OS.
    • Physical AI (robotics, autonomous vehicles, industrial automation) is Nvidia’s play at a $50 trillion market — and it’s already a ~$10 billion/year business growing exponentially.
    • OpenClaw (Claude’s open-source agentic framework) is, in Jensen’s view, the new operating system for modern computing.
    • Jensen pushed back hard on AI doomerism — and diplomatically but clearly called out Anthropic’s communications as too extreme.
    • Robots are 3–5 years away from being “all over the place.” Jensen hopes for more than one robot per human on Earth.
    • Dario Amodei’s $1 trillion AI revenue forecast by 2030? Jensen says he’s being too conservative.
    • His advice to young people: become deeply expert at using AI. English majors may end up winning.

    🔑 Key Takeaways

    1. Nvidia Is No Longer a Chip Company

    Jensen Huang made clear that Nvidia’s identity has fundamentally shifted. The company is now an AI factory company — building not just GPUs but the entire computing stack: GPUs, CPUs, networking switches, storage processors (BlueField), and now LPUs via the Groq acquisition. The operating system tying it all together is called Dynamo, named after the Siemens machine that powered the last industrial revolution by turning water into electricity. Huang’s point: Dynamo is doing the same thing for AI — turning raw compute into intelligence at industrial scale.

    2. The Inference Explosion Is Real and Massive

    A year ago, Huang predicted inference would scale enormously. He’s now doubling down: from generative AI to reasoning models, compute requirements grew roughly 100x. From reasoning to agentic AI, another 100x. That’s 10,000x in two years — and Huang says we haven’t even started scaling yet. He believes the ultimate trajectory is 1 million times more compute than where we started. Analysts who project 20–30% revenue growth for Nvidia fundamentally don’t understand what’s coming.

    3. Disaggregated Inference Is the New Architecture

    The technical centerpiece of GTC 2026 was disaggregated inference — the idea that the AI processing pipeline is so complex (prefill, decode, working memory, long-term memory, tool use, multi-agent coordination) that it should run across heterogeneous chips, not just a single GPU rack. Nvidia’s Vera Rubin system is built for this: multiple rack types handling different workloads. Jensen says Nvidia’s TAM grew by 33–50% just from adding those four new rack types to what was previously a one-rack company.

    4. The $50 Billion Factory Produces the Cheapest Tokens

    Critics argue that Nvidia’s inference factories cost $40–50B versus competitors at $25–30B. Huang’s rebuttal is clean: don’t equate the price of the factory with the cost of the tokens. A $50B Nvidia factory producing 10x the throughput of a $30B alternative means Nvidia’s tokens are actually cheaper. When land, power, shell, storage, networking, and cooling are already fixed costs, the delta between GPU options is a small fraction of total spend — but the performance difference is enormous.

    5. OpenClaw Is the New OS for Modern Computing

    Jensen spent serious time on Claude’s open-source agentic framework (referred to throughout as “OpenClaw”). His view: it’s not just a product announcement — it’s a computing paradigm shift. OpenClaw has a memory system (short-term scratch, long-term file system), skills/tools, resource management, scheduling, cron jobs, multi-agent spawning, and external I/O. These are the four foundational elements of an operating system. His conclusion: for the first time, we have a personal AI computer — and it’s open source, running everywhere.

    6. Agents Mean Every Engineer Gets 100 Helpers

    Jensen’s internal benchmark at Nvidia: if a $500K/year engineer isn’t spending at least $250K worth of tokens annually, something is wrong. He compared it to a chip designer refusing to use CAD tools and working only in pencil. His vision: every engineer will have 100 agents working alongside them. The nature of programming shifts from writing code to writing ideas, architectures, specifications, and evaluation criteria — and then guiding agents toward outcomes.

    7. Physical AI Is a $50 Trillion Opportunity

    This is the biggest framing in the talk. Physical AI — robotics, autonomous vehicles, industrial automation, agriculture, healthcare instruments — represents the technology industry’s first real shot at a $50 trillion market that has been “largely void of technology until now.” Nvidia started this journey 10 years ago, it’s now inflecting, and it’s already approaching $10 billion/year as a standalone business. Huang expects this to grow exponentially.

    8. Robots Are 3–5 Years Away from Ubiquity

    Huang was asked about the “lost decade” of robotics — Google buying and selling Boston Dynamics, years of underwhelming progress. His take: America got into robotics too soon, got exhausted, and quit about five years before the enabling technology (AI “brains”) appeared. Now the brain is here. From a “high-functioning existence proof” (what we have now) to “reasonable products,” technology historically takes 2–3 cycles — meaning 3 to 5 years. He also flagged China’s formidable position in robotics hardware: motors, rare earth elements, magnets, micro-electronics. The world’s robotics industry will depend heavily on China’s supply chain.

    9. Jensen Thinks Dario Amodei Is Too Conservative

    Dario Amodei publicly predicted that AI model and agent companies will generate hundreds of billions in revenue by 2027–28 and reach $1 trillion by 2030. Jensen’s response: “I think he’s being very conservative. Way better than that.” His reasoning? Dario hasn’t fully accounted for the fact that every enterprise software company will become a reseller of AI tokens — a logarithmic expansion of go-to-market that will dwarf what any AI lab can sell directly.

    10. The AI Moat Is Deep Specialization

    When asked what the real competitive moat is at the application layer, Jensen said: deep specialization. General models will handle general intelligence. But every industry has domain expertise that needs to be captured in specialized sub-agents, trained on proprietary data. The entrepreneur who knows their vertical better than anyone else, connects their agent to customers first, and builds that flywheel — that’s the moat. He framed it as an inversion of traditional software: instead of building horizontal platforms and customizing at the edges, AI enables you to go vertical-first from day one.

    11. Jensen’s Gentle but Clear Critique of Anthropic’s Communications

    Asked what advice he’d give Anthropic following the Department of Defense controversy that created a PR crisis, Jensen praised Anthropic’s technology and their focus on safety — then offered a measured but pointed critique: warning people is good, scaring people is less good. He argued that AI leaders need to be more circumspect, more humble, more moderate. Making extreme, catastrophic predictions without evidence can damage public trust in a technology that is “too important.” His implicit warning: look what happened to nuclear energy. A 17% public approval rating for AI is the beginning of that same problem.

    12. China Policy: Back to Market, With Conditions

    Nvidia had a 95% market share in China — and lost it entirely due to export controls, falling to 0%. Jensen confirmed that Nvidia has received approved licenses from Secretary Lutnik to sell back into China, has received purchase orders from Chinese companies, and is actively ramping up its supply chain to ship. His broader point: the risk isn’t selling chips to China — the real risk is America becoming so afraid of AI that its own industries don’t adopt it while the rest of the world surges ahead.

    13. Taiwan, Supply Chain, and Geopolitical Risk

    Jensen laid out a three-part strategy for de-risking around Taiwan: (1) Re-industrialize the US as fast as possible — he said Arizona, Texas, and California manufacturing is accelerating with Taiwan’s help as a strategic partner. (2) Diversify the supply chain to South Korea, Japan, and Europe. (3) Demonstrate restraint — don’t press unnecessarily while building resilience. He also noted that Taiwan’s partnership has been genuine and deserves recognition and generosity in return.

    14. Data Centers in Space

    Not science fiction — Nvidia already has CUDA running in satellites doing AI imaging processing in orbit. The near-term thesis: it’s more efficient to process satellite imagery in space than beam raw data back to Earth. The longer-term architecture for space-based data centers is being explored, with radiation hardening already solved. The main challenge is cooling — in the vacuum of space, you can only use radiation cooling, which requires very large surface areas.

    15. Healthcare: Near the ChatGPT Moment for Digital Biology

    Jensen believes digital biology is approaching its own ChatGPT inflection point — the moment where representing genes, proteins, cells, and chemicals becomes as natural as language modeling. He flagged companies like Open Evidence and Hippocratic AI as examples of where agentic healthcare is already working. His vision: every hospital instrument — CT scanners, ultrasound devices, surgical robots — will become agentic, with “OpenClaw in a safe version” running inside each one.

    16. Open Source and Closed Source Will Both Win

    Jensen pushed back on the idea that open source vs. proprietary is an either/or question. It’s both, necessarily. Proprietary models (OpenAI, Anthropic, Gemini) will continue to serve the general horizontal layer — and consumers love having options with distinct personalities. But industries need open models they can specialize, fine-tune, and control. The open model ecosystem, including Chinese models, is “near the frontier” and growing fast. His framework: connect to the best available model today via a router, and use that time to cost-reduce and fine-tune your specialized version.

    17. Advice for Young People: Master AI, Go Deep on Science

    Jensen’s advice for students deciding what to study: deep science, deep math, and strong language skills — because language is the programming language of AI. He made a striking claim: the English major might end up being the most successful professional in the AI era. His one non-negotiable: whatever you study, become deeply expert at using AI tools. And he used radiologists as proof that AI doesn’t destroy jobs — when AI did 100% of the computer vision work in radiology, demand for radiologists went up, not down, because the total number of scans possible exploded.


    📋 Detailed Summary

    The Groq Acquisition and Disaggregated Inference

    The conversation opened with the Groq acquisition — a deal Chamath jokingly said made him “insufferable” during the six-week close. Jensen explained the strategic logic: as Nvidia evolved from running large language models to running full agentic systems, the compute problem became radically more complex. Agentic workloads involve working memory, long-term memory, tool use, inter-agent communication, and diverse model types (autoregressive, diffusion, large, small). No single chip type handles all of this optimally.

    The solution is disaggregated inference — routing different parts of the processing pipeline to the most efficient hardware. Groq’s LPU chips are particularly suited to certain inference tasks. Nvidia’s Vera Rubin system now encompasses five rack types where it used to be one: GPU compute, networking processors, storage processors (BlueField), CPUs, and now LPUs. Jensen’s TAM math: the addition of those four rack types grew Nvidia’s addressable market in any given data center by 33–50% overnight.

    The operating system managing all of this is Dynamo, which Jensen introduced 2.5 years ago — a deliberate reference to the Siemens dynamo machine that powered the first industrial revolution. Dynamo orchestrates workloads across this heterogeneous compute landscape, optimizing for cost, speed, and efficiency.

    Decision-Making at the World’s Most Valuable Company

    Asked how he allocates attention and makes strategic calls at a $350B+ revenue company, Jensen gave a surprisingly simple framework: pursue things that are insanely hard, that have never been done before, and that tap into Nvidia’s specific superpowers. If something is easy, competitors will flood in. If it’s hard and unique, the pain and suffering of building it becomes a moat in itself. He explicitly said he enjoys the pain — and that there’s no great invention that came easily on the first try.

    Physical AI and the Three Computers

    Jensen framed Nvidia’s physical AI strategy around three distinct computers:

    1. The Training Computer — for developing and creating AI models.
    2. The Simulation Computer (Omniverse) — for evaluating AI systems inside physics-accurate virtual environments (required for robotics and autonomous vehicles that can’t be tested purely in the real world).
    3. The Edge Computer — deployed in cars, robots, factory floors, teddy bears, and telecom base stations. Jensen flagged that the $2 trillion global telecom industry is being transformed into an extension of AI infrastructure — turning radio base stations into AI edge devices.

    Physical AI is, by Jensen’s estimate, the technology industry’s first real crack at the $50 trillion industrial economy. He started the investment 10 years ago. It’s now approaching $10 billion annually and growing exponentially.

    OpenClaw as the New Operating System

    Jensen’s analysis of OpenClaw (Anthropic’s open-source agentic framework, referred to as “Claude Code” / “Open Claude” throughout) was one of the most intellectually interesting sections of the interview. He traced three cultural inflection points:

    1. ChatGPT — put generative AI into the popular consciousness by wrapping the technology in a usable interface.
    2. Reasoning models (o1, o3) — shifted AI from answering questions to answering them with grounded, verifiable reasoning, driving economic model inflection at OpenAI.
    3. OpenClaw — introduced the concept of agentic computing to the general population. But more importantly, it defined a new computing architecture: memory (short and long-term), skills, resource scheduling, IO, external communication, and agent spawning. These are the four elements of an operating system. OpenClaw is, in Jensen’s view, the blueprint for what a personal AI computer looks like — open source, running everywhere.

    He also flagged that Nvidia contributed security governance work to OpenClaw alongside Peter Steinberger — ensuring agents with access to sensitive information, code execution, and external communication can be properly governed with appropriate policy constraints.

    The Agentic Future and Token Economics

    Jensen’s internal benchmark for token spending at Nvidia was striking: a $500K/year engineer who isn’t spending $250K/year in tokens is underperforming. He framed this as no different from a chip designer refusing to use CAD software. The implication for enterprise economics is profound: the cost basis of AI in a company isn’t an IT line item — it’s a multiplier on every knowledge worker’s output.

    He also addressed Andrej Karpathy’s “autoresearch” concept — the idea of AI systems that autonomously run research experiments. A guest described completing, in 30 minutes on a desktop, a genomics analysis that would normally constitute a seven-year PhD thesis. Jensen’s response: this isn’t a fluke. It’s the beginning of a fundamental shift in what “doing science” means.

    His forecast on compute scaling: generative to reasoning = 100x. Reasoning to agentic = 100x. Total in two years = 10,000x. And the end state isn’t even close yet — he believes the long-run trajectory is 1 million times current compute levels.

    AI’s PR Crisis and Anthropic’s Comms Mistakes

    This segment was diplomatically delivered but substantively sharp. Jensen opened by genuinely praising Anthropic — their technology, their safety focus, their culture of excellence. Then he drew a distinction: warning people about AI capabilities is good and important. Scaring people with extreme, catastrophic predictions for which there’s no evidence is less good, and potentially very damaging.

    He pointed to the nuclear analogy: public fear of nuclear energy, driven partly by technology leaders’ own alarming statements, effectively killed the US nuclear industry. America now has zero new fission reactors while China builds a hundred. AI’s 17% public approval rating in the US is the beginning of the same dynamic. Jensen said the greatest national security risk from AI isn’t what other countries do with it — it’s the US being so afraid of it that American industries fail to adopt it while the rest of the world surges ahead.

    His prescription for AI leaders: be more circumspect, more humble, more moderate. Acknowledge that we can’t completely predict the future. Avoid statements that are extreme and unsupported by evidence. Our words matter in a way they didn’t used to — technology leaders are now central to the national security and economic policy conversation.

    China Policy: Return to Market

    One of the more concrete news items in the interview: Nvidia is returning to the Chinese market. Jensen confirmed they had a 95% market share in China — and fell to 0% due to export controls. They’ve now received approved licenses from Secretary Lutnik, Chinese companies have issued purchase orders, and Nvidia is ramping its supply chain to ship.

    His framework for the right AI export policy outcome: the American tech stack — from chips to computing systems to platforms — should be used by 90% of the world as the foundation on which other countries build their own AI. The alternative — an AI industry that ends up like solar panels, rare earth minerals, motors, and telecom infrastructure (all dominated by China) — is a national security catastrophe.

    Self-Driving and Competitive Positioning

    Jensen laid out Nvidia’s strategy in autonomous vehicles: they don’t want to build self-driving cars — they want to enable every car company to build them. Nvidia supplies all three computers: training, simulation, and the in-car edge computer. Their autonomous driving AI system, called “Al Pomayo,” introduced reasoning capabilities into autonomous vehicles — decomposing complex scenarios into simpler ones the system knows how to navigate.

    On competition from customers (Google TPU, Amazon Inferentia, etc.): Jensen isn’t worried. His argument is that 40% of Nvidia’s business comes from customers who don’t just want chips — they need the full AI factory stack. CUDA isn’t just a chip instruction set; it’s a system. Companies that have tried to build their own silicon have found that chips without the full stack don’t solve the problem. Meanwhile, Nvidia is gaining market share, including pulling in Anthropic and Meta as Nvidia customers, and AWS just announced a million-chip order.

    Robotics: 3–5 Years to Everywhere

    Jensen’s robotics take was both bullish and grounded. America invented modern robotics, got too early, got exhausted, and quit just before the AI brain appeared that would make it work. That brain is here now. From the current “existence proof” stage to “reasonable products,” he sees 3–5 years. His aspiration: more than one robot per human on Earth. The use cases he described range from factory floor automation to virtual presence (using your home robot as an avatar while traveling), to lunar and Martian factories run entirely by robots with materials beamed back to Earth at near-zero energy cost.

    China’s position in robotics is formidable and can’t be wished away: they lead in micro-electronics, motors, rare earth elements, and magnets — all foundational to building robot hardware. The world’s robotics industry, including the US, will depend heavily on China’s supply chain for hardware components even if American software and AI lead.

    Revenue Forecasts: Dario Is Too Conservative

    When the hosts described Dario Amodei’s forecast of hundreds of billions in AI model/agent revenue by 2027–28 and $1 trillion by 2030, Jensen said simply: “Way better than that.” His reason: Dario hasn’t fully factored in that every enterprise software company will become a value-added reseller of AI tokens — OpenAI’s, Anthropic’s, whoever’s. The go-to-market expansion that comes from every SAP, Salesforce, and ServiceNow reselling AI is logarithmic, not linear.

    Healthcare: Near the Inflection Point

    Jensen named three layers of Nvidia’s healthcare involvement: (1) AI biology/physics — using AI to represent and predict biological behavior for drug discovery; (2) AI agents — agentic systems for diagnosis assistance, first-visit intake, and clinical decision support (he named Open Evidence and Hippocratic AI as leading examples); (3) Physical AI for healthcare — robotic surgery, AI-enabled instruments, and the vision of every hospital device (CT, ultrasound, surgical tools) becoming agentic. He sees digital biology as approaching its ChatGPT moment — the point where representing genes, proteins, and cells computationally becomes as natural and powerful as language modeling.

    Career Advice: Go Deep, Use AI

    Jensen closed with career guidance. His core advice: study deep science, deep math, and language — because language is now the programming language of AI. He made the counterintuitive claim that English majors may end up being the most successful professionals in the AI era because the ability to specify, guide, and evaluate AI outputs is an artform — and it’s not trivial. The person who knows how to give AI enough guidance without over-prescribing, who can recognize a great AI output from a mediocre one, and who can orchestrate teams of agents toward outcomes — that’s the most valuable skill.

    He used the radiologist story as his closing proof point: when computer vision was integrated into radiology, demand for radiologists went up, not down. The number of scans exploded, hospitals made more money, and more patients got diagnosed faster. AI didn’t replace radiologists — it made them bionic and made the whole system bigger. He expects the same pattern everywhere: every job will be transformed, some tasks will be eliminated, but the total pie grows dramatically.


    💭 Thoughts

    Jensen Huang is doing something rare among tech CEOs: he’s genuinely trying to build the mental model people need to understand what’s happening — not just sell products. The disaggregated inference argument, the three-computer framework, the OS analogy for OpenClaw, the token economics benchmark — these aren’t talking points. They’re conceptual tools for thinking clearly about a landscape most people are still squinting at.

    The most underappreciated part of the interview is the AI PR section. Jensen is essentially sounding an alarm without panicking: if America’s technology leaders keep scaring the public with AI doomerism, we will repeat the nuclear mistake. We’ll regulate ourselves into irrelevance while China builds the infrastructure we refused to build. The 17% approval number he cited should frighten every AI optimist in the room. Fear of a technology, once embedded culturally, is very hard to dislodge.

    The Anthropic critique was surgical. He didn’t name the specific controversy, didn’t pile on, and praised their technology extensively. But the message was clear: extreme safety warnings, even well-intentioned ones, carry real costs in the public square. That’s a genuinely hard tension for safety-focused AI companies, and there’s no clean answer — but Huang’s instinct that humility and circumspection serve better than catastrophism seems directionally correct.

    The physical AI thesis deserves more attention than it gets. Everyone is focused on the software intelligence race — OpenAI vs. Anthropic vs. Gemini. But Jensen is pointing at a $50 trillion industrial economy that AI has barely touched. Robotics, autonomous vehicles, agricultural automation, smart hospital instruments — this is where the real mass of economic value is locked. And Nvidia’s ten-year head start on the enabling infrastructure for physical AI may turn out to be more durable than any software moat.

    Finally: the robot optimism is infectious and probably correct. The world is genuinely short millions of workers. The enabling technology — AI brains good enough to drive perception, reasoning, and action in unstructured physical environments — just arrived. The hardware supply chain is largely intact. And the economic incentive to automate is stronger than it’s ever been. Three to five years feels aggressive. But so did “ChatGPT will change everything” in 2021.


  • Beyond the Bubble: Jensen Huang on the Future of AI, Robotics, and Global Tech Strategy in 2026

    In a wide-ranging discussion on the No Priors Podcast, NVIDIA Founder and CEO Jensen Huang reflects on the rapid evolution of artificial intelligence throughout 2025 and provides a strategic roadmap for 2026. From the debunking of the “AI Bubble” to the rise of physical robotics and the “ChatGPT moments” coming for digital biology, Huang offers a masterclass in how accelerated computing is reshaping the global economy.


    TL;DW (Too Long; Didn’t Watch)

    • The Core Shift: General-purpose computing (CPUs) has hit a wall; the world is moving permanently to accelerated computing.
    • The Jobs Narrative: AI automates tasks, not purposes. It is solving labor shortages in manufacturing and nursing rather than causing mass unemployment.
    • The 2026 Breakthrough: Digital biology and physical robotics are slated for their “ChatGPT moment” this year.
    • Geopolitics: A nuanced, constructive relationship with China is essential, and open source is the “innovation flywheel” that keeps the U.S. competitive.

    Key Takeaways

    • Scaling Laws & Reasoning: 2025 proved that scaling compute still translates directly to intelligence, specifically through massive improvements in reasoning, grounding, and the elimination of hallucinations.
    • The End of “God AI”: Huang dismisses the myth of a monolithic “God AI.” Instead, the future is a diverse ecosystem of specialized models for biology, physics, coding, and more.
    • Energy as Infrastructure: AI data centers are “AI Factories.” Without a massive expansion in energy (including natural gas and nuclear), the next industrial revolution cannot happen.
    • Tokenomics: The cost of AI inference dropped 100x in 2024 and could drop a billion times over the next decade, making intelligence a near-free commodity.
    • DeepSeek’s Impact: Open-source contributions from China, like DeepSeek, are significantly benefiting American startups and researchers, proving the value of a global open-source ecosystem.

    Detailed Summary

    The “Five-Layer Cake” of AI

    Huang explains AI not as a single app, but as a technology stack: EnergyChipsInfrastructureModelsApplications. He emphasizes that while the public focuses on chatbots, the real revolution is happening in “non-English” languages, such as the languages of proteins, chemicals, and physical movement.

    Task vs. Purpose: The Future of Labor

    Addressing the fear of job loss, Huang uses the “Radiologist Paradox.” While AI now powers nearly 100% of radiology applications, the number of radiologists has actually increased. Why? Because AI handles the task (scanning images), allowing the human to focus on the purpose (diagnosis and research). This same framework applies to software engineers: their purpose is solving problems, not just writing syntax.

    Robotics and Physical AI

    Huang is incredibly optimistic about robotics. He predicts a future where “everything that moves will be robotic.” By applying reasoning models to physical machines, we are moving from “digital rails” (pre-programmed paths) to autonomous agents that can navigate unknown environments. He foresees a trillion-dollar repair and maintenance industry emerging to support the billions of robots that will eventually inhabit our world.

    The “Bubble” Debate

    Is there an AI bubble? Huang argues “No.” He points to the desperate, unsatisfied demand for compute capacity across every industry. He notes that if chatbots disappeared tomorrow, NVIDIA would still thrive because the fundamental architecture of the world’s $100 trillion GDP is shifting from CPUs to GPUs to stay productive.


    Analysis & Thoughts

    Jensen Huang’s perspective is distinct because he views AI through the lens of industrial production. By calling data centers “factories” and tokens “output,” he strips away the “magic” of AI and reveals it as a standard industrial revolution—one that requires power, raw materials (data/chips), and specialized labor.

    His defense of Open Source is perhaps the most critical takeaway for policymakers. By arguing that open source prevents “suffocation” for startups and 100-year-old industrial companies, he positions transparency as a national security asset rather than a liability. As we head into 2026, the focus is clearly shifting from “Can the model talk?” to “Can the model build a protein or drive a truck?”

  • Jensen Huang on Joe Rogan: AI’s Future, Nuclear Energy, and NVIDIA’s Near-Death Origin Story

    In a landmark episode of the Joe Rogan Experience (JRE #2422), NVIDIA CEO Jensen Huang sat down for a rare, deep-dive conversation covering everything from the granular history of the GPU to the philosophical implications of artificial general intelligence. Huang, currently the longest-running tech CEO in the world, offered a fascinating look behind the curtain of the world’s most valuable company.

    For those who don’t have three hours to spare, we’ve compiled the “Too Long; Didn’t Watch” breakdown, key takeaways, and a detailed summary of this historic conversation.

    TL;DW (Too Long; Didn’t Watch)

    • The OpenAI Connection: Jensen personally delivered the first AI supercomputer (DGX-1) to Elon Musk and the OpenAI team in 2016, a pivotal moment that kickstarted the modern AI race.
    • The “Sega Moment”: NVIDIA almost went bankrupt in 1995. They were saved only because the CEO of Sega invested $5 million in them after Jensen admitted their technology was flawed and the contract needed to be broken.
    • Nuclear AI: Huang predicts that within the next decade, AI factories (data centers) will likely be powered by small, on-site nuclear reactors to handle immense energy demands.
    • Driven by Fear: Despite his success, Huang wakes up every morning with a “fear of failure” rather than a desire for success. He believes this anxiety is essential for survival in the tech industry.
    • The Immigrant Hustle: Huang’s childhood involved moving from Thailand to a reform school in rural Kentucky where he cleaned toilets and smoked cigarettes at age nine to fit in.

    Key Takeaways

    1. AI as a “Universal Function Approximator”

    Huang provided one of the most lucid non-technical explanations of deep learning to date. He described AI not just as a chatbot, but as a “universal function approximator.” While traditional software requires humans to write the function (input -> code -> output), AI flips this. You give it the input and the desired output, and the neural network figures out the function in the middle. This allows computers to solve problems for which humans cannot write the code, such as curing diseases or solving complex physics.

    2. The Future of Work and Energy

    The conversation touched heavily on resources. Huang noted that we are in a transition from “Moore’s Law” (doubling performance) to “Huang’s Law” (accelerated computing), where the cost of computing drops while energy efficiency skyrockets. However, the sheer scale of AI requires massive power. He envisions a future of “energy abundance” driven by nuclear power, which will support the massive “AI factories” of the future.

    3. Safety Through “Smartness”

    Addressing Rogan’s concerns about AI safety and rogue sentience, Huang argued that “smarter is safer.” He compared AI to cars: a 1,000-horsepower car is safer than a Model T because the technology is channeled into braking, handling, and safety systems. Similarly, future computing power will be channeled into “reflection” and “fact-checking” before an AI gives an answer, reducing hallucinations and danger.

    Detailed Summary

    The Origin of the AI Boom

    The interview began with a look back at the relationship between NVIDIA and Elon Musk. In 2016, NVIDIA spent billions developing the DGX-1 supercomputer. At the time, no one understood it or wanted to buy it—except Musk. Jensen personally delivered the first unit to a small office in San Francisco where the OpenAI team (including Ilya Sutskever) was working. That hardware trained the early models that eventually became ChatGPT.

    The “Struggle” and the Sega Pivot

    Perhaps the most compelling part of the interview was Huang’s recounting of NVIDIA’s early days. In 1995, NVIDIA was building 3D graphics chips using “forward texture mapping” and curved surfaces—a strategy that turned out to be technically wrong compared to the industry standard. Facing bankruptcy, Huang had to tell his only major partner, Sega, that NVIDIA could not complete their console contract.

    In a move that saved the company, the CEO of Sega, who liked Jensen personally, agreed to invest the remaining $5 million of their contract into NVIDIA anyway. Jensen used that money to pivot, buying an emulator to test a new chip architecture (RIVA 128) that eventually revolutionized PC gaming. Huang admits that without that act of kindness and luck, NVIDIA would not exist today.

    From Kentucky to Silicon Valley

    Huang shared his “American Dream” story. Born in Taiwan and raised in Thailand, his parents sent him and his brother to the U.S. for safety during civil unrest. Due to a misunderstanding, they were enrolled in the Oneida Baptist Institute in Kentucky, which turned out to be a reform school for troubled youth. Huang described a rough upbringing where he was the youngest student, his roommate was a 17-year-old recovering from a knife fight, and he was responsible for cleaning the dorm toilets. He credits these hardships with giving him a high tolerance for pain and suffering—traits he says are required for entrepreneurship.

    The Philosophy of Leadership

    When asked how he stays motivated as the head of a trillion-dollar company, Huang gave a surprising answer: “I have a greater drive from not wanting to fail than the drive of wanting to succeed.” He described living in a constant state of “low-grade anxiety” that the company is 30 days away from going out of business. This paranoia, he argues, keeps the company honest, grounded, and agile enough to “surf the waves” of technological chaos.

    Some Thoughts

    What stands out most in this interview is the lack of “tech messiah” complex often seen in Silicon Valley. Jensen Huang does not present himself as a visionary who saw it all coming. Instead, he presents himself as a survivor—someone who was wrong about technology multiple times, who was saved by the grace of a Japanese executive, and who lucked into the AI boom because researchers happened to buy NVIDIA gaming cards to train neural networks.

    This humility, combined with the technical depth of how NVIDIA is re-architecting the world’s computing infrastructure, makes this one of the most essential JRE episodes for understanding where the future is heading. It serves as a reminder that the “overnight success” of AI is actually the result of 30 years of near-failures, pivots, and relentless problem-solving.

  • All-In Podcast Breaks Down OpenAI’s Turbulent Week, the AI Arms Race, and Socialism’s Surge in America

    November 8, 2025

    In the latest episode of the All-In Podcast, aired on November 7, 2025, hosts Jason Calacanis, Chamath Palihapitiya, David Sacks, and guest Brad Gerstner (with David Friedberg absent) delivered a packed discussion on the tech world’s hottest topics. From OpenAI’s public relations mishaps and massive infrastructure bets to the intensifying U.S.-China AI rivalry, market volatility, and the surprising rise of socialism in U.S. politics, the episode painted a vivid picture of an industry at a crossroads. Here’s a deep dive into the key takeaways.

    OpenAI’s “Rough Week”: From Altman’s Feistiness to CFO’s Backstop Blunder

    The podcast kicked off with a spotlight on OpenAI, which has been under intense scrutiny following CEO Sam Altman’s appearance on the BG2 podcast. Gerstner, who hosts BG2, recounted asking Altman about OpenAI’s reported $13 billion in revenue juxtaposed against $1.4 trillion in spending commitments for data centers and infrastructure. Altman’s response—offering to find buyers for Gerstner’s shares if he was unhappy—went viral, sparking debates about OpenAI’s financial health and the broader AI “bubble.”

    Gerstner defended the question as “mundane” and fair, noting that Altman later clarified OpenAI’s revenue is growing steeply, projecting a $20 billion run rate by year’s end. Palihapitiya downplayed the market’s reaction, attributing stock dips in companies like Microsoft and Nvidia to natural “risk-off” cycles rather than OpenAI-specific drama. “Every now and then you have a bad day,” he said, suggesting Altman might regret his tone but emphasizing broader market dynamics.

    The conversation escalated with OpenAI CFO Sarah Friar’s Wall Street Journal comments hoping for a U.S. government “backstop” to finance infrastructure. This fueled bailout rumors, prompting Friar to clarify she meant public-private partnerships for industrial capacity, not direct aid. Sacks, recently appointed as the White House AI “czar,” emphatically stated, “There’s not going to be a federal bailout for AI.” He praised the sector’s competitiveness, noting rivals like Grok, Claude, and Gemini ensure no single player is “too big to fail.”

    The hosts debated OpenAI’s revenue model, with Calacanis highlighting its consumer-heavy focus (estimated 75% from subscriptions like ChatGPT Plus at $240/year) versus competitors like Anthropic’s API-driven enterprise approach. Gerstner expressed optimism in the “AI supercycle,” betting on long-term growth despite headwinds like free alternatives from Google and Apple.

    The AI Race: Jensen Huang’s Warning and the Call for Federal Unity

    Shifting gears, the panel addressed Nvidia CEO Jensen Huang’s stark prediction to the Financial Times: “China is going to win the AI race.” Huang cited U.S. regulatory hurdles and power constraints as key obstacles, contrasting with China’s centralized support for GPUs and data centers.

    Gerstner echoed Huang’s call for acceleration, praising federal efforts to clear regulatory barriers for power infrastructure. Palihapitiya warned of Chinese open-source models like Qwen gaining traction, as seen in products like Cursor 2.0. Sacks advocated for a federal AI framework to preempt a patchwork of state regulations, arguing blue states like California and New York could impose “ideological capture” via DEI mandates disguised as anti-discrimination rules. “We need federal preemption,” he urged, invoking the Commerce Clause to ensure a unified national market.

    Calacanis tied this to environmental successes like California’s emissions standards but cautioned against overregulation stifling innovation. The consensus: Without streamlined permitting and behind-the-meter power generation, the U.S. risks ceding ground to China.

    Market Woes: Consumer Cracks, Layoffs, and the AI Job Debate

    The discussion turned to broader economic signals, with Gerstner highlighting a “two-tier economy” where high-end consumers thrive while lower-income groups falter. Credit card delinquencies at 2009 levels, regional bank rollovers, and earnings beats tempered by cautious forecasts painted a picture of volatility. Palihapitiya attributed recent market dips to year-end rebalancing, not AI hype, predicting a “risk-on” rebound by February.

    A heated exchange ensued over layoffs and unemployment, particularly among 20-24-year-olds (at 9.2%). Calacanis attributed spikes to AI displacing entry-level white-collar jobs, citing startup trends and software deployments. Sacks countered with data showing stable white-collar employment percentages, calling AI blame “anecdotal” and suggesting factors like unemployable “woke” degrees or over-hiring during zero-interest-rate policies (ZIRP). Gerstner aligned with Sacks, noting companies’ shift to “flatter is faster” efficiency cultures, per Morgan Stanley analysis.

    Inflation ticking up to 3% was flagged as a barrier to rate cuts, with Calacanis criticizing the administration for downplaying it. Trump’s net approval rating has dipped to -13%, with 65% of Americans feeling he’s fallen short on middle-class issues. Palihapitiya called for domestic wins, like using trade deal funds (e.g., $3.2 trillion from Japan and allies) to boost earnings.

    Socialism’s Rise: Mamdani’s NYC Win and the Filibuster Nuclear Option

    The episode’s most provocative segment analyzed Democratic socialist Zohran Mamdani’s upset victory as New York City’s mayor-elect. Mamdani, promising rent freezes, free transit, and higher taxes on the rich (pushing rates to 54%), won narrowly at 50.4%. Calacanis noted polling showed strong support from young women and recent transplants, while native New Yorkers largely rejected him.

    Palihapitiya linked this to a “broken generational compact,” quoting Peter Thiel on student debt and housing unaffordability fueling anti-capitalist sentiment. He advocated reforming student loans via market pricing and even expressed newfound sympathy for forgiveness—if tied to systemic overhaul. Sacks warned of Democrats shifting left, with “centrist” figures like Joe Manchin and Kyrsten Sinema exiting, leaving energy with revolutionaries. He tied this to the ongoing government shutdown, blaming Democrats’ filibuster leverage and urging Republicans to eliminate it for a “nuclear option” to pass reforms.

    Gerstner, fresh from debating “ban the billionaires” at Stanford (where many students initially favored it), stressed Republicans must address affordability through policies like no taxes on tips or overtime. He predicted an A/B test: San Francisco’s centrist turnaround versus New York’s potential chaos under Mamdani.

    Holiday Cheer and Final Thoughts

    Amid the heavy topics, the hosts plugged their All-In Holiday Spectacular on December 6, promising comedy roasts by Kill Tony, poker, and open bar. Calacanis shared updates on his Founder University expansions to Saudi Arabia and Japan.

    Overall, the episode underscored optimism in AI’s transformative potential tempered by real-world challenges: financial scrutiny, geopolitical rivalry, economic inequality, and political polarization. As Gerstner put it, “Time is on your side if you’re betting over a five- to 10-year horizon.” With Trump’s mandate in play, the panel urged swift action to secure America’s edge—or risk socialism’s further ascent.