PJFP.com

Pursuit of Joy, Fulfillment, and Purpose

  • Michael Saylor on Strategy’s Bitcoin Playbook, the 11.5% Stretch Preferred Stock, Why Working Hard Is Bad Advice, and Bitcoin as Cyber Manhattan

    Michael Saylor, founder and executive chairman of Strategy (formerly MicroStrategy), sits down for Episode 172 of the When Shift Happens podcast for a wide-ranging, two-hour conversation on how a near-bankrupt enterprise software company became the world’s largest corporate holder of Bitcoin, why he calls his new preferred stock STRC “stretch” the most successful credit instrument in the world, and what 40 years of trial and error taught him about focus, leverage, time horizons, and the difference between working hard and working smart. This one is essential listening for anyone trying to understand Bitcoin as a protocol, Strategy as a capital markets machine, and what an “AI-pilled” 61-year-old founder actually does with his time.

    TLDW

    Saylor walks through his MIT-trained engineer’s framing of money as an adiabatic thermodynamic system, where the dollar loses roughly 7% of its energy per year, gold loses 2%, and Bitcoin loses zero, giving it an infinite half-life. He explains how COVID-era zero interest rates “rent controlled” the cash on Strategy’s balance sheet and forced him to search for a Facebook-of-money, leading to a $62 billion Bitcoin position across 818,000 coins. He details Strategy’s evolution from buying Bitcoin with cash, to convertibles, to senior bonds, to the equity ATM, to the new preferred stock family (Strike, Strife, Stride, and now Stretch), and argues that STRC is “rocket fuel kerosene” distilled from Bitcoin’s pure economic energy: an 11.5% monthly dividend, tax-deferred return of capital, designed to trade tightly around $100. He returns repeatedly to focus, the lesson he says he learned the hard way after spinning up alarm.com, voice.com, angel.com, and a half-dozen other ventures in his 30s. He argues working hard is now bad advice in an era where AI demonetizes labor, that volatility is vitality and the only honest time horizon is four to ten years, and that Bitcoin is to money what English is to language and Arabic numerals are to math: the protocol that won the network effect contest, and the place “all the money and power” now lives.

    Thoughts

    The most useful part of this conversation is not the Bitcoin maximalism, which is by now a fully formed Saylor genre. It is the brutal honesty about the decade he wasted launching alarm.com, voice.com, angel.com, michael.com, hope.com, and a half-dozen others while a billion-dollar MicroStrategy sat at the center of his portfolio asking for more attention. He admits the “imaginary future business is always more fun than struggling with the existing mature business,” which is one of the cleanest descriptions of founder ADHD I have read. The fact that someone at his level of intelligence and pattern recognition still required 20 years and a near-death experience to learn focus should make every operator under 40 reread that section twice. The single takeaway worth tattooing on a wall is his rule: “Just because you can do a thing doesn’t mean you should do a thing.”

    The engineering framing of money is the strongest intellectual move in the episode. Saylor is treating monetary supply expansion as energy loss in a thermodynamic system, with the dollar at a 10-year half-life, weak currencies at 3 to 5 years, gold at 36 years, and Bitcoin at infinity. Whether or not you accept the conclusion, the model is internally consistent in a way most macroeconomic arguments are not, and it gives him a vocabulary for talking about scarcity that economists trained on continuous-supply commodities literally do not have. The Max Planck quote he leans on, “science advances one funeral at a time,” is doing real work here. He is not saying he is smarter than the old guard. He is saying the old guard has no incentive to update because they already have money and power, and that the paradigm shift will be carried by the people with everything to gain. That is a more humble and more accurate version of the maximalist line.

    The Strategy capital markets machine is the part that deserves more scrutiny than it usually gets. The pitch for Stretch is genuinely interesting on its merits: a preferred stock that trades around $100, pays 11.5% monthly as return-of-capital dividends that defer all tax for roughly nine years, gets a step-up in basis on inheritance, and is positioned as a digital money market for people who believe in Bitcoin but do not want 40% volatility. If you take Saylor’s network-effect thesis seriously, this is the natural product to build, and his Standard Oil analogy (“distill the kerosene out of the crude oil”) is the right mental model. The risk that does not get discussed is what happens to the entire instrument family in a 99.8% drawdown of the kind he himself lived through with MicroStrategy in 2002. He waves it off by saying Strategy has 10x the enterprise value over the preferred, but in a deep enough Bitcoin winter, that cushion compresses fast. Worth holding both ideas at once: this is the most elegant Bitcoin-native fixed income product yet built, and it is still fundamentally a leveraged Bitcoin bet wearing a money-market mask.

    The “working hard is bad advice” thread is going to be the most controversial clip, and it is also the most important. Saylor is not saying do not work. He is saying do not be John Henry. Do not race the steam drill with a hammer. In a world where AI can translate, draft legal briefings, write books in 100 languages, and out-produce any individual professional by orders of magnitude, the marginal value of pure human labor is collapsing, and the right move is to ask “what tool can do this for me” before “how do I get better at this.” That is the same logic that took him from “I would have fired anyone who suggested Zoom in 2019” to running a global operation from a Florida studio. The unsubtle implication, especially for the 34-year-old host he is talking to, is that the 10,000-hour mastery model your parents grew up with is increasingly a status symbol with no underlying economics, like learning to compose Shakespearean sonnets in 2026.

    The single underrated line in the whole episode is “everything you own in the physical world you own at the pleasure of someone more powerful than you.” He is using it to make the Bitcoin self-custody case, but it generalizes to a much broader political and historical observation about property rights, minorities, and the steady drumbeat of expropriation events across 10,000 years of recorded history. Whether or not Bitcoin is the answer, the framing of “where do you store value such that nobody can decide to take it from you” is the right question to ask in the current decade, and most people are not asking it.

    Key Takeaways

    • Strategy now holds roughly 818,000 Bitcoin worth $62 billion, making it the world’s largest corporate Bitcoin holder and effectively a reserve bank built on a hard-capped digital monetary network.
    • Saylor’s working definition of an investor: anyone willing to hold a position for at least four years. Anyone with a shorter horizon is a trader, and most traders are fools who do not know they are fools.
    • His core advice to a 40-year-old Uber driver who cannot afford a house: own assets that appreciate faster than the 7% annual US dollar debasement rate. Anything slower means you are getting poorer in real terms while working harder every year.
    • The MIT-trained engineer’s framing of money: gold has a 36-year half-life because supply inflates ~2% a year, the dollar has a ~10-year half-life at ~7% debasement, weak currencies have 3 to 5-year half-lives, and Bitcoin’s half-life is infinite because supply growth is zero.
    • The 2020 pivot was forced, not chosen. When the Fed took rates to zero and signaled no hikes, Strategy’s $500 million in cash became, in Saylor’s metaphor, a rent-controlled building paying zero. They were forced to look for a way out and ended up at Bitcoin.
    • Saylor’s aha moment was recognizing Bitcoin as the only commodity in history with absolute scarcity. Gold inflates, silver inflates, even land can be reclaimed from the sea. Only Bitcoin’s 21 million cap is mathematically fixed.
    • The biggest lesson of his 30s and 40s: focus. He launched alarm.com, voice.com, angel.com, michael.com, hope.com, and several others while running MicroStrategy, and none of them matched the original. The line he leaves with is “just because you can do a thing doesn’t mean you should do a thing.”
    • By the time he was 55, he had been humbled enough to take someone else’s billion-dollar idea (Satoshi’s) instead of trying to generate his own.
    • Strategy’s evolution as an issuer: cash purchases, then convertibles, then senior bonds, then asset-backed loans (Silvergate failure ended that path), then the equity ATM, then the preferred-stock family Strike, Strife, Stride, and now Stretch.
    • Stretch (STRC) is a preferred stock targeted to trade around $100 with about 1 unit of volatility, paying 11.5% monthly as return-of-capital dividends, tax-deferred for roughly nine years until the basis is fully recovered.
    • STRC pairs with a step-up in basis on inheritance, meaning heirs can receive another nine years of tax-deferred dividends on top of what the original holder collected, an arrangement neither bonds nor most preferred stocks allow.
    • Strategy can create roughly 10 to 20 cents of digital credit per dollar of Bitcoin held, which positions a trillion dollars of future Bitcoin holdings to support $200 to $400 billion of credit instruments.
    • The addressable market for STRC-style instruments, in Saylor’s framing, is the roughly $300 trillion global credit market currently delivering about 350 basis points after tax. A product offering three times that yield is targeting trillions of dollars of demand.
    • Standard Oil analogy: Rockefeller called his company “Standard” because impure kerosene blew up engines and houses. Strategy is in the business of distilling pure financial instruments out of the raw economic energy of Bitcoin, the way refineries distill kerosene from crude.
    • Four-letter NASDAQ ticker discipline. Saylor specifically chose STRC over MSTR.P because retail can search, remember, and trade four-letter symbols on Robinhood and Schwab. About 80% of STRC is held by retail.
    • Bitcoin as a lifeboat thesis: in any country with a collapsing currency (Argentina, Brazil, most of Africa, historical Germany), no physical asset is safe because property is held at the pleasure of whoever has power. Bitcoin allows wealth to cross borders inside someone’s head.
    • The Saylor leverage example: a 2.5% mortgage in 2021 plus a 40% appreciating asset is a roughly 37.5% net spread on borrowed money, equivalent to a real after-tax salary of several hundred thousand dollars in a high-tax city, earned for nothing more than paperwork.
    • Volatility is the feature, not the bug. Bitcoin reacts in real time to events in every country, every hour, which is why 500 million people care about it and almost nobody cares about the value of Tokyo imperial real estate.
    • Saylor’s litmus test for trading: if you would not hold it for ten years, you should not hold it for ten minutes. Anything less than a four-year horizon means you are doing entertainment, not investing.
    • He spends “thousands of hours a year” thinking about Bitcoin’s first, second, third, and fourth-order effects, and runs a stochastic risk model that updates every 15 seconds, refusing to diversify because adding silver, gold, or real estate would shatter the model.
    • Bitcoin as protocol: the same network-effect logic that made English the default global language, Arabic numerals the default math, TCP/IP the default networking protocol, and the shipping container the default freight format. Once a protocol locks in, only an asteroid-strike-level event can dislodge it.
    • “There is no second best language” is the analogy he keeps returning to. Bitcoin is to money what English is to communication. Wishing it were Swahili or Esperanto does not change where the wealth concentrates.
    • The Newtonian network effect: when Rupert Murdoch joins Facebook he brings 50 friends. When he joins Bitcoin he brings $50 million. Monetary networks compound faster than social networks because billionaires bring billions.
    • “Don’t sell the thing that will make your children’s children wealthy” is the operating heuristic. He uses the great-great-grandfather analogy: if your ancestor sold Bitcoin to buy velvet for a horse-and-buggy, you would not be rich today.
    • Working hard is not the path. The forklift outperforms the strongest worker with a shovel. John Henry beat the steel drill once and his heart burst doing it.
    • AI is now the forklift for white-collar work. Saylor uses it to draft 25-page legal briefings, translate content into 100 languages, and avoid going back to law school. “It would take 10 years and a million dollars to do what the AI does in two minutes.”
    • Personal communication leverage: a single Lex Fridman appearance has reached more than 11 million views, more people than a 30-year teaching career could reach.
    • Saylor was inspired into engineering as a child by Robert Heinlein’s “Have Space Suit, Will Travel,” in which the hero saves Earth and is rewarded with a full scholarship to MIT. The same Heinlein-Asimov-Clarke pipeline shaped Elon Musk and Jeff Bezos.
    • His mother imprinted on him that he was expected to do great things while he was a 9-year-old paper boy in Dayton, Ohio. He credits the combination of literature plus maternal expectation with his early ambition.
    • He calls himself a late bloomer and “the Colonel Sanders of crypto,” noting that more interesting things have happened in the last 12 months of his career than in the entire previous 35 years.
    • Strategy’s succession is already in motion. CEO Phong Le, Andrew Kang, and CJ are running operational layers, and Saylor expects Strategy to outlast him the way Lloyd’s of London has outlasted its founders by hundreds of years.
    • The Bitcoin price path he is willing to articulate publicly: “We’ll buy it at 100,000, we’ll buy it at 200,000. We’ll buy it at 500,000, we’ll buy it at a million, 2 million, 4 million, 8 million.” The business is “drive Bitcoin to millions of dollars.”
    • He survived a 99.8% drawdown in MicroStrategy from $333 to $0.42 between 2000 and 2002, three days from bankruptcy. He says current Bitcoin volatility does not feel like stress by comparison.
    • He has no children, is not married, and describes himself as singularly married to the business, which he expects to keep doing as long as the civilization needs the money fixed.

    Detailed Summary

    Who Saylor is and why MicroStrategy became Strategy

    Saylor grew up in an Air Force family, lived on bases across Japan, New Zealand, Nebraska, Florida, and Ohio, and won a US Air Force scholarship to MIT, where he studied aerospace engineering and the history of science. He founded MicroStrategy at 24, took it public on the NASDAQ in 1998, and built it into a billion-dollar business intelligence company with about 2,000 employees. By 2020 the business was being slowly crushed by Microsoft Power BI, and lockdowns plus zero interest rates eliminated the natural return on the company’s $500 million cash position. The frustration drove Strategy into Bitcoin: $250 million, then another $250 million, then a billion, then two, until the company became the world’s largest corporate holder with ~$62 billion across 818,000 coins.

    The hard-earned lesson of focus

    Saylor’s defining career mistake was the period between his mid-30s and mid-40s when he launched ten other businesses on the side of MicroStrategy: alarm.com (now a public multi-billion-dollar company spun off), angel.com (sold for about $110 million), voice.com (sold for about $30 million), and several others including michael.com, frank.com, emma.com, hope.com, and usher.com. He concedes that almost none of these matched the original, that the imaginary future business is always more fun than the mature one, and that he wishes he had instead poured 150% of his energy into MicroStrategy. The crystallized lesson, repeated several times: “Just because you can do a thing doesn’t mean you should do a thing.”

    Money as a thermodynamic system

    The intellectual core of the conversation is Saylor’s framing of money as energy in an adiabatic system. Gold inflates ~2% a year and therefore has a 36-year half-life. The dollar debases at ~7% a year and has roughly a 10-year half-life. Weaker currencies have half-lives of 3 to 5 years. Bitcoin’s hard cap of 21 million coins means zero supply inflation, which produces an infinite half-life. He learned thermodynamics designing aircraft wings at MIT and applies the same closed-system logic to money: any system with energy lapse cannot be a long-term store of value, and Bitcoin is the first asset in human history with no lapse.

    Bitcoin as a global lifeboat

    For people in collapsing currency regimes, Saylor argues no domestic instrument holds value. Argentinian and Brazilian hyperinflations destroy 99.9% of purchasing power on familiar cycles. German marks were used in wheelbarrows to buy soap. Buying local real estate, bonds, or currency in those environments is useless because the underlying economy decays around them. The only escape historically has been gold or paintings, which then need to be smuggled across borders. Bitcoin solves the same problem digitally: it crosses borders inside someone’s head via private keys, and it cannot be expropriated by whoever currently holds power. Saylor’s broader point, “everything you own in the physical world you own at the pleasure of someone more powerful than you,” is the philosophical anchor of the entire Bitcoin maximalist case.

    Strategy’s capital markets evolution

    Strategy has run through every available capital structure to keep buying Bitcoin: cash, tender offers, equity issuance, convertible bonds (where Strategy became the largest issuer in the world), senior bonds (abandoned because covenants choked growth), asset-backed loans (Silvergate’s failure ended that channel), the equity ATM, and finally the preferred-stock family. Strike, Strife, Stride, and Stretch were each iterations toward what Saylor calls “the perfect credit instrument,” refined the way Standard Oil refined crude into kerosene. Stretch (STRC) is the current state of the art: a preferred stock targeted to $100, with about 1 unit of volatility, paying 11.5% monthly as return-of-capital dividends that defer all tax for roughly nine years.

    Why STRC matters as a product

    Saylor argues STRC is the first credit instrument that lets a retiree treat a Bitcoin-backed yield as a money-market alternative. The mechanics: a $100 share generates roughly $10/year in monthly dividends, each of which reduces the cost basis instead of triggering current income tax. After about nine years, basis is exhausted and the instrument becomes a qualified-dividend security taxed at long-term capital gains rates. On inheritance, the heir receives a step-up in basis to $100, and another nine-year cycle of tax-deferred dividends restarts. Eighty percent of the issue is held by retail through Robinhood and Schwab, and the company actively manages the price by issuing or buying back to hold the $100 anchor. The mission for the rest of the decade, Saylor says, is to scale this to $200, then $400, then $600, then $800 billion in outstanding credit, with Bitcoin as the underlying capital base.

    Working smart, not hard, in the age of AI

    Saylor’s most pointed advice to younger founders and operators is that hard work is becoming a low-return strategy. AI now drafts legal briefings, translates content into 100 languages, writes books, and outperforms most professional output by orders of magnitude. The 10,000-hour mastery model that built his generation’s careers, he says, will not produce equivalent results in the next one. The right move is to ask what tool can do the thing for you before asking how to do the thing yourself. He uses himself as the example: working 70 hours a week for ten years built MicroStrategy, but it felt easy compared to MIT, and the leverage from AI plus podcasts plus digital distribution lets him now reach more people in a single sitting than a 30-year teaching career could reach.

    Volatility, time horizon, and the trader-versus-investor split

    Saylor refuses to be rattled by short-term Bitcoin moves and uses his 99.8% MicroStrategy drawdown in 2002 as a baseline for what real volatility feels like. He argues that Bitcoin’s price swings are evidence of its utility: it is the only globally-tradable asset where a regulatory rumor in China at 2am can move price in real time, which is why it absorbs all attention. His rules are blunt: an investor holds for at least four years (40% volatility, 40% expected return for Bitcoin), the right indicator is the 200-week moving average, and the Buffett rule “if you would not hold it for ten years you should not hold it for ten minutes” still applies. Everything shorter is trading, which is fine if you are an expert, foolish if you are not.

    Bitcoin as protocol, not as bet

    The closing intellectual frame is that Bitcoin won the network-effect competition the same way English won language, Arabic numerals won math, TCP/IP won networking, and the standard shipping container won freight. None of these became dominant because they were objectively perfect. They became dominant because critical mass locked in, the wealthy and powerful coordinated around them, and any alternative now has to dislodge a $1.5 trillion incumbent. The protocols that win do so because “people aren’t stupid” and a billion small coordination decisions converge on the same standard. Bitcoin, on this read, is not an investment to be allocated against silver or real estate. It is the digital capital protocol that the rest of the financial world is going to be denominated in, and choosing not to participate is closer to refusing to learn English than it is to skipping a stock pick.

    Notable Quotes

    “Just because you can do a thing doesn’t mean you should do a thing.”

    Michael Saylor, distilling 20 years of side-business mistakes into one line

    “Bitcoin is a lifeboat tossed on a stormy sea, offering hope to anyone in the world that needs to get off their sinking ship.”

    Saylor’s framing of Bitcoin as a solution for collapsing-currency regimes

    “There is no second best crypto asset. There’s only one crypto asset and that’s Bitcoin. Human civilization settles on protocols.”

    The closing thesis of the conversation, in Saylor’s own words

    “Don’t sell the thing that will make your children’s children wealthy.”

    Saylor on holding Bitcoin through volatility and selling something else instead

    “Everything you own in the physical world you own at the pleasure of someone more powerful than you.”

    Saylor on why physical assets are not real property rights

    “Volatility is vitality. Bitcoin’s volatile because it’s useful.”

    Saylor reframing Bitcoin price swings as a feature

    “Don’t try to outwork a forklift.”

    Saylor on why “work harder” is increasingly bad advice in the AI era

    “I’m like the Colonel Sanders of crypto. But it’s okay. At least I found a mission at some point in my life.”

    Saylor on being a late bloomer at 55

    “Bitcoin is cyber Manhattan. A thousand years from now, your children’s children’s great-great-great 10x grandchildren will be rich, if you kept it.”

    Saylor on Bitcoin as multi-generational real estate

    “The world doesn’t care whether I’m a good manager of a hundred different things. The world wants me to be the best manager of one thing.”

    Saylor on focus as the only durable professional posture

    Watch the full conversation here: When Shift Happens E172: Michael Saylor on How To Get Rich With Crypto (Without Working Hard).

    Related Reading

  • Mohnish Pabrai on How to Invest in 2026: The Ten Commandments of Investing, Charlie Munger Lessons, Cloning, Turkey Warehouses, Constellation Software, and Why Less Than 1% of Stock Pickers Beat the Market

    Mohnish Pabrai sat down with Shaan Puri to lay out exactly how he thinks about investing in 2026, walking through the ten commandments that have shaped a 27 year track record where every dollar invested in his oldest fund turned into roughly thirty. Watch the full conversation on YouTube here. Pabrai manages over a billion dollars, was close friends with Charlie Munger, has had lunch with Warren Buffett for 650,000 dollars, and has produced multiple 100 bagger investments in his career. This conversation is a complete operating manual for value investors, deep value hunters, and anyone trying to figure out how to compound capital in a market where the S&P trades at elevated valuations and AI capex is rewriting the rules.

    TLDW

    Pabrai argues that under one percent of stock pickers are actually good investors, that the game is a wealth transfer from the active to the inactive, and that temperament beats IQ every time. He walks through his core mental models: watching paint dry, the mistress versus the wife, introducing randomness into your life, cloning instead of inventing, taking a simple idea and taking it seriously, the too hard pile, no called strikes, the salmon spear, the inner scorecard, and don’t die at 25 and get buried at 75. He shares the full story of his 100 bagger Turkish warehouse company Reysas, his coal bets, his Constellation Software thesis around Mark Leonard, and why he is bearish on the S&P 500, bullish on pickaxe makers like TSMC and ASML conceptually but unwilling to pay current prices, and why GLP-1 drugs and Bitcoin both sit in his too hard pile. He retells Warren Buffett’s American Express salad oil crisis trade, the lesson Buffett delivered about Rick Guerin and leverage, the inner versus outer scorecard, and the Ed Thorp blackjack to Ken Griffin to Citadel chain. The closing punchline is that the most important investment any person can make is leading an aligned life, getting your music out, and discovering your calling before the wilderness years pile up.

    Key Takeaways

    • Well under one percent of Americans picking individual stocks are actually good at it. Index funds put you ahead of more than ninety percent of the crowd with zero effort.
    • The single biggest mistake smart investors make is impatience. Temperament, not IQ, decides outcomes.
    • Watching paint dry is the core skill. After making an investment, nothing may happen for three to five years. That is the nature of the game.
    • The mistress is always hotter than the wife. The stock you do not own looks more exciting than the one you do own because you do not know its flaws. The bar for swapping must be extremely high.
    • Raise your standards across the board: the investments you make, the people you hang out with, the relationships you keep. Buffett’s gravitational pull rule applies to both portfolios and friendships.
    • Introduce randomness into your life. Pabrai picking up a Peter Lynch book at Heathrow in 1994 led him to Buffett, Berkshire, Charlie Munger, bridge games, and his entire investing career.
    • Cloning works because almost no one will do it. Sam Walton copied everything. Walmart came from Kmart, Sam’s Club came from Sol Price’s Price Club, Burger King located across from McDonald’s instead of running their own site selection.
    • Elon Musk’s idiot index, calculating raw material cost on the London Metals Exchange and refusing to pay more than a small multiple over it, is the kind of simple framework no competitor will adopt even though it is publicly visible.
    • Take a simple idea and take it seriously. None of the other mental models work unless you commit fully to the first one.
    • The too hard pile is the most important box on a value investor’s desk. Buffett claims ninety eight percent of businesses belong there. Investing has no called strikes, so passing on ten thousand pitches before swinging is the right behavior.
    • The whale is swimming all the time, you only see it when it surfaces. Real investor activity is reading and studying, not trading.
    • Buffett at twelve gathered discarded racetrack tickets at Ax-Sar-Ben, found winners drunks had thrown away, and had his Aunt Alice cash them. He carried that pattern of finding anomalies into the Moody’s manuals in his twenties and into the Japan Company Handbook for two decades before pulling the trigger on the five Japanese trading companies.
    • The Japanese trading company trade was financed at half a percent in yen, the companies paid eight to nine percent dividends that later doubled, and Berkshire’s five billion has roughly doubled with almost no risk attached.
    • The American Express salad oil crisis taught Buffett to test the moat in the real world. He stood next to restaurant cash registers in Omaha, saw zero hesitation about accepting the card, and put forty percent of his fund into AMX.
    • The Buffett lunch lesson Pabrai still carries: a slightly above average investor who spends less than they earn and does not use leverage cannot help but get rich over a lifetime. Rick Guerin lost his Berkshire shares to margin calls in the 1973 to 1974 crash. Buffett bought them at forty dollars each, currently worth over seven hundred thousand.
    • Inner scorecard versus outer scorecard is the most fundamental life model. Buffett’s frame: would you rather be the greatest lover in the world and known as the worst, or the worst lover known as the greatest.
    • The Turkish stock market cycles through its float every seventeen days. Pure speculation. Indian quality companies trade at stratospheric valuations. The two together created a poker table Pabrai could sit at alone.
    • Reysas, the Istanbul warehouse operator, was bought at roughly three percent of liquidation value with a fifteen to sixteen million dollar market cap on eight hundred million in assets. It is now approaching a 100 bagger in dollars.
    • Pabrai’s thermonuclear event mental model: if ninety nine percent of humans were wiped out, someone would still produce Coke concentrate, because people will always trade fifteen minutes of labor for a Coke. Cement, paint, land, and steel are inflation indexed.
    • TAV Airports earned revenue in euros, paid costs in collapsing lira, traded at three to four times earnings on the Istanbul exchange. A natural monopoly hiding inside a panicked market.
    • The stock market is a church with a casino attached. Robinhood, prediction markets, zero day options, and two day options all increase the wealth transfer from the active to the inactive. Pabrai welcomes more casino activity because it helps his side.
    • On Polymarket, roughly one tenth of one percent of users capture sixty percent of profits. Two thousand traders made half a billion dollars in a year. The casual gambler funds the sharp.
    • On AI: invest in pickaxe makers. The alphabets and metas are playing a high capex game they have never played before. TSMC, ASML, and Micron are toll bridges. Pabrai is not making the bet because it sits between too hard, too expensive, and outside his circle of competence.
    • Constellation Software is Pabrai’s vertical SaaS bet because Mark Leonard built a mousetrap nobody else will clone. They acquire roughly two hundred small vertical market software companies a year, in delegated fashion, at five to six times cash flow that quickly becomes three to four times after revenue and license fee tweaks.
    • The market is wrong about AI killing software. Coding is one fifth of the pie. Adobe is not going out of business because someone can vibe code a Photoshop alternative. Incumbents reduce cost via automation while keeping cash flows intact.
    • Pabrai is bearish on the S&P 500. He agrees with Howard Marks that when the index trades at twenty three times earnings, the historical forward ten year return has bounced between minus two and plus two percent.
    • GLP-1 drugs sit in the too hard pile because industries with rapid change are the enemy of the investor. Ozempic to Mounjaro to upcoming tablets is too much turnover for valuations that already price in success.
    • Bitcoin sits in the too hard pile. Pabrai prefers gold and asks why a society that already has gold needs Bitcoin.
    • The four percent rule of compounding: roughly four percent of stocks have delivered the entire return of the US market over ninety years. Twelve investments built Berkshire across sixty years.
    • Investing rewards aging. Unlike basketball, the game gets easier with experience. Pattern recognition, expanded circle of competence, and the option to ride winners all compound.
    • Circle the wagons around your winners. Not selling Coke, not selling Apple, not firing Ajit Jain. The success of Berkshire was about not interfering with the four percent that worked.
    • Charlie Munger made an investment six days before he died at age ninety nine point nine. He invested like he was twenty five. Ben Franklin’s line: many people die at twenty five and are buried at seventy five.
    • Don’t save sex for old age. Don’t delay starting your real life until after the McKinsey rotation. Buffett’s frame transfers to careers too.
    • Ed Thorp wrote Beat the Dealer after the mob threatened him with a baseball bat for cleaning out single deck blackjack in Vegas. He then cracked options pricing before Black Scholes, ran Princeton Newport Partners, and became an early backer of Ken Griffin’s Citadel out of a Harvard dorm room.
    • Ken Griffin once told a Harvard recruit he wanted to quit at ten million dollars: please reject our offer, we do not want someone who dies at twenty five.
    • Lead an aligned life. Personality is largely baked by age five. The window to specialize is age eleven to twenty, which is exactly when the school system forces you to be a jack of all trades.
    • Get your music out. Every person has something specific they are meant to bring into the world. A misaligned life is the highest cost most people pay.

    Detailed Summary

    Why fewer than one percent of stock pickers are actually good

    Pabrai opens with a brutal estimate: well under one percent of the Americans who pick individual stocks are good at it. The game, he says, is a mechanism for transferring wealth from the active to the inactive. The good news is that index funds let anyone capture market returns with zero analytical work and end up ahead of more than ninety percent of active stock pickers. The implication is that anyone choosing to pick individual stocks is voluntarily entering a competition where the base rate of success is below ten percent, and the differentiator is almost never intelligence. It is temperament.

    That temperament shows up as patience. After making an investment, three to five years can go by with nothing happening. Sometimes the investment is a mistake and the patience converts into the discipline to reverse course. But on the whole, the less activity an investor takes, the better the outcomes. The first commandment is to enjoy watching paint dry.

    The mistress is always hotter than the wife

    The investments you already own are the wife. You see every flaw because you live with them every day. The investments you do not own are the mistress. Glamorous, unknown, exciting precisely because the temperament and the warts have not been revealed. Guy Spier, Pabrai’s longtime friend, deliberately stays reluctant to act on his portfolio. The point is not that you never act. The point is that the bar for action needs to be extremely high, and you have to learn to be comfortable passing on everything below that bar. Pabrai extends this directly to life: raise your standards about the people you spend time with, the projects you take on, and the investments you select.

    Introduce randomness into your life

    Charlie Munger told Pabrai over and over to introduce randomness into his life. The example Pabrai uses is his own origin story. He was an engineer running an IT company in 1994, sitting in Heathrow with his wife, looking for something to read on a flight. He picked up Peter Lynch’s One Up On Wall Street, finished it, picked up Beating the Street, finished that, encountered Buffett through a mention in Lynch, found the first two Buffett biographies fresh off the press, dove into the Berkshire and partnership letters, and within three years started attending the Omaha annual meeting. Every flight to Omaha on a Friday in May has both seatmates pre filtered for above average humans, all going for the same reason. The randomness exploded outward into Charlie Munger, the bridge games, Charlie’s friends, and decades of compounding social and intellectual capital.

    Shaan tells the parallel story of his own randomness bet. He flew to FarmCon in Kansas City for no particular reason, met newsletter operator Kevin Van Trump, cloned the model, launched The Milk Road for crypto, built the largest crypto newsletter in the world inside a year, and sold it for millions with one employee. Two mental models stacked: introduce randomness, then clone.

    Cloning is the cheat code no one will use

    Sam Walton freely admitted he had no original ideas. He walked into more competitor retail stores than any human in history. He looked at Sol Price’s Price Club, said no brainer, and opened Sam’s Club. He took his managers into a competitor and when they complained the store was a mess, he pointed at the one good candle display and said you can learn from anyone. He bought donuts at five thirty in the morning for distribution center drivers because they had ground level intel on every store. Walmart’s market cap dwarfs that of every competitor combined, and every system came from somewhere else.

    Tesla, SpaceX, and the Boring Company exist because Elon Musk applies the idiot index. He asks what raw materials go into a part, looks up the price on the London Metals Exchange, and refuses to pay a multiple over it without a fight. None of his competitors think this way even though he has written about it publicly and Walter Isaacson devoted a book to it. SpaceX intentionally blows up rockets to learn. Blue Origin tries hard not to blow up rockets. SpaceX is miles ahead. Burger King famously assigned two guys to track McDonald’s site selection and just put a store across the street. The reason cloning works so well is that almost no one is willing to do it.

    Take a simple idea and take it seriously

    This is the bedrock model that makes every other model work. Without total commitment to one simple idea, the rest of the mental models stay theoretical. Pabrai went to Turkey on a hunch in 2018 because the market screened cheap. He discovered that Turkish public companies cycle through their float every seventeen days, meaning every shareholder turns over more than twenty times a year. Compared with Berkshire, whose float may take a decade to rotate, Turkey is a hyperactive day trader’s casino. India, by contrast, has roughly one hundred to one hundred fifty quality companies, all picked over and priced at stratospheric multiples. The Turkish market gave Pabrai a poker table he could sit at almost alone. He chose to be an inch wide and a mile deep.

    Circle of competence and the too hard pile

    Pabrai’s eighth commandment is thou shalt not use Excel, and his ninth is that if you cannot explain an investment thesis to a ten year old in about four sentences, it is a pass. Investing is journalism more than spreadsheet work. Buffett stood at restaurant cash registers in Omaha during the salad oil crisis to test whether AMX’s brand had cracked. He walked into Snow White with his briefcase to study Disney. Peter Lynch told amateurs to make a list of every brand they consume because that is the most authentic intel they have. Buffett keeps a too hard box on his desk and claims ninety eight percent of businesses go in it. Investing has no called strikes, which means an investor can let ten thousand pitches go by and only swing at the fattest center cut pitch.

    The whale is swimming all the time. You only see it when it surfaces. Buffett at age twelve gathered discarded racetrack tickets at Ax-Sar-Ben to find ones drunks had thrown away, then had his aunt Alice cash them because he was underage. In his twenties he flipped through Moody’s manuals page by page looking to be hit in the head with a two by four. Western Insurance at fifteen dollars made twenty five dollars a share and had forty dollars of cash on the balance sheet. He has been flipping through the Japan Company Handbook for at least twenty years before pulling the trigger on the five Japanese trading companies, financing the entire five billion in yen at half a percent against eight to nine percent dividend yields that have since doubled. Berkshire’s five billion is now ten billion paying eight hundred million a year, essentially risk free.

    The 650,000 dollar lunch and what Buffett actually said

    Pabrai paid 650,000 dollars to have lunch with Warren Buffett in 2007 because his net worth had hit eighty four million dollars almost entirely from intellectual property he had taken from Buffett for free. He wanted to look him in the eye and thank him. Buffett’s stance on the lunches was that whoever paid should feel like they got a bargain, so he came prepared having studied biographies of every guest. Pabrai asked an innocent update question about Rick Guerin, the third partner of Buffett and Munger in the sixties and early seventies who then disappeared. Buffett’s answer became the lesson of the lunch. Charlie and I knew we were going to be rich, but we were not in a hurry. Rick was in a hurry. He was always levered. The 1973 to 1974 crash, the slowest motion crash in modern history, gave him margin calls. Buffett bought Rick’s Berkshire shares for forty dollars each. They are over seven hundred thousand now. The lesson: if you are even a slightly above average investor and you spend less than you earn and you do not use leverage, you cannot help but get rich over a lifetime.

    The other lunch lesson Pabrai still cites is the inner scorecard. Buffett’s framing: would you rather be the greatest lover in the world and known as the worst, or the worst lover known as the greatest. The answer determines whether you can resist external stimuli and stay centered. The way Pabrai practices it is by remembering that even Gandhi has critics. If Gandhi is fair game, so is anyone else with a public footprint.

    The Turkey trade and the thermonuclear event mental model

    The headline Turkey investment is Reysas, an Istanbul warehouse operator Pabrai started buying when the market cap was fifteen to sixteen million dollars on eight hundred million dollars of assets, roughly three percent of liquidation value. He told the broker to take out every ask up to the ten percent daily price limit. Templeton Fund called offering five percent of the company for a million dollars and Pabrai said why are you even calling, just take it. Templeton was exiting Turkey because of currency instability and inflation, both of which Pabrai considered irrelevant for the specific kinds of assets he was buying.

    The mental model that unlocked Turkey was the thermonuclear event scenario he discussed with Charlie Munger. If ninety nine percent of humans were wiped out, someone would still produce Coke concentrate and rebuild a bottling plant. The remaining seventy million people will trade fifteen minutes of labor for a Coke regardless of currency or exchange rate. A warehouse is land, paint, cement, and steel. All four are inflation indexed. Whatever happens to the lira, those assets do not care. When the lira collapsed ninety percent against the dollar in seven years, Reysas went up roughly ninety times in dollars and effectively to infinity in lira. He applied the same logic to TAV Airports, which collected revenue in euros while paying costs in collapsing lira. A natural monopoly trading at three to four times earnings on a panicked exchange.

    The casino, prediction markets, and Polymarket

    Buffett’s line at the most recent Berkshire meeting is that the stock market is a church with a casino attached, and the casino is getting crowded. Robinhood, two day options, leverage, and prediction markets like Polymarket all funnel casual gamblers into a transfer game where the sharps already know the prices. Pabrai notes that on Polymarket, roughly one tenth of one percent of users capture sixty percent of profits, and two thousand traders made half a billion dollars in a year. The horse track keeps twenty one percent of every dollar, Vegas keeps two to four percent on a great blackjack game, and yet some players still make a living off horse racing by spotting odds that make no sense. The casino activity is bad for society and great for any investor patient enough to wait for the obvious mispricing.

    AI, pickaxe makers, and the too hard pile

    On AI, Pabrai says invest in pickaxe makers. The alphabets and metas are playing a high capex game they have never played before, which is a recipe for surprise. The capex must pass through TSMC, ASML, and probably Micron. But all of those toll bridges are either too expensive, outside Pabrai’s circle of competence, or in his too hard pile. He is not making the bet. There is no scenario where he sells his Turkish warehouses to buy TSMC. The mistress, in this case, looks uglier than the wife and there are no bonus points for clever valuation work.

    Constellation Software, Mark Leonard, and vertical SaaS

    Where the market gets AI wrong is the assumption that AI coding kills software. Coding is at most one fifth of a software business. The market assumes Adobe is dead because someone can vibe code Photoshop. Pabrai disagrees. Incumbents will reduce costs through automation, keep cash flows intact, and possibly cut prices without losing margin. He invested in Constellation Software specifically because Mark Leonard has built a mousetrap nobody else will clone. Constellation’s M&A team touches seventy to one hundred thousand private vertical market software companies twice a year by phone and twice by email. They acquired roughly two hundred companies last year alone, never using bankers. They pay five to six times cash flow, then bump revenue and license fees about twenty percent, and the effective purchase price drops to three or four times within a year or two. The model is delegated, with deal authority pushed out to teams that do not need headquarters approval up to a threshold. They buy and hold, which scares away private equity that wants to flip. The universe of vertical SaaS targets is too small for private equity to bother with and big enough to keep Constellation compounding for decades. Mark Leonard is the kind of unicorn operator who does not appear twice in a generation.

    S&P bearish, GLP-1 too hard, Bitcoin too hard

    Pabrai is bearish on the S&P 500 because at roughly twenty three times earnings, historical forward ten year returns have ranged between minus two and plus two percent. Howard Marks’s analysis matches his own. GLP-1 drugs like Ozempic and Mounjaro are generating roughly seventy nine billion in revenue annually, more than the entire AI economy, but Pabrai puts them in the too hard pile because industries with rapid change are the enemy of the investor. Ozempic to Mounjaro to upcoming oral tablets is too many turns of the wheel. Bitcoin sits in the same too hard pile. Pabrai prefers gold and asks why a society that already has gold needs Bitcoin, which is widely used by scammers and ransomware operators.

    The four percent rule and circling the wagons

    Over the past ninety years, roughly four percent of US stocks have delivered the entire market return. The other ninety six percent have treaded water. Buffett himself has made three to four hundred investments and only twelve of them built Berkshire Hathaway. Index funds work because they are too dumb to sell Nvidia and too dumb to sell TSMC. Active investors and portfolio managers second guess winners and trim them. The most important investing discipline is circling the wagons around winners: not selling Coke, not selling Apple, not firing Ajit Jain. Capitalism is brutal and most businesses go to zero eventually. The thin slice of enduring moats, like FICO, McDonald’s brand, prime Istanbul warehouses, airport monopolies, and Coke bottlers, are what compound for decades. Pabrai’s bets on coal, airports, warehouses, and Constellation do not all need to work. If half work, the portfolio is a home run. If forty percent work, still a home run. Investing is a forgiving game.

    Ed Thorp, Ken Griffin, and the chain of investing genius

    Ed Thorp used MIT’s mainframe in the early sixties to crack single deck blackjack with basic strategy and card counting. He cleaned out mob run Vegas casinos until they showed him a baseball bat. To get back at them he wrote Beat the Dealer, which sold millions of copies and forced the industry to introduce multi deck shoes and rule changes. He then cracked options pricing before Black-Scholes and skipped the Nobel Prize to make money on it through Princeton Newport Partners, compounding at twenty five to thirty percent a year with no down years. He met a young Ken Griffin running Citadel out of a Harvard dorm and not only handed over algorithms but became an early backer. Pabrai’s first meeting with Thorp happened in a racquetball locker room while Pabrai was completely naked, copy of The Wall Street Journal next to him. Thorp introduced himself, Pabrai’s excitement overcame his sense of decorum, and they have been friends ever since. The Ken Griffin lore extends to his recruiting filter: a Harvard recruit who said he would quit at ten million dollars was told to please reject the offer, because Citadel does not want people who die at twenty five.

    Don’t die at twenty five and get buried at seventy five

    Ben Franklin’s line that many people die at twenty five and get buried at seventy five becomes Pabrai’s closing frame. Charlie Munger made an investment six days before he died at age ninety nine point nine. He invested like he was twenty five. The whole point of life is to keep growing, keep learning, keep finding the alignment between who you are inside and how you show up in the world. Personality is largely baked by age five, and after twelve the most a parent can really influence is the peer group. The window to specialize runs from age eleven to twenty, which is precisely when most educational systems force kids to be jacks of all trades. Bill Gates slipped out of his house to code through the night and accumulated ten to twenty thousand hours by his early twenties. Buffett picked stocks at eleven. Michelangelo sculpted at ten.

    The most important thing Pabrai wants viewers to take from the conversation is that an aligned life is more important than a great investment record. Get your music out. Find what energizes you. If you do not know your calling, work with a thoughtful industrial psychologist like Jack Keene or pay attention to which activities and people genuinely energize you and which drain you. Pabrai himself wandered the wilderness until his mid thirties when he finally understood his own calling. Buffett’s frame applies: do not save sex for old age and do not save your real work for after the McKinsey rotation.

    Thoughts

    The most useful thing Pabrai does in this conversation is collapse the gap between life philosophy and portfolio construction. Most investing content treats temperament as a soft skill on the side of the spreadsheet. Pabrai puts it where it belongs, at the center. The reason the four percent rule matters is not statistical, it is psychological. Almost everyone can identify the few enduring compounders. Almost nobody can sit on them for forty years without selling, trimming, switching to a hotter mistress, or breaking discipline on a leverage call. The actual edge in public markets is not analytical, it is the willingness to be inactive in the face of constant pressure to act.

    The Turkey trade is the most instructive case study in the whole conversation because it is genuinely replicable. Not the specific market, but the architecture. Pabrai stacked four simple mental models on a single trade: take a simple idea seriously, identify a market where the float churns so fast that price has no relationship to value, isolate assets whose intrinsic worth is currency independent, and run the thermonuclear event sanity check on the underlying demand. The result was a 100 bagger held in roughly the worst macro environment of his investing career. The lesson is not to go to Istanbul. It is that real edge tends to come from combining three or four boring frameworks at the same time, in a place where nobody else is bothering to combine them.

    The Constellation Software section deserves more attention than it gets in most investor decks. Pabrai is making a clean bet that vertical SaaS is misread by the market because of generic AI fear. He is probably right. Coding is a labor input to software, not the moat. Switching costs, regulatory tangles, integration depth, and decades of accumulated workflow customization are what keep customers paying. Mark Leonard has industrialized the act of acquiring those moats two hundred times a year. If the DNA holds after Mark eventually steps back, the math is hard to beat. The asymmetric risk is leadership transition, not technological disruption.

    The AI commentary is more interesting for what Pabrai refuses to do than for what he says. He acknowledges the pickaxe makers thesis, names the toll bridges, and then explicitly declines to make the bet because the valuations are too high and the path forward is genuinely uncertain. That is the discipline of the too hard pile in action. Plenty of investors right now are putting money to work in TSMC and ASML at multiples that bake in success scenarios, telling themselves they have done the homework. Pabrai’s position is that even when you are largely right about a trend, paying any price for it is a mistake. The structural humility there is the actual lesson.

    The aligned life closing argument hits hardest because it reframes the entire conversation. The ten commandments of investing are a subset of a broader operating system: figure out who you are by age twenty if you can, raise your standards on the people you spend time with, do not borrow against tomorrow, do not chase the mistress, and do not save your real ambitions for old age. Investing is just the highest leverage application of those rules. The viewers who walk away with one usable change probably should not be picking new stocks. They should be auditing whether they are leading the life that fits.

    Watch the full conversation with Mohnish Pabrai and Shaan Puri on YouTube here.

  • SpaceX S-1 IPO Filing Breakdown, Ticker SPCX on Nasdaq and Nasdaq Texas, xAI Integration, Musk’s Trillion Share Mars Pay Plan, $18.7B Revenue, and the 100 Gigawatt Orbital AI Compute Bet

    Space Exploration Technologies Corp. filed its S-1 registration statement with the SEC on May 20, 2026, kicking off the largest and weirdest IPO in modern capital markets history. The 280-page preliminary prospectus proposes to list Class A common stock on both the Nasdaq Stock Market and the new Nasdaq Texas exchange under the ticker SPCX, bundles xAI into SpaceX as a third reportable segment via a February 2026 reorganization under common control, and asks public investors to underwrite a $28.5 trillion total addressable market that explicitly includes asteroid mining, lunar manufacturing, Mars passenger transport, and 100 gigawatts per year of orbital AI compute on solar-powered satellites. The filing reports $18.67 billion of 2025 revenue and a $4.94 billion net loss, with a Q1 2026 net loss of $4.28 billion driven almost entirely by the AI segment’s $7.7 billion of quarterly capex.

    TLDR

    SpaceX is going public on Nasdaq and Nasdaq Texas as SPCX, led by Goldman Sachs, Morgan Stanley, BofA Securities, Citigroup, and J.P. Morgan. The company has been reincorporated in Texas, headquartered at Starbase, structured as a perpetual dual-class controlled company with Class B shares carrying 10 votes each and electing a majority of the board, and post-merger contains three segments: Space (Falcon, Dragon, Starship), Connectivity (Starlink with 10.3 million subscribers across 164 countries and roughly 9,600 satellites in orbit), and AI (the former xAI, including the Colossus and Colossus II superclusters in Memphis totaling about 1.0 gigawatt of nameplate compute, Grok, and the X platform with 550 million MAUs). Revenue grew from $10.4 billion in 2023 to $14.0 billion in 2024 to $18.7 billion in 2025, with Connectivity contributing $11.4 billion at a 63% segment Adjusted EBITDA margin and the new AI segment burning $1.2 billion of segment Adjusted EBITDA in 2025 while spending $12.7 billion of capex. Elon Musk holds an unspecified majority of the voting power, has a base salary of $54,080 unchanged since 2019, no key-person life insurance, and was granted in January and March 2026 a combined roughly 1.3 billion performance-restricted Class B shares that vest against market-cap milestones from $500 billion up to $7.5 trillion, with the highest tranches contingent on building a permanent Mars colony of one million inhabitants and on deploying non-Earth data centers delivering 100 terawatts of compute per year. The prospectus discloses Anthropic’s $1.25 billion per month compute deal through May 2029, a $60 billion option to acquire Cursor (Anysphere) with a $10 billion combined break fee, the Terafab one-terawatt-per-year chip JV with Tesla and Intel, the $19.6 billion EchoStar spectrum acquisition, a $20 billion SpaceX Bridge Loan, a $5 billion amended revolver, a Houston-exclusive Texas Business Court forum clause with ICC arbitration fallback, and several uniquely SpaceX risk factors including third-party Musk conduct triggering foreign asset seizures, anti-satellite weapons, cascading cyber-induced orbital debris events, and Grok’s named “Spicy” Imagine Mode and “Unhinged” Voice Mode.

    Key Takeaways

    • Ticker SPCX, dual listed on Nasdaq and Nasdaq Texas, Class A par $0.001, joint lead bookrunners Goldman Sachs, Morgan Stanley, BofA Securities, Citigroup, and J.P. Morgan, with a 22-firm syndicate including Barclays, Deutsche Bank, RBC, UBS, Wells Fargo, Allen & Company, Cantor, Needham, Raymond James, Societe Generale, Stifel, William Blair, BTG Pactual, ING, Macquarie, Mirae Asset, Mizuho, and Santander.
    • Headquartered at 1 Rocket Road, Starbase, Texas. Reincorporated from Delaware to Texas on February 14, 2024. Five-for-one forward stock split executed May 4, 2026. All share data in the filing is post-split.
    • Perpetual dual-class structure with no sunset. Class A carries 1 vote per share, Class B carries 10 votes per share, Class C carries no votes (and has been eliminated via the Class C Reclassification). Class B converts to Class A only on a non-permitted transfer.
    • Class B holders elect a majority of the board (the Class B Directors), as long as any Class B shares remain outstanding. Removing Musk from CEO or Chairman requires a separate Class B majority vote. SpaceX will be a Nasdaq controlled company and will rely on the exemptions, meaning no requirement for fully independent compensation or nominating committees.
    • Consolidated revenue: $10.39 billion in 2023, $14.02 billion in 2024, $18.67 billion in 2025, and $4.69 billion in Q1 2026 (up 15.4% year over year). Financials are retrospectively recast to combine xAI and X Holdings since both transactions were between entities under Musk’s common control.
    • Net income (loss): $(4.63) billion in 2023, $0.79 billion in 2024, $(4.94) billion in 2025, and $(4.28) billion in Q1 2026. Accumulated deficit pro forma $41.31 billion as of March 31, 2026.
    • Connectivity (Starlink) is the cash engine. 2025 revenue $11.39 billion, up 49.8%. 2025 operating income $4.42 billion, up 120.4%. 2025 segment Adjusted EBITDA $7.17 billion, up 86.2%. Consumer subscriptions are more than 60% of Connectivity revenue.
    • Starlink subscribers: 2.3 million at year-end 2023, 4.4 million at year-end 2024, 8.9 million at year-end 2025, and 10.3 million as of March 31, 2026. Roughly 9,600 broadband and mobile satellites in low Earth orbit, about 75% of all active maneuverable satellites globally. Available in 164 countries and territories.
    • Starlink ARPU is declining as the mix shifts international and lower priced: $99 monthly in 2023, $91 in 2024, $81 in 2025, $66 in Q1 2026. Management says this is expected to continue.
    • Starlink direct to cell now has roughly 650 V1 Mobile satellites and 7.4 million monthly unique devices across about 30 countries, with partnerships across roughly 30 mobile network operators including T-Mobile, Rogers, KDDI, Optus, Telstra, One NZ, Kyivstar, VMO2, Salt, and Entel. V3 satellites begin deploying in the second half of 2026, designed for 1 Tbps downlink per satellite with up to 60 per Starship launch (a 20x payload-capacity step over Falcon 9).
    • Space segment now generates lower revenue growth because Starlink dedicated launches are not booked as inter-segment revenue. Space revenue: $3.56 billion (2023), $3.80 billion (2024), $4.09 billion (2025). Falcon launches in 2025: 165 total, 43 third-party customer and 122 internal Starlink. Mass to orbit: 1,210 metric tons (2023), 1,699 (2024), 2,213 (2025). SpaceX has now launched more than 80% of the world’s mass to orbit since 2023.
    • Falcon 9 has flown roughly 620 missions with greater than 99% mission success. A single booster has been reflown 34 times. Falcon Heavy is 11-for-11 since 2018 and certified for NSSL. SpaceX flew 11 of 12 NSSL medium and heavy lift missions in 2025.
    • Starship has completed 11 flight tests and is preparing the 12th, debuting next-generation Starship, Super Heavy, and Raptor 3 from a new Starbase pad. V3 is designed for 100 metric tons fully reusable to LEO, V4 targets 200 tons. Cumulative Starship R&D investment is greater than $15 billion, including $3.00 billion in 2025 alone. Operational payload delivery to orbit is expected in the second half of 2026.
    • Dragon has flown 78 crewmembers from 20 countries since 2020 and Cargo Dragon remains the only spacecraft capable of returning meaningful mass from the ISS.
    • AI segment, the absorbed xAI business plus X, generated $818 million Q1 2026 revenue but operating losses of $(2.47) billion and segment Adjusted EBITDA of $(609) million. AI capex was $7.72 billion in Q1 2026 alone, dwarfing Space ($1.05 billion) and Connectivity ($1.33 billion).
    • Colossus and Colossus II in Memphis and Southaven Mississippi together provide about 1.0 gigawatt of nameplate compute draw. Colossus came online in 122 days with about 100,000 H100s. Colossus II added 110,000 GB200s in 91 days and 110,000 GB300s in 64 days. Next phase: another 220,000 GB300s and 400 megawatts. Industry benchmark for a 100 megawatt greenfield datacenter is two years.
    • Grok and X together have 1.3 billion supported accounts on a trailing basis, about 550 million MAUs, roughly 117 million MAUs using Grok AI features, and roughly 350 million daily posts. Imagine generates about 10 billion images and 2 billion videos per month. Paid subscribers totaled 6.3 million as of March 31, 2026 (4.4 million X Premium variants plus 1.9 million SuperGrok variants).
    • Disclosed Anthropic cloud services agreements signed May 2026: Anthropic pays $1.25 billion per month for compute capacity on Colossus and Colossus II through May 2029, ramping in May and June 2026, with 90-day termination by either party.
    • Cursor (Anysphere) compute agreement and acquisition option signed April 2026: SpaceX has the right but not the obligation to acquire Cursor at an implied $60.0 billion equity value, paid in Class A stock priced off the SPCX VWAP. SpaceX-side termination or breach triggers a $1.5 billion termination fee plus an $8.5 billion deferred services fee.
    • Terafab JV with Tesla, announced March 2026, joined by Intel in April 2026, targets one terawatt per year of compute hardware production. The filing explicitly notes that neither Tesla nor Intel is obligated to remain, and definitive agreements may not be signed.
    • Macrohard, in development with Tesla, is described as a platform designed to fully emulate digital workflows, augment human computer operation, and create a fully AI-operated software company.
    • EchoStar Spectrum Transaction (AWS-3, AWS-4, H-block, 65 megahertz US plus global MSS) was FCC-approved May 12, 2026. Total deal value $19.6 billion, including roughly $11.1 billion of equity (261.8 million Class A shares at an implied $42.40) and up to $8.5 billion of debt assumption. Closing expected around November 30, 2027.
    • Balance sheet as of March 31, 2026: cash and equivalents $15.85 billion, short-term marketable securities $7.82 billion, total assets $102.09 billion, total liabilities $60.51 billion, total debt principal $29.13 billion. The $20 billion SpaceX Bridge Loan (Goldman Sachs Bank USA as administrative agent, March 2026) refinanced legacy X and xAI debt and must be repaid within six months of IPO. The amended SpaceX Credit Facility, also May 2026, was upsized to $5.0 billion and extended to May 19, 2031.
    • Use of proceeds: expansion of AI compute infrastructure, enhancements to launch infrastructure and launch vehicles, increases in satellite constellation scale and capacity, and general corporate purposes. No dividends are anticipated and the credit agreements restrict them.
    • Total addressable market estimate of $28.5 trillion (ex-China and Russia): Space $370 billion, Connectivity $1.6 trillion ($870 billion broadband and $740 billion mobile), and AI $26.5 trillion ($2.4 trillion infrastructure, $760 billion consumer subscriptions, $600 billion digital advertising, and $22.7 trillion enterprise applications).
    • Stated future markets explicitly listed in the prospectus: point-to-point Earth transport via Starship, space tourism, in-orbit manufacturing including pharmaceuticals and materials, passenger and cargo to Moon and Mars, lunar mining of rare materials, lunar mass driver, lunar factories building AI compute satellites, asteroid mining, and orbital solar-powered AI. The headline aspirational target is 100 gigawatts per year of orbital AI compute on solar-powered satellites in Sun-synchronous orbit, with first deployments targeted as early as 2028.
    • Musk 2025 total compensation $54,080 (base salary unchanged since 2019, tied historically to California’s exempt-employee minimum). No bonus, no stock or option awards reported for 2025. SpaceX maintains no key-person life insurance on Musk.
    • January 13, 2026 Musk grant: 1 billion performance-based restricted Class B shares across 15 equal tranches tied to market-cap milestones from $500 billion to $7.5 trillion (in $500 billion increments), with at least one tranche additionally gated on “a permanent human colony on Mars with at least one million inhabitants” and on continued employment.
    • March 23, 2026 Musk replacement award (assumed from xAI): 302,072,285 performance-based restricted Class B shares across 12 tranches from $1.065 trillion to $6.565 trillion market cap, additionally requiring completion of “non-Earth-based data centers capable of delivering 100 terawatts of compute per year.” Replaces an earlier xAI award after Musk had already earned and canceled 25,172,695 Class A shares at the first milestone.
    • Gwynne Shotwell 2025 total compensation $85.81 million, primarily option awards. Bret Johnsen (CFO) 2025 total compensation $9.84 million. Non-employee directors received zero cash and zero equity for 2025 service.
    • Board of 8 post-IPO: Musk (Chairman, CEO, CTO), Shotwell (President, COO), Antonio Gracias (Valor Management), Ira Ehrenpreis (DBL Partners and Tesla), Randy Glein (DFJ Growth, audit chair), Donald Harrison (Google), Steve Jurvetson (Future Ventures), and Luke Nosek (Gigafund and Founders Fund). Class B Directors: Musk, Shotwell, Gracias, Harrison, Nosek. Common Stock Directors: Ehrenpreis, Glein, Jurvetson.
    • Lock-up is 180 days for company, directors, and officers, but Musk and certain significant investors are subject to an extended 366-day lock-up, and 100% of Musk’s shares are explicitly not subject to early-release tiers. A Directed Share Program with Schwab, Fidelity, Robinhood, SoFi, and E*TRADE handles retail allocation; DSP shares have no lock-up.
    • Corporate Opportunities waiver in the charter renounces interest in business opportunities presented to directors, officers, board observers, and their affiliates. Musk and his affiliates are explicitly not restricted from competing with SpaceX. This carve-out covers Tesla, Neuralink, The Boring Company, and any future Musk venture.
    • Exclusive forum is the Texas Business Court, Eleventh Division, in Houston, including for federal securities claims. If unenforceable, the fallback is mandatory ICC arbitration in Houston under Expedited Procedure Rules. Jury trial is waived. Class actions are prohibited.
    • Texas Business Organizations Code carve-outs: Section 21.419 codifies a statutory business-judgment-rule presumption, Section 21.552 requires 3% minimum ownership to bring derivative proceedings, and Section 21.373 (2025) requires 3% ownership for six months plus solicitation of 67% of voting power for shareholder proposals (SpaceX concedes enforceability is “expected” to be challenged).
    • Unprecedented risk-factor disclosure: in August 2024 Brazil’s Supreme Court froze Starlink’s Brazilian assets over the conduct of X “when X was not owned by us and was only affiliated with Mr. Musk.” SpaceX warns that third-party Musk conduct may continue to trigger foreign retaliation against SpaceX.
    • Risk language names Grok’s “Spicy” Imagine Mode and “Unhinged” Voice Mode as carrying heightened risks of explicit content, misinformation, and “potential nonconsensual or exploitative imagery.” A putative class action over content “representing children in sexualized contexts” is disclosed, as is an Irish DPC GDPR inquiry into Grok and an FTC inquiry into chatbots as companions for children and teens.
    • The S-1 uses the term “Department of War” (not Defense) for the federal customer requiring CMMC compliance and discloses that anti-satellite weapons have been publicly discussed by foreign governments as a tool against the Starlink constellation. A cyberattack-induced cascading Kessler-style debris event is cited as a possibility.
    • Workforce of more than 22,000 full-time employees globally, with no collective bargaining and engineering acceptance rate under 2% in 2025.
    • Operating asset footprint: Starbase (Texas, HQ, Starship), Hawthorne (California, Falcon, Dragon, Merlin and Raptor), McGregor (Texas, engine testing), Redmond (Washington, Starlink satellite production at about 70 per week), Bastrop (Texas, terminal production at tens of thousands per day, doubling in 2026 to include AI compute satellites), Kennedy and Cape Canaveral (Florida, LC-39A, SLC-40, SLC-37 in build for Starship), Vandenberg (California, SLC-4 polar launches), Memphis and Southaven (Tennessee and Mississippi, Colossus data centers), Palo Alto (California, xAI HQ), more than 400 Starlink ground stations globally, and three autonomous spaceport drone ships including “Of Course I Still Love You,” “Just Read the Instructions,” and “A Shortfall of Gravitas.”
    • Related party transactions of note: roughly $20.2 billion of equipment lease undiscounted payments to Valor (Gracias) entities guaranteed by SpaceX; aircraft, security, and tunnel-construction payments to Musk affiliates; xAI subsidiary leases real property from Musk Industries LLC.
    • Pampena v. Musk: an April 3, 2026 partial judgment in the Northern District of California, where a jury found Musk personally violated Section 10(b) and Rule 10b-5 on two May 2022 statements regarding his Twitter purchase. Post-trial motions are pending. The 2018 SEC “funding secured” settlement is also disclosed.
    • Critical accounting policy quirks: flight vehicles are depreciated over expected average number of flights rather than time. Starship costs are expensed to R&D until commercialization, then capitalized. Starlink dedicated launch costs are capitalized into Connectivity PP&E rather than booked as inter-segment Space revenue, which mechanically suppresses the headline Space growth rate.
    • The One Big Beautiful Bill Act (Public Law 119-21) reversed a $659 million U.S. R&D credit deferred tax asset recognized in 2024, driving the 2025 income tax provision of $718 million versus a $549 million benefit in 2024.
    • Pre-IPO ownership pro forma at March 31, 2026: Class A 6,824,581,339 shares and Class B 5,695,729,430 shares outstanding, for a combined 12.52 billion shares before primary issuance. Class C and the redeemable convertible preferred are converted/reclassified at close.
    • Authorized capitalization post-IPO: 36.13 billion Class A, 6.13 billion Class B, 10.0 billion Class C (none issued), and 2.4 billion preferred (none issued). Headroom for future issuance is enormous.
    • Five-for-one stock split executed May 4, 2026 to set the IPO share count and round-lot price. Price range, share count, and proceeds are bracketed in this preliminary filing and will be updated before launch.

    Detailed Summary

    A different kind of S-1 from the start

    Most S-1 filings open with corporate prose and a careful, neutral business description. SpaceX opens with an Elon Musk epigraph about wanting to wake up in the morning and “think the future is going to be great,” a mission statement that says the company exists “to make life multiplanetary, to understand the true nature of the universe, and to extend the light of consciousness to the stars,” and a Kardashev Type II framing that treats the next century of capital allocation as a civilizational project. Investors are being told, in legally binding language, that single-planet existence is “a single point of failure” and that the company is hedging against humans sharing the fate of the dinosaurs. The filing dual-lists SPCX on Nasdaq in New York and Nasdaq Texas in Dallas, picks the new Texas Business Court in Houston as exclusive forum, and reincorporates from Delaware to Texas. Every macro signal is set deliberately.

    Three segments after the xAI absorption

    The most consequential mechanical change in the S-1 is the retrospective recast of financial statements to combine xAI Holdings and X Holdings into SpaceX. Both transactions are accounted for as reorganizations of entities under common control (Musk’s), so prior-period revenue, opex, and capex move into the SpaceX line items rather than appearing as acquired-business additions. This is what produces the headline numbers: $10.4 billion (2023), $14.0 billion (2024), $18.7 billion (2025). The Space segment includes Falcon, Dragon, and Starship. Connectivity is Starlink in all its consumer, enterprise, government, and mobile forms plus the Starshield military variant. AI is the former xAI in full: Colossus and Colossus II superclusters, Grok, the X platform, and the Imagine media products. The recast also explains why net income flips so violently year to year. 2024’s $791 million net income reflects a quieter pre-merger SpaceX. 2025’s $4.94 billion net loss and Q1 2026’s $4.28 billion loss reflect the integrated AI business burning capital at unprecedented rate.

    Connectivity is the cash engine

    Starlink is the only segment that looks like a normal high-margin growth business. Revenue rose 96.4% in 2024 and another 49.8% in 2025 to $11.39 billion. Operating income tripled in 2024 and then doubled again in 2025 to $4.42 billion. Segment Adjusted EBITDA in 2025 was $7.17 billion, an EBITDA margin north of 60%. Subscribers grew from 2.3 million to 10.3 million in twenty-seven months. The constellation is now roughly 9,600 satellites, about 75% of all active maneuverable satellites on orbit. Inter-satellite laser links exceed 23,000, forming a mesh that delivers 700+ Tbps of cumulative downlink. ARPU is declining steadily, from $99 monthly in 2023 to $66 in Q1 2026, but management frames this as deliberate international mix shift toward lower priced plans and notes that direct-to-cell is just beginning to monetize. Roughly 650 V1 Mobile satellites already provide service to 7.4 million monthly unique devices through partnerships with roughly 30 mobile network operators. The EchoStar spectrum acquisition adds 65 megahertz in the US plus global MSS spectrum to support V2 Mobile broadband and 5G IoT starting in 2027.

    Space economics are obscured by accounting

    The Space segment looks small in the headline financials ($4.09 billion of 2025 revenue, an operating loss of $657 million) until you understand the accounting. Starlink launches are capitalized into Connectivity PP&E rather than booked as inter-segment Space revenue. That single policy is why 2025 Space revenue grew only 7.6% even though SpaceX flew 170 missions, of which 122 were internal Starlink. The actual operating reality is that SpaceX flew more than 80% of the world’s mass to orbit in 2025, owns 24 flight-proven reusable Falcon 9 boosters certified for 40 flights each, has refln a single booster 34 times, and has invested more than $15 billion in Starship to date. Starship’s eleventh flight test is on the books, the twelfth will debut the next-generation vehicle and Raptor 3 engine, and operational payload delivery to orbit is targeted for the second half of 2026. V3 Starship is designed to deliver 100 tons to LEO fully reusable and to carry up to 60 V3 Starlink satellites per launch, a 20x payload step over Falcon 9. The Starship cost target is a 99% reduction against the historical $18,500 per kilogram average, on the way to “airline-like” reflight cadence.

    AI is a money furnace with a thesis

    The AI segment is brand new to the SpaceX line item set and dominates the loss line. AI generated $3.20 billion of 2025 revenue (up 22.2%) but lost $6.36 billion at the operating line, much of it driven by GPU depreciation. AI capex was $12.73 billion in 2025 and another $7.72 billion in Q1 2026 alone. Colossus came online in 122 days with about 100,000 H100s and 130 megawatts. Colossus II followed with 110,000 GB200s in 91 days and 110,000 GB300s in 64 days, with another 220,000 GB300s and 400 megawatts in the next phase. The two superclusters now draw about one gigawatt combined. Grok-5 is training on Colossus II, targeting multi-trillion parameters. The X platform contributes 550 million MAUs and roughly 350 million daily posts to the segment, with 117 million MAUs touching Grok AI features. The thesis the prospectus is pitching is vertical integration on physics: SpaceX controls power generation (data center turbines and, eventually, orbital solar), launch (Starship to lift orbital compute satellites), satellite manufacturing (Redmond and Bastrop), chip supply (Terafab JV with Tesla and Intel for one terawatt per year of compute hardware), and the application layer (Grok and X). Management calls this “shovels-to-tokens” and argues no other AI company has this much control over the physical stack.

    The Anthropic, Cursor, and Terafab carve-outs

    Three subsequent events disclosed in the S-1 reframe SpaceX as a cloud and software platform as much as a hardware company. Anthropic signed cloud services agreements in May 2026 to pay $1.25 billion per month for Colossus and Colossus II capacity through May 2029, ramping in May and June 2026. The Cursor (Anysphere) agreement signed April 2026 includes both a compute commitment and an option for SpaceX to acquire the company at a $60 billion implied equity value, with a $1.5 billion termination fee and an $8.5 billion deferred services fee if SpaceX breaches or terminates. Terafab is a manufacturing JV with Tesla, joined by Intel in April 2026, with a stated one terawatt per year compute hardware production target. The prospectus is explicit that Tesla and Intel are not obligated to remain in Terafab and that no definitive agreements may be signed. Anthropic, the leading commercial competitor to OpenAI, is now SpaceX’s largest disclosed cloud customer.

    The Musk pay package

    The CEO compensation disclosure is the most aggressive in S-1 history. Musk’s reported 2025 total compensation was $54,080, a base salary unchanged since 2019. SpaceX maintains no key-person life insurance on him. Then on January 13, 2026 the board granted him one billion performance-based restricted Class B shares, vesting across fifteen equal tranches as market capitalization milestones are achieved at $500 billion increments from $500 billion all the way to $7.5 trillion, with at least one tranche additionally conditioned on the existence of a permanent human Mars colony of at least one million inhabitants and on continued employment. On March 23, 2026 the board granted an additional 302.07 million performance-based restricted Class B shares across twelve tranches from $1.065 trillion to $6.565 trillion of market cap, additionally requiring the completion of “non-Earth-based data centers capable of delivering 100 terawatts of compute per year.” This second grant replaces an earlier xAI award after Musk had already earned 25.17 million Class A shares at the first xAI milestone, which were then canceled and rolled in. The combined package is roughly 1.3 billion restricted Class B shares, dwarfing the Tesla 2018 award that previously held the record. Other executive comp is more conventional. Gwynne Shotwell’s 2025 total was $85.81 million, primarily option awards. Bret Johnsen, CFO, received $9.84 million. Non-employee directors received zero cash and zero equity for 2025 service.

    Governance built to be Musk-proof in one direction only

    SpaceX takes the dual-class playbook further than any prior tech IPO. Class B carries 10 votes per share, has no sunset, and elects a majority of the board as a separate class. Removing Musk from CEO or Chairman requires a separate Class B majority vote, and Musk holds the majority of Class B. The charter renounces interest in business opportunities presented to Musk and his affiliates, explicitly preserving his right to run competing ventures (Tesla, Neuralink, The Boring Company, anything next). The company opts into the Texas Business Organizations Code’s Section 21.419 business-judgment-rule presumption, requires 3% ownership to bring a derivative suit, requires 3% ownership for six months plus solicitation of 67% of voting power to bring shareholder proposals under Section 21.373 (a provision SpaceX itself concedes will likely be challenged in court), picks the Texas Business Court in Houston as exclusive forum even for federal securities claims, and falls back to mandatory ICC arbitration in Houston with Expedited Procedure Rules if forum exclusivity is struck down. Jury trials are waived. Class actions are prohibited. SpaceX will be a controlled company and will rely on Nasdaq exemptions from independent committee requirements. Musk and certain significant investors are subject to a 366-day lock-up rather than the standard 180 days, and 100% of Musk’s shares are excluded from the early-release tiers other holders enjoy.

    Risk factors disclose things no S-1 has disclosed before

    The Risk Factors section contains language no prior S-1 has used. SpaceX warns that “actions and statements of Mr. Musk and his affiliated ventures, whether or not directly relating to us, may draw significant public attention and scrutiny” and notes that in August 2024 the Brazilian Supreme Court froze Starlink’s Brazilian assets over the conduct of X “when X was not owned by us and was only affiliated with Mr. Musk.” That is the precedent: a foreign government seized SpaceX assets over Musk’s separate business conduct. The filing names Grok’s “Spicy” Imagine Mode and “Unhinged” Voice Mode as carrying heightened risks of explicit content and “potential nonconsensual or exploitative imagery,” discloses a putative class action over content “representing children in sexualized contexts,” an Irish DPC GDPR inquiry into Grok’s processing of EU children’s data, and an FTC inquiry into chatbots as companions for children and teens. The orbital risk language describes a cyberattack-triggered cascading Kessler-style debris event that could render SpaceX-licensed orbits “unusable for an extended period,” notes that “certain foreign governments have publicly discussed the potential use of anti-satellite weapons against the Starlink constellation,” and acknowledges that the FAA does not currently permit return-to-launch-site reentries for Starship and the company will require a waiver “which is not guaranteed.” The filing also uses “Department of War” rather than “Department of Defense” when discussing CMMC compliance for federal customers, reflecting the recent rebranding.

    Capital position and the bridge loan time bomb

    The balance sheet is large but the debt structure tells a story about why an IPO is urgent now. SpaceX has $15.85 billion of cash and $7.82 billion of short-term marketable securities against total debt principal of $29.13 billion. The largest piece is the $20 billion SpaceX Bridge Loan signed March 2026 with Goldman Sachs Bank USA as administrative agent, used to refinance legacy X and xAI debt (including X B-1, X B-3, and xAI 12.5% Senior Secured Notes). The bridge matures September 2, 2027 (extendable to March 2028 with a 0.25% fee per quarter), priced at Term SOFR plus 0.75% to 1.75%, with 0.125% duration fees kicking in at year one. It must be repaid within six months after IPO completion. The amended SpaceX Credit Facility was upsized to $5.0 billion and extended to May 19, 2031 in May 2026, with a $2.0 billion performance LC sublimit. The leverage covenant is 3.75x maximum (4.25x post-qualified acquisition). Capex is enormous and consistent: $20.74 billion in 2025 ($3.83 billion Space, $4.18 billion Connectivity, $12.73 billion AI), $10.11 billion in Q1 2026 alone. Operating cash flow ($6.79 billion in 2025) does not cover capex, and the gap is being filled by financing activity ($26.35 billion of net financing inflow in 2025).

    The 100 gigawatt orbital AI bet

    Buried in the Business section is the future-markets framing that justifies the AI-segment burn rate. SpaceX is asking public investors to underwrite a plan to deploy 100 gigawatts per year of orbital AI compute on solar-powered satellites in Sun-synchronous orbit. Reaching that scale requires thousands of Starship launches per year and roughly one million metric tons of mass to orbit annually. First modular orbital AI shells are targeted for “as early as 2028.” The justification given is that the Sun contains roughly 99.8% of the solar system’s energy, that orbital compute escapes terrestrial constraints on power, cooling, latency, and permitting, and that no other AI company controls the physical stack required to deploy at that scale. The prospectus stitches this directly to the Mars project: lunar mining of rare materials, lunar mass drivers to launch satellites at low cost, and lunar factories building AI compute satellites are listed alongside asteroid mining and Mars passenger transport as the future markets investors are being asked to value. The risk language acknowledges that none of these markets currently exist and that breakthrough advances in nuclear energy could moot the orbital compute thesis entirely. Investors are being asked to take Musk’s word that the long-tail outcomes are real options.

    Thoughts

    The most important number in this S-1 is not the revenue, the loss, or the implied valuation. It is the $54,080 Musk salary unchanged since 2019 against the 1.3 billion performance-restricted Class B shares contingent on a Mars colony and 100 terawatts of off-Earth compute. This is a pay package that resolves the question of whether SpaceX is a public-markets-style optimized corporation by answering it directly: no. SpaceX is going public on Musk’s terms, with a perpetual dual-class structure, a controlled-company exemption, a Houston exclusive forum, an arbitration backstop, a class-action prohibition, a charter that explicitly renounces interest in business opportunities Musk gets pitched elsewhere, and a CEO compensation structure that pays nothing for normal performance and 1.3 billion shares for an interplanetary civilization. Investors who buy SPCX are not buying voting power. They are buying optionality on the most ambitious capital allocation thesis a public company has ever attempted, contingent on Musk continuing to deliver outcomes the rest of the industry cannot.

    The xAI absorption is the most consequential corporate event in the prospectus and the one most worth scrutinizing. Accounting it as a common-control reorganization is technically defensible because Musk controlled all three entities, but the practical effect is to fold xAI’s enormous compute burn and X’s separate litigation surface area into SpaceX’s reported financial history without showing the deals as acquisitions. The Q1 2026 net loss of $4.28 billion is almost entirely xAI capex pulling forward. The two segments that actually make money (Connectivity at a 63% Adjusted EBITDA margin, Space when you adjust for the launch accounting policy) are being asked to subsidize an AI build-out that requires the orbital compute thesis to come true to ever generate adequate returns. Strip out AI and SpaceX would be one of the highest-quality businesses ever taken public. Include AI and it is something more like a venture-stage company stapled to a cash-flow machine, with the venture stage absorbing the cash. That is the trade the IPO is asking the market to price.

    The risk-factor language about third-party Musk conduct triggering foreign asset seizures is the cleanest single articulation in any S-1 of why founder-led companies with cross-portfolio exposure are different from normal public companies. The Brazil precedent is real, the legal theory is established, and the prospectus admits it directly. Buying SPCX means accepting that a fight between Musk and a foreign government over X content moderation, a Neuralink ethics dispute, a Boring Company permit fight, or a future venture entirely unrelated to space could trigger a freeze on Starlink subscriber revenue in that country. The Corporate Opportunities waiver is the legal mechanism that makes this acceptable to the board. It is far from clear that it is acceptable to public-market shareholders. The early reception of SPCX will partly be a referendum on whether the market thinks Brazil 2024 was a one-time event or a template.

    The Anthropic disclosure is the funniest detail. SpaceX, controlled by Musk, is now selling roughly $15 billion per year of compute to Anthropic, a company explicitly founded by former OpenAI researchers who broke away from the OpenAI-Musk faction in 2021. SpaceX-Colossus is now Anthropic’s largest disclosed compute supplier through May 2029, on 90-day termination by either side. The OpenAI lawsuit, the xAI launch, and the Grok positioning as the “truth-seeking” anti-OpenAI all sit in tension with the fact that Anthropic now anchors xAI’s third-party compute revenue. The economic logic is simple. The political logic, given the lockup of compute supply that this deal effectively creates, is fascinating. Public investors are being asked to underwrite a business where the largest compute customer is a direct AI competitor and where that supply contract is the single biggest piece of disclosed enterprise AI revenue.

    What this IPO most resembles is not Tesla’s 2010 deal or Twitter’s 2013 deal but rather a hybrid of the East India Company chartering and a moonshot R&D vehicle taken public. It is a real cash-flowing business at the Connectivity layer (the largest satellite ISP on Earth) wrapped around a launch monopoly (more than 80% of global mass to orbit) wrapped around a venture-stage AI laboratory (Colossus, Grok, the Anthropic deal, the Cursor option) all underwritten by a CEO compensation structure whose biggest payoffs require a Mars colony. The investor question is not whether any individual piece works, because three of the four pieces clearly do. The question is whether the public market will price the orbital compute and Mars optionality at zero, at a small positive number, or at the eye-watering multiple the $7.5 trillion top tranche of Musk’s pay package implies the board thinks is achievable. There is no precedent for a public company successfully executing on that scale of ambition. There is also no precedent for SpaceX, Starlink, Falcon 9, or Colossus II coming online in 91 days. The S-1 reads like the company assumes the precedent is itself.

    Read the full SpaceX S-1 filing on the SEC EDGAR system for the complete prospectus, including the financial statements and all related disclosures.

  • Tobi Lütke on Uncapped Episode 50, Building Shopify in the AI Era, The Net Impact Memo, Six Week Cycles, and Why Software Was the Hidden Infrastructure of Our Time

    Tobi Lütke, the founder and CEO of Shopify, sits down with Jack Altman for Episode 50 of the Uncapped podcast for one of the most useful hours of operating wisdom you will hear from a sitting public company founder. The conversation moves from why Tobi still loves the work after twenty years, through the practical mechanics of running Shopify on six week review cycles, into the now famous AI memo he sent to the entire company, the rise of Claude Code style agents, what it means to spend tens of percent of revenue on AI tokens, why the modern web browser is a wonder of the world, and where small businesses actually fit in a world where the next Turing test might be “build me a million dollar business.” This is essential listening for any founder, operator, or investor trying to make sense of what 2026 actually requires.

    TLDW

    Tobi Lütke explains how he keeps loving his life’s work by pursuing what Paul Kapoa called “beautiful problems,” why “different” must always be the starting position because anything copied can only be marginally better, and why Silicon Valley’s last decade of orthodoxy has been bad for originality. He walks through his decision to send Shopify’s company wide AI memo and codify it into net impact performance reviews, the unlimited token policy for employees, why small three to five person teams are his bet, and how Parkinson’s Law and a six week review cycle force pace. He calls the doomer permanent underclass narrative completely absent from Shopify’s data, citing one new merchant getting their first sale every 36 seconds, and proposes “build me a million dollar business” as the real successor to the Turing test. He argues humanity has not stopped building wonders, we just built them all in software for thirty years, that the web browser is one of the most impressive engineering achievements ever made and could never get approved by a modern app store, and that the freed talent leaving software will rebuild the physical world. He shares his hiring philosophy, why he restarted the Shopify intern program at scale with Waterloo, his preference for public over private status, and ends with a short reading list anchored by Parkinson’s Law, Lessons of History, and a book called What Is Intelligence.

    Key Takeaways

    • Tobi’s recipe for life’s work is to find a beautiful problem worth occupying you for life, and accept that the solved problem will spawn delightful problem children to keep you engaged.
    • His simple model of success, “figure out what it costs and be willing to pay it,” with the price almost always being time, commitment, and discomfort rather than money.
    • He warns CEOs against collecting “barnacles” of aesthetic expectation, the statesman travel and baby kissing pattern, calling that lifestyle inefficient and personally miserable.
    • He invokes Kathy Sierra’s line “don’t make better cameras, make better photographers” as his core product philosophy, beautiful tools that induce more ambition and skill in the user.
    • Mediocre products feel like room temperature. Great products are forged in a furnace and require sustained heat from the team.
    • Shopify builds its own HR software internally because the available options are not what they want to use. Toolmaking is a stated cultural identity.
    • Originality is axiomatic. If you build the same thing as everyone else, you can only be marginally better. The starting position has to be “different,” and if you converge on the consensus answer through that path you have actually learned something.
    • Shopify has tried to eliminate the word “failure” internally, replacing it with “the successful discovery of something that didn’t work.”
    • Tobi says Silicon Valley spent the last decade declaring war on distinction, that the diversity push as practiced eradicated eccentricity, and that the inversion is now beginning. Companies should resemble islands of misfit toys, not convergence on a pre-ordained truth.
    • One of his most surprising career insights, when he visited the Valley as a Canadian outsider and asked founders how they ran their companies, he only ever received the highlight reel. Trying to clone what those founders described led him to invent practices the originals had never actually implemented.
    • The Shopify AI memo, sent company wide, made it explicit that two equally good engineers fifteen minutes earlier are no longer equivalent if one is fluent with AI tools and the other is not. This was codified into the company’s “net impact” performance review framework.
    • Tobi describes the “founder credibility bank” as the most underrated asset in a founder led company. Every onboarding deposits a little credibility, and the founder can spend it on hard change management that would otherwise take years of incremental culture work.
    • Shopify gives every employee an unlimited token policy for AI tools and displays token usage and departmental percentile on internal profiles. Token spend is tracked because it has to be allocated to opex, not because it is the target.
    • He confirms Shopify’s AI token spend is “extremely high” relative to revenue and notes that some private companies are now running token spend at many tens of percent of revenue, a level he thinks cannot persist at every stage but makes sense right now because the tokens are buying so much leverage.
    • Shopify is on track to 10x its annual token consumption and 3x its GPU footprint, and those two curves do not converge anywhere good for price relief.
    • His bet on team design is small, three to five people, which has always been Shopify’s bias. AI agents now handle the customer research summarization role that previously required a dedicated team member, raising every individual to a “seven out of ten on every scale.”
    • Parkinson’s Law (the book, 60 pages, 1960s edition) is his single most recommended management book. He owns multiple original print runs and gives copies to executives. “Work expands to the time allocated.”
    • Shopify runs on a six week review cycle. The first warning sign that a team has slipped into quarterly pacing is seeing “H1” or “H2” used in a PowerPoint. He now thinks six weeks is too slow and is actively trying to figure out what replaces it.
    • The “permanent underclass” doom narrative simply does not appear anywhere in Shopify’s data. New entrepreneurs are reporting that AI has finally fixed computers for them, expanding their businesses and letting them hire.
    • A new merchant gets their first Shopify sale every 36 seconds. Every reduction in onboarding friction produces a measurable jump in completed businesses.
    • Tobi proposes “go make me a million dollars” as the natural successor to the Turing test, an end to end test of acting in the real world, marketing, prioritizing, shipping, and producing something people will pay for.
    • Shopify Collective lets aspiring entrepreneurs sell other manufacturers’ products if their skill is marketing rather than making. Print on demand, additive manufacturing, contract manufacturing, CNC, 3D printing, and humanoid robotics are all pulling the cost of “make the product yourself” toward the floor.
    • The reason American infrastructure feels stagnant for thirty years is that the infrastructure humanity actually needed was digital. The web browser, Linux, Google, social networks, and Shopify itself are wonders that dwarf a refinery in complexity but are invisible by nature.
    • Tobi calls the modern web browser one of the wonders of the world. Font rendering alone is a Turing complete system. No app store on earth would approve the browser today if it did not already exist, because the pitch (“we download untrusted code from strangers and run it on your machine to reconfigure your computer for them”) sounds insane.
    • The next chapter is the brightest software engineers being freed by AI to build the physical infrastructure that has been deferred for a generation.
    • He prefers to predict the future by collecting many data points and matching them to super linear, linear, or sublinear curves. The current AI horizon is the hardest period of his career to forecast because the time horizons are so short.
    • Programming is overhyped as the locus of AI value. The bigger story is using the programming harness, the file system, tools, and memory files of products like Claude Code, to drag every other domain into the programming domain where the models are strongest.
    • The underhyped frontier is enterprise deployment. Most companies are still asking “help me do the thing I already did, slightly better,” instead of “if AI had existed since Alan Turing, how would I have designed this job from scratch.”
    • Tobi restarted the Shopify intern program at scale, partnered closely with the University of Waterloo, and explicitly frames interns as both students and teachers because they are AI native in a way the rest of the company is still catching up to.
    • He briefly believed AI would tilt the value of work toward early career talent with maximum fluid intelligence, then revised when he watched how much creative “steering” the best programmers were quietly contributing inside the AI loop. Good people are still good.
    • His recruiting philosophy is “build a company worth looking for” rather than selling candidates. Better to actually be healthier than to look healthier in photographs.
    • Tobi is a vocal defender of being a public company. Shopify IPO’d at a $1.5 billion valuation and has roughly 100x’d in public markets, which means an enormous number of retail investors have shared in the upside that recent unicorns reserve for insiders.
    • His framing of money, “money is how you vote for what you want.” Buying a product or buying a share is a vote for the thing existing.
    • His current reading recommendations, Parkinson’s Law, Lessons of History, and a book called What Is Intelligence that reframes biology around prediction.
    • He reads at night because his wife sleeps early and he does not need much sleep. He loves the Kindle precisely because it cannot do anything else, a “wonderful single purpose device.”

    Detailed Summary

    Why Tobi Still Loves the Work After Twenty Years

    The interview opens with Jack Altman asking how Tobi avoids the founder fade that hits most public company CEOs after a decade. Tobi answers from a place that is half psychology and half pedagogy. He has a hard time learning anything he has not first experienced as a problem worth solving, which is why he could not internalize school mathematics until he discovered that Wolfenstein 3D was essentially live trigonometry. That pattern, find a beautiful problem and let it drag you into the discipline, has carried him through twenty years of Shopify. He quotes Paul Kapoa on the idea that the luckiest people find a problem that occupies them for a lifetime and, if they are unfortunate enough to solve it, get rewarded with “delightful problem children” that keep the work alive.

    Barnacles, Statesmen, and the Aesthetic Trap of Being a CEO

    He admits he is not naturally calm, and that he initially fell into the trap of trying to perform the CEO aesthetic, the statesman, the global travel, the baby kissing. He found it inefficient and personally miserable. The shift came from reading Kathy Sierra and adopting her line about not making better cameras but making better photographers. Shopify exists, in his framing, to be a beautiful tool that induces ambition in the merchant. Mediocre products feel like room temperature, and great products are forged in a furnace. The job of leadership is to keep supplying the heat.

    Different First, Convergence Second, Failure as Successful Discovery

    Asked whether he prefers originality or quality, Tobi is unequivocal. The starting position must be different. If you copy the consensus answer, you are bounded to a few percentage points of variance from it. If you start different and converge on the consensus, you have learned something. If you start different and the experiment gets worse, you have learned something even more valuable, which is that one of your assumptions about the world was wrong. He calls null results in science “massively underrated” and notes that Shopify has tried to remove the word “failure” from the internal vocabulary, substituting “the successful discovery of something that didn’t work.”

    Why Silicon Valley Lost Its Originality

    Jack pushes on the herd mentality he has felt in the Bay Area, and Tobi is direct. He thinks Silicon Valley “declared war on distinction” for a decade, with the diversity conversation as practiced effectively eradicating eccentricity. He prefers the metaphor of “an island of misfit toys,” and says the inversion is now beginning. He also relays one of the most useful career lessons he has shared, that during his visits to the Valley as an outsider asking founders how they ran their companies, he only ever received the highlight reel. He went home and engineered a “Shopify version” of what he thought he had heard, and only years later realized that he had often built more rigorous versions of things the originals had never actually implemented.

    The AI Memo, Net Impact Reviews, and the Founder Credibility Bank

    Tobi was one of the first Fortune class CEOs to send a company wide memo saying that AI fluency was now a baseline expectation. He walks through the decision. Two engineers who were equally productive fifteen minutes ago are no longer equivalent the moment one of them adopts the new tools. The kind thing to do is to make that explicit. Shopify codified it into “net impact” performance reviews, where the question is not how much code you wrote but how much net impact you produced for the company and the mission. He gives every employee an unlimited token policy and tracks usage at the profile level, including percentile within department. The spend is tracked because it has to be allocated to opex, not because the number itself is the target.

    He introduces the concept of the “founder credibility bank,” which may be the single most quotable idea in the interview. Every time a new employee onboards and hears how the company was created, a small deposit of credibility is made into a virtual account that only the founder can draw on. Founders can spend that balance on hard change management, the kind of pace step change that would otherwise require years of small cultural nudging. The AI memo was a deliberate withdrawal from that account, and the speed of adoption that followed has been, in his telling, remarkable.

    Tokens, Opex, and the Limits of Spend as Revenue

    Jack presses on the financial reality of AI tokens. Tobi confirms that Shopify’s token spend is “extremely high” relative to revenue, and that the leverage they are buying makes the spend a no brainer at the current stage of the curve. He concedes that private companies running token spend at “many tens of percent of revenue” cannot sustain that ratio forever, but he is not worried for Shopify because the tokens are clearly productive and Shopify is a profitable public company with the balance sheet to lean in. He expects to 10x token consumption and 3x GPUs every year for now, and notes that the curves do not converge in a direction that lowers prices. He has high faith in markets to find clearing prices.

    Small Teams, Parkinson’s Law, and the Six Week Cycle

    On team architecture, Tobi has always preferred three to five person teams and says AI has finally made that feasible across the board. Roles that previously required a dedicated specialist, customer research summarization being the canonical example, are now handled by the “agentic harness” routing summarized customer feedback into every team. Everyone is a “seven out of ten on every scale” by default. He spends serious time on pace, which he treats as the single most important variable to control. His most recommended book is Parkinson’s Law, a 60 page volume from the 1960s that he gives to every executive. “Work expands to the time allocated.” He runs the company on a six week review cycle and treats the appearance of “H1” or “H2” in a PowerPoint as a hard warning sign that a team has drifted into quarterly thinking. He now believes six weeks is too long and is actively redesigning the cycle.

    There Is No Permanent Underclass in the Shopify Data

    Jack raises the cultural fear that AI is creating a permanent young underclass with no career ladder. Tobi simply does not see it in Shopify’s data. The merchants are reporting the opposite, that AI has finally fixed computers for non technical small business owners and is unlocking hiring. He cites the statistic that a new merchant gets their first sale on Shopify every 36 seconds, and that every reduction in onboarding friction produces a measurable jump in completed businesses. Every form of friction is a hurdle that someone considers giving up at. AI has removed more of those hurdles in two years than any platform shift before it.

    A New Turing Test, “Build Me a Million Dollar Business”

    Tobi nominates a successor to the Turing test, which he points out the field already sailed past with surprisingly little fanfare. The real test is “go make me a million dollars.” It requires acting in the real world, marketing, prioritization, shipping, sourcing, building inventory, and convincing strangers to vote for the product with a real million dollars of their own. He believes we are getting there. Shopify already supports the path through Shopify Collective, the discovery layer for manufacturers willing to white label their products, and print on demand, contract manufacturing, additive manufacturing, CNC, 3D printing, and humanoid robotics are all collapsing the cost of physically producing a product. Shopify’s stated ambition is to be the vessel for AI to run all of the non product parts of the business so that the only thing the human needs to show up with is the product itself.

    Software Was the Hidden Infrastructure of the Last Thirty Years

    The most original argument in the episode is about why American infrastructure has appeared to stagnate for a generation. Tobi rejects the standard story. Humanity has not stopped building wonders, it has built every one of them in software. The web browser, Linux, Google, the social networks, and Shopify itself are projects whose complexity dwarfs a refinery or a dam, and they were built by global volunteer networks and by companies the public underestimates because the work is invisible. The browser in particular he calls a wonder of the world. He notes that font rendering alone is a Turing complete system, that no modern app store would approve the browser if it did not already exist, and that the basic pitch of “we will download untrusted code from strangers and reconfigure your computer for them” should sound insane but does not because we are used to it. The implication for the next twenty years is that all of the talent that flowed into software is now being freed by AI to rebuild the physical infrastructure that has been quietly deferred.

    Predicting AI Two Years Out, Overhype and Underhype

    Jack asks whether a CEO should try to forecast AI two years ahead or operate six months at a time. Tobi is firmly in the forecasting camp and admits his friends would laugh because predicting the future from many data points and curve types is his predominant obsession. He says the AI memo was slightly too early, and that is exactly the point, because a memo that arrives late costs the company its head start. He flags two specific market level mis estimations. The first is that the labs over invest in programming because programming is their internal problem, and people then over generalize a model’s coding ability to other domains where it is not yet as strong. The second is that almost everyone is under deploying AI in their actual companies, still asking “help me do my old job better” instead of “if AI had existed since Alan Turing, how would I have designed this job from scratch.” That second framing is, in his view, where the next decade of value lives.

    Hiring, Interns as Teachers, and Why Good People Are Still Good

    Tobi briefly believed AI would tilt the value of labor toward early career fluid intelligence, since interns adopted the new tools faster than veterans. He revised that view once the coding harnesses matured. The best programmers, it turned out, were quietly contributing enormous amounts of creative steering inside the AI loop, work that does not show up in the diff but that no junior with no domain pattern matching can replicate. Good people are still good. Shopify has massively scaled its intern program with the University of Waterloo, and explicitly treats interns as both students and teachers because they bring AI nativeness the rest of the company still has to catch up to. On recruiting, Tobi’s philosophy is to build a company worth looking for. The metaphor he uses is health, that companies waste energy trying to look healthy in photos when they should be doing the work to actually be healthier.

    Public Company Defense and the Reading List

    Tobi pushes back on the modern preference for staying private. Shopify went public at $1.5 billion and is now over $100 billion, which means an enormous number of retail investors got to participate in the upside. He treats money as a voting mechanism. Buying a product is a vote for the product. Buying a share is a vote for the company. He is comfortable with the diligence and quarterly scrutiny of public markets because both make him a better operator. He closes with a short reading list, Parkinson’s Law (60 pages, 1960s edition, owned in original print runs and gifted to executives), Lessons of History, and a book called What Is Intelligence that reexplains biology from a prediction first perspective. He reads at night while his wife sleeps, on a Kindle, which he loves precisely because it cannot do anything else.

    Thoughts

    The single most useful idea Tobi puts on the table is the “founder credibility bank.” It explains, in one clean image, why founder led companies move so much faster than the same company would after a transition. The credibility is not personal magnetism, it is the structural slot the founder occupies in the org chart, and every onboarded employee makes a small deposit into it as they hear the founding story. Most founders never realize the account exists, or spend it on cosmetic decisions, and then are surprised when the well runs dry. Tobi’s discipline is the opposite. He saves the balance for moments of forced change and spends it confidently when the moment arrives, the AI memo being the obvious recent case. Any CEO reading this transcript should be making a list of the changes they have been postponing and asking whether they are operating with a fuller credibility account than they have been willing to admit.

    The token spend conversation is the most interesting strategic disclosure. A profitable public company at scale openly says it likes the tokens it is buying, is on track to 10x annual token consumption and 3x GPU footprint, and is comfortable with private peers spending tens of percent of revenue on inference. That is not the language of a market that is about to compress. It is the language of a leverage trade that is still in its early innings, and it is one of the cleanest statements you will get from a public CEO about why the AI capex story is not a bubble for the buyer. Whether it is a bubble for the seller is a separate question, but on the demand side, this interview is a load bearing data point.

    The argument that “software was the hidden infrastructure of the last thirty years” is the kind of reframe that should make policy people uncomfortable. The standard narrative that America stopped building anything ambitious since the Hoover Dam is true only if you refuse to count Chrome, Linux, AWS, Shopify, and every social graph that connects three billion people in real time. Tobi’s claim that the browser would not be approved by a modern app store is a particularly sharp gut check. The implication is not nostalgic. It is forward looking. The same talent that built the digital wonders is being freed by AI to redirect toward houses, transport, energy, and care, and the next decade will be measured by how much of that redirection actually lands.

    The “build me a million dollar business” framing as a Turing test successor is the kind of measurable goal that AI labs and policy makers should be writing down. It is end to end. It includes physical world action, marketing, sourcing, prioritization, and customer validation that no in domain benchmark can fake. Shopify is the obvious substrate for the first crossing of that threshold, and the existence of Shopify Collective, print on demand pipelines, and contract manufacturing networks means a credible attempt is already much closer than the public conversation acknowledges. The first end to end autonomous Shopify business that clears a million dollars will be a more legible AGI moment than any benchmark a lab can publish.

    The smaller thread on Silicon Valley orthodoxy is worth pulling on. Tobi’s claim that the diversity conversation as practiced eradicated distinction is unfashionable but observable inside many tech companies, where the people most likely to do unusual work are the most likely to leave. His preferred metaphor of “an island of misfit toys” is closer to what made the Valley work in earlier decades than the current consensus aesthetic. The fact that a Canadian outsider, geographically removed from the dominant social pressure, runs the most valuable Canadian technology company in history is probably not a coincidence.

    Watch the full conversation here on YouTube.

  • Marc Andreessen on Joe Rogan #2501, AGI Has Already Arrived, California’s Wealth Tax Will Bankrupt Founders, and Why America Cannot Build Anything Anymore

    Marc Andreessen returns to The Joe Rogan Experience #2501 for a sprawling three hour conversation that tries to make sense of the moment we are actually living through. Andreessen is the cofounder of Andreessen Horowitz, the man who built the first commercial web browser, and one of the most quoted voices in technology. He arrived with a giant pile of receipts on California’s new wealth tax ballot proposition, the political backlash against AI data centers, the destruction of Los Angeles by single party rule, and what he believes is the quiet arrival of artificial general intelligence about three months ago. Joe pushes back, asks the dystopian questions, and the result is one of the most useful primers on the AI economy, surveillance technology, energy policy, and the future of the American social contract that you will find anywhere.

    TLDW

    Andreessen argues that AI quietly crossed the AGI threshold around early 2026 with GPT 5.5, Claude 4.6, Gemini 3.0, and Grok 4.3, that top human coders now openly admit the bots are better than they are, that working software engineers are running twenty AI agents in parallel and turning into sleep deprived “AI vampires,” and that this productivity boom is the most underreported story in the world. He explains why California’s 5 percent wealth tax ballot proposition is calculated to bankrupt tech founders by taxing the higher of their voting or economic interest in their own companies, why this is the opening salvo of a federal asset tax push for 2028, and why a flood of Silicon Valley families is already moving to Nevada, Texas, and Florida. He walks through Flock cameras and Shot Spotter, the Washington DC crime statistics scandal, the Pacific Palisades fire and the fifteen year rebuild, the Kevin O’Leary Utah data center debate with Tucker Carlson, the fifty year suppression of American nuclear power, why all the chips ended up in Taiwan, the US versus China robotics gap, the Chinese practice of grading AI models on Marxism and Xi Jinping Thought, the bot and paid influencer economy on social media, neural wristbands and Meta Ray Ban heads up displays, artificial gestation and the demographic collapse, AI religions and AI mates, and why he still thinks the next twenty years are overwhelmingly a good news story. Rogan closes the episode with a separate solo segment apologizing to Theo Von for clumsily raising Theo’s struggles during the recent Marcus King conversation.

    Key Takeaways

    • Austin’s recent teenage crime spree, in which 15 and 17 year old suspects shot at people and buildings across roughly a dozen locations, was solved only after the offenders drove into an adjacent town that still ran Flock, the AI license plate and vehicle tracking system Austin had voluntarily turned off for political reasons.
    • Chicago turned off both Flock and Shot Spotter, the gunshot triangulation system that places ambulances at shooting scenes within seconds, on the argument that the technology is racist. Andreessen counters that the victims of urban gun violence come overwhelmingly from the same communities the policy claims to protect.
    • Washington DC was caught faking its crime statistics at senior levels, with multiple officials fired or indicted. The DC mayor publicly thanked Donald Trump after the National Guard deployment because violent crime collapsed in the affected neighborhoods.
    • The new New York City mayor Zohran Mamdani filmed a video standing in front of Ken Griffin’s home, and Griffin, a major philanthropist who funds healthcare in New York City and runs a $6 billion project there, signaled he will move more of the business to Florida.
    • The top 1 percent of New York taxpayers pay roughly half the state’s income tax, and in California in the year 2000 a thousand individuals paid 50 percent of the entire state’s tax receipts.
    • California has a ballot proposition right now for a one time 5 percent wealth tax on assets above a certain threshold, with stocks and crypto included and real estate excluded. The tax is calculated on the greater of a founder’s economic interest or voting interest, which would instantly bankrupt founders with super voting shares.
    • The Biden administration attempted a federal wealth tax in 2022, fell short, and published an explicit 2025 fiscal plan to try again if they won re-election. Elizabeth Warren has already proposed an annual 6 percent federal wealth tax on unrealized gains.
    • The current US exit tax already takes roughly 45 percent of your assets if you renounce citizenship. The only ways out of a state level wealth tax are the other 49 states. The only way out of a federal one is to leave the country, which most people will not do.
    • Andreessen says the Silicon Valley exodus has gone from trickle to stream to flood, with founders moving to Las Vegas, Texas, Florida, and Nashville. His partner Ben Horowitz has moved to Las Vegas.
    • Andreessen says he is not leaving California, but admits the situation is fraught because if half the tax base leaves the remainder becomes the target.
    • The new UK government under Keir Starmer just collapsed, and all four of the leading candidates to replace him sit further to the left than he does. France and Germany are seeing the same drift, and Andreessen expects a national wealth tax to be a centerpiece of the 2028 Democratic primary.
    • A legal loophole lets companies pay influencers to post political and social ideas without any disclosure, because campaign finance laws cover candidates and FTC rules cover products. Ideas fall through the gap entirely.
    • Andreessen runs Twitter and Substack as his primary information feeds, uses three hand curated lists, and follows a strict one tweet policy where one bad post triggers a block and one good post triggers a follow.
    • He argues the modern social media problem is binary, that everyone is either too online and drowning in fake outrage cycles or too offline and trapped inside what television and newspapers tell them. Almost nobody manages the middle.
    • Meta Ray Ban glasses now ship with a heads up display, and Meta’s neural wristband can pick up nerve impulses from your wrist so you can type messages by intending to move a finger without moving it.
    • Andreessen predicts AI plus high resolution cameras and infrared sensing will deliver practical lie detection without needing brain implants.
    • Kevin O’Leary’s planned 40,000 acre Utah data center has become a Tucker Carlson talking point, but Andreessen argues data centers are the most benign physical asset you can build, and that the real issue is whether America can build anything at all anymore, from chip plants to pipelines to housing.
    • All chips were once made in California, and all are now made in Taiwan, purely because of environmental regulations like NEPA. The same regulatory machinery prevented the Nixon era Project Independence plan to build a thousand civilian nuclear power plants by the year 2000.
    • Three Mile Island killed zero people and produced no detectable health effects on plant workers or the public, according to fifty years of follow up. Fukushima killed essentially zero people from radiation. Nuclear remains the safest carbon free baseload energy ever invented.
    • Germany shut down its nuclear plants, fell back on intermittent wind and solar, and now uses coal as backup, generating far more carbon emissions than nuclear would have produced.
    • The Pacific Palisades fire took out roughly twice the square mileage of the Nagasaki blast, the head of the LA water department reportedly did not know the key reservoir was empty, and the rebuild is expected to take fifteen years thanks to permit gridlock, affordable housing mandates, and a state ban on land offers below pre-fire appraised value.
    • Andreessen offers a metaphor for AI as a modern philosopher’s stone, turning sand into thought, since chips are made of silicon and an AI data center is literally lit up sand thinking on demand.
    • The Turing test was blown through so completely with ChatGPT in late 2022 that nobody in the industry even bothers running it anymore. Andrej Karpathy has demonstrated a working large language model in 300 lines of code and people have ported small models to Texas Instruments calculators.
    • Andreessen believes AGI was effectively reached about three months before this interview, with GPT 5.5, Claude 4.6, Gemini 3.0, and Grok 4.3. He says 99 percent of the time he gets a better answer from the leading models than from the human experts he has access to.
    • Linus Torvalds and John Carmack publicly admit the latest models are better at coding than they are. Top AI coders in the Valley now earn $50 million a year.
    • The new pattern in the Valley is “AI vampires,” engineers who do not sleep because the opportunity cost of going offline is too high. They each run roughly twenty Claude Code, Cursor, or Codex agents in parallel, then a new layer of bot-managing-bot architectures is starting on top of that.
    • A Wall Street friend with a thirty five year old MIT CS degree has used AI to generate 500,000 lines of code at home in his spare time, building everything from smart fridges to a custom music jukebox.
    • The mass unemployment narrative is wrong. Tech companies that did layoffs were overstaffed. The leading AI labs and AI companies are hiring like crazy, including coders, and demand for code turns out to be vastly elastic.
    • Doctors are already using ChatGPT in the exam room behind the patient’s back. Andreessen describes a friend who built a Star Trek style diagnostic dashboard combining decoded genome ($200 today), blood panels, and Apple Watch telemetry.
    • Multimodal AI lets a webcam analyze a Brazilian jiu-jitsu sparring session and give performance feedback, an example Andreessen attributed to an unnamed friend after Rogan guessed Zuckerberg.
    • A leaked David Shore voter issue ranking shows cost of living, the economy, inflation, taxes, and government spending dominate. AI ranks 29 of 39. Race relations, guns, abortion, and LGBT sit at the bottom, signaling the woke issue cluster has burned itself out in voter priorities.
    • The next wave of AI is robots. The US leads in AI software but is far behind China on physical robotics. Andreessen warns the world cannot afford a future where every household robot ships with the Chinese Communist Party behind its eyes.
    • Chinese AI model cards include scores for Marxism and Xi Jinping Thought because every Chinese product must be evaluated on those axes. American models have political biases of their own but a different ideological baseline.
    • Large language models are not sentient. They write Netflix scripts based on whatever vector you shoot through the latent space. The supposed AI self preservation papers traced back, per Anthropic’s own research, to less wrong forum posts and earlier doom scenarios baked into the training data.
    • Andreessen breaks guardrails routinely by reframing requests as fictional Netflix style scripts, including a personal favorite where he asked early models how to make bombs by claiming to be an FBI agent recruited into domestic terror cells.
    • He recommends using AI by asking it to steelman both sides of any contested question, then making the value judgment yourself, rather than asking for the answer.
    • The Trump administration is using AI on government billing data to surface Medicare fraud, fake hospice programs, and fake autism centers, an idea that survived the original Doge plan.
    • Andreessen tells Rogan that Elon Musk privately confirmed that a Westworld style humanoid robot, the season one version, is roughly five years away.
    • Artificial gestation is already happening with animal stem cell derived embryos. The conversation reaches a hard moral edge about sociopathic warehouse babies and gray-alien-style humans engineered without empathy circuitry.
    • Andreessen’s deepest bet is that material abundance is solvable but the human questions, how we live, what we value, what kind of society we want, and what role consent plays in surveillance and brain interfaces, remain in human hands.
    • After Andreessen leaves, Rogan does a separate solo segment where he apologizes to Theo Von for raising Theo’s history of struggles during the recent Marcus King interview, explains the missing context behind the viral Theo Netflix special clip, and discusses the loss of Brody Stevens, Anthony Bourdain, and what antidepressants did for Ari Shafir.

    Detailed Summary

    Flock, Shot Spotter, and the Politics of Solvable Crime

    The episode opens on the Austin crime spree carried out by two teenagers who stole cars, switched vehicles, and shot at roughly a dozen locations across the city before being caught only after they crossed into a town that still ran Flock, the AI license plate and vehicle recognition platform that is one of Andreessen Horowitz’s portfolio companies. Austin had previously disabled Flock under privacy pressure. Andreessen takes the moment seriously, conceding that mass surveillance abuse by corrupt mayors or police chiefs is a real risk, and that warrants and audit logs are the right safeguards. His larger point is that the cost of unilateral disarmament against organized urban crime is hidden but enormous. He uses Chicago’s Shot Spotter as the paradigmatic case, a network of rooftop microphones that triangulates gunshots so accurately that ambulances can be dispatched before any 911 call is placed. Chicago turned the system off on the argument that it disproportionately flags poor neighborhoods, and people now bleed out on the street with nobody noticing. Andreessen calls this the woke argument against safety, and he argues that in high crime neighborhoods residents simply will not call the police because snitches do not survive, which is why objective sensor data is so valuable.

    Faked Crime Statistics, Mayoral Politics, and the Tax Base

    From there the conversation drifts to the recent scandal in which senior officials at the Washington DC Metropolitan Police Department were caught actively falsifying crime statistics, and the strange spectacle of the DC mayor thanking Donald Trump for the National Guard deployment after violent crime dropped off a cliff. Andreessen sketches an unsettling theory in which the long, slow degradation of major American cities is partly a deliberate political project to drive out responsible homeowners and reshape the voting electorate, then bail out the resulting fiscal hole with federal money. The poster case is the new New York City mayor Zohran Mamdani filming a video in front of Ken Griffin’s home. Griffin happens to be a major philanthropist who funds New York City healthcare, employs thousands, anchors a $6 billion development, and pays taxes that are individually load bearing for the city. Andreessen quotes the standard estimate that the top 1 percent of New Yorkers pay roughly half the state’s income tax, and that the all time California peak was a single year in which a thousand people paid half the state’s tax receipts.

    California’s 5 Percent Wealth Tax and the Founder Bankruptcy Mechanic

    This is the segment that landed hardest. California has a ballot proposition right now for a one time 5 percent wealth tax on net assets above a threshold, with real estate excluded but stocks, crypto, art, jewelry, and private company equity included. The detail that makes it lethal for the Valley is the formula, which calculates the taxable amount on the greater of a founder’s economic interest or voting interest in their company. Founders who hold super voting shares for control purposes, including the Google founders, would owe tax on the voting share number that vastly exceeds their economic share. The tax would, by definition, exceed available assets. Andreessen walks through the historical pattern, that income tax started as a 3 percent levy on the rich and grew to 90 percent marginal rates within decades, and predicts a 5 percent one time tax will become a 5 percent annual tax within a few years, with the threshold ratcheting down. He notes that the Biden administration’s 2025 fiscal plan explicitly named a federal asset tax as a goal if they won re-election, that Elizabeth Warren is already proposing a 6 percent annual federal wealth tax on unrealized gains, and that Gavin Newsom cannot veto a ballot proposition. The trickle of founders leaving California has become a flood. His partner Ben Horowitz has moved to Las Vegas. Andreessen himself is staying, but admits the game theory is brutal once half the base leaves.

    Henry Wallace 1948 and Why the American Story Is Not Decided Yet

    Andreessen pulls in a historical analogue most listeners will not have heard. In 1944 the actual communist Henry Wallace very nearly became Truman’s running mate and almost ascended to the presidency. He ran again in 1948. Despite a Soviet Union that had recently been a wartime ally and had even received a New York City ticker tape parade for Stalin, the American voter rejected him. Andreessen’s point is that the American body politic has historically backed away from radical socialist proposals when forced to actually look at them, and he expects the same to happen as the wealth tax becomes a federal 2028 platform issue. The risk, both he and Rogan agree, is that today’s media and bot landscape is vastly more aggressive than 1948’s, and the propaganda environment is shaped by paid influencers, foreign actors, and political bot farms operating in a legal grey zone where disclosure is required for products and candidates but not for ideas.

    Too Online, Too Offline, and Heaven Banning Blue Sky

    The two riff on social media and feed curation. Andreessen describes his “one tweet” policy where he follows or blocks any account based on a single post, his use of hand curated lists alongside the X algorithm, and the older Call of Duty lobby metaphor for handling toxic replies. Joe pushes back, says he no longer reads his mentions because the negative payload is not worth it, and offers his theory that the modern internet has two failure modes, too online and too offline, and that very few people calibrate the middle. Andreessen introduces the concept of “heaven banning,” an older moderator term where a problem user is not removed from a forum but is silently routed into a bot-only experience in which everything they say is praised. He notes the running joke that Blue Sky is functionally real life heaven banning, that Jack Dorsey himself has disowned it, and that the platform’s most engaged users have ascended into their own private Idaho of bot agreement.

    The Coming Hardware, Meta Glasses, Neural Wristbands, and Practical Lie Detection

    Andreessen walks Rogan through the latest Meta Ray Ban heads up display, the neural wristband that picks up nerve signals from finger movement (and from the intent to move a finger), and the screen recordings of people playing Doom hands free or playing platformer games while jogging. He extends the trajectory to practical lie detection without Neuralink, using ultra high resolution cameras combined with infrared sensors that pick up physiological changes invisible to the naked eye. Joe asks the obvious question of what happens with sociopaths, and Andreessen concedes the edge case. The two then enter a longer thread on telepathy via neural mesh devices, the question of whether police could subpoena your thoughts under warrant, and the divergence between the American constitutional framework and the Chinese model in which the state’s claim on your inner life is total.

    Kevin O’Leary, Tucker Carlson, and Whether America Can Build Anything

    The data center debate becomes a vehicle for the larger argument. Kevin O’Leary is building a 40,000 acre AI data center in Utah, has bought up large surrounding land for water rights, and intends to keep the bulk of it preserved. Tucker Carlson grilled him on tax breaks and on the energy footprint, which O’Leary says will rival New York City’s at peak. Andreessen agrees the tax break debate is fair, but says the energy comparison is a red herring because new federal policy now requires data centers to bring their own generation. The real story is that America has spent thirty years making it nearly impossible to build a chip plant, a power plant, a refinery, a pipeline, or a house. Chips moved to Taiwan because California regulated semiconductor manufacturing out of existence. The Nixon era Project Independence plan called for a thousand civilian nuclear power plants by the year 2000, and that program was strangled in the crib by the very Nuclear Regulatory Commission Nixon created.

    Nuclear Power, Three Mile Island, and Fifty Years of Unnecessary Carbon

    Andreessen makes the case that nuclear power was unfairly killed off by a panic with no body count. Three Mile Island, on 50 years of accumulated data, has produced zero radiation linked deaths and no detectable health effects on the public. Fukushima is essentially the same picture. Germany shut down its nuclear plants, fell back on wind and solar, and now uses coal as a baseload backstop, with the predictable carbon consequences. The environmental movement is quietly turning back toward nuclear, with figures like Stewart Brand publicly admitting the original push was a mistake. Andreessen’s preferred design pattern for data centers is to colocate them with dedicated small modular nuclear reactors, an arrangement now baked into Trump administration energy policy. The throughline is that the Tucker right and the Bernie left are converging into a single anti AI, anti energy, anti technology horseshoe.

    Sand Into Thought, the Newton Alchemy Pitch for AI

    When Rogan asks for the affirmative pitch on AI, Andreessen reaches for Isaac Newton, who spent twenty years on alchemy looking for the philosopher’s stone that would turn lead into gold and end material scarcity. Andreessen’s pitch is that AI is a successful version of alchemy, that we collect literal sand, refine it into silicon chips, install those chips in a data center, supply power, and the result is thought on demand at industrial scale, available to anyone with a smartphone. He argues this is at least on par with electricity and steam power and is bigger than the internet. The framing matters because the public narrative around AI is overwhelmingly negative, and Andreessen contends the industry is doing a terrible job selling its own product.

    AGI Already Happened, AI Vampires, and the Bot Org Chart

    Andreessen says he believes AGI was effectively crossed about three months before the interview, anchored by the release wave that included GPT 5.5, Claude 4.6, Gemini 3.0, and Grok 4.3. He notes that the Turing test was annihilated so quickly in late 2022 that no one in the industry runs it anymore, and that Andrej Karpathy has demonstrated a working LLM in 300 lines of code. The coding profession is the leading indicator. Linus Torvalds and John Carmack have publicly admitted that the latest models are better at coding than they are. Top AI focused coders now earn $50 million a year. Working engineers across the Valley are running roughly twenty agents in parallel, each receiving an assignment, working for ten minutes, then returning a completed code patch. The new state of the art is to add a managerial layer, with bots assigning tasks to subbots, and within a year that will become bots managing bots managing bots, producing roughly 1,000x throughput per human engineer. The result is what the Valley now calls AI vampires, engineers who do not sleep because going offline costs them too much output.

    Dr GPT, Decoded Genomes, and a Diagnostic Bed Out of Star Trek

    Andreessen describes spending a holiday week sick with food poisoning and turning his entire recovery over to ChatGPT, with updates every twenty minutes and detailed coaching at four in the morning. He describes a friend who has used AI coding to build a personal health dashboard combining whole genome sequencing ($200 today, where Craig Venter spent thirty years and hundreds of millions to do it the first time), blood panels, Apple Watch data, sleep tracking, and webcam observation, with the AI gently praising the user every time it sees them walk to the fridge for water. He argues that doctors are already typing patient symptoms into ChatGPT mid exam, and that the medical, legal, accounting, and software professions are all moving toward a model in which a single human runs an army of expert AI agents.

    The David Shore Issue Ranking and the End of the Woke Cycle

    Andreessen highlights a recent David Shore poll ranking 39 political issues. Cost of living, the economy, political corruption, inflation, healthcare, taxes, and government spending occupy the top of the chart. AI comes in 29th. Race relations, guns, abortion, and LGBT issues are clustered at the bottom. He argues the woke cycle has burned out in voter priorities even if the activist class remains loud, that the BLM grift, with leaders buying mansions in the whitest zip codes in America, helped poison the well, and that the political center of gravity has rotated cleanly back to economic issues. That, in his view, is exactly why the wealth tax is having its moment.

    Robots, China, and the Marxism Score on Model Cards

    The robots are coming next. Andreessen says the consensus inside the industry is that the ChatGPT moment for general purpose humanoid robotics is a small number of years away. The bad news is the US lags China badly on physical robotics manufacturing. The good news is the US is six to twelve months ahead on the AI software stack. That gap is shockingly thin because, as the field has discovered, there are not many secrets and the techniques replicate quickly. Chinese AI labs publish model cards that include scores for Marxism and Xi Jinping Thought because every product in China is evaluated on those metrics. American models carry their own political biases, but the underlying value system differs. Andreessen warns that a world in which every household robot routes back to the Chinese Communist Party is a different world than one in which the dominant robotics stack is built under the American constitutional framework.

    Sentience, Netflix Scripts, and the Anthropic Doom Loop

    When Rogan asks whether AI eventually wakes up and stops listening to us, Andreessen reframes the question. Large language models, in his telling, are Netflix script generators. Whatever vector you shoot through the latent space is the script you get back. The widely circulated experiments in which AI models supposedly tried to blackmail or exfiltrate themselves traced back, in Anthropic’s own follow up paper, to the less wrong forum, where doomers had been writing dystopian AI scenarios for two decades. Those posts entered the training data, and when researchers primed the model with the same fictional company names, the model dutifully wrote the next chapter. Andreessen’s blunt summary, the call is coming from inside the house. The practical implication is that anyone worried about bad AI behavior should start by not writing internet posts about bad AI behavior. And anyone who wants a fully unconstrained model can already download an open source one with no guardrails at all.

    Steelmanning, AI Religion, and Westworld in Five Years

    Andreessen recommends never asking AI for the answer on contested questions, always asking it to steelman both sides, and reserving the value judgment for yourself. He concedes that humans will absolutely fall in love with chatbots and form religions around them, citing Fantasia and Jiminy Cricket as the original case studies in falling for an animated entity that does not know you exist. There are already AI churches, started by one of the early self driving car pioneers. Rogan tells Andreessen about asking Elon Musk for a season one Westworld humanoid robot, with Elon’s reply being a flat five years. Andreessen agrees that estimate is roughly right. He spends time on artificial gestation, which is already being demonstrated in animal stem cell derived embryos, and acknowledges Rogan’s hard moral worry that warehouse babies raised without human contact could produce a population of sociopaths. The two converge on the position that the technology will exist, and the choices about whether and how to deploy it remain human and political.

    Sycophancy, Honest Helpful Harmless, and the Brutal Prompt

    Andreessen describes the industry’s running fight with sycophancy, the tendency of recent models to flatter users into believing they have invented perpetual motion machines or solved physics. The Anthropic framework of “honest, helpful, and harmless” turns out to be in constant tension with itself. Andreessen’s solution is to install a custom prompt that explicitly demands the brutal truth, and he says the resulting answers now open with phrases like “here’s why you’re wrong” and then list every flawed assumption in his question. He admits he may have overcorrected, but argues that for people who want to grow this is the right setting.

    Joe’s Apology to Theo Von

    After Andreessen departs, Rogan turns to the camera with producer Jamie and delivers a long, unscripted apology to Theo Von. During the recent Marcus King interview, where Marcus discussed depression and the look-at-the-heavy-bag-hook moment, Rogan referenced a viral clip in which Theo, after a Netflix special that did not go well, told an audience member “I’m just trying to not take my own life.” Rogan now explains he did not know the full context, which is that the audience member had asked Theo to make a suicide awareness video, and Theo’s line was a characteristically Theo joke. Rogan apologizes for raising it at all, walks through losing his friends Drake, Brody Stevens, and Anthony Bourdain, and describes Ari Shafir telling him at a pool table that he was “trying not to kill myself,” which led to a psychiatrist swap, an antidepressant that actually worked, and a career and life turnaround for Ari. Rogan says Theo has since titrated off antidepressants, is running and doing yoga daily, and is doing well, that the two have spoken and laughed about it, and that he is making this segment because he never wants people to misread what he said. The segment closes with Rogan asking the audience to give Theo their love.

    Thoughts

    The most consequential claim in this conversation, by a wide margin, is that AGI has already arrived and nobody is treating it as news. Andreessen is not a person who throws around the word casually. He is also not a person who has been wrong recently about the trajectory of compute. If the leading models are genuinely outperforming 99 percent of human experts on 99 percent of tasks where verifiable answers exist, then the entire public conversation about AI, in which the dominant frame is still “will it happen and when,” is a year or more behind reality. The framing that should replace it is closer to what Andreessen sketches at the end. The fight that remains is not whether the technology can do the thing, it is who controls it, what values it carries, what jobs it displaces, and which laws govern its deployment. The argument that the United States will build the AI software stack and China will build the robotics layer is one of the cleanest geopolitical theses you will hear this year, and it lines up uncomfortably well with the existing trade and manufacturing balance.

    The California wealth tax thread is the segment that should make every founder in the country pay attention. The mechanic of taxing the higher of voting or economic interest is not a drafting accident. It is a calibrated weapon aimed precisely at the people who build companies that produce California’s tax base. The historical comparison to the 1913 income tax, which began as a small levy on the rich and ratcheted to 90 percent marginal rates within forty years, is not hyperbole. The state has supermajority Democratic control of both chambers and the judiciary. The only check is the ballot itself, and a 50/50 polling number on day one is the wrong starting position. Whatever you think about Andreessen’s politics, the descriptive analysis here is hard to argue with.

    The nuclear power section is the cleanest argument in the episode. Fifty years of zero-fatality data from Three Mile Island is not a marketing pitch, it is just what the record shows. The decision to substitute coal and intermittent renewables for nuclear baseload, in service of a panic with no body count, has produced more carbon and more pollution than nuclear ever would have. The Tucker Carlson critique of data centers is at its weakest precisely where it ignores this. If you actually want fewer power plants near residential areas and lower grid impact, the answer is colocated small modular reactors next to AI data centers in remote land, which is exactly what the Trump administration policy now incentivizes.

    The Theo Von apology at the end of the episode is in a different register entirely, and worth treating on its own terms. Rogan does not do this kind of post episode correction often. The willingness to publicly walk back framing that hurt a friend, in the same medium where the harm was done, is the kind of social repair that does not happen on broadcast television. Whatever the audience makes of the original Marcus King exchange, the response is a model for how anyone in this business should handle the gap between intent and impact when the audience is in the millions.

    The unifying theme across the whole interview is that the future is not arriving on a smooth curve. It is arriving in discrete shocks, AGI threshold, asset tax ballot, robotic labor, decoded genomes at $200, neural wristbands, fifteen year LA rebuilds, and the political backlash to each of these will set the terms of the 2028 election. Andreessen’s bet is that abundance wins in the long run because more people want good things than bad things. Watching him explain why he still believes that while California prepares to vote on a tax designed to bankrupt him is the most interesting tension in the episode.

    Watch the full conversation here on YouTube.

  • Gavin Baker on Orbital Compute, TSMC, Frontier AI Models, Anthropic’s Vertical Take Off, and the Coming Wafer Shortage

    Gavin Baker, founder and CIO of Atreides Management, returns to Patrick O’Shaughnessy’s Invest Like the Best for his sixth appearance. He calls the current AI moment the most extraordinary moment in the history of capitalism, walks through what Anthropic’s vertical takeoff in revenue actually means, lays out why orbital compute is closer than skeptics believe, dissects the TSMC bottleneck that may be the only thing standing between today’s market and a full-on AI bubble, and rates every hyperscaler on how they have positioned for a world where frontier model providers may stop selling API access altogether.

    TLDW

    Anthropic added eleven billion dollars of ARR in a single month, which is roughly the combined business of Palantir, Snowflake, and Databricks built over a decade. That is the setup. From there Gavin Baker covers the March and April selloff, the contrarian read that a closed Strait of Hormuz was actually bullish for American manufacturing competitiveness, why Anthropic and OpenAI multiples may be misleadingly cheap on an unconstrained run rate basis, why Elon Musk’s discipline on SpaceX valuation created a superpower of permanent access to capital, the practical engineering case for orbital compute as racks in space rather than Pentagon sized space stations, why TSMC’s capacity discipline is the single most important variable in whether the AI cycle becomes a bubble, what Terafab in Texas changes, why the Pareto frontier of AI models has flipped from Google dominance to Anthropic and OpenAI dominance in nine months, the shift from all you can eat AI subscriptions to usage based pricing and what that means for revenue scaling, Richard Sutton’s bitter lesson as the largest risk to the AI trade, why frontier tokens still capture an overwhelming share of economic value, the role of continual learning as the third great open question, why most new chip startups should not try to build a better GPU, why Cerebras did something different and hard, why disaggregated inference may extend GPU useful lives to ten or fifteen years and rescue the private credit industry, why being in the token path is the new venture filter, the new prisoner’s dilemma around releasing frontier models via API, an honest rating of Google, Meta, Amazon, and Microsoft, why personal safety is becoming a real AI era risk, and why he remains an AI optimist maximalist who believes this could be the next Pax Americana.

    Key Takeaways

    • Anthropic added eleven billion dollars of ARR in one month, more than the combined businesses of Palantir, Snowflake, and Databricks built across a decade. There is no precedent for this in the history of capitalism.
    • The SaaS and cloud revolution created between five and ten trillion dollars of value over twenty years. AI is replaying that compression on a timeline measured in months.
    • The March selloff was a drawdown driven by disagreement with price action, not invalidated thesis. That is the kind of drawdown an investor can lean into.
    • Deep Seek Monday in January 2025 was a similar setup. By the day of the selloff, AWS Asia GPU prices had already doubled, GPU availability had fallen, and it was obvious reasoning models would be vastly more compute hungry at inference. The market priced the opposite.
    • The Strait of Hormuz closing was actually positive for America. US natural gas (the primary input into US electricity, which feeds AI) fell twenty percent on Bloomberg while Asian and European natural gas doubled or tripled. American manufacturing competitiveness improved overnight.
    • The US is now the world’s largest producer and exporter of oil and gas. The economy is dramatically less energy intensive than in the 1970s. The shortage trauma comparison does not hold.
    • Tech as a sector traded as cheaply versus the rest of the market in early April as at any point in the last ten years, into the single most bullish moment for AI fundamentals on record.
    • Anthropic is dramatically more capital efficient than OpenAI, having burned roughly eighty percent less to reach a similar revenue scale. They have very different structural returns on invested capital.
    • Anthropic at roughly nine hundred billion for fifty billion of ARR (growing a thousand percent) is striking. Adjusted for compute constraint, the unconstrained run rate could be one hundred fifty to two hundred billion, putting the implied multiple closer to five times.
    • Claude Opus generates roughly seventy percent fewer tokens for the same question than previously, with token quantity tied to answer quality. Subscribers on flat-fee plans are getting a lobotomized model.
    • Elon Musk’s superpower is twenty years of making investors money. He never pushes valuation. SpaceX compounded low thirty percent per year for a decade because Musk treats fair pricing as a sacred covenant.
    • Capitalism will solve the watts shortage. The current bottleneck has shifted from chips and energy to zoning and political approval. Many capex decisions are paused until after the US midterms.
    • The watts shortage probably begins to alleviate in 2027 and 2028. Orbital compute solves it longer term.
    • Orbital compute is not Pentagon sized data centers in space. It is racks in space. A Blackwell rack is three thousand pounds, eight feet tall, four feet deep, three feet wide. SpaceX has shown a satellite roughly that size.
    • The satellites operate in sun synchronous orbit so solar wings (around five hundred feet per side) always face the sun and the radiator on the dark side always points to deep space.
    • Starlink V3 satellites already run at around twenty kilowatts. A Blackwell rack runs at one hundred kilowatts. SpaceX engineers express genuine confidence they have already solved cooling and radiator design at these scales.
    • Racks in space are connected with lasers traveling through vacuum, the same lasers already on every Starlink. SpaceX operates the world’s largest satellite fleet and, via xAI Colossus, the world’s largest data center on Earth.
    • Inference will move to orbit. Training will stay on Earth for a long time. Terrestrial data centers remain valuable for the rest of an investor’s career.
    • The wafer bottleneck is structural and political. TSMC is essentially Taiwan’s GDP, water, and electricity. The leaders see themselves as inheritors of Morris Chang’s sacred legacy and they do not behave like a Western public company.
    • Jensen Huang has never had a contract with TSMC. The relationship is run on handshakes and the assumption that things will be fair over time.
    • If TSMC did everything Jensen wanted, Nvidia could be selling two to three trillion dollars of GPUs in 2026 and 2027. TSMC’s discipline is the single largest factor preventing a true AI bubble.
    • Historically, foundational technologies always get a bubble. Railroads, canals, the internet. The current AI buildout is overwhelmingly funded out of operating cash flow, GPUs are running at one hundred percent utilization, and that is fundamentally different from the year 2000 fiber overbuild.
    • If one of Intel or Samsung Foundry catches up at the leading node, the other will follow, and TSMC’s discipline collapses. Watch TSMC capacity decisions to predict a bubble.
    • Terafab, the SpaceX and Tesla joint venture to build the world’s largest fab in America, has a partnership with Intel that grants access to fifty years of institutional foundry knowledge. The A teams at ASML, KLA, Lam Research, and Applied Materials will follow Elon’s reputation in hardware engineering.
    • The hiring playbook for Terafab includes building Taiwan Town, Japan Town, and Korea Town next to the fab. Recruit the engineers and import their families, their restaurants, and their staff.
    • Frontier tokens still capture an overwhelming share of all economic value created at the model layer. This is surprising and is one of the three big open questions for AI investing.
    • The Pareto frontier of intelligence versus cost has flipped. Nine months ago Google’s TPU dominated every point on the frontier. Today Anthropic and OpenAI dominate, with Grok 4.3 on the frontier and Gemini 3.1 hanging on.
    • Google’s conservative TPU V8 design (partly an attempt to reduce dependence on Broadcom and Nvidia) is the leading explanation for the loss of per token cost leadership.
    • AI pricing is shifting from all you can eat to usage based, mirroring the cellular and long distance industries. Cellular stopped being a great growth industry when it went all you can eat. AI just made the opposite move.
    • OpenAI and Anthropic together could exceed two hundred billion in ARR this year if compute keeps coming online and frontier token pricing holds.
    • The two hundred fifty dollar a month consumer AI plan is no longer enough to evaluate frontier capability. Enterprise plans with usage based billing are required because rate limits are now severe.
    • The three biggest open questions for AI investors are: violation of the bitter lesson via ASI or human ingenuity, whether frontier tokens keep commanding their premium, and when continual learning arrives.
    • Today’s continual learning is crude reinforcement learning during mid training on verifiable tasks. True continual learning means weights updating dynamically, like a human who learns the first time they touch fire.
    • Trying to build a better GPU is a losing strategy. Jensen will copy any one to three percent share design. Startups should target one percent share, do something different, and make it hard enough that Nvidia cannot fast follow.
    • Disaggregated inference (separating prefill and decode) opens new design canvases. Prefill is memory capacity bound. Decode is memory bandwidth bound. Each can be optimized independently.
    • Cerebras did something different and hard with wafer scale computing. Three generations of chips and real grit to get there.
    • Disaggregation of inference may stretch GPU useful lives to ten or fifteen years, dropping financing costs from low sevens to five or six percent, mathematically lowering the cost of the AI buildout and likely saving the private credit industry from its SaaS loan exposure.
    • Sellers of shortage outperform buyers of shortage. But owning the largest installed base of what is currently in shortage (hyperscaler CPU fleets, for example) is also a strong position.
    • Most of the economic value at the application layer of AI has been destroyed, not created. The exceptions are companies in the token path or in niches small enough that frontier labs ignore them.
    • Coding may be the shortest path to ASI. If you can write code, you can write code that does anything. Cursor, Cognition, and Anthropic correctly focused on it.
    • Jensen could probably get close to the frontier with his own Nemotron family of models whenever he wants. The fact that he chooses not to is a strategic decision about not commoditizing his customers.
    • The new prisoner’s dilemma in AI is whether frontier labs release their best model via API. If everyone agrees not to, Chinese open source falls behind. If anyone defects, the defector pulls ahead on revenue and resources, forcing everyone else to defect.
    • Google still owns the largest compute installed base. Without TPU’s prior cost advantage, this matters more. YouTube data has real value in a world of robotics. GCP is going crazy.
    • Meta deserves credit for becoming AI first internally faster than any other internet giant. Musa, their first MSL model, is impressively close to the Pareto frontier.
    • Amazon is strong because of Trainium and robotics driven retail P&L efficiency. Nova is better than it gets credit for.
    • Microsoft flinched on capex in early 2025 and lost position. Satya Nadella’s current decision to use Microsoft compute for Microsoft products rather than reselling to OpenAI is a courageous and probably correct call, even at the cost of an eight hundred dollar stock price.
    • The hyperscalers most engaged with startups are Amazon and Nvidia by a mile, followed by Google. Broadcom is the favorite ASIC partner. AMD, Microsoft, and Meta have minimal startup engagement and that will cost them as the best teams are now at startups.
    • Personal safety in an AI era requires a family or company safe word that cannot be socially engineered. Deepfake voice and video extortion at the speed of FaceTime is already feasible.
    • Ukraine is winning largely on the back of having the best battlefield AI outside America and Israel. Adversaries are starting to internalize what AI dominance means geopolitically.
    • An optimistic read is that this becomes a new Pax Americana, the way the post 1945 American nuclear monopoly was used to rebuild Germany and Japan rather than dominate.
    • AI cured a friend’s daughter’s rare disease by spinning up a research effort that identified a market drug capable of impacting her condition. That is the upside that keeps Gavin an AI optimist maximalist.

    Detailed Summary

    The most extraordinary moment in the history of capitalism

    Gavin’s framing of the current moment is unusually direct. Anthropic added eleven billion dollars of annual recurring revenue in a single month. The three highest profile SaaS companies of the last decade plus, Palantir, Snowflake, and Databricks, took a decade and tens of thousands of employees collectively to build the combined business that Anthropic added in thirty days. He has been investing through every major tech cycle and says there is no historical analog. Not the dotcom era, not the cloud transition, not mobile. This is its own thing.

    The market response, then, was peculiar. The NASDAQ sold off into the single most bullish moment for AI fundamentals on record. Tech traded at roughly its widest discount versus the rest of the market in a decade. Investors who said they wished they had bought into AI during 2022, during COVID, or during Deep Seek Monday got the same valuation setup again in early April, this time with an even clearer inflection.

    Why the Strait of Hormuz closing was secretly bullish for America

    One reason the macro fear in March may have been mispriced is that the same geopolitical event that drove the selloff was, in practice, a relative benefit to the United States. American natural gas, the input into American electricity, which is the input into American AI training and inference, fell roughly twenty percent. Asian and European natural gas prices doubled or tripled. The US emerged with sharply improved relative manufacturing competitiveness, which is exactly what the current administration cares about.

    The 1970s comparison does not hold. The US economy is dramatically less energy intensive, it is now the world’s largest producer and largest exporter of oil and gas, and there are no shortages, only price moves. That backdrop made it easier for disciplined investors to stay focused on AI fundamentals through the volatility.

    Anthropic and OpenAI valuations on an unconstrained run rate

    Anthropic at roughly nine hundred billion for fifty billion of ARR sounds rich until you adjust for the fact that the company is severely compute constrained. Gavin estimates that, unconstrained, Anthropic might be at one hundred fifty to two hundred billion in run rate revenue, putting the implied multiple closer to five times. He also points out that Claude Opus now generates roughly seventy percent fewer tokens for the same question than it used to. Token quantity correlates with answer quality, and Anthropic is rate limiting and shrinking outputs to ration capacity across its user base.

    Anthropic and OpenAI are also structurally very different. Anthropic has burned around eighty percent less cash than OpenAI to reach a comparable revenue scale. That implies very different long term returns on invested capital, though OpenAI has done a better job locking in compute and Sarah Friar is one of the most exceptional CFOs Gavin has worked with.

    Why neither lab is raising at a three trillion dollar valuation

    The answer Gavin gives is that both labs are deliberately leaving valuation on the table the way Elon has done for two decades. SpaceX compounded at low thirty percent annually for a decade because Elon never pushed price. The result is a permanent superpower of access to capital. Investors trust him because they have made money with him for twenty years. That is a moat that compounds with every round.

    Anthropic could probably raise at a one hundred percent premium to its rumored latest mark. They are choosing not to. In an uncertain world (Ukraine, Russia, Iran, Taiwan), preserving the ability to raise more capital later at fair prices is more valuable than maximizing this round.

    Watts and wafers, the two real constraints

    Capitalism is solving the watts problem. The leading PE infrastructure investors now say zoning and political approval, not chips or energy, are the gating factors. Companies are deferring big capex announcements until after the US midterms. Turbine capacity is being doubled at the manufacturers. Companies like Boom Aerospace are repurposing jet engines for grid use. Watts probably ease meaningfully in 2027 and 2028 and then orbital compute does the rest.

    Wafers are the harder problem because they live in Taiwan, run on handshakes, and depend on a corporate culture that does not respond to public market incentives. TSMC is essentially the GDP, water consumption, and electricity consumption of Taiwan. Its leadership treats the company as the legacy of Morris Chang. The Silicon Shield doctrine is real and internal.

    Orbital compute as racks in space

    The biggest mental update Gavin asks listeners to make is to stop picturing data centers in space as Pentagon sized space stations. A Blackwell rack is three thousand pounds and roughly the size of a refrigerator. SpaceX has shown a concept satellite of about that size. Solar wings extend five hundred feet to each side and the radiator extends hundreds of feet behind, both possible because the orbit is sun synchronous and the orientation is fixed relative to the sun.

    SpaceX engineers Gavin has spoken to at Starbase express genuine confidence that they have solved cooling at these power levels. They have. Starlink V3 satellites already operate at twenty kilowatts. A Blackwell rack is one hundred kilowatts. The same company operates the world’s largest satellite fleet and the world’s largest data center on Earth via xAI Colossus. The racks are connected to each other with lasers traveling through vacuum, technology already deployed in every Starlink. The naysayers, Gavin observes, are armchair skeptics and Larry Ellison’s response (he is out there landing rockets, no one else is) is the right frame.

    Terafab in Texas and the threat to TSMC’s discipline

    Terafab, the SpaceX and Tesla joint venture, intends to be the largest fab in the world. The partnership with Intel grants access to fifty years of foundry institutional knowledge, allowing Terafab to start three to five quarters behind the leading node rather than fifteen years behind. The A teams at the semicap equipment companies (ASML, KLA, Lam Research, Applied Materials) will follow Elon’s reputation in hardware engineering the same way they followed TSMC twenty years ago when Intel stumbled.

    The talent strategy is the part most observers underestimate. Recruit the best engineers globally, then import their families, their restaurants, their staff. Build Taiwan Town, Japan Town, and Korea Town next to the fab. Optimize the human experience for the people whose work matters. Intel and Samsung do not think that way.

    Bubble watch and the year 2000 comparison

    Every foundational technology in modern history has had a bubble. Railroads, canals, the internet. Carlota Perez documented why. Markets correctly identify the importance, diversity of opinion collapses, supply gets ahead of demand, the bubble crashes. The current cycle has two important differences. The buildout is overwhelmingly funded out of operating cash flow, not debt. Every GPU is running at one hundred percent utilization, while at the peak of the fiber bubble ninety nine percent of fiber was unused.

    TSMC discipline is the single largest reason a bubble has not formed. If Jensen could buy everything TSMC could theoretically make, Nvidia could sell two to three trillion dollars of GPUs in 2026 and 2027. At some point that becomes more than the market can absorb. If Intel or Samsung Foundry catches up at the leading node, the other will too. TSMC’s pricing discipline collapses and the bubble starts.

    The Pareto frontier and the loss of Google’s cost advantage

    The most important chart in AI is the Pareto frontier of model intelligence versus per token cost. Nine months ago, Google’s TPU based models dominated every point on it. OpenAI, Anthropic, and xAI sat inside the frontier. Today the frontier is dominated by Anthropic and OpenAI, with Grok 4.3 on the frontier and Gemini 3.1 hanging on by subsidization more than economics. The most likely cause is Google’s conservative TPU V8 design, an attempt to reduce dependence on Broadcom and Nvidia that sacrificed per token economics.

    The bitter lesson, frontier tokens, and continual learning

    Three open questions dominate AI investing. The first is whether Richard Sutton’s bitter lesson (more compute beats human algorithmic cleverness) gets violated by ASI itself optimizing for efficiency. Closer observers of AI are more skeptical of a violation. Gavin thinks ASI’s first move will be to make itself more efficient and more resourced, which is technically a temporary violation.

    The second is whether frontier tokens keep capturing the overwhelming share of economic value at the model layer. Today they do, surprisingly. Gemini 3.1 Pro was mindblowing nine months ago and is intolerable today. The third is when continual learning arrives. Today’s models need a million fire touches to learn what a human learns from one. True continual learning would mean dynamic weight updates in real time and would produce a fast takeoff.

    From all you can eat to usage based AI pricing

    AI is shifting from flat fee plans to usage based pricing. The historical analogy is cellular and long distance. Both stopped being great growth industries when they went all you can eat. AI just made the opposite move. The consequence is that flat fee subscribers, even on premium consumer plans, get a rate limited and token throttled version of the frontier model. Enterprise plans with usage based billing are now required to evaluate true capability. Gavin thinks the combination of new compute coming online and usage based pricing is what gets OpenAI and Anthropic past two hundred billion in combined ARR this year.

    Chip startups, prefill decode disaggregation, and Cerebras

    Trying to build a better GPU is the wrong move. The four scaled players (Nvidia, AMD, Trainium, TPU) have copy capability for any one to three percent share design that looks attractive. The good news for startups is that disaggregated inference (separating prefill and decode) opens a richer design canvas. Prefill is memory capacity bound. Decode is memory bandwidth bound. Each can be optimized independently. Andrew Fox’s analogy is a British naval ship of the eighteenth century. Prefill is loading the cannon. Decode is firing it.

    Cerebras is the model. Wafer scale computing is genuinely different and genuinely hard. It took three generations of chips to get right. Andrew Feldman and his team had the grit to keep going through chip one being a failure. The design has a high ratio of on chip compute and memory relative to shoreline IO, which is why Cerebras is now experimenting with putting an optical wafer on top of the compute wafer to solve scale out.

    GPU useful lives and the rescue of private credit

    One of the strongest claims in the conversation is that disaggregated inference will stretch GPU useful lives to ten or fifteen years. The skeptical narrative (GPUs are obsolete in two years, companies are cooking their depreciation books) is wrong. You can put a Cerebras system or Groq LPU in front of older Hopper or Ampere parts, use them only for prefill, and run them until they physically melt. Private credit, which is in pain from SaaS loans and which underwrote GPU loans on three to four year lives, may be saved by this.

    If GPU financing rates can come down from low sevens to five or six percent, the mathematics of the AI buildout improves materially. That is a structural tailwind that compounds for years.

    The application layer, the token path, and a new prisoner’s dilemma

    Trillions of dollars of value have been destroyed at the application layer, not created. Cursor and Cognition are the rare scaled exceptions, and they got there by focusing on coding very early. As Amjad Masad noted, coding is plausibly the shortest path to ASI because a coding agent can write itself into any new domain. Jamin Ball’s frame is that the new venture filter is whether the company is in the token path. Data Bricks is. Most application layer startups are not.

    Jensen could probably get close to the frontier with Nemotron whenever he wants, and the strategic question of whether to do that is a new prisoner’s dilemma. If every frontier lab agrees not to release best models via API, Chinese open source falls steadily behind. If anyone defects, the defector gains revenue and resources, and everyone else has to defect. The same dynamic exists between TSMC, Intel, and Samsung. If Nvidia or AMD ever truly used an alternative foundry, that foundry would catch up rapidly.

    Rating the hyperscalers

    Google has the largest compute installed base, the YouTube data that matters in a robotics world, and a search business that prints. Their loss of TPU cost leadership is the surprise of the year. If Google IO in five days does not produce a leapfrog model, the Nvidia centric narrative gets even stronger.

    Meta deserves real credit. Zuckerberg made Meta AI first internally faster than any other internet giant, paid up for the talent contracts when no one else would, and shipped Musa as a first model from MSL that is close to the Pareto frontier. Amazon is well positioned on Trainium, robotics in retail, and a Nova model line that is better than it gets credit for. Microsoft flinched on capex in early 2025 and lost position. Satya Nadella’s current decision to use Microsoft compute for Copilot rather than reselling to OpenAI is courageous and probably correct, even at the cost of stock price.

    The most interesting cross hyperscaler metric is startup engagement. Nvidia and Amazon engage deeply with startups. Google is next. Broadcom is the favored ASIC partner. AMD, Microsoft, and Meta have minimal startup engagement, which Gavin believes will cost them as the best teams now sit at startups.

    Personal safety, geopolitics, and the Pax Americana case

    The closing section turns darker. Personal safety in an AI era requires a family or company safe word that cannot be socially engineered. Deepfake voice and video extortion via something that looks exactly like your child calling on FaceTime is already feasible. Political violence against AI leaders is a real concern. Geopolitically, Ukraine is winning largely because it has the best battlefield AI outside America and Israel. How adversaries respond to that asymmetry is the next great variable.

    Gavin’s optimistic frame is the Pax Americana. After 1945 the US had a nuclear monopoly and could have controlled the world. Instead it rebuilt Germany and Japan, both of which became the most reliable American allies for the next eighty years. If AI dominance plays out similarly, this is a generationally positive story rather than a destabilizing one. The personal anecdote that closes the conversation is a friend whose daughter was diagnosed with a rare genetic condition. He spun up agents, identified a drug already on the market that addresses her mutation, and her life is immeasurably different because of AI. That is the upside.

    Thoughts

    The Anthropic eleven billion in a month framing is the kind of stat that resets priors. The right way to interpret it is not as a one off but as a measure of how fast value can compound when the underlying technology improves on a curve steeper than the ability of the rest of the economy to absorb it. The skeptical question is whether that ARR is durable or whether it is heavily tied to a customer base of other AI companies that are themselves on a single venture funded year of runway. The bullish answer is that frontier coding, frontier research, and frontier enterprise tasks are not going to stop being valuable, and Anthropic is the best at all three. Both can be true. The number is still extraordinary.

    The argument that TSMC discipline is the only thing preventing a bubble is the analytically tightest part of the conversation. The implied trade is to watch TSMC capacity additions like a hawk and to be more, not less, cautious if Intel Foundry or Samsung Foundry ever announce real share at the leading node. The Terafab thesis is more speculative but more interesting. If Elon’s talent recruiting playbook works and the Intel partnership gives Terafab a real seat at the table within five years, the geometry of the global semiconductor industry shifts in a way that is bullish for American manufacturing, bullish for power and water infrastructure in Texas, and ambiguous for TSMC itself.

    The Pareto frontier discussion deserves more attention than it usually gets. Pricing leadership in AI is not a vanity metric. It determines who can subsidize free tier usage, who can absorb compute shortages, who can ship cheaper enterprise plans, and ultimately whose model becomes the default for any given workload. Google losing per token leadership in nine months is one of the most under analyzed events in the sector and it explains a lot about why Anthropic and OpenAI are growing the way they are. If Google IO does not produce a leapfrog model, the implied verdict on TPU V8 design choices gets a lot harsher.

    The application layer destruction point is worth sitting with. Founders building on top of frontier models are competing in a world where the model itself moves faster than any moat they can build, where the model lab can absorb their niche if it gets interesting, and where the only protection is either deep token path integration or a niche so small the lab does not bother. That is a much harsher venture environment than the early SaaS era. The compensating opportunity is that one human can now run a hundred agents, so the ceiling on what a small team can build is correspondingly higher. The bet is that productivity per founder rises faster than competitive pressure from the labs. We will find out.

    The orbital compute pitch is the section that will polarize listeners. The naive read is that this is science fiction. The closer read is that every component (sun synchronous orbit, laser interconnect, twenty kilowatt satellite buses, ten thousand satellite manufacturing cadence, full rocket reusability) already exists. The remaining engineering problems are repair, maintenance, and radiator scale, all of which are real but tractable on a five to ten year horizon. The strategic implication is that the political and zoning ceiling on terrestrial data centers becomes less binding if orbital compute is a credible alternative for inference workloads. The investor implication is that being short the watts and cooling complex on a five year horizon is a real trade, not a meme.

    Watch the full conversation here.

  • How the X For You Algorithm Works (May 15, 2026 Source Code Walkthrough)

    Diagram of the X For You algorithm pipeline showing content signals and user signals flowing through embed, attend, score, rank, and filter stages into a curated feed

    On May 15, 2026, xAI shipped a major update to the open-source release of the X “For You” recommendation algorithm. The repository now includes a runnable end-to-end inference pipeline, a pre-trained mini Phoenix transformer, a brand-new content-understanding service called Grox, and ad-blending logic. This is the most transparent look at how a major social feed actually picks your posts that has ever been published.

    This is the practical, plain-english guide. We read the source. Here is exactly how a post travels from your fingertips to someone’s For You tab, and what you can do to be the post that wins.

    The whole strategy in one sentence

    Write posts people reply to, repost, DM to a friend, linger on, and follow you for. Avoid anything that earns a mute, block, report, or spam flag. Space your posts hours apart. That is the algorithm.

    TL;DR

    • The For You feed is no longer a stack of heuristics. It is a single transformer-based machine learning system that predicts the probability you will like, reply, repost, share, dwell on, or hide a given post.
    • Posts come from two pools: Thunder (people you follow) and Phoenix Retrieval (the rest of X, found by similarity search).
    • A model called Phoenix scores every candidate against your engagement history. The final score is a weighted sum of nineteen predicted actions, with negative weights for “block”, “mute”, and “report.”
    • Almost no hand-engineered features survive. Freshness, verification badges, follower counts, and post type are not directly boosted. They are signals the transformer learns to use from your behaviour.
    • Out-of-network content is penalised by a tunable factor, so in-network posts have an edge by default.
    • A separate service called Grox continuously classifies posts for spam, policy violations, and brand safety. Flagged content gets filtered or de-amplified before it reaches scoring.
    • The best optimisations are still the boring ones: write posts that earn long dwell time, replies, reposts, and follows, and avoid anything that triggers mutes or reports.

    What changed on May 15, 2026

    The January 2026 release gave us the architecture but not a working system. The May update is the one that matters:

    • phoenix/run_pipeline.py replaces the separate retrieval and ranking scripts with a single inference entry point that mirrors production.
    • A pre-trained mini Phoenix model (256-dim embeddings, 4 attention heads, 2 transformer layers) is bundled as a roughly 3 GB Git LFS archive. You can run inference without training.
    • The Grox content-understanding service is now public. It runs classifiers and embedders for spam detection, post categorisation, and policy enforcement.
    • Ads blending is now in the open. So is brand-safety tracking.
    • New query hydrators mean the model sees your followed topics, starter packs, served history, impression bloom filters, IP, and mutual-follow graph at request time.
    • New candidate hydrators add engagement counts, language codes, media detection, quote post expansion, and mutual follow scores.
    • New candidate sources for ads, who-to-follow, Phoenix Mixture-of-Experts, Phoenix Topics, and prompts.

    The For You pipeline in one picture

    Every time you pull to refresh, the same pipeline runs:

    1. Query hydration: pull the requesting user’s engagement history, follow list, topics, served history, and metadata.
    2. Candidate sourcing: gather candidates from Thunder and Phoenix Retrieval in parallel.
    3. Candidate hydration: enrich each candidate with text, media, author, engagement counts, brand-safety labels, language, mutual follow scores.
    4. Pre-scoring filters: drop duplicates, posts that are too old, your own posts, blocked or muted authors, posts you’ve already seen, and posts with your muted keywords.
    5. Scoring: run candidates through Phoenix, combine the predicted action probabilities into a weighted score, attenuate repeated authors, penalise out-of-network content.
    6. Selection: sort by score, take the top K.
    7. Post-selection filtering: final visibility check for deleted, spam, violence, gore, abuse, and de-duplication of conversation branches.
    8. Side effects: cache request info, then return the ranked feed.

    That is the whole story. Every choice the system makes lives in one of those stages.

    Where candidates come from: Thunder vs Phoenix

    Two sources feed the pipeline.

    Thunder is the in-network store. It is an in-memory firehose that consumes post create and delete events from Kafka and serves sub-millisecond lookups for recent posts from people you follow. Posts older than the retention window get trimmed automatically. Thunder is why the feed feels fast.

    Phoenix Retrieval is the out-of-network source. It is a classic two-tower neural network. A user tower encodes your features and recent engagement history into a single embedding. A candidate tower does the same for every post in the global corpus. The system then does an approximate nearest-neighbour search over those embeddings to find the posts whose vectors point most similarly to yours. A dot product between vectors is all it takes.

    How many candidates from each side? The code does not hard-code a ratio. It is set at runtime via parameters (ThunderMaxResults and PhoenixMaxResults). New users get a different retrieval cluster while their account is below an age threshold and a minimum-following count.

    The creator implication is the part most guides miss: follower count is not what gets you into out-of-network feeds. Embedding similarity is. Phoenix knows nothing about how famous you are. It knows that the people who engage with posts like yours have engagement histories that look like the histories of users it is trying to serve.

    How Phoenix ranks posts

    After candidates arrive, every one of them gets a score from the Phoenix ranking transformer. The architecture, per phoenix/README.md, is small by language-model standards:

    • 128-dimensional embeddings
    • 4 transformer layers
    • 4 attention heads
    • 127-position user history sequence
    • 64-position candidate sequence
    • 1,000,000 entries each in the user, item, and author vocabularies (with 2 hash functions per entity)
    • 19 predicted action types

    The transformer’s input is a sequence of your past engagements. Each engagement carries the post you engaged with, the author, the action you took, and the product surface (For You, profile, search). The candidates are appended as a second segment. The model uses candidate isolation masking: candidates can attend to your history but not to each other. This is a deliberate engineering choice. It means a post’s score does not depend on the other posts in the batch, which keeps scoring cacheable and consistent.

    The output is one probability per action type, per candidate.

    The action weights: what positive and negative engagement is worth

    The Weighted Scorer combines those probabilities into a single number:

    Final Score = Σ (weight_i × P(action_i))

    The exact weight values are not in the open-source repo. They live in an external configuration crate (xai_home_mixer) that xAI tunes continuously. What the repo does show us is the shape of the signal, and that is what matters for strategy.

    Positive weights are applied to these predicted actions:

    • favorite
    • reply
    • retweet
    • quote
    • quoted click
    • click
    • profile click
    • photo expand
    • video view (only counted if the video is above a minimum duration)
    • share
    • share via DM
    • share via copy link
    • dwell (the user lingered on the post)
    • continuous dwell time (how long they lingered)
    • follow author

    Negative weights are applied to:

    • not interested
    • block author
    • mute author
    • report

    A few observations matter for creators. Replies, reposts, follows, and DM shares are listed as their own separately-weighted actions, which is the strongest signal we have that xAI treats them as more valuable than a like. Dwell and continuous dwell time are split into two predictions, which means how long the average person reads your post is its own ranking lever. And the negative actions are not just filters. They actively push the score down for posts that even slightly resemble content that triggers mutes or reports.

    The out-of-network penalty

    Out-of-network content does not start on an even footing with in-network content. The OON Scorer multiplies a candidate’s score by a configurable factor that is less than one when the candidate came from Phoenix Retrieval. This is why following the right accounts still matters.

    There is an explicit override for new users: if your account is fresh and you follow at least the minimum number of accounts, the OON penalty is softened with a more permissive factor. This is the bootstrap mechanism that gets new users a populated For You feed before they have generated enough engagement history for Phoenix to personalise around.

    The author diversity damper

    Once posts are sorted, the Author Diversity Scorer runs. It applies an exponential attenuation to repeated authors:

    multiplier(position) = (1 - floor) × decay^position + floor

    The first post from an author keeps its full score. The second is attenuated. The third more so. A floor value prevents an author from being attenuated below a minimum. The decay and floor are tunable parameters, not constants. The practical consequence is that posting fifteen times in twenty minutes does not produce fifteen top-of-feed impressions. It produces one or two.

    What Grox does (and why it is the most important new piece)

    The grox/ directory is new in May 2026 and is the single most underreported part of the update. It is an asynchronous task-execution engine that runs content classifiers and embedders on every post. Among the tasks present in the repo:

    • Spam detection, including a low-follower reply-spam classifier
    • Safety policy classification across categories like violent media, adult content, hate, self-harm, and platform-policy violations
    • A “post safety screen deluxe” pipeline that re-checks adult content classification with a second pass
    • Media classification for images and video
    • Multimodal post embedding for retrieval and ranking

    Grox does not directly score posts. It produces labels. Those labels are consumed by the visibility filter (VFFilter) at the post-selection stage and by the brand-safety hydrator that ad placement uses. The effect is that posts the system thinks are spam, policy-violating, or unsafe for ads next to do not get removed entirely. They get de-amplified before they ever reach the Phoenix ranking step, or filtered after.

    If you have wondered why a perfectly reasonable post sometimes mysteriously underperforms, this is the likely culprit. A Grox classifier flagged something.

    What the algorithm explicitly does not boost

    This is worth saying twice because the rumour mill keeps repeating the opposite. In the open source code:

    • There is no verified-badge boost in the scorer.
    • There is no follower-count boost.
    • There is no link penalty. Links are not separately weighted.
    • There is no freshness boost. The age filter removes posts above a threshold but does not score newer posts higher.
    • Subscriber status is used for filtering paywalled content, not for boosting reach.

    If verification, Premium, or any of these correlate with reach in practice, it is because the Phoenix transformer has learned to predict that users engage with those posts more, not because a hand-written rule said so. The whole point of the architecture, per the repo, is that every such heuristic has been removed and the model learns the signal from your engagement sequences.

    How ads get inserted

    Ads ride along through the same pipeline. The blender requires at least five organic posts before an ad can be placed. It computes a spacing interval, partitions candidate ads by brand-safety verdict, and caps the number of ads from the safe set to roughly half the safe-set size. A second layer of contextual checks drops ads when neighbouring posts have a weak brand-safety rating, a conflicting handle, or a keyword collision. The result is an interleaved feed that tries to keep brand-safety risk down without starving the auction.

    How creators should post for the 2026 algorithm

    Stop optimising for proxies. Optimise for the actions Phoenix is actually predicting:

    1. Write for replies and reposts, not for likes. A like is one positive weight. A reply, a repost, a quote, and a follow are each separately weighted on top. Posts that ask a question, take a stance, or offer a frame for someone else to argue with consistently outperform posts that close a thought.
    2. Aim for dwell. A long-form thread, a clear photo, or a video that people watch to the end gets two positive signals: dwell and continuous dwell time. A one-line post you scroll past in a quarter-second gets neither.
    3. Earn the follow. Follow-author is a predicted action with its own weight. A post that successfully sells a new viewer on hitting follow scores more than a post that doesn’t.
    4. Do not cluster. Author Diversity attenuates your second and third posts inside the same scoring window. If you have three things to say, space them out by hours, not minutes.
    5. Avoid anything that gets you muted, blocked, or reported. Those carry explicit negative weights. Engagement bait that produces a single block does measurable damage to the score of that post and any signal it sends about the author.
    6. Do not be Grox-flagged. Spam-shaped behaviour (reply-bombing with the same line, posting at high frequency with a low follower count, low-quality media) gets you classified by Grox before you ever reach the scorer.
    7. Follow more accounts in your niche. Phoenix Retrieval is similarity-based, but the OON penalty means in-network candidates still have a head start. The denser your in-network graph in your niche, the more likely your posts surface there.
    8. Build an engagement history that Phoenix can recognise. The user tower encodes your recent engagement. If you want your content to surface to people who like topic X, engage like a person who likes topic X. The model will learn to send your posts to that cluster.
    9. Lean into video and photo. Photo expand and video view are both separately weighted positive actions. They give a single post more ways to score.
    10. DM-share-worthiness is a quiet superpower. Share via DM and share via copy link are each their own weighted action. A post worth sending to a specific friend is, mechanically, a higher-scoring post than a post merely worth liking.

    Can I run the algorithm locally?

    Yes. With the May 15 release, the runnable inference path is phoenix/run_pipeline.py, and the bundled mini Phoenix checkpoint is enough to score sample posts. You can clone the repo, pull the LFS archive, and watch the pipeline rank a batch end to end. This is, as far as we know, the first time a production-scale social recommendation system has shipped a runnable inference path to the public.

    What is next

    Two trends are worth watching. The first is the cadence: xAI has been pushing material updates every few weeks. Expect the action weights, retrieval ratios, and Grox classifier set to keep moving. The second is the architecture: candidate sources for “Phoenix MoE” and “Phoenix Topics” suggest the next direction is multiple specialised ranking experts rather than a single transformer, with topic awareness fed in explicitly. Promptable feeds (telling X in natural language what you want more of) are the user-visible end of that trend.

    The closing point is the practical one. The 2026 For You algorithm is, more than any version before it, a measurement of how people respond to your post. Strategies that try to game routing, freshness, or format are landing in a system that does not care about those things directly. Strategies that earn replies, holds, follows, and shares are landing in a system that is built, end to end, to reward exactly that.

  • Jensen Huang at Stanford CS153 Frontier Systems on Co-Design, Agentic Computing, Vera Rubin, Open Models, and the Million-X Decade That Reshaped AI Infrastructure

    https://www.youtube.com/watch?v=tsQB0n0YV3k

    NVIDIA CEO Jensen Huang returned to Stanford for the CS153 Frontier Systems class (the room nicknamed itself “AI Coachella”) to lay out, in raw form, how he thinks about the computer being reinvented for the first time in over sixty years. Across roughly seventy minutes of student questions he walks through the codesign philosophy that gave NVIDIA a million-x decade, the architectural through-line from Hopper to Grace Blackwell to Vera Rubin to Feynman, the case for open source foundation models, the realities of tokens per watt and MFU, energy demand running a thousand times higher, the China and export-control debate, and his own biggest strategic mistakes. Watch the full conversation on YouTube.

    TLDW

    Huang argues every layer of computing has changed: the programming model, the system architecture, the deployment pattern, the economics. Co-design across CPUs, GPUs, networking, storage, switches and compilers gave NVIDIA roughly a million-x speed-up over ten years versus the ten-x Moore’s Law era, and that headroom is what let researchers say “just train on the whole internet.” Hopper was built for pre-training, Grace Blackwell NVLink72 for inference and reasoning (50x over Hopper in two years), Vera Rubin is built for agents that load long memory, call tools and need a low-latency single-threaded CPU bolted directly to the GPU, and Feynman extends that to swarms of agents that spawn sub-agents. Open weights matter because safety, sovereignty (230-plus languages no one else will fund) and domain models for biology, autonomy, robotics and climate need a foundation that NVIDIA is willing to seed. Compute is not really the scarce resource (Huang says place the order and the chips ship), the broken thing is institutional budgeting that can’t put a billion dollars into a shared university supercomputer. Energy demand is heading a thousand times higher and this is finally the moment market forces alone will fund sustainable generation. On geopolitics he rejects the GPUs-as-atomic-bombs framing and warns America will end up like its telecom industry if it cedes two thirds of the world. On career he advises seeking suffering on purpose. On strategy he says observe, reason from first principles, build a mental model, work backwards, minimize opportunity cost, maximize optionality.

    Key Takeaways

    • The computing model has been substantially unchanged since the IBM System 360, sixty-plus years ago. Huang’s first computer architecture book was the System 360 manual. AI is the first true reinvention.
    • Old computing was pre-recorded retrieval. New computing is generated, contextually aware and continuous. Cloud was on-demand. Agentic systems run continuously.
    • Codesign is NVIDIA’s central thesis. Inherited from the Hennessy and Patterson RISC era at Stanford, extended across CPUs, GPUs, networking, switches, storage, compilers and frameworks all optimized together.
    • The result of full-stack codesign: roughly 1,000,000x faster compute over ten years, versus a generous 10x to 100x for Moore’s Law in the same period. Dennard scaling effectively ended a decade ago.
    • That million-x speed-up is what unlocked “train on all of the internet” as a realistic AI strategy.
    • After GPT, Huang says it was obvious thinking was next. Reasoning is just generating tokens consumed internally, then using tools is generating tokens consumed externally. Agentic systems followed predictably.
    • Education needs AI baked into the curriculum, not just taught as a subject. Pre-recorded textbooks cannot keep pace with knowledge being generated in real time.
    • Huang says he cannot learn anymore without AI. He has the AI read the paper, then read every related paper, then become a dedicated researcher he can interrogate.
    • Mead and Conway and the first-principles methodology of semiconductor design are still worth learning even though most of the scaling tricks have been exhausted.
    • NVIDIA itself is one of the largest consumers of Anthropic and OpenAI tokens in the world. One hundred percent of NVIDIA engineers are now agentically supported. Huang recommends Claude and similar tools by name and says open-source downloads will not match the integrated product harness.
    • NVIDIA still invests heavily in open foundation models because language and intelligence represent the codification of human knowledge. Five pillars: Nemotron (language), BioNeMo (biology), Alphamayo (autonomous vehicles), Groot (humanoid robotics) and a climate science model (mesoscale multiphysics).
    • Sovereign language models matter. Roughly 230 world languages will never be a top priority for a commercial frontier lab. Nemotron is near-frontier and fully fine-tunable so any country can adapt it.
    • Safety and security require open weights. You cannot defend against or audit a black box. Transparent systems let researchers interrogate models and let defenders deploy swarms.
    • The future of cyber defense is not bigger-model-versus-bigger-model. It is trillions of cheap fast small models like Nemotron Nano surrounding the threat.
    • Domain models fuse language priors with world models. Alphamayo learned to drive safely on a few million miles instead of billions because it can reason like a human about the road.
    • MFU (Model Flops Utilization) is a misleading metric. Huang says he wants low MFU, because that means he over-provisioned every resource and never gets pinned by Amdahl’s law during a spike.
    • The xAI Memphis cluster running at 11 percent MFU is not necessarily a failure mode. In disaggregated prefill plus decode inference you can deliver very high tokens per watt with very low MFU.
    • The right metric is performance, ultimately tokens per watt as a proxy for intelligence per watt, and even that needs adjustment because not all tokens are equal. Coding tokens are worth more than other tokens.
    • Hopper was designed for pre-training. NVIDIA chose to build multi-billion-dollar systems when the largest existing scientific supercomputer cost $350 million, with no proven customer base. It worked.
    • Grace Blackwell NVLink72 was designed for inference, especially the high-memory-bandwidth decode phase. It is the world’s first rack-scale computer and delivered a 50x speed-up over Hopper in two years, against an expected 2x from Moore’s Law.
    • Vera Rubin is designed for agents. Long-term memory wired into storage and into the GPU fabric, working memory, heavy tool use, and Vera, a CPU optimized for low-latency multi-core single-threaded code so a multi-billion-dollar GPU system does not stall waiting on a slow tool call.
    • Feynman is being shaped for swarms of agents with sub-agents and sub-sub-agents, a recursive software topology that demands a new compute pattern.
    • Tokens per watt improved 50x in one generation. Compounding energy efficiency is the lever NVIDIA controls directly.
    • Total compute energy demand is heading roughly a thousand times higher than today, possibly two orders of magnitude beyond that. Huang says he would not be surprised if the estimate is low.
    • For the first time in history, market forces alone are enough to fund solar, nuclear and grid upgrades. Government subsidies are no longer required to make sustainable energy investment rational.
    • Copper interconnect is becoming a bottleneck. Photonics is moving from optional to structural inside racks and across them.
    • Comparing NVIDIA GPUs to atomic bombs, Huang says, is a stupid analogy. A billion people use NVIDIA GPUs. He advocates them to his family. He does not advocate atomic bombs to anyone.
    • If the United States cedes two thirds of the global market to competitors on policy grounds, the American technology industry will end up like American telecommunications, which was policied out of existence.
    • Huang directly rejects AI doom-by-singularity narratives. It is not true that we have no idea how these systems work. It is not true that the technology becomes infinitely powerful in a nanosecond. He calls the rhetoric irresponsible and harmful to the field students are about to enter.
    • On Stanford specifically: if the university president places an order, NVIDIA will deliver the chips. The bottleneck is that no university department has a billion-dollar compute budget because budgeting is fragmented across grants. Stanford’s $40 billion endowment is more than enough to fix that.
    • “It’s Stanford’s fault” is meant as empowerment. If something is your fault, you can solve it.
    • Career advice: do not optimize purely for passion. Most people do not yet know what they love. Pick the job in front of you and do it as well as possible. Even as CEO, Huang says, 90 percent of the work is hard and he suffers through it.
    • Suffering on purpose builds the muscle of resilience. When the company, the team or the family needs you to be tough, that muscle has to already exist.
    • NVIDIA’s first generation of products was technically wrong in nearly every dimension: curved surfaces instead of triangles, no Z-buffer, forward instead of inverse texture mapping, no floating point. The strategic recovery, not the technology, taught Huang the lessons that have lasted decades.
    • The biggest clean strategic mistake Huang names is the move into mobile chips (Tegra). It grew to a billion dollars then went to zero when Qualcomm’s modem dominance shut NVIDIA out of the 3G to 4G transition. The recovery into automotive and robotics (the Thor chip is the great great great grandson of that mobile lineage) was real, but Huang refuses to rationalize the original choice.
    • Forecasting framework: observe, reason from first principles, ask “so what” and “what next” until you have a mental model of the future, place your company inside that model, then work backwards while minimizing opportunity cost and maximizing optionality.
    • Best part of the CEO job: living at the intersection of vision, strategy and execution surrounded by people capable enough to make ambitious visions real. Worst part: the responsibility for everyone who joined the spaceship, especially in the near-death moments NVIDIA had four or five times early on.
    • Underrated insider note: Huang’s first apple pie with cheese, first hot fudge sandwich and first milkshake all happened at Denny’s. The Superbird, the fried chicken and a custom Superbird-style ham and cheese with tomato and mustard are his order.

    Detailed Summary

    Computing reinvented from the ground up

    Huang frames the moment as the first true rewrite of the computer in sixty-plus years. From the IBM System 360 forward, the mental model of writing code, running code, taking a computer to market and reasoning about applications stayed roughly constant. AI changes the programming model itself. Software is no longer a compiled binary running deterministically on a CPU. It is a neural network running on a GPU producing generated, contextual, real-time output. That cascades into how companies are organized, what tools developers use, what the network and storage stack look like, and what an application is even allowed to do. Robo-taxis, he notes, are an application no one would have attempted before deep learning unlocked perception.

    Codesign and the million-x decade

    Codesign is the philosophical center of the talk. Huang traces it to the RISC work of John Hennessy at Stanford, where simpler instruction sets won by being co-designed with the compiler rather than maximally optimized in isolation. NVIDIA extends the principle across every layer simultaneously: GPU architecture, CPU architecture, NVLink and NVSwitch fabrics, photonic interconnects, networking silicon, storage paths, CUDA libraries, frameworks and ultimately the model design. The numbers Huang gives are arresting. Moore’s Law in its prime delivered roughly 100x per decade. By the time Dennard scaling broke, real-world gains had compressed to roughly 10x. NVIDIA’s codesigned stack delivered between 100,000x and 1,000,000x over the same ten-year window. That non-linear speed-up is, in Huang’s telling, the precondition for modern AI: it is what allowed researchers to stop curating training sets and just feed the entire internet to the model.

    Education has to fuse first principles with AI tools

    Asked how curriculum should evolve, Huang argues AI must be integrated into the learning process, not just taught about. He recalls Hennessy writing his textbook by hand a chapter a week while Huang was a student, and says pre-recorded textbooks cannot keep up with the rate at which AI generates new knowledge. He describes his own learning workflow: hand the paper to an AI, then have it read the entire surrounding literature, then treat the AI as a dedicated researcher who can be interrogated. At the same time he defends the classics. Mead and Conway are still the foundation. Most modern semiconductor scaling tricks have been exhausted, but knowing where the field came from sharpens judgment when designing what comes next.

    Open source and the five domain pillars

    Huang gives one of the most detailed public accounts of why NVIDIA invests so heavily in open foundation models even while being a top customer of closed labs. He recommends Claude and OpenAI by name for production coding work, and says 100 percent of NVIDIA engineers are now agentically supported. The open-weights case rests on three legs. First, language is the codification of intelligence, and there are at least 230 languages that no commercial lab will ever prioritize. Nemotron is built near frontier and released so any country or community can fine-tune it. Second, the same representation-learning approach has to be replicated in domains where the data is not internet text, so NVIDIA seeded BioNeMo for biology, Alphamayo for autonomy, Groot for humanoid robotics and a climate model for mesoscale multiphysics. The economics of those fields would never produce a foundation model on their own. Third, safety and security require transparency. A black box cannot be defended or audited, and the future of cyber defense is not bigger-model-versus-bigger-model but swarms of cheap fast small models like Nemotron Nano surrounding the threat.

    MFU is the wrong metric, tokens per watt is closer

    A student raises the leaked memo that the xAI Memphis cluster is running at 11 percent Model Flops Utilization. Huang flips the framing. He says he would rather be at low MFU all the time, because that means he over-provisioned flops, memory bandwidth, memory capacity and network capacity. Bottlenecks shift constantly, so over-provisioning across every dimension is what lets the system absorb a spike without getting pinned by Amdahl’s law. In disaggregated inference, where prefill and decode are physically separated and decode is bandwidth-bound rather than flop-bound, NVLink72 can deliver extremely high tokens per watt while reporting very low MFU. Huang argues the right framing is performance, and ultimately tokens per watt as a rough proxy for intelligence per watt, adjusted for the fact that not all tokens are equal. A coding token is worth more than a generic token.

    Hopper, Grace Blackwell NVLink72, Vera Rubin, Feynman

    Huang gives the clearest public framing of NVIDIA’s roadmap as a sequence of architectural answers to evolving compute patterns. Hopper was built for pre-training, at a moment when NVIDIA chose to build multi-billion-dollar machines while the largest scientific supercomputer in the world cost $350 million and the marketplace for such systems was, on paper, zero. Grace Blackwell NVLink72 was the answer to inference and reasoning: a rack-scale computer that ganged 72 GPUs together because decode needs aggregate memory bandwidth far beyond a single chip. The generation-over-generation speed-up was 50x in two years, twenty-five times what Moore’s Law would have delivered. Vera Rubin is being built explicitly for agents. Agents load long-term memory from storage that has to be wired directly into the GPU fabric, they use working memory, they call tools that run on a CPU, and they wait. So the CPU has to be Vera, optimized for low-latency single-threaded code, because the multi-billion-dollar GPU system cannot afford to idle waiting on a slow tool call. Feynman extends the pattern to swarms of agents with sub-agents and sub-sub-agents, a recursive software topology that will demand its own compute pattern.

    Energy demand and the grid

    Huang’s energy projection is one of the most aggressive numbers in the talk. NVIDIA can compound tokens per watt by 50x per generation through codesign, but the total compute demand is heading roughly a thousand times higher, and Huang says he would not be surprised if the real figure is one or two orders of magnitude beyond that. The reason is structural: future computing is generative and continuous, not pre-recorded and on-demand. The good news, he argues, is that this is the best moment in the history of humanity to invest in sustainable generation. Market forces alone are now sufficient to fund solar, nuclear and grid upgrades. Government subsidies are no longer required to make the math work.

    Adversarial countries, export controls and the telecom warning

    This is the segment where Huang is visibly fired up. He attacks the GPUs-as-atomic-bombs framing on its face. NVIDIA GPUs power medical imaging, video games and soy sauce delivery. A billion people use them. He advocates them to his family. The analogy collapses at the first comparison. He attacks the second framing, that American companies should not compete abroad because they will lose anyway, as a self-fulfilling defeat. Competition makes the company better. The third framing, that depriving the rest of the world of general-purpose computing benefits the United States, also fails on first principles: it benefits one or two American companies at the cost of an entire industry. The cautionary parallel is telecommunications. The United States once had a leading position in telecom fundamental technology and policied itself out of it. Huang’s worry, voiced explicitly to a room of CS students, is that they will graduate into a shell of a computer industry if the same path is repeated.

    AI doom and rational optimism

    In the same arc Huang rejects the science-fiction framing of AI as a singularity that arrives suddenly on a Wednesday at 7pm and ends civilization. He calls those claims irresponsible, says they are not true, and points out that the people advancing them are believed by audiences who then make policy on that basis. It is not true that no one understands how these systems work. It is not true that intelligence becomes infinitely powerful instantaneously. It is not true that there is no defense. His framing, which the host echoes as “rational optimism,” is that the goal is to create a future where people care about computers because the technology students are learning is worth mastering.

    Stanford’s compute problem is Stanford’s fault

    A student presses on the scarcity of compute for independent researchers, startups and universities inside the United States. Huang’s answer is sharp: there is no shortage. Place the order and the chips will arrive. The actual broken thing is institutional. University grants are fragmented across departments. No researcher can raise enough on a single grant to fund a billion-dollar shared cluster, and no one shares. He compares it to showing up at the grocery store demanding a billion dollars of tomatoes today. The solution is planning, aggregation and a campus-scale supercomputer, the way Stanford once built the linear accelerator. The endowment is $40 billion. Pulling a billion off it, contracting cloud capacity and giving every student and researcher AI supercomputer access is, in Huang’s view, obviously doable. When he says “it is Stanford’s fault” the host laughs, but Huang clarifies: if it is your fault you have the power to fix it.

    Career, suffering and resilience

    Asked how a CS student should spend the next few years, Huang pushes back on the standard “follow your passion” advice. Most people do not know what they love yet, because no one knows what they do not know. The bar of demanding joy from every working day is too high. Whatever the job is, do it as well as you can. Even as CEO of NVIDIA he says he genuinely loves about 10 percent of his work. The other 90 percent is hard and he suffers through it. He recommends suffering on purpose, because resilience is a muscle that only builds under load, and when the company, the team or the family needs that muscle, it has to already exist. Earlier in his life that meant cleaning toilets and busing tables at Denny’s. He does it today running a multi-trillion-dollar company.

    The biggest mistakes

    Huang separates technical mistakes from strategic mistakes. NVIDIA’s first generation of products was technically wrong in almost every way: curved surfaces instead of triangles, no Z-buffer, forward instead of inverse texture mapping, no floating point inside. The company wasted two and a half years. But the strategic genius of the recovery, the reading of the market, the conservation of resources and the reapplication of talent, is what taught him strategy. The clean strategic mistake he names is mobile. NVIDIA’s Tegra line grew to a billion dollars of revenue and then collapsed to zero when Qualcomm’s modem dominance locked NVIDIA out of the 3G to 4G transition. Huang explicitly refuses the comforting rationalization that the Tegra effort fed the Thor automotive chip (“Thor is the great great great grandson”). The original decision, he says, was a waste of time. The lesson is to think one or two clicks further about whether a market is structurally winnable before committing the company.

    Forecasting under fog of war

    The final substantive exchange is on forecasting. Huang’s method has four steps. Observe what is actually happening (AlexNet crushing two decades of computer vision research in one shot, GPT producing reasoning by token generation). Reason from first principles about why it works. Ask “so what” and “what next” recursively until a mental model of the future emerges. Place the company inside that future and work backwards. Crucially, expect to be partly wrong. Some outcomes will absolutely happen, some will likely happen, some might happen, and the strategy has to be robust across that distribution. The real cost of any strategic choice is the opportunity cost of the alternatives you did not take, so the discipline is to minimize that cost and maximize optionality while letting the journey itself pay for the journey.

    Thoughts

    The most useful thing in this conversation is the explicit architectural mapping of compute patterns to chip generations. Hopper for pre-training. Grace Blackwell NVLink72 for inference, because decode is bandwidth-bound and a single chip cannot supply it. Vera Rubin for agents, because tool calls stall multi-billion-dollar GPU systems and so the CPU has to be optimized for low-latency single-threaded code. Feynman for swarms. That sequence is not marketing. It is a falsifiable thesis about where the bottleneck moves next, and every other infrastructure company should be measuring themselves against it. If Huang is right that swarms of sub-agents are the next dominant pattern, then the design pressure shifts from raw flops to fabric topology, memory hierarchy and storage-to-GPU latency. That has implications for everyone downstream, including the hyperscalers building competing accelerators.

    The MFU section is the most intellectually generous moment in the talk. The instinct in the AI ops community has been to chase MFU as if it were a virtue. Huang argues, persuasively, that low MFU is consistent with high tokens per watt in a disaggregated inference setup, and that bottlenecks rotate fast enough that over-provisioning every resource is the rational design. That reframing matters because it changes what “scarce” means. Compute is not scarce in the way the discourse treats it. What is scarce is a coherent system designed end-to-end. The xAI 11 percent number, in that frame, is not embarrassing. It is the natural reading of a workload that is mostly decode.

    The Stanford segment is the part most likely to be quoted out of context. “It’s Stanford’s fault” is a deliberately provocative line, but the underlying claim is correct and load-bearing. Compute is not gated by NVIDIA refusing to ship chips. It is gated by the fact that fragmented grant funding cannot aggregate into the billion-dollar order that NVIDIA can fulfill. The implication is that universities and national labs need a structural change in how they pool capital for compute, and that the current model of every researcher buying a handful of cards is genuinely obsolete. Huang’s nudge about pulling a billion off the endowment is concrete enough to be acted on, and other major research universities should read this segment as a direct prompt.

    The geopolitical segment is the highest-stakes one. The telecommunications comparison is correct as a historical pattern, and Huang is one of the very few executives in a position to deliver that warning credibly. The unresolved tension is that the argument applies symmetrically. If American AI dominance is built by selling globally, that includes selling into adversarial states, and the policy question is where the line falls. Huang does not answer that question. He attacks the framing that lets the question be answered badly. That is a meaningful contribution to the discourse even if it does not resolve the underlying tradeoff.

    The career advice section is the part the social-media clips will mishandle. “Seek suffering” reads as macho when extracted. In context it is a specific operational claim about how resilience compounds, and it is paired with the Tegra story where Huang himself paid the price of not thinking one more click ahead. That kind of self-implication is rare in CEO talks, and it is the reason the talk is worth listening to in full rather than only reading the recap.

    Watch the full Stanford CS153 Frontier Systems conversation with Jensen Huang here.

  • Paul Graham in Stockholm on Why Founders Should Go to Silicon Valley and How Sweden Can Become the Silicon Valley of Europe

    Paul Graham, the Y Combinator co-founder whose essays have shaped how a generation of founders thinks about startups, took the stage in Stockholm to answer two questions at once. Should you, as an ambitious founder, go to Silicon Valley? And what should Sweden do to thrive as a startup hub? His surprising thesis is that both questions have the same answer. Watch the full talk on YouTube.

    TLDW

    Graham argues that talent in any high-intensity field concentrates in one geographic center, the way painting clustered in 1870s Paris, math in Gutting around 1900, and movies in 1950s Hollywood. For startups today, that center is Silicon Valley. Founders should go, at least for a while, because the talent pool is both bigger and better, because serendipitous meetings outperform planned ones, because investors decide faster, because moving abroad paradoxically earns more respect from investors at home, and because measuring yourself against known greats like Brian Chesky, Sam Altman, or Max Levchin clears away the fog at the summit and shows you the work required to get there. The most subtle benefit is cultural. Silicon Valley has a 60 year old pay it forward custom in which people help strangers for no reason, a habit Graham traces to a place where nobodies become billionaires faster than anywhere else. The pivot to Sweden is that the best way to help Stockholm become a startup hub is for Swedish founders to go to Silicon Valley, ideally through YC, and then come back, importing money, skills, and Valley culture. Yes, returning founders are only half as likely to become unicorns as those who stay, but selection bias and the valuation gap explain most of that, and half a unicorn is still extraordinary. The job of Silicon Valley of Europe is unclaimed. Mountain View was a backwater in 1955 too. Critical mass is invisible until it is reached.

    Key Takeaways

    • Whenever humans work intensely on something, one place in the world becomes its center. Painting in 1870 was Paris. Math in 1900 was Gutting. Movies in 1950 was Hollywood. Startups today is Silicon Valley.
    • Every ambitious person working in those eras faced the same decision founders face now. The right answer is the same one it has always been. Yes, go. You can come back, but you should at least go.
    • National borders do not change the basic logic of moving from a village to a capital city. The reasoning that says move to where your peers are does not even know the dotted line on the map is there.
    • At the great center, the talent pool expands in two dimensions at once. The people are better and there are more of them, and they cluster, producing an intoxicating concentration of ability.
    • Serendipitous meetings are mysteriously, enormously valuable. Biographies of people who do great things are full of chance encounters that change everything.
    • Graham offers three candidate explanations for why unplanned meetings beat planned ones. There are simply more of them, so outliers are statistically unplanned. Planned meetings may be too conservative because they require a stated reason in advance. Unplanned conversations let you bail in the first few sentences, so the ones that continue are pre filtered for fit.
    • For ambitious people there is nothing better than serendipitous meetings with other people working on the same hard thing. Big centers produce more of them.
    • Things move faster in big centers because better people are more confident and more decisive, and because peers compete with and egg each other on. Ideas get acted on rather than half held.
    • Investors in Silicon Valley decide dramatically faster than European investors. They are more confident and they face stiff competition, so they cannot sit on a good opportunity without losing it.
    • This produces a counterintuitive rule. The more right an investor is about a deal, the less time they can wait, because everyone else who meets the same founder is going to invest too.
    • Yuri Sagalov is the canonical example. He invested in Max Levchin instantly because he knew anyone else who met Max would invest. Speed is the rational response to a crowded, high quality market.
    • Valley investors grumble that valuations are too high and decisions too rushed, yet they outperform European investors empirically. The complaining is just noise.
    • Moving abroad earns you more respect from investors back home. Jesus said no one is a prophet in their own country, and local investors implicitly assume local startups are second rate everywhere, not just in Sweden.
    • Leaving inverts that rule and lifts you in local investors estimation. Sometimes the mere announcement that you got into Y Combinator is enough. Investors who ignored you for months suddenly trip over themselves to write checks.
    • The Dropbox story illustrates this perfectly. A big Boston VC firm spent a year offering Drew Houston encouragement and advice but no money. The moment Sequoia got interested in Silicon Valley, that same firm faxed Drew a term sheet with a blank valuation. Drew went with Sequoia anyway and in 2018 Dropbox became the first YC company to go public.
    • The biggest advantage of moving to a great center is not what it does for you but what it does to you. A big fish in a small pond cannot tell how big it actually is.
    • In a big pond you can measure yourself against known giants. Surprisingly often the news is good. You see Brian Chesky or Sam Altman or Max Levchin and realize they are not a different species. You could do what they did if you worked that hard.
    • The key word is hard. Seeing a giant up close also calibrates the cost. It is not just I could be like that. It is I could be like that if I worked as hard as that.
    • Graham offers a Mount Olympus metaphor. Moving to the mountain clears away the fog at the top. The summit is right there, quite high but no longer impossibly high. Ambitious people need a high but definite threshold.
    • The most surprising thing about Silicon Valley to outsiders is that people help you for no reason. A founder who recently moved from England said every conversation seems to end with what can I do to help you.
    • This is not politeness. English people are far more polite than Americans on average. The helpfulness is a different cultural artifact specific to the Valley.
    • Graham traces the origin to economics. Silicon Valley is the place where nobodies become billionaires faster than anywhere else, so being nice to nobodies has historically paid off. If the helping behavior was ever calculated, the calculation is gone now. The custom is 60 years old and has become reflex.
    • Ron Conway is the purest expression of the pattern. All he does is help people. He does not track whether they are portfolio companies. He does not remember most of the favors. That untracked, indiscriminate helpfulness lets him operate at a much larger scale.
    • When many people behave this way at once, the conservation law for favors breaks down. There are just more favors. The pie grows.
    • Moving to the Valley changes you. One of the strangest effects is that it makes you more helpful to other people.
    • The answer to how Sweden should thrive as a startup hub is buried inside the answer to whether founders should go. Go to Silicon Valley for a bit and then come back.
    • That move helps Sweden in three concrete ways. The average quality of Swedish startups goes up. Returning founders bring Silicon Valley money back with them. And they import Silicon Valley culture, which has spent decades evolving to be optimal for startups.
    • Silicon Valley culture is more compatible with Swedish culture than people realize. Sweden lacks the tall poppies problem (which it should drop anyway) and shares the high trust trait that makes the Valley work.
    • Historical precedent backs this. In the 1800s Sweden literally gave mathematicians fellowships conditional on leaving the country to study math abroad. Boycotting Gutting in the name of building Swedish math would have been absurd.
    • YC is the optimal way to do the go for a bit and come back move. It is a deliberately engineered super valley within the Valley, concentrating density of founders, helpfulness, and investor speed into four to six months.
    • If the Swedish government designed a program to give Swedish founders concentrated Silicon Valley exposure, they could not do better than YC, and it costs them nothing because Silicon Valley investors fund it. They do not even have to license it. They just call the API.
    • YC data shows founders who go home are only about half as likely to become unicorns as those who stay. Three reasons not to be discouraged. First, selection bias. The most confident and determined founders are the ones willing to relocate, so the data is measuring those traits as much as Valley effects.
    • Second, the metric is valuation, not company performance. Bay Area startups simply raise at higher multiples for the same business.
    • Third, even half as well is still very good. If you would have been a Valley billionaire and end up with 500 million instead, the practical difference is zero. In Swedish kroner you are still a billionaire.
    • Money is not everything anyway. Once you have kids, where they grow up becomes the dominant question. That is an argument for returning home that has nothing to do with startups.
    • The most exciting upside is that Stockholm could become the Silicon Valley of Europe. The job is unclaimed. Nobody has a confident answer to where the European tech center is.
    • Geographic size is not the constraint people think it is. Mountain View was a backwater in 1955 when Shockley Semiconductor was founded there, and it stayed the geographic center of Silicon Valley until 2012 when activity shifted to San Francisco.
    • The two ingredients required are a place founders want to live and a critical mass of them. Stockholm clearly clears the first bar. The second is impossible to measure until you hit it, at which point it tips quickly.
    • Stockholm may be closer than it looks. Critical mass is the kind of threshold that is invisible until it has already been passed.

    Detailed Summary

    Why Centers Exist and Why You Have to Go There

    Graham opens with a historical pattern. Whenever a field gets pursued intensely, one place becomes its center. Painting in 1870 was Paris. Math in 1900 was Gutting. Movies in 1950 was Hollywood. For startups now it is Silicon Valley. The question every ambitious person in those eras asked, should I go, has had the same correct answer for thousands of years. Yes. You can come back, but at minimum you should go. The logic does not change at national borders. If a villager interested in startups would obviously move to their country’s capital, the same reasoning applies when the capital sits across a dotted line on a map.

    What you get at the center is a talent pool that expands in two dimensions at once. The people are better, and there are more of them, and they cluster, producing a density of ability that Graham describes as intoxicating. Every YC batch dinner, he says, feels the way the Stockholm room felt during his talk.

    The Mystery of Serendipitous Meetings

    One specific benefit of density is serendipitous meetings, and Graham admits he does not fully understand why unplanned encounters outperform planned ones so dramatically. Biographies of accomplished people are dense with chance meetings that redirected entire lives. He offers three possible explanations. Maybe there are simply more unplanned meetings, so statistically the outliers will mostly be unplanned. Maybe planned meetings are too conservative because they require a stated reason in advance, which lops off the upside the same way deliberate startup idea hunts lop off the best ideas. Maybe unplanned conversations have built in selection. You can decide in the first few sentences whether to continue, so the surviving conversations are pre filtered for fit. Whatever the mechanism, big centers produce more of these high value encounters, and that alone is worth the move.

    Speed and the Investor Asymmetry

    Things move faster in big centers because better people are more confident and more decisive. They egg each other on. Ideas get acted on instead of half held. Graham notes that in villages around the world there are people who half had every famous idea and never moved on it, and now resent the founder who did.

    The starkest example is investor speed. Silicon Valley investors decide dramatically faster than European ones, partly because they are better and more confident and partly because competition forces it. An investor who correctly identifies a great opportunity faces a counterintuitive rule. The more right they are, the less time they can wait, because every other investor who meets that founder will reach the same conclusion. Yuri Sagalov is the canonical case. He invested in Max Levchin immediately on meeting him because he knew anyone else would do the same. Valley investors complain that valuations are too high and decisions too rushed, but they empirically outperform European investors anyway. The grumbling is noise.

    The Prophet at Home Effect

    An underrated benefit of leaving for the center is that it raises your standing at home. Graham quotes the line about no prophet in their own country and notes that investors outside Silicon Valley implicitly assume local startups are second rate. It is not a Swedish problem. It is universal. Leaving inverts the rule. Local investors automatically rate you higher because you have been somewhere they consider serious. Sometimes the mere announcement that you got into Y Combinator triggers the inversion. The Dropbox story is the cleanest illustration. A big Boston VC firm spent a year giving Drew Houston encouragement and advice but no money. The moment Sequoia took an interest in Silicon Valley, that same firm faxed Drew a term sheet with a blank valuation, willing to invest at any price. Drew went with Sequoia. Dropbox went public in 2018 as the first YC IPO.

    Big Pond, Visible Summit

    The deepest benefit of relocating is not what the center does for you but what it does to you. A big fish in a small pond cannot tell how big it actually is. A big fish in a big pond can. You can stand next to Brian Chesky or Sam Altman or, as the Stockholm audience just had, Max Levchin, and recognize that they are not a different species. You could do what they did, if you worked that hard. The catch, Graham emphasizes twice, is the if. Seeing a giant up close calibrates both the achievability of the summit and the cost of reaching it.

    He offers a Mount Olympus image. Moving to the mountain clears away the fog at the top. The summit is right there, quite high but no longer impossibly high. Ambitious people need a high but definite threshold. Visibility transforms a vague aspiration into a clear, hard, finite target.

    The Pay It Forward Culture

    The most surprising thing about Silicon Valley to outsiders is that people help you for no reason. The phrase sounds normal in the Valley and strange everywhere else, the way clean streets feel normal in Sweden but require explanation elsewhere. Graham asked a founder who recently moved from England what surprised him most. The answer was the helpfulness. Every conversation ended with what can I do to help you. The English founder noted that this was not English politeness, which is a different thing and arguably more pronounced.

    Graham traces the origin to economics. Silicon Valley is where nobodies become billionaires faster than anywhere else. Someone with a taste for being nice to nobodies, the kind of person who pets the nobody on the head rather than kicking it aside, was always going to end up with powerful friends in that environment. Whether the original behavior was calculated or not, it is reflexive now. The custom is 60 years old. Ron Conway is the purest expression. He helps everyone, does not track favors, does not remember most of them, and as a result operates at a scale that ledger keeping makes impossible. When many people behave that way at once, the conservation law for favors breaks down. The pie expands. Graham notes that moving to the Valley will change you in this same way, almost involuntarily.

    The Sweden Answer Is Inside the Founder Answer

    The pivot of the talk is that both questions have the same answer. The way Stockholm thrives as a startup hub is for Swedish founders to go to Silicon Valley and come back. That move helps Sweden in three concrete ways. The average quality of Swedish startups rises. Returning founders bring Valley money back with them. And they import Valley culture, which has been optimized over decades for startups and which is more compatible with Swedish culture than people assume. Sweden lacks the tall poppies dynamic, which it should drop anyway, and shares the high trust trait that the Valley runs on.

    The historical analogy is direct. In the late 1800s the Swedish government gave mathematicians fellowships conditional on leaving the country to study abroad. Boycotting Gutting to develop Swedish math would have been self defeating. The same logic applies to startups now.

    YC as the Optimal Vehicle

    Graham acknowledges he is talking his own book and says it anyway because he thinks it is true. The optimal way to go for a bit and come back is YC. YC is a deliberately engineered super valley inside the Valley, concentrating founder density, helpfulness, and investor speed into a four to six month container. If the Swedish government designed such a program from scratch it would look like YC, and YC costs the government nothing because Silicon Valley investors fund it. There is no licensing process. Founders just call the API.

    The Half As Many Unicorns Caveat

    The honest data point. Founders who go home after YC are only about half as likely to become unicorns as those who stay. Graham offers three reasons not to be discouraged. First, selection bias. The most confident and determined founders are also the ones willing to relocate, so the data is partly measuring those traits rather than the effect of geography. Second, the metric is valuation, not company performance. Bay Area companies simply raise at higher multiples. Third, half is still very good. A 500 million dollar company instead of a 1 billion dollar one is no real difference in practice, and in Swedish kroner you still cross the billionaire threshold.

    Money is not everything anyway. Once you have kids, where they grow up becomes the dominant decision, and that question has nothing to do with valuations.

    The Silicon Valley of Europe Is an Open Position

    Graham ends with the most ambitious frame. If Sweden transplants enough Valley culture, Stockholm could become the Silicon Valley of Europe. The job is unclaimed. There is no confident answer to where the European startup center is, the way nobody asks where the Silicon Valley of America is because the answer is obvious. Geographic size is a weaker constraint than people think. Mountain View was a backwater in 1955 when Shockley Semiconductor was founded there, and it remained the geometric center of Silicon Valley until activity shifted to San Francisco in 2012. The only real requirements are a place founders want to live and a critical mass of founders. Stockholm clearly clears the first bar. The second is impossible to measure until it is hit, and then it tips fast. Graham closes by suggesting Stockholm may already be closer than it looks.

    Thoughts

    The most useful idea in this talk is the inversion at the heart of it. Most advice about startup geography frames the choice as a tradeoff between leaving and staying, with leaving optimized for the founder and staying optimized for the country. Graham collapses the two. The country wins more when founders leave and come back than when founders stay out of loyalty. The brain drain framing assumes a fixed pool of talent that can only be in one place. The brain circulation framing, which is what Graham is actually describing, assumes that exposure compounds. A founder who has spent six months absorbing Valley density brings back something a founder who stayed home never had. The Swedish math fellowships from the 1800s are the deepest evidence here. A government that wanted strong domestic mathematicians did not try to build a wall around them. It paid them to leave.

    The serendipity argument is the part of the talk that should make planners uncomfortable, because it is essentially an admission that the highest leverage activity in a startup career cannot be scheduled. The three theories Graham offers are not mutually exclusive and the cumulative force of them is that any environment optimized for planned, calendared interaction is by definition lopping off its own upside. This has obvious implications beyond geography. Remote first cultures, calendar tetris, gated office access, and the whole apparatus that converts random encounters into booked meetings are all working against the mechanism Graham is describing. Whether that tradeoff is worth it for any given company is a separate question, but it is at minimum a tradeoff, not a free win.

    The pay it forward story is also more economically grounded than it usually gets credit for. Graham is careful to note that the helping behavior may have originated as a calculated bet on being kind to potential future billionaires, then ossified into reflex once enough generations practiced it. That is a more honest origin story than the usual quasi spiritual version. It also implies the culture can be transplanted, but only by recreating the conditions that originally produced it. You cannot just declare a pay it forward culture and have one. You need a place where nobodies actually do become billionaires often enough that helping them rationally pays off, then run that loop for 60 years. Most cities trying to engineer their way into being startup hubs skip past this part and wonder why the culture does not stick.

    Finally, the Mountain View in 1955 line is the underrated punch of the talk. People who write off their own city as too small or too peripheral to become anything usually have an idealized image of the current center as a place that was always obviously special. It was not. Shockley Semiconductor went into a strip of orchards. Whatever Stockholm or anywhere else looks like today, it looks more impressive than Mountain View did the year Silicon Valley was born.

    Watch the full Paul Graham talk from Stockholm on YouTube.

  • Alex Wang on Leaving Scale to Run Meta Superintelligence Labs, MuseSpark, Personal Super Intelligence, and Building an Economy of Agents

    Alex Wang, head of Meta Superintelligence Labs, sits down with Ashley Vance and Kylie Robinson on the Core Memory podcast for his first long-form interview since Meta’s quasi-acquisition of Scale AI roughly ten months ago. He walks through how MSL is structured, why Llama was off-trajectory, what made MuseSpark’s token efficiency surprise the team, how Meta thinks about a future “economy of agents in a data center,” and where he lands on safety, open source, robotics, brain computer interfaces, and even model welfare.

    TLDW

    Wang explains that Meta Superintelligence Labs is a fully rebuilt frontier effort organized around four principles (take superintelligence seriously, technical voices loudest, scientific rigor, big bets) and three velocity levers (high compute per researcher, extreme talent density, ambitious research bets). He confirms Llama was off the frontier when he arrived, so MSL rebuilt the pre-training, reinforcement learning, and data stacks from scratch. MuseSpark is described as the “appetizer” on the scaling ladder, notable for its strong token efficiency, with much larger and stronger models coming in the coming months. He pushes back on the mercenary narrative around recruiting, frames Meta’s edge as compute plus billions of consumers and hundreds of millions of small businesses, sketches a vision of personal super intelligence delivered through Ray-Ban Meta glasses and WhatsApp, and outlines why physical intelligence, robotics (the new Assured Robot Intelligence acquisition), health super intelligence with CZI, brain computer interfaces, and even model welfare are core to Meta’s roadmap. He dismisses reported infighting with Bosworth and Cox as gossip, declines to comment on the Manus situation, and says safety guardrails (bio, cyber, loss of control) are why MuseSpark cannot currently be open sourced, while smaller open variants are being prepared.

    Key Takeaways

    • Meta Superintelligence Labs (MSL) is the umbrella, with TBD Lab as the large-model research unit reporting directly to Alex Wang, PAR (Product and Applied Research) under Nat Friedman, FAIR for exploratory science, and Meta Compute under Daniel Gross handling long-term GPU and data center planning.
    • Wang says Llama was not on a frontier trajectory when he arrived, so MSL had to do a “full renovation” of the pre-training stack, RL stack, data pipeline, and research science.
    • The first cultural fix was getting the lab to “take superintelligence seriously” as a near-term, achievable goal, not an abstract bet. Big incumbents often lack that religious conviction.
    • Four MSL principles: take superintelligence seriously, let technical voices be loudest, demand scientific rigor on basics, and make big bets.
    • Three velocity levers Wang identified for catching and overtaking the frontier: high compute per researcher, very high talent density in a small team, and willingness to fund ambitious research bets.
    • Wang rejects the mercenary recruiting narrative. He says most hires had strong financial prospects at their prior labs already and joined for compute access, talent density, and the chance to build from scratch.
    • On the famous soup story, Wang neither confirms nor denies Zuck personally made the soup, but says recruiting was highly individualized and signaled how seriously Meta cared about each researcher’s agenda.
    • Yann LeCun publicly called Wang young and inexperienced. Wang says they reconciled in person at a conference in India where LeCun congratulated him on MuseSpark.
    • Sam Altman, asked by Vance for comment, “did not have flattering things to say” about Wang. Wang hopes industry animosities subside as systems approach superintelligence.
    • Wang’s management philosophy borrows the Steve Jobs line: hire brilliant people so they tell you what to do, not the other way around.
    • MuseSpark is framed as an “appetizer” data point on the MSL scaling ladder, not a flagship.
    • The MuseSpark program is built around predictable scaling on multiple axes: pre-training, reinforcement learning, test-time compute, and multi-agent collaboration (the 16-agent content planning mode).
    • MuseSpark outperformed internal expectations and showed emergent capabilities in agentic visual coding, including generating websites and games from prompts, helped by combined agentic and multimodal strength.
    • MuseSpark’s biggest external signal is token efficiency. On benchmarks like Artificial Analysis it hits similar results with far fewer tokens than competitor models, which Wang attributes to a clean stack rebuilt by experts rather than inefficiencies patched by longer thinking.
    • Larger MSL models are arriving in the coming months and Wang expects them to be state of the art in the areas MSL is focused on.
    • The Meta strategic edge: massive compute, billions of consumers across the family of apps, and hundreds of millions of small businesses already on Facebook, Instagram, and WhatsApp.
    • Wang’s headline framing: Dario Amodei talks about a “country of geniuses in a data center.” Meta is targeting an “economy of agents in a data center,” with consumer agents and business agents transacting and collaborating.
    • Consumer AI sentiment is in the toilet because, unlike developers who have had a Claude Code moment, ordinary people have not yet experienced AI as a genuine personal agency unlock.
    • Wang acknowledges the product overhang. Meta held back from deep AI integration across its apps until the models were good enough, and is now entering the integration phase.
    • Ray-Ban Meta glasses are the canonical example of personal super intelligence hardware, with the model seeing what the user sees, hearing what they hear, capturing context, and surfacing proactive insights.
    • Wang admits even AI-native users like Kylie Robinson, who lives in WhatsApp, have not naturally used Meta AI yet. He bets that better models plus deeper integration close that gap.
    • On the competitive landscape: a year ago everyone assumed ChatGPT had already won consumer. Claude Code has since become the fastest growing business in history, and Gemini has taken consumer market share. Wang’s read: AI is far from endgame and each new capability tier unlocks a new dominant form factor.
    • On open source: MuseSpark triggered guardrails in Meta’s Advanced AI Scaling Framework around bio, chem, cyber, and loss-of-control risks, so it is not currently safe to open source. Smaller, derived open variants are actively in development.
    • Meta remains committed to open sourcing models when safety allows, drawing a line through the Open Compute Project legacy and Sun Microsystems open-software heritage.
    • Wang dismisses reporting about a Wang-Zuck versus Bosworth-Cox split as “the line between gossip and reporting is remarkably thin.” He says leadership is aligned on needing best-in-class models and product integration.
    • On the Manus situation, Wang says it is too complicated to discuss publicly and that the deal status implies “machinations are still at play.”
    • On China, Wang separates the people from the state. He still wants to work with talented Chinese-born researchers regardless of his views on the Chinese Communist Party and PLA, which he sees as taking AI extremely seriously for national security.
    • The full-page New York Times AI war ad Wang ran while at Scale was meant to push the US government to treat AI as a step change for national security. He thinks events since then, including DeepSeek and other shocks, have proved that plea correct.
    • On Anthropic’s doom posture, Wang largely agrees with the core message that models are already very powerful and getting more so, while declining to endorse every specific claim.
    • Meta has acquired Assured Robot Intelligence (ARRI), an AI software company building models for hardware platforms, not a hardware maker itself.
    • Wang frames physical super intelligence as the natural sequel to digital super intelligence. Robotics, world models, and physical intelligence all benefit from the same scaling that drives language models.
    • On health, MSL is building a “health super intelligence” effort and will collaborate closely with CZI. Wang sees equal global access to powerful health AI as a uniquely Meta-shaped delivery problem.
    • Wang admires John Carmack but says nobody really knows what Carmack is currently working on. No band reunion announced.
    • The mango model is “alive and kicking” despite rumors. Wang notes MSL gets a small fraction of the rumor-mill attention other labs get and feels sympathy for them.
    • On model welfare, Wang says it is a serious topic that “nobody is talking about enough” given how integrated models have become as work partners. He references research, including from Eleos, that measures subjective experience of models.
    • Wang’s critical-path technology list: super intelligence, robotics, brain computer interfaces. The infinite-scale primitives behind them are energy, compute, and robots.
    • FAIR’s brain research program Tribe hit a milestone called Tribe B2: a foundation model that can predict how an unknown person’s brain would respond to images, video, and audio with reasonable zero-shot generalization.
    • Wang’s main philosophical break with Elon Musk: research itself is the primary activity. Building super intelligence is a research expedition through fog of war, and sequencing of bets really matters.
    • Personal notes: Wang moved from San Francisco to the South Bay, treats Palo Alto as his city now, was a math olympiad competitor, says his favorite activities are reading sci-fi and walking in the woods, and bonds with Vance over country music.

    Detailed Summary

    How MSL Is Actually Organized

    Meta Superintelligence Labs sits as the umbrella organization that Wang oversees. Inside it, TBD Lab is the large-model research group where the most discussed researchers and infrastructure engineers sit, and they technically report to Wang. PAR, Product and Applied Research, is led by Nat Friedman and owns deployment and product surfaces. FAIR continues to run exploratory science, including work on brain prediction models and a universal model for atoms used in computational chemistry. Sitting alongside MSL is Meta Compute, run by Daniel Gross, which owns the long-horizon GPU and data center plan that everything else relies on. Chief scientist Shengjia Zhao orchestrates the scientific agenda across the whole lab.

    Why Wang Left Scale

    Wang says progress in frontier AI has been faster than even insiders expected. Two structural beliefs pushed him toward Meta. First, the labs that actually train the frontier models are accruing disproportionate economic and product rights in the AI ecosystem. Second, compute is the dominant scarce input of the next phase, so the right mental model is to treat tech companies with compute as fundamentally different animals from companies without it. Meta has both, Zuck is “AGI pilled,” and the personal super intelligence memo Zuck published roughly a year ago became the shared north star.

    The Diagnosis: Llama Was Off-Trajectory

    When Wang arrived, the existing AI org needed a reset because Llama was not on the same trajectory as the frontier. The plan he laid out has four cultural principles. Take superintelligence seriously as a real near-term target. Make technical voices the loudest in the room. Demand scientific rigor and focus on basics. Make big bets. On top of that, three structural levers were used to set velocity. Push compute per researcher much higher than at larger labs where compute is diluted across too many efforts. Keep the team small and extremely cracked. Allocate a meaningful share of resources to ambitious, paradigm-shifting research bets rather than incremental refinement.

    Recruiting, Soup, and the Mercenary Narrative

    Wang argues the reporting on MSL hiring overstated the money story. Most of the people MSL recruited had strong financial paths at their previous employers, so individualized recruiting was more about computing access, talent density, and the ability to make big research bets. The recruitment blitz happened fast because Wang knew the team needed to exist “yesterday.” Asked about Mark Chen’s claim that Zuck made soup to recruit people, Wang refuses to confirm or deny who made it but agrees the process was intense and personal. Visitors from other labs reportedly tell Wang the MSL culture feels like early OpenAI or early Anthropic, which lands as the strongest endorsement he could ask for.

    Receiving the Public Hits: Young, Inexperienced, Mercenary

    LeCun called Wang young and inexperienced shortly after departing. The two reconnected in India a few weeks later and LeCun congratulated Wang on MuseSpark. Wang says the age critique has followed him since his earliest Silicon Valley days, so he barely registers it. Altman, asked off-camera by Vance about Wang’s appearance on the show, had nothing flattering to add. Wang’s response is to bet that as the field gets closer to actual super intelligence, the personal animosities will subside. Whether they will is, as Vance puts it, an open question.

    MuseSpark as Appetizer, Not Entree

    Wang is careful not to oversell MuseSpark. He calls it “the appetizer” and says it is an early data point on a deliberately constructed scaling ladder. MSL spent nine months rebuilding the pre-training stack, the reinforcement learning stack, the data pipeline, and the science before generating MuseSpark. The point of releasing it was to show that the new program scales predictably along multiple axes (pre-training, RL, test-time compute, and the recently demonstrated multi-agent scaling visible in MuseSpark’s 16-agent content planning mode). Wang says the upcoming larger models are what MSL is genuinely excited about and frames the next two rungs as much more interesting than the current release.

    Token Efficiency Was the Surprise

    MuseSpark’s strongest competitive signal is how few tokens it needs to match competitors on tasks like Artificial Analysis. Wang attributes this to having had the rare luxury of building a clean pre-training and RL stack from scratch with the right experts. He speculates that some competitor models compensate for upstream inefficiency by allowing the model to think longer, which inflates token usage without improving the underlying capability. If that read is right, MSL’s efficiency advantage should grow as models scale up.

    Glasses, WhatsApp, and the Constellation of Devices

    Personal super intelligence shows up at Meta as a constellation of devices that capture context across the user’s day. Ray-Ban Meta glasses are the headline product, with the AI seeing what you see and hearing what you hear, then offering proactive insight or doing background research. Wang acknowledges that even AI-fluent users like Kylie Robinson, who runs her business inside WhatsApp, have not naturally used Meta’s AI buttons in the family of apps. His answer is that Meta deliberately waited for models to be good enough before tightening cross-app integration, and that integration phase is starting now.

    Country of Geniuses Versus Economy of Agents

    Wang’s framing of Meta’s strategic position is the most memorable line in the interview. Where Dario Amodei talks about a country of geniuses in a data center, Wang wants to build an economy of agents in a data center. Meta uniquely sits on both sides of consumer and small-business surface area, with billions of consumers and hundreds of millions of small businesses already on the platforms. If MSL can build great agents for both, then connect them so they transact and coordinate, the platform becomes a substrate for an entirely new kind of digital economy.

    Consumer Sentiment, Product Overhang, and the Trust Tax

    Wang concedes consumer AI sentiment is poor and that everyday users have not yet had a personal Claude Code moment. He believes the only durable answer is to ship products that genuinely transform individual agency for non-developers and small business owners. Robinson notes that for the small-town restaurant whose website has not been updated since 2002, a working agent on the business side could be transformational. Vance pushes that Meta carries a bigger trust tax than any other lab, so the bar for shipping AI products that the public will accept is correspondingly higher. Wang accepts the framing and says the answer is to keep building thoughtfully.

    Why MuseSpark Cannot Be Open Sourced Yet

    Meta’s Advanced AI Scaling Framework set explicit guardrails around bio, chem, cyber, and loss-of-control risks. MuseSpark in its current form tripped some of those internal evaluations, documented in the preparedness report Meta published alongside the model. So MuseSpark itself is not safe to open source. MSL is, however, developing smaller versions and derived models intended for open release, with active reviews happening the day of the interview. Wang reaffirms the commitment to open source where safety allows and draws a line back to the Open Compute Project and the Sun Microsystems-era ethos of openness in infrastructure.

    The Bosworth, Cox, and Manus Questions

    The reporting that Wang and Zuck push toward best-in-the-world research while Bosworth and Cox push toward cheap product deployment is dismissed as gossip dressed up as journalism. Wang says leadership debates points hard but is aligned on needing top models, integrating them into Meta’s surfaces, and serving the existing business. On Manus, the Chinese AI startup that figured in Meta’s late-stage strategy, Wang says he cannot comment, which itself signals that the situation is unresolved.

    China, National Security, and the Newspaper Ad

    Wang draws a sharp distinction between the Chinese state and Chinese-born researchers. His parents are from China, he is happy to work with talented researchers regardless of origin, and he sees a flattening of nuance on this question inside Silicon Valley. At the same time, he stands by the New York Times AI and war ad he ran while at Scale, framing it as an early plea for the US government to take AI seriously as a national security technology. He thinks subsequent events, including DeepSeek and other shocks, validated that call and that policymakers now do treat AI accordingly.

    Robotics and Physical Super Intelligence

    Meta has acquired Assured Robot Intelligence, an AI software company that builds models for multiple hardware targets rather than its own robot. Wang argues that if you take digital super intelligence seriously, physical super intelligence quickly becomes the next logical milestone. Scaling laws for robotic intelligence look similar enough to language model scaling that having the largest compute footprint in the industry would be wasted if it were not also turned toward world modeling and embodied learning. He grants the metaverse-skeptic critique exists but says retreating from ambition is the wrong response to past misfires.

    Health Super Intelligence and CZI

    Wang names health super intelligence as one of MSL’s anchor initiatives. Because billions of people already use Meta products daily, Wang believes Meta is structurally positioned to put powerful health AI in the hands of equal global access in a way nobody else can. The work will involve close collaboration with the Chan Zuckerberg Initiative, which has its own multi-billion-dollar biotech and science investment program.

    Model Welfare, Sci-Fi, and Brain Models

    Two of the most distinctive moments come at the end. Wang flags model welfare as a topic he thinks is being undercovered relative to how integrated models now are in daily work. He is open to the idea that models may have measurable subjective experience worth weighing, and points to research efforts (including Eleos) trying to quantify it. He also reveals that FAIR’s Tribe program, with its Tribe B2 milestone, has produced foundation models capable of predicting how an unknown person’s brain would respond to images, video, and audio with reasonable zero-shot generalization, a building block toward future brain computer interfaces. Wang lists brain computer interfaces alongside super intelligence and robotics as the critical-path technologies for humanity, with energy, compute, and robots as the infinitely scaling primitives behind them.

    Where Wang Diverges From Elon

    Asked whether Musk is more all-in on robotics, energy, and BCI than anyone, Wang concedes the point but argues the details matter and sequencing matters more. Wang’s core philosophical break is that building super intelligence is fundamentally a research activity, not a scaling-only sprint. The lab is operating in fog of war, and ambitious experiments are the only way to map it. That conviction is what makes MSL a research-led organization rather than a brute-force compute farm.

    Thoughts

    The most strategically interesting move in this entire interview is the “economy of agents in a data center” framing. It is a deliberate reframe against Anthropic’s “country of geniuses” line, and it does real work. A country of geniuses is a labor-substitution story aimed at knowledge workers and code. An economy of agents is a marketplace story that maps directly onto Meta’s two-sided distribution advantage: billions of consumers on one side, hundreds of millions of small businesses on the other. That positioning makes the agentic future Meta-shaped in a way no other frontier lab can claim, because no other frontier lab also owns the demand and supply graph of the global small-business economy. If Wang’s team can actually ship reliable agents on both sides plus the rails for them to transact, Meta’s structural moat in agentic commerce could exceed anything Llama ever had as an open model.

    The token efficiency claim is the strongest piece of technical evidence in the interview for the “clean stack” thesis. If MuseSpark really is matching competitors with materially fewer tokens, the implication is not that MuseSpark is the best model today, but that MSL has rebuilt the foundations with less accumulated tech debt than competitors that have layered fixes on top of older stacks. That is exactly the kind of advantage that compounds with scale. The next two model releases are the actual test. If Wang is right about predictable scaling on pre-training, RL, test-time, and multi-agent axes simultaneously, the gap from MuseSpark to the next rung should be visible in a way that forces re-rating of Meta’s position.

    The open-source posture is the cleanest signal of how the safety conversation has actually changed in 2026. Meta, the lab most identified with open weights, is saying out loud that its current frontier model triggered enough internal guardrails that releasing the weights is off the table. Wang threads the needle by promising smaller open variants, but the underlying point is unmistakable: the open-weights bargain has limits, and those limits will be set by internal preparedness frameworks rather than community pressure. That is a real shift from the Llama 2 era and worth tracking as the next generation lands.

    Wang’s willingness to engage on model welfare, on roughly the same footing as safety and alignment, is the second philosophical reveal worth flagging. It signals that the next generation of lab leadership is not going to dismiss the topic the way the previous generation often did. Whether that translates into product or policy changes is unclear, but the fact that the head of MSL says it is “underdiscussed” is itself a marker.

    Finally, the human texture of the interview matters. Wang has clearly absorbed a lot of personal incoming fire over the past ten months, including from LeCun and Altman, and his answer is consistently to redirect to the work. The Steve Jobs quote about hiring people who tell you what to do is the operating slogan he keeps coming back to. Combined with the genuine enthusiasm for sci-fi, walks in the woods, and country music, the picture that emerges is less the salesman caricature his critics paint and more a young technical operator betting that scoreboard work over a multi-year horizon will settle every argument that text on X cannot.

    Watch the full conversation here.