PJFP.com

Pursuit of Joy, Fulfillment, and Purpose

Tag: Alphabet

  • Gavin Baker on Orbital Compute, TSMC, Frontier AI Models, Anthropic’s Vertical Take Off, and the Coming Wafer Shortage

    Gavin Baker, founder and CIO of Atreides Management, returns to Patrick O’Shaughnessy’s Invest Like the Best for his sixth appearance. He calls the current AI moment the most extraordinary moment in the history of capitalism, walks through what Anthropic’s vertical takeoff in revenue actually means, lays out why orbital compute is closer than skeptics believe, dissects the TSMC bottleneck that may be the only thing standing between today’s market and a full-on AI bubble, and rates every hyperscaler on how they have positioned for a world where frontier model providers may stop selling API access altogether.

    TLDW

    Anthropic added eleven billion dollars of ARR in a single month, which is roughly the combined business of Palantir, Snowflake, and Databricks built over a decade. That is the setup. From there Gavin Baker covers the March and April selloff, the contrarian read that a closed Strait of Hormuz was actually bullish for American manufacturing competitiveness, why Anthropic and OpenAI multiples may be misleadingly cheap on an unconstrained run rate basis, why Elon Musk’s discipline on SpaceX valuation created a superpower of permanent access to capital, the practical engineering case for orbital compute as racks in space rather than Pentagon sized space stations, why TSMC’s capacity discipline is the single most important variable in whether the AI cycle becomes a bubble, what Terafab in Texas changes, why the Pareto frontier of AI models has flipped from Google dominance to Anthropic and OpenAI dominance in nine months, the shift from all you can eat AI subscriptions to usage based pricing and what that means for revenue scaling, Richard Sutton’s bitter lesson as the largest risk to the AI trade, why frontier tokens still capture an overwhelming share of economic value, the role of continual learning as the third great open question, why most new chip startups should not try to build a better GPU, why Cerebras did something different and hard, why disaggregated inference may extend GPU useful lives to ten or fifteen years and rescue the private credit industry, why being in the token path is the new venture filter, the new prisoner’s dilemma around releasing frontier models via API, an honest rating of Google, Meta, Amazon, and Microsoft, why personal safety is becoming a real AI era risk, and why he remains an AI optimist maximalist who believes this could be the next Pax Americana.

    Key Takeaways

    • Anthropic added eleven billion dollars of ARR in one month, more than the combined businesses of Palantir, Snowflake, and Databricks built across a decade. There is no precedent for this in the history of capitalism.
    • The SaaS and cloud revolution created between five and ten trillion dollars of value over twenty years. AI is replaying that compression on a timeline measured in months.
    • The March selloff was a drawdown driven by disagreement with price action, not invalidated thesis. That is the kind of drawdown an investor can lean into.
    • Deep Seek Monday in January 2025 was a similar setup. By the day of the selloff, AWS Asia GPU prices had already doubled, GPU availability had fallen, and it was obvious reasoning models would be vastly more compute hungry at inference. The market priced the opposite.
    • The Strait of Hormuz closing was actually positive for America. US natural gas (the primary input into US electricity, which feeds AI) fell twenty percent on Bloomberg while Asian and European natural gas doubled or tripled. American manufacturing competitiveness improved overnight.
    • The US is now the world’s largest producer and exporter of oil and gas. The economy is dramatically less energy intensive than in the 1970s. The shortage trauma comparison does not hold.
    • Tech as a sector traded as cheaply versus the rest of the market in early April as at any point in the last ten years, into the single most bullish moment for AI fundamentals on record.
    • Anthropic is dramatically more capital efficient than OpenAI, having burned roughly eighty percent less to reach a similar revenue scale. They have very different structural returns on invested capital.
    • Anthropic at roughly nine hundred billion for fifty billion of ARR (growing a thousand percent) is striking. Adjusted for compute constraint, the unconstrained run rate could be one hundred fifty to two hundred billion, putting the implied multiple closer to five times.
    • Claude Opus generates roughly seventy percent fewer tokens for the same question than previously, with token quantity tied to answer quality. Subscribers on flat-fee plans are getting a lobotomized model.
    • Elon Musk’s superpower is twenty years of making investors money. He never pushes valuation. SpaceX compounded low thirty percent per year for a decade because Musk treats fair pricing as a sacred covenant.
    • Capitalism will solve the watts shortage. The current bottleneck has shifted from chips and energy to zoning and political approval. Many capex decisions are paused until after the US midterms.
    • The watts shortage probably begins to alleviate in 2027 and 2028. Orbital compute solves it longer term.
    • Orbital compute is not Pentagon sized data centers in space. It is racks in space. A Blackwell rack is three thousand pounds, eight feet tall, four feet deep, three feet wide. SpaceX has shown a satellite roughly that size.
    • The satellites operate in sun synchronous orbit so solar wings (around five hundred feet per side) always face the sun and the radiator on the dark side always points to deep space.
    • Starlink V3 satellites already run at around twenty kilowatts. A Blackwell rack runs at one hundred kilowatts. SpaceX engineers express genuine confidence they have already solved cooling and radiator design at these scales.
    • Racks in space are connected with lasers traveling through vacuum, the same lasers already on every Starlink. SpaceX operates the world’s largest satellite fleet and, via xAI Colossus, the world’s largest data center on Earth.
    • Inference will move to orbit. Training will stay on Earth for a long time. Terrestrial data centers remain valuable for the rest of an investor’s career.
    • The wafer bottleneck is structural and political. TSMC is essentially Taiwan’s GDP, water, and electricity. The leaders see themselves as inheritors of Morris Chang’s sacred legacy and they do not behave like a Western public company.
    • Jensen Huang has never had a contract with TSMC. The relationship is run on handshakes and the assumption that things will be fair over time.
    • If TSMC did everything Jensen wanted, Nvidia could be selling two to three trillion dollars of GPUs in 2026 and 2027. TSMC’s discipline is the single largest factor preventing a true AI bubble.
    • Historically, foundational technologies always get a bubble. Railroads, canals, the internet. The current AI buildout is overwhelmingly funded out of operating cash flow, GPUs are running at one hundred percent utilization, and that is fundamentally different from the year 2000 fiber overbuild.
    • If one of Intel or Samsung Foundry catches up at the leading node, the other will follow, and TSMC’s discipline collapses. Watch TSMC capacity decisions to predict a bubble.
    • Terafab, the SpaceX and Tesla joint venture to build the world’s largest fab in America, has a partnership with Intel that grants access to fifty years of institutional foundry knowledge. The A teams at ASML, KLA, Lam Research, and Applied Materials will follow Elon’s reputation in hardware engineering.
    • The hiring playbook for Terafab includes building Taiwan Town, Japan Town, and Korea Town next to the fab. Recruit the engineers and import their families, their restaurants, and their staff.
    • Frontier tokens still capture an overwhelming share of all economic value created at the model layer. This is surprising and is one of the three big open questions for AI investing.
    • The Pareto frontier of intelligence versus cost has flipped. Nine months ago Google’s TPU dominated every point on the frontier. Today Anthropic and OpenAI dominate, with Grok 4.3 on the frontier and Gemini 3.1 hanging on.
    • Google’s conservative TPU V8 design (partly an attempt to reduce dependence on Broadcom and Nvidia) is the leading explanation for the loss of per token cost leadership.
    • AI pricing is shifting from all you can eat to usage based, mirroring the cellular and long distance industries. Cellular stopped being a great growth industry when it went all you can eat. AI just made the opposite move.
    • OpenAI and Anthropic together could exceed two hundred billion in ARR this year if compute keeps coming online and frontier token pricing holds.
    • The two hundred fifty dollar a month consumer AI plan is no longer enough to evaluate frontier capability. Enterprise plans with usage based billing are required because rate limits are now severe.
    • The three biggest open questions for AI investors are: violation of the bitter lesson via ASI or human ingenuity, whether frontier tokens keep commanding their premium, and when continual learning arrives.
    • Today’s continual learning is crude reinforcement learning during mid training on verifiable tasks. True continual learning means weights updating dynamically, like a human who learns the first time they touch fire.
    • Trying to build a better GPU is a losing strategy. Jensen will copy any one to three percent share design. Startups should target one percent share, do something different, and make it hard enough that Nvidia cannot fast follow.
    • Disaggregated inference (separating prefill and decode) opens new design canvases. Prefill is memory capacity bound. Decode is memory bandwidth bound. Each can be optimized independently.
    • Cerebras did something different and hard with wafer scale computing. Three generations of chips and real grit to get there.
    • Disaggregation of inference may stretch GPU useful lives to ten or fifteen years, dropping financing costs from low sevens to five or six percent, mathematically lowering the cost of the AI buildout and likely saving the private credit industry from its SaaS loan exposure.
    • Sellers of shortage outperform buyers of shortage. But owning the largest installed base of what is currently in shortage (hyperscaler CPU fleets, for example) is also a strong position.
    • Most of the economic value at the application layer of AI has been destroyed, not created. The exceptions are companies in the token path or in niches small enough that frontier labs ignore them.
    • Coding may be the shortest path to ASI. If you can write code, you can write code that does anything. Cursor, Cognition, and Anthropic correctly focused on it.
    • Jensen could probably get close to the frontier with his own Nemotron family of models whenever he wants. The fact that he chooses not to is a strategic decision about not commoditizing his customers.
    • The new prisoner’s dilemma in AI is whether frontier labs release their best model via API. If everyone agrees not to, Chinese open source falls behind. If anyone defects, the defector pulls ahead on revenue and resources, forcing everyone else to defect.
    • Google still owns the largest compute installed base. Without TPU’s prior cost advantage, this matters more. YouTube data has real value in a world of robotics. GCP is going crazy.
    • Meta deserves credit for becoming AI first internally faster than any other internet giant. Musa, their first MSL model, is impressively close to the Pareto frontier.
    • Amazon is strong because of Trainium and robotics driven retail P&L efficiency. Nova is better than it gets credit for.
    • Microsoft flinched on capex in early 2025 and lost position. Satya Nadella’s current decision to use Microsoft compute for Microsoft products rather than reselling to OpenAI is a courageous and probably correct call, even at the cost of an eight hundred dollar stock price.
    • The hyperscalers most engaged with startups are Amazon and Nvidia by a mile, followed by Google. Broadcom is the favorite ASIC partner. AMD, Microsoft, and Meta have minimal startup engagement and that will cost them as the best teams are now at startups.
    • Personal safety in an AI era requires a family or company safe word that cannot be socially engineered. Deepfake voice and video extortion at the speed of FaceTime is already feasible.
    • Ukraine is winning largely on the back of having the best battlefield AI outside America and Israel. Adversaries are starting to internalize what AI dominance means geopolitically.
    • An optimistic read is that this becomes a new Pax Americana, the way the post 1945 American nuclear monopoly was used to rebuild Germany and Japan rather than dominate.
    • AI cured a friend’s daughter’s rare disease by spinning up a research effort that identified a market drug capable of impacting her condition. That is the upside that keeps Gavin an AI optimist maximalist.

    Detailed Summary

    The most extraordinary moment in the history of capitalism

    Gavin’s framing of the current moment is unusually direct. Anthropic added eleven billion dollars of annual recurring revenue in a single month. The three highest profile SaaS companies of the last decade plus, Palantir, Snowflake, and Databricks, took a decade and tens of thousands of employees collectively to build the combined business that Anthropic added in thirty days. He has been investing through every major tech cycle and says there is no historical analog. Not the dotcom era, not the cloud transition, not mobile. This is its own thing.

    The market response, then, was peculiar. The NASDAQ sold off into the single most bullish moment for AI fundamentals on record. Tech traded at roughly its widest discount versus the rest of the market in a decade. Investors who said they wished they had bought into AI during 2022, during COVID, or during Deep Seek Monday got the same valuation setup again in early April, this time with an even clearer inflection.

    Why the Strait of Hormuz closing was secretly bullish for America

    One reason the macro fear in March may have been mispriced is that the same geopolitical event that drove the selloff was, in practice, a relative benefit to the United States. American natural gas, the input into American electricity, which is the input into American AI training and inference, fell roughly twenty percent. Asian and European natural gas prices doubled or tripled. The US emerged with sharply improved relative manufacturing competitiveness, which is exactly what the current administration cares about.

    The 1970s comparison does not hold. The US economy is dramatically less energy intensive, it is now the world’s largest producer and largest exporter of oil and gas, and there are no shortages, only price moves. That backdrop made it easier for disciplined investors to stay focused on AI fundamentals through the volatility.

    Anthropic and OpenAI valuations on an unconstrained run rate

    Anthropic at roughly nine hundred billion for fifty billion of ARR sounds rich until you adjust for the fact that the company is severely compute constrained. Gavin estimates that, unconstrained, Anthropic might be at one hundred fifty to two hundred billion in run rate revenue, putting the implied multiple closer to five times. He also points out that Claude Opus now generates roughly seventy percent fewer tokens for the same question than it used to. Token quantity correlates with answer quality, and Anthropic is rate limiting and shrinking outputs to ration capacity across its user base.

    Anthropic and OpenAI are also structurally very different. Anthropic has burned around eighty percent less cash than OpenAI to reach a comparable revenue scale. That implies very different long term returns on invested capital, though OpenAI has done a better job locking in compute and Sarah Friar is one of the most exceptional CFOs Gavin has worked with.

    Why neither lab is raising at a three trillion dollar valuation

    The answer Gavin gives is that both labs are deliberately leaving valuation on the table the way Elon has done for two decades. SpaceX compounded at low thirty percent annually for a decade because Elon never pushed price. The result is a permanent superpower of access to capital. Investors trust him because they have made money with him for twenty years. That is a moat that compounds with every round.

    Anthropic could probably raise at a one hundred percent premium to its rumored latest mark. They are choosing not to. In an uncertain world (Ukraine, Russia, Iran, Taiwan), preserving the ability to raise more capital later at fair prices is more valuable than maximizing this round.

    Watts and wafers, the two real constraints

    Capitalism is solving the watts problem. The leading PE infrastructure investors now say zoning and political approval, not chips or energy, are the gating factors. Companies are deferring big capex announcements until after the US midterms. Turbine capacity is being doubled at the manufacturers. Companies like Boom Aerospace are repurposing jet engines for grid use. Watts probably ease meaningfully in 2027 and 2028 and then orbital compute does the rest.

    Wafers are the harder problem because they live in Taiwan, run on handshakes, and depend on a corporate culture that does not respond to public market incentives. TSMC is essentially the GDP, water consumption, and electricity consumption of Taiwan. Its leadership treats the company as the legacy of Morris Chang. The Silicon Shield doctrine is real and internal.

    Orbital compute as racks in space

    The biggest mental update Gavin asks listeners to make is to stop picturing data centers in space as Pentagon sized space stations. A Blackwell rack is three thousand pounds and roughly the size of a refrigerator. SpaceX has shown a concept satellite of about that size. Solar wings extend five hundred feet to each side and the radiator extends hundreds of feet behind, both possible because the orbit is sun synchronous and the orientation is fixed relative to the sun.

    SpaceX engineers Gavin has spoken to at Starbase express genuine confidence that they have solved cooling at these power levels. They have. Starlink V3 satellites already operate at twenty kilowatts. A Blackwell rack is one hundred kilowatts. The same company operates the world’s largest satellite fleet and the world’s largest data center on Earth via xAI Colossus. The racks are connected to each other with lasers traveling through vacuum, technology already deployed in every Starlink. The naysayers, Gavin observes, are armchair skeptics and Larry Ellison’s response (he is out there landing rockets, no one else is) is the right frame.

    Terafab in Texas and the threat to TSMC’s discipline

    Terafab, the SpaceX and Tesla joint venture, intends to be the largest fab in the world. The partnership with Intel grants access to fifty years of foundry institutional knowledge, allowing Terafab to start three to five quarters behind the leading node rather than fifteen years behind. The A teams at the semicap equipment companies (ASML, KLA, Lam Research, Applied Materials) will follow Elon’s reputation in hardware engineering the same way they followed TSMC twenty years ago when Intel stumbled.

    The talent strategy is the part most observers underestimate. Recruit the best engineers globally, then import their families, their restaurants, their staff. Build Taiwan Town, Japan Town, and Korea Town next to the fab. Optimize the human experience for the people whose work matters. Intel and Samsung do not think that way.

    Bubble watch and the year 2000 comparison

    Every foundational technology in modern history has had a bubble. Railroads, canals, the internet. Carlota Perez documented why. Markets correctly identify the importance, diversity of opinion collapses, supply gets ahead of demand, the bubble crashes. The current cycle has two important differences. The buildout is overwhelmingly funded out of operating cash flow, not debt. Every GPU is running at one hundred percent utilization, while at the peak of the fiber bubble ninety nine percent of fiber was unused.

    TSMC discipline is the single largest reason a bubble has not formed. If Jensen could buy everything TSMC could theoretically make, Nvidia could sell two to three trillion dollars of GPUs in 2026 and 2027. At some point that becomes more than the market can absorb. If Intel or Samsung Foundry catches up at the leading node, the other will too. TSMC’s pricing discipline collapses and the bubble starts.

    The Pareto frontier and the loss of Google’s cost advantage

    The most important chart in AI is the Pareto frontier of model intelligence versus per token cost. Nine months ago, Google’s TPU based models dominated every point on it. OpenAI, Anthropic, and xAI sat inside the frontier. Today the frontier is dominated by Anthropic and OpenAI, with Grok 4.3 on the frontier and Gemini 3.1 hanging on by subsidization more than economics. The most likely cause is Google’s conservative TPU V8 design, an attempt to reduce dependence on Broadcom and Nvidia that sacrificed per token economics.

    The bitter lesson, frontier tokens, and continual learning

    Three open questions dominate AI investing. The first is whether Richard Sutton’s bitter lesson (more compute beats human algorithmic cleverness) gets violated by ASI itself optimizing for efficiency. Closer observers of AI are more skeptical of a violation. Gavin thinks ASI’s first move will be to make itself more efficient and more resourced, which is technically a temporary violation.

    The second is whether frontier tokens keep capturing the overwhelming share of economic value at the model layer. Today they do, surprisingly. Gemini 3.1 Pro was mindblowing nine months ago and is intolerable today. The third is when continual learning arrives. Today’s models need a million fire touches to learn what a human learns from one. True continual learning would mean dynamic weight updates in real time and would produce a fast takeoff.

    From all you can eat to usage based AI pricing

    AI is shifting from flat fee plans to usage based pricing. The historical analogy is cellular and long distance. Both stopped being great growth industries when they went all you can eat. AI just made the opposite move. The consequence is that flat fee subscribers, even on premium consumer plans, get a rate limited and token throttled version of the frontier model. Enterprise plans with usage based billing are now required to evaluate true capability. Gavin thinks the combination of new compute coming online and usage based pricing is what gets OpenAI and Anthropic past two hundred billion in combined ARR this year.

    Chip startups, prefill decode disaggregation, and Cerebras

    Trying to build a better GPU is the wrong move. The four scaled players (Nvidia, AMD, Trainium, TPU) have copy capability for any one to three percent share design that looks attractive. The good news for startups is that disaggregated inference (separating prefill and decode) opens a richer design canvas. Prefill is memory capacity bound. Decode is memory bandwidth bound. Each can be optimized independently. Andrew Fox’s analogy is a British naval ship of the eighteenth century. Prefill is loading the cannon. Decode is firing it.

    Cerebras is the model. Wafer scale computing is genuinely different and genuinely hard. It took three generations of chips to get right. Andrew Feldman and his team had the grit to keep going through chip one being a failure. The design has a high ratio of on chip compute and memory relative to shoreline IO, which is why Cerebras is now experimenting with putting an optical wafer on top of the compute wafer to solve scale out.

    GPU useful lives and the rescue of private credit

    One of the strongest claims in the conversation is that disaggregated inference will stretch GPU useful lives to ten or fifteen years. The skeptical narrative (GPUs are obsolete in two years, companies are cooking their depreciation books) is wrong. You can put a Cerebras system or Groq LPU in front of older Hopper or Ampere parts, use them only for prefill, and run them until they physically melt. Private credit, which is in pain from SaaS loans and which underwrote GPU loans on three to four year lives, may be saved by this.

    If GPU financing rates can come down from low sevens to five or six percent, the mathematics of the AI buildout improves materially. That is a structural tailwind that compounds for years.

    The application layer, the token path, and a new prisoner’s dilemma

    Trillions of dollars of value have been destroyed at the application layer, not created. Cursor and Cognition are the rare scaled exceptions, and they got there by focusing on coding very early. As Amjad Masad noted, coding is plausibly the shortest path to ASI because a coding agent can write itself into any new domain. Jamin Ball’s frame is that the new venture filter is whether the company is in the token path. Data Bricks is. Most application layer startups are not.

    Jensen could probably get close to the frontier with Nemotron whenever he wants, and the strategic question of whether to do that is a new prisoner’s dilemma. If every frontier lab agrees not to release best models via API, Chinese open source falls steadily behind. If anyone defects, the defector gains revenue and resources, and everyone else has to defect. The same dynamic exists between TSMC, Intel, and Samsung. If Nvidia or AMD ever truly used an alternative foundry, that foundry would catch up rapidly.

    Rating the hyperscalers

    Google has the largest compute installed base, the YouTube data that matters in a robotics world, and a search business that prints. Their loss of TPU cost leadership is the surprise of the year. If Google IO in five days does not produce a leapfrog model, the Nvidia centric narrative gets even stronger.

    Meta deserves real credit. Zuckerberg made Meta AI first internally faster than any other internet giant, paid up for the talent contracts when no one else would, and shipped Musa as a first model from MSL that is close to the Pareto frontier. Amazon is well positioned on Trainium, robotics in retail, and a Nova model line that is better than it gets credit for. Microsoft flinched on capex in early 2025 and lost position. Satya Nadella’s current decision to use Microsoft compute for Copilot rather than reselling to OpenAI is courageous and probably correct, even at the cost of stock price.

    The most interesting cross hyperscaler metric is startup engagement. Nvidia and Amazon engage deeply with startups. Google is next. Broadcom is the favored ASIC partner. AMD, Microsoft, and Meta have minimal startup engagement, which Gavin believes will cost them as the best teams now sit at startups.

    Personal safety, geopolitics, and the Pax Americana case

    The closing section turns darker. Personal safety in an AI era requires a family or company safe word that cannot be socially engineered. Deepfake voice and video extortion via something that looks exactly like your child calling on FaceTime is already feasible. Political violence against AI leaders is a real concern. Geopolitically, Ukraine is winning largely because it has the best battlefield AI outside America and Israel. How adversaries respond to that asymmetry is the next great variable.

    Gavin’s optimistic frame is the Pax Americana. After 1945 the US had a nuclear monopoly and could have controlled the world. Instead it rebuilt Germany and Japan, both of which became the most reliable American allies for the next eighty years. If AI dominance plays out similarly, this is a generationally positive story rather than a destabilizing one. The personal anecdote that closes the conversation is a friend whose daughter was diagnosed with a rare genetic condition. He spun up agents, identified a drug already on the market that addresses her mutation, and her life is immeasurably different because of AI. That is the upside.

    Thoughts

    The Anthropic eleven billion in a month framing is the kind of stat that resets priors. The right way to interpret it is not as a one off but as a measure of how fast value can compound when the underlying technology improves on a curve steeper than the ability of the rest of the economy to absorb it. The skeptical question is whether that ARR is durable or whether it is heavily tied to a customer base of other AI companies that are themselves on a single venture funded year of runway. The bullish answer is that frontier coding, frontier research, and frontier enterprise tasks are not going to stop being valuable, and Anthropic is the best at all three. Both can be true. The number is still extraordinary.

    The argument that TSMC discipline is the only thing preventing a bubble is the analytically tightest part of the conversation. The implied trade is to watch TSMC capacity additions like a hawk and to be more, not less, cautious if Intel Foundry or Samsung Foundry ever announce real share at the leading node. The Terafab thesis is more speculative but more interesting. If Elon’s talent recruiting playbook works and the Intel partnership gives Terafab a real seat at the table within five years, the geometry of the global semiconductor industry shifts in a way that is bullish for American manufacturing, bullish for power and water infrastructure in Texas, and ambiguous for TSMC itself.

    The Pareto frontier discussion deserves more attention than it usually gets. Pricing leadership in AI is not a vanity metric. It determines who can subsidize free tier usage, who can absorb compute shortages, who can ship cheaper enterprise plans, and ultimately whose model becomes the default for any given workload. Google losing per token leadership in nine months is one of the most under analyzed events in the sector and it explains a lot about why Anthropic and OpenAI are growing the way they are. If Google IO does not produce a leapfrog model, the implied verdict on TPU V8 design choices gets a lot harsher.

    The application layer destruction point is worth sitting with. Founders building on top of frontier models are competing in a world where the model itself moves faster than any moat they can build, where the model lab can absorb their niche if it gets interesting, and where the only protection is either deep token path integration or a niche so small the lab does not bother. That is a much harsher venture environment than the early SaaS era. The compensating opportunity is that one human can now run a hundred agents, so the ceiling on what a small team can build is correspondingly higher. The bet is that productivity per founder rises faster than competitive pressure from the labs. We will find out.

    The orbital compute pitch is the section that will polarize listeners. The naive read is that this is science fiction. The closer read is that every component (sun synchronous orbit, laser interconnect, twenty kilowatt satellite buses, ten thousand satellite manufacturing cadence, full rocket reusability) already exists. The remaining engineering problems are repair, maintenance, and radiator scale, all of which are real but tractable on a five to ten year horizon. The strategic implication is that the political and zoning ceiling on terrestrial data centers becomes less binding if orbital compute is a credible alternative for inference workloads. The investor implication is that being short the watts and cooling complex on a five year horizon is a real trade, not a meme.

    Watch the full conversation here.

  • Sundar Pichai on the All-In Podcast: Unpacking Alphabet’s AI Future, Competitive Pressures, and the Next $100B Bets

    TLDW (Too Long; Didn’t Watch):

    Sundar Pichai, CEO of Alphabet, sat down with the All-In Podcast to discuss AI’s seismic impact on Google Search, the company’s infrastructure and model advantages, the future of human-computer interaction, intense competition (including from China), energy constraints, long-term bets like quantum computing and robotics, and the evolving culture at Google. He remains bullish on Google’s ability to navigate disruption and lead in the AI era, emphasizing a “follow the user” philosophy and relentless innovation.

    Executive Summary: Navigating the AI Revolution with Sundar Pichai

    In a comprehensive and candid interview on the All-In Podcast (dated May 16, 2025), Alphabet CEO Sundar Pichai offered deep insights into Google’s strategy amidst the transformative wave of Artificial Intelligence. Pichai addressed the “innovator’s dilemma” head-on, asserting Google’s proactive stance in evolving its core Search product with AI, rather than fearing self-disruption. He detailed Google’s significant infrastructure advantages, including custom TPUs, and differentiation in foundational models. The conversation spanned the future of human-computer interaction, the burgeoning competitive landscape, critical energy constraints for AI’s growth, and Google’s “patient” investments in quantum computing and robotics. Pichai also touched upon fostering a high-performance, mission-driven culture and clarified Alphabet’s structure as a technology-first company, not just a holding entity. The overarching theme was one of optimistic resilience, with Pichai confident in Google’s capacity to innovate and lead through this pivotal technological shift.

    Key Takeaways from Sundar Pichai’s All-In Interview:

    • AI is an Opportunity, Not Just a Threat to Search: Google sees AI as the biggest driver for Search progress, expanding query types and user engagement, not a zero-sum game. “AI Mode” is coming to Search.
    • Disrupting Itself Proactively: Pichai rejects the “innovator’s dilemma” if a company leans into user needs and innovation, citing mobile and YouTube Shorts as examples. Cost per AI query is falling; latency is a bigger challenge.
    • Infrastructure is a Core Differentiator: Google’s decades of investment in custom hardware (TPUs – now 7th gen “Ironwood”), data centers, and full-stack approach provide a significant cost and performance advantage for training and serving AI models. 50% of 2025 compute capex ($70-75B total) goes to Google Cloud.
    • Foundational Model Strength: Google believes its models (like Gemini 2.5 Pro and Flash series) are at the frontier, with ongoing progress in LLMs and beyond (e.g., world models, diffusion models). Data from Google products (with user permission) offers a differentiation opportunity.
    • Human-Computer Interaction is Evolving Towards Seamlessness: Pichai sees AR glasses (not immersive displays) as a potential next leap, making computing ambient and intuitive, though system integration challenges remain.
    • Energy is a Critical Constraint for AI Growth: Pichai acknowledges electricity as a major gating factor for AI progress and GDP, advocating for innovation in solar, nuclear, geothermal, grid upgrades, and workforce development.
    • Long-Term Bets on Quantum and Robotics:
      • Quantum Computing: Pichai believes quantum is where AI was in 2015, predicting a “useful, practical computation” superior to classical within 5 years. Google is at the frontier.
      • Robotics: The combination of AI with robotics is creating a “sweet spot.” Google is developing foundational models (vision, language, action) and exploring product strategies, expecting a “magical moment” in 2-3 years.
    • Culture of Innovation and Accountability: Google aims to empower employees within a mission-focused framework, learning from the WFH era and fostering intensity, especially in teams like Google DeepMind. The goal is to attract and retain top talent.
    • Competitive Landscape is Fierce but Expansive: Pichai respects competitors like OpenAI, Meta, XAI, and Microsoft, and acknowledges China’s (e.g., DeepSeek) rapid AI progress. He believes AI is a vast opportunity, not a winner-take-all market.
    • Alphabet’s Structure: More Than a Holding Company: Alphabet leverages foundational technology and R&D across its businesses (Search, YouTube, Cloud, Waymo, Isomorphic, X). It’s about differentiated value propositions, not just capital allocation.
    • Founder Engagement: Larry Page and Sergey Brin are deeply engaged, with Sergey actively coding and contributing to Gemini, providing “unparalleled energy.”
    • Regrets & Pride: Pichai is proud of Google’s ability to push foundational R&D into impactful products. A “small regret” includes not acquiring Netflix when intensely debated internally.

    In what can only be described as a pivotal moment for the technology landscape, Sundar Pichai, the CEO of Alphabet and Google, joined David Friedberg and discussed the pressing questions surrounding Google’s dominance, its response to the AI revolution, and its vision for the future. This wasn’t just a cursory Q&A; it was a strategic deep-dive into the mind of one of tech’s most influential leaders.

    (2:58) The Elephant in the Room: Will AI Kill Search? Google’s Strategy for Self-Disruption

    The conversation immediately tackled the “innovator’s dilemma,” a theory that haunts established giants when new paradigms emerge. Friedberg directly questioned if AI, with its chat interfaces and complete answers, poses an existential threat to Google’s $200 billion search advertising cash cow.

    Pichai’s response was a masterclass in strategic framing. He emphasized that Google has been “AI-first” for nearly a decade, viewing AI not as a threat, but as the primary driver for advancing Search. “We really felt that AI is what will drive the biggest progress in search,” Pichai stated. He pointed to the success of AI Overviews, now used by 1.5 billion users, which are expanding the types of queries people make. Empirically, Google sees query growth and increased engagement where AI Overviews are triggered.

    Critically, Pichai revealed a “whole new dedicated AI experience called AI mode coming to search,” promising a full-on conversational AI experience powered by cutting-edge models. This mode sees users inputting queries “literally long paragraphs,” two to three times longer than traditional search queries. He dismissed the “dilemma” framing: “The dilemma only exists if you treat it as a dilemma… you have to innovate to stay ahead.” He drew parallels to Google’s successful navigation of the mobile transition and YouTube’s thriving alongside TikTok by launching Shorts, even when monetization wasn’t immediately clear. The guiding principle remains: “Follow the user, all else will follow.”

    Addressing the unit economics, Pichai downplayed concerns about the cost of serving AI queries, stating, “Google with its infrastructure, I’d wager on that… the cost to serve that query has fallen dramatically in an 18-month time frame.” Latency, he admitted, is a more significant constraint than cost. For ad revenue, AI Overviews are already at baseline parity with traditional search, with potential for improvement as AI can better match commercial intent with relevant information.

    (15:32) The Unseen Fortress: Infrastructure Advantage and Foundational Model Differentiation

    A cornerstone of Google’s confidence lies in its unparalleled infrastructure. Pichai highlighted Google’s position on the “Pareto frontier of performance and cost,” delivering top models cost-effectively. This is largely due to their custom-built Tensor Processing Units (TPUs). “We are in our seventh generation of TPUs,” Pichai noted, with the latest “Ironwood” generation offering over 40 exaflops per part. This full-stack approach, from subsea cables to custom chips, is crucial for serving AI at scale and managing costs.

    Regarding the hefty $70-75 billion capex projected for 2025, Pichai clarified that roughly half of the compute spend is allocated to Google Cloud, supporting its enterprise offerings and enabling innovation from Google DeepMind across various AI domains – not just LLMs, but also image, video, and “world models.”

    When asked about Nvidia, Pichai expressed “extraordinary respect” for Jensen Huang and Nvidia’s “world-class” software stack. While Google trains its Gemini models on TPUs internally, they also use Nvidia GPUs and offer them to cloud customers. “I like that flexibility,” he said, “but we are also long-term committed to the TPU direction.”

    On the topic of foundational model performance, Pichai acknowledged that progress isn’t always linear (“artificial jag jag intelligence,” as Andrej Karpathy termed it). However, he sees continuous progress and believes Google is “pushing the research frontier in a much broader way than most other people beyond just LLMs.” He doesn’t see fundamental roadblocks to further advancements yet, though progress gets harder, which he believes will distinguish elite teams. He also touched upon the “differentiated innovation opportunity” of leveraging data from Google’s suite of products (like Gmail, Calendar, YouTube) with user permission to create superior, personalized experiences.

    (25:08) The Future of Human-Computer Interaction, Hardware, and the AI Competitive Landscape

    Looking ahead, Pichai envisions human-computer interaction becoming more seamless, where “computing kind of works for you.” He sees AR glasses – not immersive VR displays, but glasses that augment reality ambiently – as a potential “next leap,” comparable to smartphones in 2006-2007. “When AR really works, I think that’ll wow people,” he mused, while acknowledging existing system integration challenges.

    The competitive landscape is undeniably intense. Pichai spoke respectfully of OpenAI (Sam Altman), XAI (Elon Musk), Meta (Mark Zuckerberg), and Microsoft (Satya Nadella), calling them an “impressive group” driving rapid progress. “I think all of us are going to do well in this scenario,” he suggested, emphasizing that AI represents a “much bigger landscape opportunity than all the previous technologies we have known combined.” He even noted that “companies we don’t even know… might be extraordinarily big winners.”

    The discussion also covered China’s AI prowess, particularly highlighted by DeepSeek’s efficient models. Pichai admitted that DeepSeek made many “adjust our priors a little bit” about how close Chinese R&D is to the frontier, though he noted Google’s Flash models benchmarked favorably. “China will be very, very competitive on the AI frontier,” he affirmed.

    A significant portion of this section involved the engagement of Google’s founders, Larry Page and Sergey Brin. Pichai described them as “deeply involved in their own unique ways,” with Sergey Brin actively “sitting and coding” with the Gemini team, looking at loss curves and model architectures. “To have a founder sitting there… it’s a rare, rare place to be,” Pichai shared, valuing their “nonlinear thinking.”

    (35:29) The Energy Bottleneck: AI’s Thirst for Power

    A critical, and often underestimated, constraint for AI’s future is energy. Pichai agreed with Elon Musk’s concerns, identifying electricity as “the most likely constraint for AI progress and hence by definition GDP growth.” He stressed this is an “execution challenge,” not an insurmountable physics barrier. Solutions involve embracing innovations in solar (plus batteries), nuclear (SMRs, fusion), geothermal, alongside crucial grid upgrades, streamlined permitting, and addressing workforce shortages (e.g., electricians). While Google faces current supply constraints and project delays due to these factors, Pichai expressed faith in the US’s ability to innovate and meet the moment, driven by capitalist solutions.

    (41:20) Google’s Moonshots: Quantum Computing and Robotics

    Pichai reiterated Google’s commitment to long-term, patient R&D, citing Waymo as an example of perseverance.

    Quantum Computing: The Next Frontier

    He likened the current state of quantum computing to where AI was around 2015. “I would say in a 5-year time frame, you would have that moment where some a really useful practical computation… is done in a quantum way far superior to classical computers.” Despite the “noise” in the industry, Pichai is “absolutely confident” in Google’s leading position and expects more exciting announcements this year that will “expand people’s minds.”

    Robotics: AI Embodied

    The synergy between AI and robotics is creating a “next sweet spot.” Google, with its “world-class” vision-language-action models (Gemini robotics efforts), is actively planning its next moves. While past ventures into the application layer of robotics might have been premature, the current AI advancements make the field ripe for breakthroughs. “We are probably two to three years away from that magical moment in robotics too,” Pichai predicted, suggesting Google could develop something akin to an “Android for robotics” or offer its models like Gemini to power third-party hardware. He mentioned Intrinsic, an Alphabet company, as already working in this direction.

    (47:56) Culture, Coddling, and Talent in the Age of AI

    Addressing narratives about Google’s “coddling” culture, Pichai explained the original intent behind perks like free food: to foster collaboration and cross-pollination of ideas. While acknowledging the need to constantly refine culture, he emphasized that empowering employees remains a source of strength. He highlighted the intensity and mission-focus within teams like Google DeepMind, where top engineers often work in person five days a week.

    “We are not all here in the company to resolve all our personal differences,” he stated. “We are here because you’re excited about… innovating in the service of the mission of the company.” The COVID era was a “big distortion,” and bringing people back, even in a hybrid model, has been crucial. He believes Google continues to attract top-tier talent, including the best PhD researchers, and that the current “exciting and intense” AI moment fosters a sense of optimism reminiscent of early Google.

    (56:50) Alphabet’s Identity: Beyond a Holding Company

    Pichai clarified that Alphabet isn’t a traditional holding company merely allocating capital. Instead, it’s built on a “foundational technology basis,” leveraging core R&D (like AI, quantum, self-driving tech) to innovate across diverse businesses. “Waymo is going to keep getting better because of the same work we do in Gemini,” he illustrated. The common strand is deep computer science and physics-based R&D, with X (formerly Google X) continuing to play a role as an incubator for moonshots like sustainable agriculture (Tapestries) and grid modernization.

    Reflections: Regrets and Pride

    When asked about his biggest regrets and proudest achievements, Pichai expressed immense pride in Google’s unique ability to “push the technology frontier” with foundational R&D and translate it into valuable products and businesses. As for regrets, he mentioned, “There are acquisitions we debated hard, came close.” When pressed for a name, he hesitantly offered, “Maybe Netflix. We debated Netflix at some point super intensely inside.” He framed these not as deep regrets but as acknowledgments of alternate paths in a world of “butterfly effects.”

    Sundar Pichai’s appearance on the All-In Podcast painted a picture of a leader and a company that are not just reacting to the AI revolution but are actively shaping it. With a clear-eyed view of the challenges and an unwavering belief in Google’s innovative capacity, Pichai’s insights suggest that Alphabet is determined to remain at the forefront of technological advancement for years to come.