PJFP.com

Pursuit of Joy, Fulfillment, and Purpose

Tag: Google

  • Anthropic Raises $65 Billion Series H at $965 Billion Valuation to Fund AI Safety Research and Massive Compute Expansion

    Anthropic has closed one of the largest private financing rounds in the history of technology, raising $65 billion in Series H funding at a $965 billion post-money valuation. The round, announced on May 28, 2026, lands as demand for Claude reaches what the company calls historic levels, and it positions Anthropic to pour fresh capital into safety research, compute, and the products that enterprises now lean on every day.

    TLDR

    Anthropic raised $65 billion in its Series H at a $965 billion post-money valuation, with Altimeter Capital, Dragoneer, Greenoaks, and Sequoia Capital leading and Capital Group, Coatue, D1 Capital Partners, GIC, ICONIQ, and XN co-leading, alongside $15 billion in previously committed hyperscaler investment that includes $5 billion from Amazon. The raise follows Anthropic crossing $47 billion in run-rate revenue earlier in May 2026, and it funds three priorities named by CFO Krishna Rao: advancing safety and interpretability research, expanding compute capacity to meet growing Claude demand, and scaling the products and partnerships customers depend on. On the infrastructure side, the company is locking in gigawatt-scale compute through 5 gigawatts with Amazon, 5 gigawatts of TPU capacity via Google and Broadcom, GPU access from SpaceX, and supply from partners Micron, Samsung, and SK hynix, while Claude remains available across all three major cloud platforms, AWS, Google Cloud, and Microsoft Azure, with widespread enterprise adoption across industries.

    Thoughts

    Start with the number that everyone will fixate on. A $965 billion post-money valuation against $47 billion in run-rate revenue is roughly 20 times sales, and for a company growing this fast that multiple is not the interesting part. The interesting part is that run-rate revenue crossed $47 billion earlier this month, which means the denominator is moving so quickly that the multiple is already stale. Investors are not pricing the business Anthropic is today. They are pricing the slope. A 20x multiple on a number that may double again inside a year is a very different bet than 20x on a flat line, and the lead names here (Altimeter, Dragoneer, Greenoaks, Sequoia, with Capital Group, Coatue, GIC and others co-leading) are not the kind of capital that pays for nostalgia. They are paying for the second derivative.

    But the real story is not the valuation. It is the compute. Read the infrastructure list carefully and you see the actual problem this round solves: 5 gigawatts from Amazon, 5 gigawatts of TPU capacity through Google and Broadcom, GPU access from SpaceX, and memory supply locked down with Micron, Samsung, and SK hynix. That is more than 10 gigawatts of secured power and silicon. The constraint on frontier AI in 2026 is no longer talent or even algorithms. It is electricity, land, and the multi-year queue for advanced packaging and high-bandwidth memory. You cannot buy 10 gigawatts on a quarterly basis. You reserve it years out, and you need the balance sheet to make those commitments credible. A $65 billion raise is, in plain terms, the down payment that lets Anthropic sign for capacity nobody can conjure on demand. The money is downstream of the megawatts.

    The diversification across that compute stack matters as much as the size. By splitting between Amazon’s infrastructure, Google and Broadcom’s custom TPUs, and SpaceX-supplied GPUs, Anthropic is refusing to become hostage to any single supplier’s roadmap or pricing. Custom silicon through Broadcom in particular is a bet on bending the cost curve, because the long-term economics of serving Claude at this scale depend on dollars per token, not just on raw availability. Anyone who has watched cloud lock-in play out over the last decade understands the move. Optionality at the hardware layer is leverage, and leverage is what keeps margins from being dictated by whoever owns the only fab slot you can reach.

    It is worth pausing on the fact that the round explicitly funds safety and interpretability research alongside scaling, and not as a footnote. Most companies treat safety spend as a cost center to be minimized once growth kicks in. Naming it first, ahead of compute and products, is a statement about where Anthropic believes its durable advantage sits. If models keep getting more capable, the binding constraint on deployment inside regulated industries (finance, healthcare, government) becomes trust, not intelligence. Interpretability is the work that turns a black box into something an enterprise risk committee can actually sign off on. Framed that way, safety research is not philanthropy subtracted from the bottom line. It is the thing that unlocks the most lucrative and defensible parts of the market, and pairing it with the scaling budget is the tell.

    Finally, look at distribution. Claude now ships on all three major clouds at once: AWS, Google Cloud, and Microsoft Azure. In a market where most frontier labs are tethered to a single hyperscaler, being available everywhere enterprises already run their workloads is a structural edge. It removes the procurement friction of asking a customer to adopt a new vendor relationship, and it means Anthropic competes on the merits of the model rather than on which cloud a buyer happened to standardize on years ago. Combine that omnipresent distribution with the compute reservations and the explicit safety mandate, and the shape of the strategy is clear. This is not a company buying time. It is a company buying the three things that actually compound: capacity that cannot be rushed, trust that cannot be faked, and reach into every place where work already happens.

    Key Takeaways

    • Anthropic raised $65 billion in its Series H funding round, one of the largest private financings in the history of the technology industry.
    • The round set Anthropic’s post-money valuation at $965 billion, placing the company within reach of the $1 trillion mark.
    • Altimeter Capital, Dragoneer, Greenoaks, and Sequoia Capital led the Series H round.
    • Capital Group, Coatue, D1 Capital Partners, GIC, ICONIQ, and XN served as co-leads on the investment.
    • The new capital builds on $15 billion in previously committed hyperscaler investments, which includes $5 billion from Amazon.
    • Anthropic crossed $47 billion in run-rate revenue earlier in May 2026, reflecting the surging commercial demand for Claude.
    • A core priority for the funding is to advance Anthropic’s safety and interpretability research.
    • The company will use the capital to expand compute capacity in order to meet growing demand for Claude.
    • Anthropic plans to scale the products and partnerships that customers depend on across its business.
    • CFO Krishna Rao said the funding will help Anthropic serve the historic demand it is experiencing, stay at the research frontier, and bring Claude to more of the places where work happens.
    • Amazon is providing 5 gigawatts of compute capacity as part of Anthropic’s infrastructure expansion.
    • Google and Broadcom are supplying 5 gigawatts of TPU capacity to power Claude’s growth.
    • SpaceX is contributing GPU access to Anthropic’s compute footprint.
    • Micron, Samsung, and SK hynix are partnering with Anthropic on memory and infrastructure to support its scaling needs.
    • Claude is available on all three major cloud platforms, AWS, Google Cloud, and Microsoft Azure.
    • Anthropic reports widespread enterprise adoption of Claude across a broad range of industries.

    Detailed Summary

    The Raise and the Valuation

    Anthropic has raised $65 billion in Series H funding, a round that values the company at $965 billion on a post-money basis. The size of the raise places it among the largest private financing events the technology industry has ever seen, and the valuation pushes Anthropic to the doorstep of the trillion dollar mark. The capital arrives at a moment when demand for the company’s Claude models has accelerated sharply, and the round is built to fund the response to that demand rather than simply mark a milestone. Anthropic framed the financing in its Series H announcement as the fuel for staying at the research frontier while scaling the infrastructure and products that customers increasingly rely on.

    Who Put In the Money

    The Series H was led by Altimeter Capital, Dragoneer, Greenoaks, and Sequoia Capital, a group that combines deep growth-stage technology experience with conviction in Anthropic’s long-term trajectory. Joining as co-leads were Capital Group, Coatue, D1 Capital Partners, GIC, ICONIQ, and XN, a roster that spans crossover funds, sovereign wealth, and institutional investors. Beyond the new equity, Anthropic pointed to $15 billion in previously committed hyperscaler investment, including $5 billion from Amazon. Taken together, the investor base reflects a mix of financial backers and strategic partners with a direct stake in seeing Claude reach more customers and more compute.

    Revenue at $47 Billion Run-Rate

    Underpinning the valuation is a business that has scaled with unusual speed. Anthropic crossed a $47 billion run-rate revenue figure earlier in May 2026, a number that signals how quickly enterprises and developers have adopted Claude across their workflows. Run-rate revenue annualizes the company’s most recent performance, and at this level it puts Anthropic firmly among the fastest growing software businesses on record. That financial momentum is the practical justification for both the round’s size and the near trillion dollar valuation investors were willing to support.

    The Compute Buildout

    A large share of the strategy behind the raise centers on securing compute at enormous scale. Anthropic detailed a set of infrastructure partnerships designed to keep pace with Claude demand. Amazon is providing 5 gigawatts of capacity, while Google and Broadcom together are supplying 5 gigawatts of TPU capacity. SpaceX is contributing GPU access, broadening the range of silicon Anthropic can draw on. Supporting the buildout on the hardware supply side are Micron, Samsung, and SK hynix, the memory and component partners whose output is essential to standing up data centers at this magnitude. The combined picture is a company assembling power, chips, and supply chain commitments measured in gigawatts rather than racks.

    Where the Money Goes

    Anthropic outlined three priorities for the new capital. The first is to advance safety and interpretability research, continuing the work of understanding how models behave and ensuring they remain reliable as they grow more capable. The second is to expand compute capacity to meet the growing demand for Claude, the practical engine behind the infrastructure commitments above. The third is to scale the products and partnerships that customers depend on, deepening the company’s reach into the tools and platforms where work actually happens. Krishna Rao, Anthropic’s chief financial officer, said the funding “will help us serve the historic demand we are experiencing, stay at the research frontier, and bring Claude to more of the places where work happens.”

    Claude Everywhere

    The funding lands on top of a distribution footprint that already spans the major cloud ecosystems. Claude is available on all three leading cloud platforms, AWS, Google Cloud, and Microsoft Azure, which means enterprises can reach the models through whichever provider they have standardized on. That availability has translated into widespread enterprise adoption across industries, from software and finance to healthcare and beyond. By being present everywhere developers and businesses already operate, Anthropic positions Claude not as a destination customers must travel to but as a capability woven into the platforms they use every day.

    Notable Quotes

    This funding will help us serve the historic demand we are experiencing, stay at the research frontier, and bring Claude to more of the places where work happens.

    Krishna Rao, CFO at Anthropic, on the purpose of the Series H round.

    Advance safety and interpretability research, expand compute capacity to meet growing Claude demand, and scale products and partnerships customers depend on.

    How Anthropic describes its use of funds from the round.

    For the full details on the round, the lead and co-lead investors, and how Anthropic plans to deploy the capital across safety research, compute, and products, read the full announcement here.

    Related Reading

    • Anthropic, the AI safety and research company behind Claude that raised this Series H round.
    • Sequoia Capital, one of the lead investors anchoring the financing.
    • Amazon Web Services, one of the three major cloud platforms where Claude is available and the source of a $5 billion investment.
    • Google Cloud TPUs, the tensor processing units behind the 5 gigawatts of TPU capacity in the Google and Broadcom partnership.
    • AI safety, the research field at the center of how Anthropic says it will use the new funding.
  • Gavin Baker on Orbital Compute, TSMC, Frontier AI Models, Anthropic’s Vertical Take Off, and the Coming Wafer Shortage

    Gavin Baker, founder and CIO of Atreides Management, returns to Patrick O’Shaughnessy’s Invest Like the Best for his sixth appearance. He calls the current AI moment the most extraordinary moment in the history of capitalism, walks through what Anthropic’s vertical takeoff in revenue actually means, lays out why orbital compute is closer than skeptics believe, dissects the TSMC bottleneck that may be the only thing standing between today’s market and a full-on AI bubble, and rates every hyperscaler on how they have positioned for a world where frontier model providers may stop selling API access altogether.

    TLDW

    Anthropic added eleven billion dollars of ARR in a single month, which is roughly the combined business of Palantir, Snowflake, and Databricks built over a decade. That is the setup. From there Gavin Baker covers the March and April selloff, the contrarian read that a closed Strait of Hormuz was actually bullish for American manufacturing competitiveness, why Anthropic and OpenAI multiples may be misleadingly cheap on an unconstrained run rate basis, why Elon Musk’s discipline on SpaceX valuation created a superpower of permanent access to capital, the practical engineering case for orbital compute as racks in space rather than Pentagon sized space stations, why TSMC’s capacity discipline is the single most important variable in whether the AI cycle becomes a bubble, what Terafab in Texas changes, why the Pareto frontier of AI models has flipped from Google dominance to Anthropic and OpenAI dominance in nine months, the shift from all you can eat AI subscriptions to usage based pricing and what that means for revenue scaling, Richard Sutton’s bitter lesson as the largest risk to the AI trade, why frontier tokens still capture an overwhelming share of economic value, the role of continual learning as the third great open question, why most new chip startups should not try to build a better GPU, why Cerebras did something different and hard, why disaggregated inference may extend GPU useful lives to ten or fifteen years and rescue the private credit industry, why being in the token path is the new venture filter, the new prisoner’s dilemma around releasing frontier models via API, an honest rating of Google, Meta, Amazon, and Microsoft, why personal safety is becoming a real AI era risk, and why he remains an AI optimist maximalist who believes this could be the next Pax Americana.

    Key Takeaways

    • Anthropic added eleven billion dollars of ARR in one month, more than the combined businesses of Palantir, Snowflake, and Databricks built across a decade. There is no precedent for this in the history of capitalism.
    • The SaaS and cloud revolution created between five and ten trillion dollars of value over twenty years. AI is replaying that compression on a timeline measured in months.
    • The March selloff was a drawdown driven by disagreement with price action, not invalidated thesis. That is the kind of drawdown an investor can lean into.
    • Deep Seek Monday in January 2025 was a similar setup. By the day of the selloff, AWS Asia GPU prices had already doubled, GPU availability had fallen, and it was obvious reasoning models would be vastly more compute hungry at inference. The market priced the opposite.
    • The Strait of Hormuz closing was actually positive for America. US natural gas (the primary input into US electricity, which feeds AI) fell twenty percent on Bloomberg while Asian and European natural gas doubled or tripled. American manufacturing competitiveness improved overnight.
    • The US is now the world’s largest producer and exporter of oil and gas. The economy is dramatically less energy intensive than in the 1970s. The shortage trauma comparison does not hold.
    • Tech as a sector traded as cheaply versus the rest of the market in early April as at any point in the last ten years, into the single most bullish moment for AI fundamentals on record.
    • Anthropic is dramatically more capital efficient than OpenAI, having burned roughly eighty percent less to reach a similar revenue scale. They have very different structural returns on invested capital.
    • Anthropic at roughly nine hundred billion for fifty billion of ARR (growing a thousand percent) is striking. Adjusted for compute constraint, the unconstrained run rate could be one hundred fifty to two hundred billion, putting the implied multiple closer to five times.
    • Claude Opus generates roughly seventy percent fewer tokens for the same question than previously, with token quantity tied to answer quality. Subscribers on flat-fee plans are getting a lobotomized model.
    • Elon Musk’s superpower is twenty years of making investors money. He never pushes valuation. SpaceX compounded low thirty percent per year for a decade because Musk treats fair pricing as a sacred covenant.
    • Capitalism will solve the watts shortage. The current bottleneck has shifted from chips and energy to zoning and political approval. Many capex decisions are paused until after the US midterms.
    • The watts shortage probably begins to alleviate in 2027 and 2028. Orbital compute solves it longer term.
    • Orbital compute is not Pentagon sized data centers in space. It is racks in space. A Blackwell rack is three thousand pounds, eight feet tall, four feet deep, three feet wide. SpaceX has shown a satellite roughly that size.
    • The satellites operate in sun synchronous orbit so solar wings (around five hundred feet per side) always face the sun and the radiator on the dark side always points to deep space.
    • Starlink V3 satellites already run at around twenty kilowatts. A Blackwell rack runs at one hundred kilowatts. SpaceX engineers express genuine confidence they have already solved cooling and radiator design at these scales.
    • Racks in space are connected with lasers traveling through vacuum, the same lasers already on every Starlink. SpaceX operates the world’s largest satellite fleet and, via xAI Colossus, the world’s largest data center on Earth.
    • Inference will move to orbit. Training will stay on Earth for a long time. Terrestrial data centers remain valuable for the rest of an investor’s career.
    • The wafer bottleneck is structural and political. TSMC is essentially Taiwan’s GDP, water, and electricity. The leaders see themselves as inheritors of Morris Chang’s sacred legacy and they do not behave like a Western public company.
    • Jensen Huang has never had a contract with TSMC. The relationship is run on handshakes and the assumption that things will be fair over time.
    • If TSMC did everything Jensen wanted, Nvidia could be selling two to three trillion dollars of GPUs in 2026 and 2027. TSMC’s discipline is the single largest factor preventing a true AI bubble.
    • Historically, foundational technologies always get a bubble. Railroads, canals, the internet. The current AI buildout is overwhelmingly funded out of operating cash flow, GPUs are running at one hundred percent utilization, and that is fundamentally different from the year 2000 fiber overbuild.
    • If one of Intel or Samsung Foundry catches up at the leading node, the other will follow, and TSMC’s discipline collapses. Watch TSMC capacity decisions to predict a bubble.
    • Terafab, the SpaceX and Tesla joint venture to build the world’s largest fab in America, has a partnership with Intel that grants access to fifty years of institutional foundry knowledge. The A teams at ASML, KLA, Lam Research, and Applied Materials will follow Elon’s reputation in hardware engineering.
    • The hiring playbook for Terafab includes building Taiwan Town, Japan Town, and Korea Town next to the fab. Recruit the engineers and import their families, their restaurants, and their staff.
    • Frontier tokens still capture an overwhelming share of all economic value created at the model layer. This is surprising and is one of the three big open questions for AI investing.
    • The Pareto frontier of intelligence versus cost has flipped. Nine months ago Google’s TPU dominated every point on the frontier. Today Anthropic and OpenAI dominate, with Grok 4.3 on the frontier and Gemini 3.1 hanging on.
    • Google’s conservative TPU V8 design (partly an attempt to reduce dependence on Broadcom and Nvidia) is the leading explanation for the loss of per token cost leadership.
    • AI pricing is shifting from all you can eat to usage based, mirroring the cellular and long distance industries. Cellular stopped being a great growth industry when it went all you can eat. AI just made the opposite move.
    • OpenAI and Anthropic together could exceed two hundred billion in ARR this year if compute keeps coming online and frontier token pricing holds.
    • The two hundred fifty dollar a month consumer AI plan is no longer enough to evaluate frontier capability. Enterprise plans with usage based billing are required because rate limits are now severe.
    • The three biggest open questions for AI investors are: violation of the bitter lesson via ASI or human ingenuity, whether frontier tokens keep commanding their premium, and when continual learning arrives.
    • Today’s continual learning is crude reinforcement learning during mid training on verifiable tasks. True continual learning means weights updating dynamically, like a human who learns the first time they touch fire.
    • Trying to build a better GPU is a losing strategy. Jensen will copy any one to three percent share design. Startups should target one percent share, do something different, and make it hard enough that Nvidia cannot fast follow.
    • Disaggregated inference (separating prefill and decode) opens new design canvases. Prefill is memory capacity bound. Decode is memory bandwidth bound. Each can be optimized independently.
    • Cerebras did something different and hard with wafer scale computing. Three generations of chips and real grit to get there.
    • Disaggregation of inference may stretch GPU useful lives to ten or fifteen years, dropping financing costs from low sevens to five or six percent, mathematically lowering the cost of the AI buildout and likely saving the private credit industry from its SaaS loan exposure.
    • Sellers of shortage outperform buyers of shortage. But owning the largest installed base of what is currently in shortage (hyperscaler CPU fleets, for example) is also a strong position.
    • Most of the economic value at the application layer of AI has been destroyed, not created. The exceptions are companies in the token path or in niches small enough that frontier labs ignore them.
    • Coding may be the shortest path to ASI. If you can write code, you can write code that does anything. Cursor, Cognition, and Anthropic correctly focused on it.
    • Jensen could probably get close to the frontier with his own Nemotron family of models whenever he wants. The fact that he chooses not to is a strategic decision about not commoditizing his customers.
    • The new prisoner’s dilemma in AI is whether frontier labs release their best model via API. If everyone agrees not to, Chinese open source falls behind. If anyone defects, the defector pulls ahead on revenue and resources, forcing everyone else to defect.
    • Google still owns the largest compute installed base. Without TPU’s prior cost advantage, this matters more. YouTube data has real value in a world of robotics. GCP is going crazy.
    • Meta deserves credit for becoming AI first internally faster than any other internet giant. Musa, their first MSL model, is impressively close to the Pareto frontier.
    • Amazon is strong because of Trainium and robotics driven retail P&L efficiency. Nova is better than it gets credit for.
    • Microsoft flinched on capex in early 2025 and lost position. Satya Nadella’s current decision to use Microsoft compute for Microsoft products rather than reselling to OpenAI is a courageous and probably correct call, even at the cost of an eight hundred dollar stock price.
    • The hyperscalers most engaged with startups are Amazon and Nvidia by a mile, followed by Google. Broadcom is the favorite ASIC partner. AMD, Microsoft, and Meta have minimal startup engagement and that will cost them as the best teams are now at startups.
    • Personal safety in an AI era requires a family or company safe word that cannot be socially engineered. Deepfake voice and video extortion at the speed of FaceTime is already feasible.
    • Ukraine is winning largely on the back of having the best battlefield AI outside America and Israel. Adversaries are starting to internalize what AI dominance means geopolitically.
    • An optimistic read is that this becomes a new Pax Americana, the way the post 1945 American nuclear monopoly was used to rebuild Germany and Japan rather than dominate.
    • AI cured a friend’s daughter’s rare disease by spinning up a research effort that identified a market drug capable of impacting her condition. That is the upside that keeps Gavin an AI optimist maximalist.

    Detailed Summary

    The most extraordinary moment in the history of capitalism

    Gavin’s framing of the current moment is unusually direct. Anthropic added eleven billion dollars of annual recurring revenue in a single month. The three highest profile SaaS companies of the last decade plus, Palantir, Snowflake, and Databricks, took a decade and tens of thousands of employees collectively to build the combined business that Anthropic added in thirty days. He has been investing through every major tech cycle and says there is no historical analog. Not the dotcom era, not the cloud transition, not mobile. This is its own thing.

    The market response, then, was peculiar. The NASDAQ sold off into the single most bullish moment for AI fundamentals on record. Tech traded at roughly its widest discount versus the rest of the market in a decade. Investors who said they wished they had bought into AI during 2022, during COVID, or during Deep Seek Monday got the same valuation setup again in early April, this time with an even clearer inflection.

    Why the Strait of Hormuz closing was secretly bullish for America

    One reason the macro fear in March may have been mispriced is that the same geopolitical event that drove the selloff was, in practice, a relative benefit to the United States. American natural gas, the input into American electricity, which is the input into American AI training and inference, fell roughly twenty percent. Asian and European natural gas prices doubled or tripled. The US emerged with sharply improved relative manufacturing competitiveness, which is exactly what the current administration cares about.

    The 1970s comparison does not hold. The US economy is dramatically less energy intensive, it is now the world’s largest producer and largest exporter of oil and gas, and there are no shortages, only price moves. That backdrop made it easier for disciplined investors to stay focused on AI fundamentals through the volatility.

    Anthropic and OpenAI valuations on an unconstrained run rate

    Anthropic at roughly nine hundred billion for fifty billion of ARR sounds rich until you adjust for the fact that the company is severely compute constrained. Gavin estimates that, unconstrained, Anthropic might be at one hundred fifty to two hundred billion in run rate revenue, putting the implied multiple closer to five times. He also points out that Claude Opus now generates roughly seventy percent fewer tokens for the same question than it used to. Token quantity correlates with answer quality, and Anthropic is rate limiting and shrinking outputs to ration capacity across its user base.

    Anthropic and OpenAI are also structurally very different. Anthropic has burned around eighty percent less cash than OpenAI to reach a comparable revenue scale. That implies very different long term returns on invested capital, though OpenAI has done a better job locking in compute and Sarah Friar is one of the most exceptional CFOs Gavin has worked with.

    Why neither lab is raising at a three trillion dollar valuation

    The answer Gavin gives is that both labs are deliberately leaving valuation on the table the way Elon has done for two decades. SpaceX compounded at low thirty percent annually for a decade because Elon never pushed price. The result is a permanent superpower of access to capital. Investors trust him because they have made money with him for twenty years. That is a moat that compounds with every round.

    Anthropic could probably raise at a one hundred percent premium to its rumored latest mark. They are choosing not to. In an uncertain world (Ukraine, Russia, Iran, Taiwan), preserving the ability to raise more capital later at fair prices is more valuable than maximizing this round.

    Watts and wafers, the two real constraints

    Capitalism is solving the watts problem. The leading PE infrastructure investors now say zoning and political approval, not chips or energy, are the gating factors. Companies are deferring big capex announcements until after the US midterms. Turbine capacity is being doubled at the manufacturers. Companies like Boom Aerospace are repurposing jet engines for grid use. Watts probably ease meaningfully in 2027 and 2028 and then orbital compute does the rest.

    Wafers are the harder problem because they live in Taiwan, run on handshakes, and depend on a corporate culture that does not respond to public market incentives. TSMC is essentially the GDP, water consumption, and electricity consumption of Taiwan. Its leadership treats the company as the legacy of Morris Chang. The Silicon Shield doctrine is real and internal.

    Orbital compute as racks in space

    The biggest mental update Gavin asks listeners to make is to stop picturing data centers in space as Pentagon sized space stations. A Blackwell rack is three thousand pounds and roughly the size of a refrigerator. SpaceX has shown a concept satellite of about that size. Solar wings extend five hundred feet to each side and the radiator extends hundreds of feet behind, both possible because the orbit is sun synchronous and the orientation is fixed relative to the sun.

    SpaceX engineers Gavin has spoken to at Starbase express genuine confidence that they have solved cooling at these power levels. They have. Starlink V3 satellites already operate at twenty kilowatts. A Blackwell rack is one hundred kilowatts. The same company operates the world’s largest satellite fleet and the world’s largest data center on Earth via xAI Colossus. The racks are connected to each other with lasers traveling through vacuum, technology already deployed in every Starlink. The naysayers, Gavin observes, are armchair skeptics and Larry Ellison’s response (he is out there landing rockets, no one else is) is the right frame.

    Terafab in Texas and the threat to TSMC’s discipline

    Terafab, the SpaceX and Tesla joint venture, intends to be the largest fab in the world. The partnership with Intel grants access to fifty years of foundry institutional knowledge, allowing Terafab to start three to five quarters behind the leading node rather than fifteen years behind. The A teams at the semicap equipment companies (ASML, KLA, Lam Research, Applied Materials) will follow Elon’s reputation in hardware engineering the same way they followed TSMC twenty years ago when Intel stumbled.

    The talent strategy is the part most observers underestimate. Recruit the best engineers globally, then import their families, their restaurants, their staff. Build Taiwan Town, Japan Town, and Korea Town next to the fab. Optimize the human experience for the people whose work matters. Intel and Samsung do not think that way.

    Bubble watch and the year 2000 comparison

    Every foundational technology in modern history has had a bubble. Railroads, canals, the internet. Carlota Perez documented why. Markets correctly identify the importance, diversity of opinion collapses, supply gets ahead of demand, the bubble crashes. The current cycle has two important differences. The buildout is overwhelmingly funded out of operating cash flow, not debt. Every GPU is running at one hundred percent utilization, while at the peak of the fiber bubble ninety nine percent of fiber was unused.

    TSMC discipline is the single largest reason a bubble has not formed. If Jensen could buy everything TSMC could theoretically make, Nvidia could sell two to three trillion dollars of GPUs in 2026 and 2027. At some point that becomes more than the market can absorb. If Intel or Samsung Foundry catches up at the leading node, the other will too. TSMC’s pricing discipline collapses and the bubble starts.

    The Pareto frontier and the loss of Google’s cost advantage

    The most important chart in AI is the Pareto frontier of model intelligence versus per token cost. Nine months ago, Google’s TPU based models dominated every point on it. OpenAI, Anthropic, and xAI sat inside the frontier. Today the frontier is dominated by Anthropic and OpenAI, with Grok 4.3 on the frontier and Gemini 3.1 hanging on by subsidization more than economics. The most likely cause is Google’s conservative TPU V8 design, an attempt to reduce dependence on Broadcom and Nvidia that sacrificed per token economics.

    The bitter lesson, frontier tokens, and continual learning

    Three open questions dominate AI investing. The first is whether Richard Sutton’s bitter lesson (more compute beats human algorithmic cleverness) gets violated by ASI itself optimizing for efficiency. Closer observers of AI are more skeptical of a violation. Gavin thinks ASI’s first move will be to make itself more efficient and more resourced, which is technically a temporary violation.

    The second is whether frontier tokens keep capturing the overwhelming share of economic value at the model layer. Today they do, surprisingly. Gemini 3.1 Pro was mindblowing nine months ago and is intolerable today. The third is when continual learning arrives. Today’s models need a million fire touches to learn what a human learns from one. True continual learning would mean dynamic weight updates in real time and would produce a fast takeoff.

    From all you can eat to usage based AI pricing

    AI is shifting from flat fee plans to usage based pricing. The historical analogy is cellular and long distance. Both stopped being great growth industries when they went all you can eat. AI just made the opposite move. The consequence is that flat fee subscribers, even on premium consumer plans, get a rate limited and token throttled version of the frontier model. Enterprise plans with usage based billing are now required to evaluate true capability. Gavin thinks the combination of new compute coming online and usage based pricing is what gets OpenAI and Anthropic past two hundred billion in combined ARR this year.

    Chip startups, prefill decode disaggregation, and Cerebras

    Trying to build a better GPU is the wrong move. The four scaled players (Nvidia, AMD, Trainium, TPU) have copy capability for any one to three percent share design that looks attractive. The good news for startups is that disaggregated inference (separating prefill and decode) opens a richer design canvas. Prefill is memory capacity bound. Decode is memory bandwidth bound. Each can be optimized independently. Andrew Fox’s analogy is a British naval ship of the eighteenth century. Prefill is loading the cannon. Decode is firing it.

    Cerebras is the model. Wafer scale computing is genuinely different and genuinely hard. It took three generations of chips to get right. Andrew Feldman and his team had the grit to keep going through chip one being a failure. The design has a high ratio of on chip compute and memory relative to shoreline IO, which is why Cerebras is now experimenting with putting an optical wafer on top of the compute wafer to solve scale out.

    GPU useful lives and the rescue of private credit

    One of the strongest claims in the conversation is that disaggregated inference will stretch GPU useful lives to ten or fifteen years. The skeptical narrative (GPUs are obsolete in two years, companies are cooking their depreciation books) is wrong. You can put a Cerebras system or Groq LPU in front of older Hopper or Ampere parts, use them only for prefill, and run them until they physically melt. Private credit, which is in pain from SaaS loans and which underwrote GPU loans on three to four year lives, may be saved by this.

    If GPU financing rates can come down from low sevens to five or six percent, the mathematics of the AI buildout improves materially. That is a structural tailwind that compounds for years.

    The application layer, the token path, and a new prisoner’s dilemma

    Trillions of dollars of value have been destroyed at the application layer, not created. Cursor and Cognition are the rare scaled exceptions, and they got there by focusing on coding very early. As Amjad Masad noted, coding is plausibly the shortest path to ASI because a coding agent can write itself into any new domain. Jamin Ball’s frame is that the new venture filter is whether the company is in the token path. Data Bricks is. Most application layer startups are not.

    Jensen could probably get close to the frontier with Nemotron whenever he wants, and the strategic question of whether to do that is a new prisoner’s dilemma. If every frontier lab agrees not to release best models via API, Chinese open source falls steadily behind. If anyone defects, the defector gains revenue and resources, and everyone else has to defect. The same dynamic exists between TSMC, Intel, and Samsung. If Nvidia or AMD ever truly used an alternative foundry, that foundry would catch up rapidly.

    Rating the hyperscalers

    Google has the largest compute installed base, the YouTube data that matters in a robotics world, and a search business that prints. Their loss of TPU cost leadership is the surprise of the year. If Google IO in five days does not produce a leapfrog model, the Nvidia centric narrative gets even stronger.

    Meta deserves real credit. Zuckerberg made Meta AI first internally faster than any other internet giant, paid up for the talent contracts when no one else would, and shipped Musa as a first model from MSL that is close to the Pareto frontier. Amazon is well positioned on Trainium, robotics in retail, and a Nova model line that is better than it gets credit for. Microsoft flinched on capex in early 2025 and lost position. Satya Nadella’s current decision to use Microsoft compute for Copilot rather than reselling to OpenAI is courageous and probably correct, even at the cost of stock price.

    The most interesting cross hyperscaler metric is startup engagement. Nvidia and Amazon engage deeply with startups. Google is next. Broadcom is the favored ASIC partner. AMD, Microsoft, and Meta have minimal startup engagement, which Gavin believes will cost them as the best teams now sit at startups.

    Personal safety, geopolitics, and the Pax Americana case

    The closing section turns darker. Personal safety in an AI era requires a family or company safe word that cannot be socially engineered. Deepfake voice and video extortion via something that looks exactly like your child calling on FaceTime is already feasible. Political violence against AI leaders is a real concern. Geopolitically, Ukraine is winning largely because it has the best battlefield AI outside America and Israel. How adversaries respond to that asymmetry is the next great variable.

    Gavin’s optimistic frame is the Pax Americana. After 1945 the US had a nuclear monopoly and could have controlled the world. Instead it rebuilt Germany and Japan, both of which became the most reliable American allies for the next eighty years. If AI dominance plays out similarly, this is a generationally positive story rather than a destabilizing one. The personal anecdote that closes the conversation is a friend whose daughter was diagnosed with a rare genetic condition. He spun up agents, identified a drug already on the market that addresses her mutation, and her life is immeasurably different because of AI. That is the upside.

    Thoughts

    The Anthropic eleven billion in a month framing is the kind of stat that resets priors. The right way to interpret it is not as a one off but as a measure of how fast value can compound when the underlying technology improves on a curve steeper than the ability of the rest of the economy to absorb it. The skeptical question is whether that ARR is durable or whether it is heavily tied to a customer base of other AI companies that are themselves on a single venture funded year of runway. The bullish answer is that frontier coding, frontier research, and frontier enterprise tasks are not going to stop being valuable, and Anthropic is the best at all three. Both can be true. The number is still extraordinary.

    The argument that TSMC discipline is the only thing preventing a bubble is the analytically tightest part of the conversation. The implied trade is to watch TSMC capacity additions like a hawk and to be more, not less, cautious if Intel Foundry or Samsung Foundry ever announce real share at the leading node. The Terafab thesis is more speculative but more interesting. If Elon’s talent recruiting playbook works and the Intel partnership gives Terafab a real seat at the table within five years, the geometry of the global semiconductor industry shifts in a way that is bullish for American manufacturing, bullish for power and water infrastructure in Texas, and ambiguous for TSMC itself.

    The Pareto frontier discussion deserves more attention than it usually gets. Pricing leadership in AI is not a vanity metric. It determines who can subsidize free tier usage, who can absorb compute shortages, who can ship cheaper enterprise plans, and ultimately whose model becomes the default for any given workload. Google losing per token leadership in nine months is one of the most under analyzed events in the sector and it explains a lot about why Anthropic and OpenAI are growing the way they are. If Google IO does not produce a leapfrog model, the implied verdict on TPU V8 design choices gets a lot harsher.

    The application layer destruction point is worth sitting with. Founders building on top of frontier models are competing in a world where the model itself moves faster than any moat they can build, where the model lab can absorb their niche if it gets interesting, and where the only protection is either deep token path integration or a niche so small the lab does not bother. That is a much harsher venture environment than the early SaaS era. The compensating opportunity is that one human can now run a hundred agents, so the ceiling on what a small team can build is correspondingly higher. The bet is that productivity per founder rises faster than competitive pressure from the labs. We will find out.

    The orbital compute pitch is the section that will polarize listeners. The naive read is that this is science fiction. The closer read is that every component (sun synchronous orbit, laser interconnect, twenty kilowatt satellite buses, ten thousand satellite manufacturing cadence, full rocket reusability) already exists. The remaining engineering problems are repair, maintenance, and radiator scale, all of which are real but tractable on a five to ten year horizon. The strategic implication is that the political and zoning ceiling on terrestrial data centers becomes less binding if orbital compute is a credible alternative for inference workloads. The investor implication is that being short the watts and cooling complex on a five year horizon is a real trade, not a meme.

    Watch the full conversation here.

  • Sundar Pichai on the All-In Podcast: Unpacking Alphabet’s AI Future, Competitive Pressures, and the Next $100B Bets

    TLDW (Too Long; Didn’t Watch):

    Sundar Pichai, CEO of Alphabet, sat down with the All-In Podcast to discuss AI’s seismic impact on Google Search, the company’s infrastructure and model advantages, the future of human-computer interaction, intense competition (including from China), energy constraints, long-term bets like quantum computing and robotics, and the evolving culture at Google. He remains bullish on Google’s ability to navigate disruption and lead in the AI era, emphasizing a “follow the user” philosophy and relentless innovation.

    Executive Summary: Navigating the AI Revolution with Sundar Pichai

    In a comprehensive and candid interview on the All-In Podcast (dated May 16, 2025), Alphabet CEO Sundar Pichai offered deep insights into Google’s strategy amidst the transformative wave of Artificial Intelligence. Pichai addressed the “innovator’s dilemma” head-on, asserting Google’s proactive stance in evolving its core Search product with AI, rather than fearing self-disruption. He detailed Google’s significant infrastructure advantages, including custom TPUs, and differentiation in foundational models. The conversation spanned the future of human-computer interaction, the burgeoning competitive landscape, critical energy constraints for AI’s growth, and Google’s “patient” investments in quantum computing and robotics. Pichai also touched upon fostering a high-performance, mission-driven culture and clarified Alphabet’s structure as a technology-first company, not just a holding entity. The overarching theme was one of optimistic resilience, with Pichai confident in Google’s capacity to innovate and lead through this pivotal technological shift.

    Key Takeaways from Sundar Pichai’s All-In Interview:

    • AI is an Opportunity, Not Just a Threat to Search: Google sees AI as the biggest driver for Search progress, expanding query types and user engagement, not a zero-sum game. “AI Mode” is coming to Search.
    • Disrupting Itself Proactively: Pichai rejects the “innovator’s dilemma” if a company leans into user needs and innovation, citing mobile and YouTube Shorts as examples. Cost per AI query is falling; latency is a bigger challenge.
    • Infrastructure is a Core Differentiator: Google’s decades of investment in custom hardware (TPUs – now 7th gen “Ironwood”), data centers, and full-stack approach provide a significant cost and performance advantage for training and serving AI models. 50% of 2025 compute capex ($70-75B total) goes to Google Cloud.
    • Foundational Model Strength: Google believes its models (like Gemini 2.5 Pro and Flash series) are at the frontier, with ongoing progress in LLMs and beyond (e.g., world models, diffusion models). Data from Google products (with user permission) offers a differentiation opportunity.
    • Human-Computer Interaction is Evolving Towards Seamlessness: Pichai sees AR glasses (not immersive displays) as a potential next leap, making computing ambient and intuitive, though system integration challenges remain.
    • Energy is a Critical Constraint for AI Growth: Pichai acknowledges electricity as a major gating factor for AI progress and GDP, advocating for innovation in solar, nuclear, geothermal, grid upgrades, and workforce development.
    • Long-Term Bets on Quantum and Robotics:
      • Quantum Computing: Pichai believes quantum is where AI was in 2015, predicting a “useful, practical computation” superior to classical within 5 years. Google is at the frontier.
      • Robotics: The combination of AI with robotics is creating a “sweet spot.” Google is developing foundational models (vision, language, action) and exploring product strategies, expecting a “magical moment” in 2-3 years.
    • Culture of Innovation and Accountability: Google aims to empower employees within a mission-focused framework, learning from the WFH era and fostering intensity, especially in teams like Google DeepMind. The goal is to attract and retain top talent.
    • Competitive Landscape is Fierce but Expansive: Pichai respects competitors like OpenAI, Meta, XAI, and Microsoft, and acknowledges China’s (e.g., DeepSeek) rapid AI progress. He believes AI is a vast opportunity, not a winner-take-all market.
    • Alphabet’s Structure: More Than a Holding Company: Alphabet leverages foundational technology and R&D across its businesses (Search, YouTube, Cloud, Waymo, Isomorphic, X). It’s about differentiated value propositions, not just capital allocation.
    • Founder Engagement: Larry Page and Sergey Brin are deeply engaged, with Sergey actively coding and contributing to Gemini, providing “unparalleled energy.”
    • Regrets & Pride: Pichai is proud of Google’s ability to push foundational R&D into impactful products. A “small regret” includes not acquiring Netflix when intensely debated internally.

    In what can only be described as a pivotal moment for the technology landscape, Sundar Pichai, the CEO of Alphabet and Google, joined David Friedberg and discussed the pressing questions surrounding Google’s dominance, its response to the AI revolution, and its vision for the future. This wasn’t just a cursory Q&A; it was a strategic deep-dive into the mind of one of tech’s most influential leaders.

    (2:58) The Elephant in the Room: Will AI Kill Search? Google’s Strategy for Self-Disruption

    The conversation immediately tackled the “innovator’s dilemma,” a theory that haunts established giants when new paradigms emerge. Friedberg directly questioned if AI, with its chat interfaces and complete answers, poses an existential threat to Google’s $200 billion search advertising cash cow.

    Pichai’s response was a masterclass in strategic framing. He emphasized that Google has been “AI-first” for nearly a decade, viewing AI not as a threat, but as the primary driver for advancing Search. “We really felt that AI is what will drive the biggest progress in search,” Pichai stated. He pointed to the success of AI Overviews, now used by 1.5 billion users, which are expanding the types of queries people make. Empirically, Google sees query growth and increased engagement where AI Overviews are triggered.

    Critically, Pichai revealed a “whole new dedicated AI experience called AI mode coming to search,” promising a full-on conversational AI experience powered by cutting-edge models. This mode sees users inputting queries “literally long paragraphs,” two to three times longer than traditional search queries. He dismissed the “dilemma” framing: “The dilemma only exists if you treat it as a dilemma… you have to innovate to stay ahead.” He drew parallels to Google’s successful navigation of the mobile transition and YouTube’s thriving alongside TikTok by launching Shorts, even when monetization wasn’t immediately clear. The guiding principle remains: “Follow the user, all else will follow.”

    Addressing the unit economics, Pichai downplayed concerns about the cost of serving AI queries, stating, “Google with its infrastructure, I’d wager on that… the cost to serve that query has fallen dramatically in an 18-month time frame.” Latency, he admitted, is a more significant constraint than cost. For ad revenue, AI Overviews are already at baseline parity with traditional search, with potential for improvement as AI can better match commercial intent with relevant information.

    (15:32) The Unseen Fortress: Infrastructure Advantage and Foundational Model Differentiation

    A cornerstone of Google’s confidence lies in its unparalleled infrastructure. Pichai highlighted Google’s position on the “Pareto frontier of performance and cost,” delivering top models cost-effectively. This is largely due to their custom-built Tensor Processing Units (TPUs). “We are in our seventh generation of TPUs,” Pichai noted, with the latest “Ironwood” generation offering over 40 exaflops per part. This full-stack approach, from subsea cables to custom chips, is crucial for serving AI at scale and managing costs.

    Regarding the hefty $70-75 billion capex projected for 2025, Pichai clarified that roughly half of the compute spend is allocated to Google Cloud, supporting its enterprise offerings and enabling innovation from Google DeepMind across various AI domains – not just LLMs, but also image, video, and “world models.”

    When asked about Nvidia, Pichai expressed “extraordinary respect” for Jensen Huang and Nvidia’s “world-class” software stack. While Google trains its Gemini models on TPUs internally, they also use Nvidia GPUs and offer them to cloud customers. “I like that flexibility,” he said, “but we are also long-term committed to the TPU direction.”

    On the topic of foundational model performance, Pichai acknowledged that progress isn’t always linear (“artificial jag jag intelligence,” as Andrej Karpathy termed it). However, he sees continuous progress and believes Google is “pushing the research frontier in a much broader way than most other people beyond just LLMs.” He doesn’t see fundamental roadblocks to further advancements yet, though progress gets harder, which he believes will distinguish elite teams. He also touched upon the “differentiated innovation opportunity” of leveraging data from Google’s suite of products (like Gmail, Calendar, YouTube) with user permission to create superior, personalized experiences.

    (25:08) The Future of Human-Computer Interaction, Hardware, and the AI Competitive Landscape

    Looking ahead, Pichai envisions human-computer interaction becoming more seamless, where “computing kind of works for you.” He sees AR glasses – not immersive VR displays, but glasses that augment reality ambiently – as a potential “next leap,” comparable to smartphones in 2006-2007. “When AR really works, I think that’ll wow people,” he mused, while acknowledging existing system integration challenges.

    The competitive landscape is undeniably intense. Pichai spoke respectfully of OpenAI (Sam Altman), XAI (Elon Musk), Meta (Mark Zuckerberg), and Microsoft (Satya Nadella), calling them an “impressive group” driving rapid progress. “I think all of us are going to do well in this scenario,” he suggested, emphasizing that AI represents a “much bigger landscape opportunity than all the previous technologies we have known combined.” He even noted that “companies we don’t even know… might be extraordinarily big winners.”

    The discussion also covered China’s AI prowess, particularly highlighted by DeepSeek’s efficient models. Pichai admitted that DeepSeek made many “adjust our priors a little bit” about how close Chinese R&D is to the frontier, though he noted Google’s Flash models benchmarked favorably. “China will be very, very competitive on the AI frontier,” he affirmed.

    A significant portion of this section involved the engagement of Google’s founders, Larry Page and Sergey Brin. Pichai described them as “deeply involved in their own unique ways,” with Sergey Brin actively “sitting and coding” with the Gemini team, looking at loss curves and model architectures. “To have a founder sitting there… it’s a rare, rare place to be,” Pichai shared, valuing their “nonlinear thinking.”

    (35:29) The Energy Bottleneck: AI’s Thirst for Power

    A critical, and often underestimated, constraint for AI’s future is energy. Pichai agreed with Elon Musk’s concerns, identifying electricity as “the most likely constraint for AI progress and hence by definition GDP growth.” He stressed this is an “execution challenge,” not an insurmountable physics barrier. Solutions involve embracing innovations in solar (plus batteries), nuclear (SMRs, fusion), geothermal, alongside crucial grid upgrades, streamlined permitting, and addressing workforce shortages (e.g., electricians). While Google faces current supply constraints and project delays due to these factors, Pichai expressed faith in the US’s ability to innovate and meet the moment, driven by capitalist solutions.

    (41:20) Google’s Moonshots: Quantum Computing and Robotics

    Pichai reiterated Google’s commitment to long-term, patient R&D, citing Waymo as an example of perseverance.

    Quantum Computing: The Next Frontier

    He likened the current state of quantum computing to where AI was around 2015. “I would say in a 5-year time frame, you would have that moment where some a really useful practical computation… is done in a quantum way far superior to classical computers.” Despite the “noise” in the industry, Pichai is “absolutely confident” in Google’s leading position and expects more exciting announcements this year that will “expand people’s minds.”

    Robotics: AI Embodied

    The synergy between AI and robotics is creating a “next sweet spot.” Google, with its “world-class” vision-language-action models (Gemini robotics efforts), is actively planning its next moves. While past ventures into the application layer of robotics might have been premature, the current AI advancements make the field ripe for breakthroughs. “We are probably two to three years away from that magical moment in robotics too,” Pichai predicted, suggesting Google could develop something akin to an “Android for robotics” or offer its models like Gemini to power third-party hardware. He mentioned Intrinsic, an Alphabet company, as already working in this direction.

    (47:56) Culture, Coddling, and Talent in the Age of AI

    Addressing narratives about Google’s “coddling” culture, Pichai explained the original intent behind perks like free food: to foster collaboration and cross-pollination of ideas. While acknowledging the need to constantly refine culture, he emphasized that empowering employees remains a source of strength. He highlighted the intensity and mission-focus within teams like Google DeepMind, where top engineers often work in person five days a week.

    “We are not all here in the company to resolve all our personal differences,” he stated. “We are here because you’re excited about… innovating in the service of the mission of the company.” The COVID era was a “big distortion,” and bringing people back, even in a hybrid model, has been crucial. He believes Google continues to attract top-tier talent, including the best PhD researchers, and that the current “exciting and intense” AI moment fosters a sense of optimism reminiscent of early Google.

    (56:50) Alphabet’s Identity: Beyond a Holding Company

    Pichai clarified that Alphabet isn’t a traditional holding company merely allocating capital. Instead, it’s built on a “foundational technology basis,” leveraging core R&D (like AI, quantum, self-driving tech) to innovate across diverse businesses. “Waymo is going to keep getting better because of the same work we do in Gemini,” he illustrated. The common strand is deep computer science and physics-based R&D, with X (formerly Google X) continuing to play a role as an incubator for moonshots like sustainable agriculture (Tapestries) and grid modernization.

    Reflections: Regrets and Pride

    When asked about his biggest regrets and proudest achievements, Pichai expressed immense pride in Google’s unique ability to “push the technology frontier” with foundational R&D and translate it into valuable products and businesses. As for regrets, he mentioned, “There are acquisitions we debated hard, came close.” When pressed for a name, he hesitantly offered, “Maybe Netflix. We debated Netflix at some point super intensely inside.” He framed these not as deep regrets but as acknowledgments of alternate paths in a world of “butterfly effects.”

    Sundar Pichai’s appearance on the All-In Podcast painted a picture of a leader and a company that are not just reacting to the AI revolution but are actively shaping it. With a clear-eyed view of the challenges and an unwavering belief in Google’s innovative capacity, Pichai’s insights suggest that Alphabet is determined to remain at the forefront of technological advancement for years to come.

  • The Future We Can’t Ignore: Google’s Ex-CEO on the Existential Risks of AI and How We Must Control It

    The Future We Can’t Ignore: Google’s Ex-CEO on the Existential Risks of AI and How We Must Control It

    AI isn’t just here to serve you the next viral cat video—it’s on the verge of revolutionizing or even dismantling everything from our jobs to global security. Eric Schmidt, former Google CEO, isn’t mincing words. For him, AI is both a spark and a wildfire, a force that could make life better or burn us down to the ground. Here’s what Schmidt sees on the horizon, from the thrilling to the bone-chilling, and why it’s time for humanity to get a grip.

    Welcome to the AI Arms Race: A Future Already in Motion

    AI is scaling up fast. And Schmidt’s blunt take? If you’re not already integrating AI into your business, you’re not just behind the times—you’re practically obsolete. But there’s a catch. It’s not enough to blindly ride the AI wave; Schmidt warns that without strong ethics, AI can drag us into dystopian territory. AI might build your company’s future, or it might drive you into a black hole of misinformation and manipulation. The choice is ours—if we’re ready to make it.

    The Good, The Bad, and The Insidious: AI in Our Daily Lives

    Schmidt pulls no punches when he points to social media as a breeding ground for AI-driven disasters. Algorithms amplify outrage, keep people glued to their screens, and aren’t exactly prioritizing users’ mental health. He sees AI as a master of manipulation, and social platforms are its current playground, locking people into feedback loops that drive anxiety, depression, and tribalism. For Schmidt, it’s not hard to see how AI could be used to undermine truth and democracy, one algorithmic nudge at a time.

    AI Isn’t Just a Tool—It’s a Weapon

    Think AI is limited to Silicon Valley’s labs? Think again. Schmidt envisions a future where AI doesn’t just enhance technology but militarizes it. Drones, cyberattacks, and autonomous weaponry could redefine warfare. Schmidt talks about “zero-day” cyber attacks—threats AI can discover and exploit before anyone else even knows they exist. In the wrong hands, AI becomes a weapon as dangerous as any in history. It’s fast, it’s ruthless, and it’s smarter than you.

    AI That Outpaces Humanity? Schmidt Says, Pull the Plug

    The elephant in the room is AGI, or artificial general intelligence. Schmidt is clear: if AI gets smart enough to make decisions independently of us—especially decisions we can’t understand or control—then the only option might be to shut it down. He’s not paranoid; he’s pragmatic. AGI isn’t just hypothetical anymore. It could evolve faster than we can keep up, making choices for us in ways that could irreversibly alter human life. Schmidt’s message is as stark as it gets: if AGI starts rewriting the rules, humanity might not survive the rewrite.

    Big Tech, Meet Big Brother: Why AI Needs Regulation

    Here’s the twist. Schmidt, a tech icon, says AI development can’t be left to the tech world alone. Government regulation, once considered a barrier to innovation, is now essential to prevent the weaponization of AI. Without oversight, we could see AI running rampant—from autonomous viral engineering to mass surveillance. Schmidt is calling for laws and ethical boundaries to rein in AI, treating it like the next nuclear power. Because without rules, this tech won’t just bend society; it might break it.

    Humanity’s Play for Survival

    Schmidt’s perspective isn’t all doom. AI could solve problems we’re still struggling with—like giving every kid a personal tutor or giving every doctor the latest life-saving insights. He argues that, used responsibly, AI could reshape education, healthcare, and economic equality for the better. But it all hinges on whether we build ethical guardrails now or wait until the Pandora’s box of AI is too wide open to shut.

    Bottom Line: The Clock’s Ticking

    AI isn’t waiting for us to get comfortable. Schmidt’s clear-eyed view is that we’re facing a choice. Either we control AI, or AI controls us. There’s no neutral ground here, no happy middle. If we don’t have the courage to face the risks head-on, AI could be the invention that ends us—or the one that finally makes us better than we ever were.

  • Gemini: Google’s Multimodal AI Breakthrough Sets New Standards in Cross-Domain Mastery

    Google’s recent unveiling of the Gemini family of multimodal models marks a significant leap in artificial intelligence. The Gemini models are not just another iteration of AI technology; they represent a paradigm shift in how machines can understand and interact with the world around them.

    What Makes Gemini Standout?

    Gemini models, developed by Google, are unique in their ability to simultaneously process and understand text, images, audio, and video. This multimodal approach allows them to excel across a broad spectrum of tasks, outperforming existing models in 30 out of 32 benchmarks. Notably, the Gemini Ultra model has achieved human-expert performance on the MMLU exam benchmark, a feat that has never been accomplished before.

    How Gemini Works

    At the core of Gemini’s architecture are Transformer decoders, which have been enhanced for stable large-scale training and optimized performance on Google’s Tensor Processing Units. These models can handle a context length of up to 32,000 tokens, incorporating efficient attention mechanisms. This capability enables them to process complex and lengthy data sequences more effectively than previous models.

    The Gemini family comprises three models: Ultra, Pro, and Nano. Ultra is designed for complex tasks requiring high-level reasoning and multimodal understanding. Pro offers enhanced performance and deployability at scale, while Nano is optimized for on-device applications, providing impressive capabilities despite its smaller size.

    Diverse Applications and Performance

    Gemini’s excellence is demonstrated through its performance on various academic benchmarks, including those in STEM, coding, and reasoning. For instance, in the MMLU exam benchmark, Gemini Ultra scored an accuracy of 90.04%, exceeding human expert performance. In mathematical problem-solving, it achieved 94.4% accuracy in the GSM8K benchmark and 53.2% in the MATH benchmark, outperforming all competitor models. These results showcase Gemini’s superior analytical capabilities and its potential as a tool for education and research.

    The model family has been evaluated across more than 50 benchmarks, covering capabilities like factuality, long-context, math/science, reasoning, and multilingual tasks. This wide-ranging evaluation further attests to Gemini’s versatility and robustness across different domains.

    Multimodal Reasoning and Generation

    Gemini’s capability extends to understanding and generating content across different modalities. It excels in tasks like VQAv2 (visual question-answering), TextVQA, and DocVQA (text reading and document understanding), demonstrating its ability to grasp both high-level concepts and fine-grained details. These capabilities are crucial for applications ranging from automated content generation to advanced information retrieval systems.

    Why Gemini Matters

    Gemini’s breakthrough lies not just in its technical prowess but in its potential to revolutionize multiple fields. From improving educational tools to enhancing coding and problem-solving platforms, its impact could be vast and far-reaching. Furthermore, its ability to understand and generate content across various modalities opens up new avenues for human-computer interaction, making technology more accessible and efficient.

    Google’s Gemini models stand at the forefront of AI development, pushing the boundaries of what’s possible in machine learning and artificial intelligence. Their ability to seamlessly integrate and reason across multiple data types makes them a formidable tool in the AI landscape, with the potential to transform how we interact with technology and how technology understands the world.


  • Microsoft Transitions from Bing Chat to Copilot: A Strategic Rebranding

    Microsoft Transitions from Bing Chat to Copilot: A Strategic Rebranding

    In a significant shift in its AI strategy, Microsoft has announced the rebranding of Bing Chat to Copilot. This move underscores the tech giant’s ambition to make a stronger imprint in the AI-assisted search market, a space currently dominated by ChatGPT.

    The Evolution from Bing Chat to Copilot

    Microsoft introduced Bing Chat earlier this year, integrating a ChatGPT-like interface within its Bing search engine. The initiative marked a pivotal moment in Microsoft’s AI journey, pitting it against Google in the search engine war. However, the landscape has evolved rapidly, with the rise of ChatGPT gaining unprecedented attention. Microsoft’s rebranding to Copilot comes in the wake of OpenAI’s announcement that ChatGPT boasts a weekly user base of 100 million.

    A Dual-Pronged Strategy: Copilot for Consumers and Businesses

    Colette Stallbaumer, General Manager of Microsoft 365, clarified that Bing Chat and Bing Chat Enterprise would now collectively be known as Copilot. This rebranding extends beyond a mere name change; it represents a strategic pivot towards offering tailored AI solutions for both consumers and businesses.

    The Standalone Experience of Copilot

    In a departure from its initial integration within Bing, Copilot is set to become a more autonomous experience. Users will no longer need to navigate through Bing to access its features. This shift highlights Microsoft’s intent to offer a distinct, streamlined AI interaction platform.

    Continued Integration with Microsoft’s Ecosystem

    Despite the rebranding, Bing continues to play a crucial role in powering the Copilot experience. The tech giant emphasizes that Bing remains integral to their overall search strategy. Moreover, Copilot will be accessible in Bing and Windows, with a dedicated domain at copilot.microsoft.com, parallel to ChatGPT’s model.

    Competitive Landscape and Market Dynamics

    The rebranding decision arrives amid a competitive AI market. Microsoft’s alignment with Copilot signifies its intention to directly compete with ChatGPT and other AI platforms. However, the company’s partnership with OpenAI, worth billions, adds a complex layer to this competitive landscape.

    The Future of AI-Powered Search and Assistance

    As AI continues to revolutionize search and digital assistance, Microsoft’s Copilot is poised to be a significant player. The company’s ability to adapt and evolve in this dynamic field will be crucial to its success in challenging the dominance of Google and other AI platforms.

  • Amazon Charts New Territory with ‘Vega’: A Homegrown OS for Smart Devices

    Amazon, the global e-commerce behemoth, is reportedly taking a bold step away from Android with the development of its own operating system for Fire TVs and smart displays. According to sources and internal discussions, the project, internally dubbed ‘Vega’, is set to revolutionize the software backbone of Amazon’s suite of connected devices.

    The initiative, which has been under the radar since as early as 2017, has gained traction recently with the involvement of notable industry professionals like former Mozilla engineer Zibi Braniecki. With Vega, Amazon aims to shed the technical limitations imposed by Android’s legacy code, which was originally designed for mobile phones, not the burgeoning smart home market.

    Vega is poised to offer a Linux-based, web-forward operating system, pivoting towards React Native for app development. This shift promises a more unified and efficient development environment, enabling programmers to create versatile apps that are operable across a myriad of devices and operating systems.

    This strategic move by Amazon seems twofold: gaining technological independence from Google’s Android, and establishing a more robust platform for reaching consumers through various devices, potentially increasing revenue through targeted ads and services.

    As Vega’s development continues, with a possible rollout on select Fire TV devices by next year, Amazon sets the stage for a new era in smart device interaction, aligning itself for greater control over its technological destiny and consumer reach.

  • Leveraging Efficiency: The Promise of Compact Language Models

    Leveraging Efficiency: The Promise of Compact Language Models

    In the world of artificial intelligence chatbots, the common mantra is “the bigger, the better.”

    Large language models such as ChatGPT and Bard, renowned for generating authentic, interactive text, progressively enhance their capabilities as they ingest more data. Daily, online pundits illustrate how recent developments – an app for article summaries, AI-driven podcasts, or a specialized model proficient in professional basketball questions – stand to revolutionize our world.

    However, developing such advanced AI demands a level of computational prowess only a handful of companies, including Google, Meta, OpenAI, and Microsoft, can provide. This prompts concern that these tech giants could potentially monopolize control over this potent technology.

    Further, larger language models present the challenge of transparency. Often termed “black boxes” even by their creators, these systems are complicated to decipher. This lack of clarity combined with the fear of misalignment between AI’s objectives and our own needs, casts a shadow over the “bigger is better” notion, underscoring it as not just obscure but exclusive.

    In response to this situation, a group of burgeoning academics from the natural language processing domain of AI – responsible for linguistic comprehension – initiated a challenge in January to reassess this trend. The challenge urged teams to construct effective language models utilizing data sets that are less than one-ten-thousandth of the size employed by the top-tier large language models. This mini-model endeavor, aptly named the BabyLM Challenge, aims to generate a system nearly as competent as its large-scale counterparts but significantly smaller, more user-friendly, and better synchronized with human interaction.

    Aaron Mueller, a computer scientist at Johns Hopkins University and one of BabyLM’s organizers, emphasized, “We’re encouraging people to prioritize efficiency and build systems that can be utilized by a broader audience.”

    Alex Warstadt, another organizer and computer scientist at ETH Zurich, expressed that the challenge redirects attention towards human language learning, instead of just focusing on model size.

    Large language models are neural networks designed to predict the upcoming word in a given sentence or phrase. Trained on an extensive corpus of words collected from transcripts, websites, novels, and newspapers, they make educated guesses and self-correct based on their proximity to the correct answer.

    The constant repetition of this process enables the model to create networks of word relationships. Generally, the larger the training dataset, the better the model performs, as every phrase provides the model with context, resulting in a more intricate understanding of each word’s implications. To illustrate, OpenAI’s GPT-3, launched in 2020, was trained on 200 billion words, while DeepMind’s Chinchilla, released in 2022, was trained on a staggering trillion words.

    Ethan Wilcox, a linguist at ETH Zurich, proposed a thought-provoking question: Could these AI language models aid our understanding of human language acquisition?

    Traditional theories, like Noam Chomsky’s influential nativism, argue that humans acquire language quickly and effectively due to an inherent comprehension of linguistic rules. However, language models also learn quickly, seemingly without this innate understanding, suggesting that these established theories may need to be reevaluated.

    Wilcox admits, though, that language models and humans learn in fundamentally different ways. Humans are socially engaged beings with tactile experiences, exposed to various spoken words and syntaxes not typically found in written form. This difference means that a computer trained on a myriad of written words can only offer limited insights into our own linguistic abilities.

    However, if a language model were trained only on the vocabulary a young human encounters, it might interact with language in a way that could shed light on our own cognitive abilities.

    With this in mind, Wilcox, Mueller, Warstadt, and a team of colleagues launched the BabyLM Challenge, aiming to inch language models towards a more human-like understanding. They invited teams to train models on roughly the same amount of words a 13-year-old human encounters – around 100 million. These models would be evaluated on their ability to generate and grasp language nuances.

    Eva Portelance, a linguist at McGill University, views the challenge as a pivot from the escalating race for bigger language models towards more accessible, intuitive AI.

    Large industry labs have also acknowledged the potential of this approach. Sam Altman, the CEO of OpenAI, recently stated that simply increasing the size of language models wouldn’t yield the same level of progress seen in recent years. Tech giants like Google and Meta have also been researching more efficient language models, taking cues from human cognitive structures. After all, a model that can generate meaningful language with less training data could potentially scale up too.

    Despite the commercial potential of a successful BabyLM, the challenge’s organizers emphasize that their goals are primarily academic. And instead of a monetary prize, the reward lies in the intellectual accomplishment. As Wilcox puts it, the prize is “Just pride.”