PJFP.com

Pursuit of Joy, Fulfillment, and Purpose

Tag: Google

Lloyd Blankfein on the 3 Sectors Where He Puts His Money Now: Big Tech, Energy, and Financial Services, Day Trading From an iPad, and the Warren Buffett Handshake That Backed Goldman in 2008
Lloyd Blankfein spent almost 40 years at Goldman Sachs, the last dozen as its chairman and chief executive, and he still trades almost every day from an iPad. In this wide ranging conversation on the My First Million podcast, the former Goldman boss lays out exactly where he is putting his own money right now, why a supportive spouse beats nearly any investment, how Warren Buffett wired five billion dollars into Goldman on a handshake during the 2008 crisis, and why he reads medieval history to stay calm about the present. It is part stock picking, part risk philosophy, and part a frank accounting of money, marriage, and the scars of growing up in the projects.

TLDW

Blankfein says he is roughly 98 percent in risky assets, almost all equities, and concentrated in three sectors he knows cold: big tech, energy, and financial services. His personal book leans heavily into single stocks over ETFs, weighted toward the big hyperscalers and a few second tier names, and he trades daily, alone, from an iPad and a phone, using calls and texts as his research network. Yet the advice he gives a normal investor is the boring opposite: a diversified S&P 500 fund like VOO, more risk when you are young because you will outlive your mistakes, the same thing Warren Buffett would tell you. The conversation ranges across the 2008 Buffett investment in Goldman, the cost of trying to legislate risk out of markets, the thin margin between the best and the rest, luck and the myth of the genius, why reputation is the real contract on Wall Street, why a supportive spouse is the highest return asset he knows, the money anxiety he carried out of a Brooklyn housing project, the dignity of a 500 dollar financial aid check, giving with a warm hand versus a cold one, the dangers of gamified investing, the big misses like SpaceX and early cellular, the obituary test a senior partner once gave him, and why reading history keeps the present in proportion.

Thoughts

The most useful tension in this interview is the gap between what Blankfein practices and what he preaches. He tells young people to buy a diversified S&P 500 index fund, he holds VOO himself, and he calls the host’s plain 90 percent stocks and 10 percent bonds split sensible. Then he admits his own portfolio is something like 90 percent single stocks that he trades by hand every day. The honest read is that his edge is not a transferable tip. It is a 40 year information network of phone calls and a tolerance for risk that most people neither have nor should want. The replicable lesson is the boring half, not the day trading half.

The most contrarian idea here is not a stock pick, it is his defense of risk itself. His argument that regulators trying to prevent the hundred year storm also forfeit the 99 normal years of growth in between is a serious claim about the price of safety, and it travels far beyond Wall Street. The same goes for his point that a good risk manager sometimes has to push people to take more risk, not less. The moment after a loss, when everyone goes gunshy, is exactly when the best operators lean back in. That is an uncomfortable thing for a former bank CEO to say out loud, and it is the part of the conversation most worth sitting with.

The Warren Buffett story is a master class in what actually moves markets, and it is not cash. Goldman did not need the five billion dollars. Blankfein says the money was almost irrelevant because the firm already had money. What it could not manufacture was confidence, and Buffett’s name supplied it. The handshake, the commitment with no paperwork, the line about worrying enough for the both of us, all point to the same thing. At the top, reputation is the collateral. His aside that most trades are never written down because you will never eat lunch in this town again is the same idea wearing street clothes.

Quietly, the personal finance thread may be the most valuable part for a normal listener. A former Goldman CEO saying that a supportive partner is more game changing than any investment, that a bad marriage is financially worse than being lonely, and that he has not paid a bill in over 40 years because his wife runs the household economy, is a reminder that household stability is itself an asset class. The 500 dollar financial aid check he still remembers half a century later, and his give with your warm hand philosophy, reframe wealth as something measured by how it feels to give and to receive, not just by the size of a pie chart.

Finally, the history obsession is not a side hobby, it is his risk model. Reading about the black plague, the McCarthy era, and the Vietnam draft is how he keeps the present in proportion. His Mark Twain line, that history does not repeat but it rhymes, is the direct antidote to the in this economy defeatism he and the host both complain about. For an investor, that long view is close to the whole game. It is what lets you hold through the drawdowns that scare everyone else out of the market.

Key Takeaways
- Blankfein estimates he is about 98 percent in risky assets, with roughly 95 of those 98 points in equities, and the rest spread thin. He invests in risky assets because, in his words, that is what is fun for him.
- Within his equities, he is heavily tilted toward single stocks rather than ETFs. He frames it as roughly a quarter to a third in ETFs and the rest in single names, and concedes it could be as lopsided as 90 percent single stocks because picking names is what he enjoys.
- The three sectors he has concentrated in for years are big tech, energy, and financial services, and he says his outperformance comes from where he focused, not from any special genius.
- On tech he owns the big hyperscalers, the Googles, Microsofts, and Nvidias of the world, plus a tier just below them, naming Oracle and Larry Ellison as an example of a slightly riskier second tier name. He thinks in categories, not fixed tickers, because he changes positions constantly.
- He says he has a background in trading energy, which is why energy is a core sleeve, and he knows financial services from the inside after almost 40 years at Goldman, so those are natural areas of edge.
- He still owns a lot of Goldman Sachs stock, out of affection for the firm he spent his career building.
- He is bullish on big tech and plans to stay bullish until it stops going up. His foreseeable future, he jokes, lasts until he finishes the conversation and checks the screen again.
- He trades every single day, alone, with no team. He does it from an iPad and a phone, not a computer, and treats the market like background music rather than a job.
- His research is human, not algorithmic. He chats and texts with people, then calls them because he is tired of fixing typos, and he reads the New York Post, the Wall Street Journal, the New York Times, the Financial Times, and Bloomberg.
- The advice he gives ordinary investors is deliberately boring and different from his own behavior: hold a diversified equity portfolio like an S&P 500 fund, with VOO as his own example, and tilt more aggressively when you are young because you have time to outlive mistakes.
- He notes that broad indexes are already heavily weighted toward tech because of market cap, so a plain index gives meaningful tech exposure, and a tech focused ETF on top can add a disproportionate tilt for believers.
- He calls the host’s simple 90 percent index and 10 percent bonds allocation sensible, and says this is essentially the same advice Warren Buffett would give a normal person.
- The older you get, the more conservative you should become, shifting from maximizing gains toward not losing what you have. Young people can afford more risk precisely because they will outlive their errors.
- During the 2008 financial crisis, Warren Buffett invested about five billion dollars in Goldman through a preferred stock structure, essentially on a phone call and a handshake, with no demand for due diligence.
- Buffett’s real value was confidence, not capital. Goldman already had money, but it had lost the confidence of the market while peers were failing. Buffett’s name signaled the firm was a good investment being beaten down by circumstances that would reverse.
- Buffett asked for a verbal commitment that Goldman would not sell shares before he did, and declined to put it in writing. He waved off the worry with the line that five billion dollars going bad would not even be a bad hurricane for Berkshire, an insurer.
- Most trading is done on reputation, not paper. Blankfein says people buy and sell bonds worth enormous sums without written contracts, relying on probity, because anyone who reneges will never eat lunch in this town again.
- On risk and regulation, he argues you cannot legislate risk away. Trying to prevent the hundred year storm also forgoes the 99 in between years of growth, and a good risk manager sometimes has to encourage people to take risk, not suppress it.
- The best traders have resilience. They bounce back, focus on new information rather than the past, and adapt quickly instead of staying gunshy after a loss.
- The difference between someone who is really good and someone who cannot make it is small. He compares it to a golf tournament won by one stroke with six people tied for second, and notes much of life is winner take all at razor thin margins.
- Luck matters enormously. He became Goldman CEO partly because his predecessor was nominated to be Treasury Secretary, a reference to Hank Paulson, and the timing of opportunities is often out of your control.
- He is skeptical of the word genius. He says he can usually see how successful people do what they do, with Elon Musk as a rare exception, and that powerful people are more normal, more insecure, and more flawed than outsiders assume.
- On democratized investing, he thinks apps that make markets accessible are good in their own terms, but gamifying trading with confetti and high fives can mask real danger for people who can lose more than they can afford.
- He has missed plenty. He thought SpaceX was overpriced at a 100 billion dollar valuation, now discussed near a trillion and three quarters, and passed on early cellular because he could not imagine why anyone would carry a bulky phone when payphones existed. He says he missed far more than he got.
- He frames a supportive spouse as more game changing than almost any investment, and warns that a bad marriage, with custody fights and property settlements, is financially and personally worse than being lonely.
- He has not paid a bill in over 40 years. His wife Laura, a former lawyer he says now chairs Barnard College, runs a bill paying service and manages the household economy. He generates the money, she distributes it.
- He grew up in an East New York, Brooklyn housing project, the son of a postal worker, and carried money anxiety well into his 30s. He recalls buying a vacation home that cost more than all their savings, with his wife unable to make the math work until they remembered the down payment.
- A 500 dollar financial aid check, handed to him without shame as a college freshman around 1971, shaped his philosophy on giving. He learned it is not enough to give people what they need, you have to give it in a way that feels dignified.
- He embraces the give with your warm hand, not your cold hand idea, the notion of giving while alive so you can experience the joy, which connects to the spirit of the book Die With Zero.
- He admits ambivalence about giving to his kids, the strange feeling of resenting that they have what he provided, and notes the heavy burden carried by children of prominent people who must prove they earned their place.
- He describes himself as wired for anxiety, inherited from his father, and says looking around corners for what could go wrong actually suited a career in a risky business with a big balance sheet.
- When he made partner, a senior partner gave him rules of the road, including avoiding misconduct, being conservative on taxes, setting up a charitable foundation, and living so that no more than three of the nine paragraphs in his eventual obituary would be about Goldman. He says he stayed too long to pass that test.
- He reads history as a discipline, favoring Barbara Tuchman, Robert Caro’s The Power Broker, Ron Chernow, Rick Atkinson, and Stephen Ambrose. His core belief, borrowed from Mark Twain, is that history does not repeat but it rhymes, which is why he would not bet against America.
Detailed Summary

The three sectors he actually invests in

The headline answer to where the former Goldman CEO is putting his money is simple: big tech, energy, and financial services. He says he has been focused on those three areas for a long time, and that his outperformance is a function of where he aimed rather than any unusual investing gift. Energy is natural because he has a background trading it. Financial services is natural because he spent nearly 40 years inside the industry. Tech is where he is most heavily concentrated, and he expects to stay there for good reason, citing the threshold of large changes in technology. He owns the major hyperscalers by category, the Googles, Microsofts, and Nvidias, plus a tier just below, offering Oracle and Larry Ellison as a polite example of a slightly riskier second tier name. He is careful to say he thinks in categories rather than fixed tickers because he changes his positions all the time.

How the portfolio is really built: single stocks over ETFs

Asked to describe his portfolio as a pie chart, Blankfein says he is about 98 percent in risky assets, with roughly 95 of those points in equities. He pushes back on the idea that index funds are safe, pointing out that a diversified equity ETF is still equities and still risky, just spread out, and very different from debt or short term money markets. Within his equity sleeve he leans into single stocks, framing it as somewhere between a quarter and a third in ETFs and the rest in individual names, and conceding it might be as extreme as 10 percent ETFs and 90 percent single stocks. The reason is preference, not theory. Picking and trading names is what he likes to do, and he is honest that this is a hobby pursued by a professional, not a model for someone investing for a living.

How he actually trades: an iPad, a phone, and a network

He trades every day, by himself, with no team. There is no Bloomberg terminal and no desk of analysts. He uses an iPad and a phone, and admits it takes discipline not to glance at his screen mid conversation. The market, he says, is like music playing in the background while he does other things. His information edge is relational. People text him, he texts back, and then he calls because he is tired of fixing typos with what he calls his fat fingers. He follows general and business news, reads a stack of newspapers starting with the New York Post, and treats companies like little stories, almost like gossip. He even notes, with some delight, that he still watches commercials on Netflix, a small window into a frugality that never fully left him.

The advice he gives young investors, and what Buffett would say

For a normal person, his counsel is the opposite of his own behavior. He would hold a diversified portfolio of equities like an S&P 500 fund, naming the SPY and VOO tickers and saying he personally uses VOO. Because of the importance of technology, he might add a tech oriented ETF for extra tilt, while noting the broad index is already tech heavy by market cap. He endorses the host’s plain 90 percent index and 10 percent bonds split as sensible and says it mirrors what Warren Buffett would advise. His one piece of age based guidance is that younger investors should accept more risk through equities, because they have time to recover, while older investors should grow more conservative and focus on not losing what they have rather than maximizing returns.

The Warren Buffett handshake that backed Goldman in 2008

The most cinematic story in the conversation is Buffett’s roughly five billion dollar investment in Goldman during the financial crisis, structured as a preferred stock that sits between a loan and equity. Blankfein describes a deal done largely on trust. When he offered to walk Buffett through everything he was worried about, Buffett replied that he knew Lloyd well enough to know he worried enough for the both of them. Buffett also asked, verbally and without writing, for a commitment that Goldman would not sell shares before he did. Blankfein is clear that the cash itself was almost irrelevant, since Goldman had money. What the firm lacked was the confidence of a frightened market, and Buffett’s willingness to invest before things improved supplied exactly that signal. Buffett, he stresses, was acting for his own shareholders, not as a rescuer, which is precisely what made the vote of confidence credible.

Why you cannot legislate risk out of the system

Reflecting on the post crisis regulatory push to make sure 2008 never happened again, Blankfein makes a careful argument about the price of safety. Once you are in the business of taking risk, anything can happen, and trying to legislate it away has a hidden cost. You may think you are protecting the world from the hundred year storm, but you also forgo the 99 years of growth in between. He extends this inside the firm too. After a period of big losses, partners had become gunshy and were talking themselves out of every idea. A good risk manager, he argues, sometimes has to promote risk taking rather than repress it, because without risk there is no growth, no entrepreneurship, and no progress. The flip side is real: take risk and there is a meaningful chance you fail and lose other people’s money, which is a terrible outcome. But the alternative, never risking anything, buys comfort at the cost of ever moving forward.

Small margins, big outcomes, and the role of luck

Asked what separated the traders who could not outperform from the rest, Blankfein says the gap between the very good and those who cannot make it is surprisingly small. He likens it to a golf tournament decided by a single stroke with six players tied for second, and to acting, where the best performer gets every role and the second best waits tables. Much of life, he says, is winner take all at tiny margins. Luck compounds this. He freely credits fortune for his own rise, noting he became CEO in part because his predecessor was tapped to be Treasury Secretary. He is also skeptical of the genius label. He can usually see how accomplished people do what they do, with Elon Musk a rare exception, and insists the powerful are more normal, more insecure, and more driven by their flaws than outsiders imagine.

Reputation is the real contract

A recurring theme is that the financial world runs on reputation more than paperwork. Blankfein notes that most of what traders do is not written down. People buy and sell bonds and other instruments that settle days later, relying on probity rather than signed contracts, because anyone who lies or reneges will never eat lunch in this town again. He references the casual texts between Elon Musk and Larry Ellison around the Twitter acquisition as proof that big does not mean complicated. There are big things that are simple and little things that are complicated. Documentation is good when execution is far off, but when a deal will be performed in two days, dotting every i is often pointless. The point is not that documents do not matter, it is that trust and reputation are the load bearing structure.

A supportive spouse as the highest return asset

The conversation turns personal when both men agree that a supportive partner may be the single most game changing factor in a life, more than any investment. Blankfein adds the inverse warning: a bad marriage, with breakups, custody battles, and property settlements, is worse than loneliness. He credits his wife Laura, a former big firm lawyer he says now chairs Barnard College, with handling everything when his career moved the family overseas, from the car to the house to the kids’ schooling, while he took the visible victory laps at work. He has not paid a bill in over 40 years. Laura manages a bill paying service and runs the household finances. As he puts it, he is in charge of generating the money and she is in charge of distributing it. The host contrasts this with his own monthly money meetings with his wife, a discipline he picked up from a personal finance author friend.

Money scars, the 500 dollar check, and giving with a warm hand

Blankfein grew up in an East New York housing project, the son of a postal worker who had earlier lost a job, in a household where rent was scarce. He calls himself an urban hick who barely left Brooklyn as a kid. That scarcity left a mark that lasted into his 30s. He tells the story of buying a small beach house that cost more than all their savings, and of his wife driving 30 miles while failing to make the closing math work, until they realized she had forgotten to count the 10 percent down payment. The most resonant memory is a 500 dollar financial aid check handed to him as a freshman around 1971, made out on the spot by a clerk with a generosity of spirit that let him receive it without shame. That experience shaped a lifelong view that giving well means preserving dignity, and he now co chairs a financial aid campaign at his university. It also connects to his embrace of the idea of giving with your warm hand rather than your cold hand, giving while alive so you can feel the joy, the same spirit as the book Die With Zero. He is candid about a strange ambivalence, the way he can resent that his kids enjoy what he himself gave them.

Robinhood, confetti, and the misses

On apps like Robinhood, Blankfein takes a balanced view. Democratizing investing and making assets accessible is good in its own terms, and advertising can pull people toward markets they would otherwise ignore. But if you make trading too much like a video game, with confetti and high fives, you can mask the danger and lure people who cannot afford to lose into losing more than they can. He is equally frank about his own misses. He thought SpaceX was overpriced at a 100 billion dollar valuation, a figure now discussed near a trillion and three quarters. He passed on early cellular because he could not imagine why anyone would carry a bulky phone with payphones everywhere. His blunt summary is that he missed far more than he got, and that nobody is great at predicting the future.

The obituary test, thick skin, and staying too long

When Blankfein made partner, a senior partner assigned to acculturate new partners gave him rules of the road: avoid anything that would today be called misconduct, be rigorous and conservative on taxes, set up and actually use a charitable foundation, and keep enough balance that, if your obituary runs nine paragraphs, no more than three are about Goldman. Blankfein says he failed that last test by staying too long, even titling his memoir around the firm. He also reflects on having a thick skin, recalling unflattering press and concluding that he could take a punch, a trait not everyone has and one he did not know he possessed until he was tested. He is careful to say this does not make people who cannot take a punch bad, just differently wired.

Why he reads history: it rhymes

The final stretch is a love letter to reading history. Blankfein favors Barbara Tuchman, whose A Distant Mirror he has read twice and whose Guns of August he calls fantastic and influential, along with Robert Caro’s The Power Broker on Robert Moses, Ron Chernow’s biographies, Rick Atkinson’s Revolution series, and Stephen Ambrose’s Undaunted Courage. He describes rereading the Robert Moses book after 40 years of trying to get things done and finding his appreciation for the achievements rise, even as the flaws stayed the same, because he had changed. He ties history directly to markets through the Mark Twain line that history does not repeat but it rhymes. Patterns recur, every generation maximizes its own crises and minimizes resolved ones, and reading about the black plague, the McCarthy era, or the Vietnam draft is how he stays calm. His conclusion, echoing a sentiment often attributed to Buffett, is that he would not bet against America, a country he describes as mostly good and able to improve.

Notable Quotes

“I invest in risky assets. That’s what’s fun for me.”
Lloyd Blankfein, describing his own portfolio, which he says is roughly 98 percent risky assets

“It’s been good to be bullish on big tech, and I’ll stop being bullish on it when it stops going up.”
Lloyd Blankfein, on why he stays concentrated in technology

“I’m not at a computer. I don’t have a computer. I have an iPad.”
Lloyd Blankfein, on how he day trades every day, alone and with no team

“To me, the market is like music. It’s out there. It’s going on.”
Lloyd Blankfein, on why trading daily feels like a hobby rather than work

“Look, $5 billion if it all goes bad, that’s not even a bad hurricane on the East Coast.”
Warren Buffett to Lloyd Blankfein, waving off the risk of his 2008 investment in Goldman Sachs

“The difference between somebody who’s really, really good and somebody who can’t make it is not that great.”
Lloyd Blankfein, on the thin margin between the best and the rest

“You may think you’re protecting the world from the hundred-year storm, but you’re also going to forego the 99 years of in between when there was growth.”
Lloyd Blankfein, on the cost of trying to legislate risk out of markets after 2008

“I’m in charge of generating the money, and she’s in charge of distributing it.”
Lloyd Blankfein, on his 40-plus-year marriage to Laura and why he has not paid a bill in decades

“History doesn’t repeat, but to paraphrase Mark Twain, it rhymes.”
Lloyd Blankfein, on why reading history keeps the present in proportion

Watch the full conversation with Lloyd Blankfein on the My First Million podcast here.

Related Reading
- Lloyd Blankfein (Wikipedia) background on the former Goldman Sachs chairman and CEO whose investing views anchor the conversation.
- My First Million podcast the show where this interview took place, for the full back catalog of investor and founder conversations.
- Berkshire Hathaway primary source on Warren Buffett’s company, which made the roughly five billion dollar Goldman investment in 2008.
- Vanguard S&P 500 ETF (VOO) the diversified index fund Blankfein names as the sensible core holding for a normal investor.
- Die With Zero by Bill Perkins the book behind the give with your warm hand, not your cold hand philosophy discussed near the end.
June 16, 2026
Anthropic Raises $65 Billion Series H at $965 Billion Valuation to Fund AI Safety Research and Massive Compute Expansion
Anthropic has closed one of the largest private financing rounds in the history of technology, raising $65 billion in Series H funding at a $965 billion post-money valuation. The round, announced on May 28, 2026, lands as demand for Claude reaches what the company calls historic levels, and it positions Anthropic to pour fresh capital into safety research, compute, and the products that enterprises now lean on every day.

TLDR

Anthropic raised $65 billion in its Series H at a $965 billion post-money valuation, with Altimeter Capital, Dragoneer, Greenoaks, and Sequoia Capital leading and Capital Group, Coatue, D1 Capital Partners, GIC, ICONIQ, and XN co-leading, alongside $15 billion in previously committed hyperscaler investment that includes $5 billion from Amazon. The raise follows Anthropic crossing $47 billion in run-rate revenue earlier in May 2026, and it funds three priorities named by CFO Krishna Rao: advancing safety and interpretability research, expanding compute capacity to meet growing Claude demand, and scaling the products and partnerships customers depend on. On the infrastructure side, the company is locking in gigawatt-scale compute through 5 gigawatts with Amazon, 5 gigawatts of TPU capacity via Google and Broadcom, GPU access from SpaceX, and supply from partners Micron, Samsung, and SK hynix, while Claude remains available across all three major cloud platforms, AWS, Google Cloud, and Microsoft Azure, with widespread enterprise adoption across industries.

Thoughts

Start with the number that everyone will fixate on. A $965 billion post-money valuation against $47 billion in run-rate revenue is roughly 20 times sales, and for a company growing this fast that multiple is not the interesting part. The interesting part is that run-rate revenue crossed $47 billion earlier this month, which means the denominator is moving so quickly that the multiple is already stale. Investors are not pricing the business Anthropic is today. They are pricing the slope. A 20x multiple on a number that may double again inside a year is a very different bet than 20x on a flat line, and the lead names here (Altimeter, Dragoneer, Greenoaks, Sequoia, with Capital Group, Coatue, GIC and others co-leading) are not the kind of capital that pays for nostalgia. They are paying for the second derivative.

But the real story is not the valuation. It is the compute. Read the infrastructure list carefully and you see the actual problem this round solves: 5 gigawatts from Amazon, 5 gigawatts of TPU capacity through Google and Broadcom, GPU access from SpaceX, and memory supply locked down with Micron, Samsung, and SK hynix. That is more than 10 gigawatts of secured power and silicon. The constraint on frontier AI in 2026 is no longer talent or even algorithms. It is electricity, land, and the multi-year queue for advanced packaging and high-bandwidth memory. You cannot buy 10 gigawatts on a quarterly basis. You reserve it years out, and you need the balance sheet to make those commitments credible. A $65 billion raise is, in plain terms, the down payment that lets Anthropic sign for capacity nobody can conjure on demand. The money is downstream of the megawatts.

The diversification across that compute stack matters as much as the size. By splitting between Amazon’s infrastructure, Google and Broadcom’s custom TPUs, and SpaceX-supplied GPUs, Anthropic is refusing to become hostage to any single supplier’s roadmap or pricing. Custom silicon through Broadcom in particular is a bet on bending the cost curve, because the long-term economics of serving Claude at this scale depend on dollars per token, not just on raw availability. Anyone who has watched cloud lock-in play out over the last decade understands the move. Optionality at the hardware layer is leverage, and leverage is what keeps margins from being dictated by whoever owns the only fab slot you can reach.

It is worth pausing on the fact that the round explicitly funds safety and interpretability research alongside scaling, and not as a footnote. Most companies treat safety spend as a cost center to be minimized once growth kicks in. Naming it first, ahead of compute and products, is a statement about where Anthropic believes its durable advantage sits. If models keep getting more capable, the binding constraint on deployment inside regulated industries (finance, healthcare, government) becomes trust, not intelligence. Interpretability is the work that turns a black box into something an enterprise risk committee can actually sign off on. Framed that way, safety research is not philanthropy subtracted from the bottom line. It is the thing that unlocks the most lucrative and defensible parts of the market, and pairing it with the scaling budget is the tell.

Finally, look at distribution. Claude now ships on all three major clouds at once: AWS, Google Cloud, and Microsoft Azure. In a market where most frontier labs are tethered to a single hyperscaler, being available everywhere enterprises already run their workloads is a structural edge. It removes the procurement friction of asking a customer to adopt a new vendor relationship, and it means Anthropic competes on the merits of the model rather than on which cloud a buyer happened to standardize on years ago. Combine that omnipresent distribution with the compute reservations and the explicit safety mandate, and the shape of the strategy is clear. This is not a company buying time. It is a company buying the three things that actually compound: capacity that cannot be rushed, trust that cannot be faked, and reach into every place where work already happens.

Key Takeaways
- Anthropic raised $65 billion in its Series H funding round, one of the largest private financings in the history of the technology industry.
- The round set Anthropic’s post-money valuation at $965 billion, placing the company within reach of the $1 trillion mark.
- Altimeter Capital, Dragoneer, Greenoaks, and Sequoia Capital led the Series H round.
- Capital Group, Coatue, D1 Capital Partners, GIC, ICONIQ, and XN served as co-leads on the investment.
- The new capital builds on $15 billion in previously committed hyperscaler investments, which includes $5 billion from Amazon.
- Anthropic crossed $47 billion in run-rate revenue earlier in May 2026, reflecting the surging commercial demand for Claude.
- A core priority for the funding is to advance Anthropic’s safety and interpretability research.
- The company will use the capital to expand compute capacity in order to meet growing demand for Claude.
- Anthropic plans to scale the products and partnerships that customers depend on across its business.
- CFO Krishna Rao said the funding will help Anthropic serve the historic demand it is experiencing, stay at the research frontier, and bring Claude to more of the places where work happens.
- Amazon is providing 5 gigawatts of compute capacity as part of Anthropic’s infrastructure expansion.
- Google and Broadcom are supplying 5 gigawatts of TPU capacity to power Claude’s growth.
- SpaceX is contributing GPU access to Anthropic’s compute footprint.
- Micron, Samsung, and SK hynix are partnering with Anthropic on memory and infrastructure to support its scaling needs.
- Claude is available on all three major cloud platforms, AWS, Google Cloud, and Microsoft Azure.
- Anthropic reports widespread enterprise adoption of Claude across a broad range of industries.
Detailed Summary

The Raise and the Valuation

Anthropic has raised $65 billion in Series H funding, a round that values the company at $965 billion on a post-money basis. The size of the raise places it among the largest private financing events the technology industry has ever seen, and the valuation pushes Anthropic to the doorstep of the trillion dollar mark. The capital arrives at a moment when demand for the company’s Claude models has accelerated sharply, and the round is built to fund the response to that demand rather than simply mark a milestone. Anthropic framed the financing in its Series H announcement as the fuel for staying at the research frontier while scaling the infrastructure and products that customers increasingly rely on.

Who Put In the Money

The Series H was led by Altimeter Capital, Dragoneer, Greenoaks, and Sequoia Capital, a group that combines deep growth-stage technology experience with conviction in Anthropic’s long-term trajectory. Joining as co-leads were Capital Group, Coatue, D1 Capital Partners, GIC, ICONIQ, and XN, a roster that spans crossover funds, sovereign wealth, and institutional investors. Beyond the new equity, Anthropic pointed to $15 billion in previously committed hyperscaler investment, including $5 billion from Amazon. Taken together, the investor base reflects a mix of financial backers and strategic partners with a direct stake in seeing Claude reach more customers and more compute.

Revenue at $47 Billion Run-Rate

Underpinning the valuation is a business that has scaled with unusual speed. Anthropic crossed a $47 billion run-rate revenue figure earlier in May 2026, a number that signals how quickly enterprises and developers have adopted Claude across their workflows. Run-rate revenue annualizes the company’s most recent performance, and at this level it puts Anthropic firmly among the fastest growing software businesses on record. That financial momentum is the practical justification for both the round’s size and the near trillion dollar valuation investors were willing to support.

The Compute Buildout

A large share of the strategy behind the raise centers on securing compute at enormous scale. Anthropic detailed a set of infrastructure partnerships designed to keep pace with Claude demand. Amazon is providing 5 gigawatts of capacity, while Google and Broadcom together are supplying 5 gigawatts of TPU capacity. SpaceX is contributing GPU access, broadening the range of silicon Anthropic can draw on. Supporting the buildout on the hardware supply side are Micron, Samsung, and SK hynix, the memory and component partners whose output is essential to standing up data centers at this magnitude. The combined picture is a company assembling power, chips, and supply chain commitments measured in gigawatts rather than racks.

Where the Money Goes

Anthropic outlined three priorities for the new capital. The first is to advance safety and interpretability research, continuing the work of understanding how models behave and ensuring they remain reliable as they grow more capable. The second is to expand compute capacity to meet the growing demand for Claude, the practical engine behind the infrastructure commitments above. The third is to scale the products and partnerships that customers depend on, deepening the company’s reach into the tools and platforms where work actually happens. Krishna Rao, Anthropic’s chief financial officer, said the funding “will help us serve the historic demand we are experiencing, stay at the research frontier, and bring Claude to more of the places where work happens.”

Claude Everywhere

The funding lands on top of a distribution footprint that already spans the major cloud ecosystems. Claude is available on all three leading cloud platforms, AWS, Google Cloud, and Microsoft Azure, which means enterprises can reach the models through whichever provider they have standardized on. That availability has translated into widespread enterprise adoption across industries, from software and finance to healthcare and beyond. By being present everywhere developers and businesses already operate, Anthropic positions Claude not as a destination customers must travel to but as a capability woven into the platforms they use every day.

Notable Quotes

This funding will help us serve the historic demand we are experiencing, stay at the research frontier, and bring Claude to more of the places where work happens.

Krishna Rao, CFO at Anthropic, on the purpose of the Series H round.

Advance safety and interpretability research, expand compute capacity to meet growing Claude demand, and scale products and partnerships customers depend on.

How Anthropic describes its use of funds from the round.

For the full details on the round, the lead and co-lead investors, and how Anthropic plans to deploy the capital across safety research, compute, and products, read the full announcement here.

Related Reading
- Anthropic, the AI safety and research company behind Claude that raised this Series H round.
- Sequoia Capital, one of the lead investors anchoring the financing.
- Amazon Web Services, one of the three major cloud platforms where Claude is available and the source of a $5 billion investment.
- Google Cloud TPUs, the tensor processing units behind the 5 gigawatts of TPU capacity in the Google and Broadcom partnership.
- AI safety, the research field at the center of how Anthropic says it will use the new funding.
May 28, 2026
Gavin Baker on Orbital Compute, TSMC, Frontier AI Models, Anthropic’s Vertical Take Off, and the Coming Wafer Shortage
Gavin Baker, founder and CIO of Atreides Management, returns to Patrick O’Shaughnessy’s Invest Like the Best for his sixth appearance. He calls the current AI moment the most extraordinary moment in the history of capitalism, walks through what Anthropic’s vertical takeoff in revenue actually means, lays out why orbital compute is closer than skeptics believe, dissects the TSMC bottleneck that may be the only thing standing between today’s market and a full-on AI bubble, and rates every hyperscaler on how they have positioned for a world where frontier model providers may stop selling API access altogether.

TLDW

Anthropic added eleven billion dollars of ARR in a single month, which is roughly the combined business of Palantir, Snowflake, and Databricks built over a decade. That is the setup. From there Gavin Baker covers the March and April selloff, the contrarian read that a closed Strait of Hormuz was actually bullish for American manufacturing competitiveness, why Anthropic and OpenAI multiples may be misleadingly cheap on an unconstrained run rate basis, why Elon Musk’s discipline on SpaceX valuation created a superpower of permanent access to capital, the practical engineering case for orbital compute as racks in space rather than Pentagon sized space stations, why TSMC’s capacity discipline is the single most important variable in whether the AI cycle becomes a bubble, what Terafab in Texas changes, why the Pareto frontier of AI models has flipped from Google dominance to Anthropic and OpenAI dominance in nine months, the shift from all you can eat AI subscriptions to usage based pricing and what that means for revenue scaling, Richard Sutton’s bitter lesson as the largest risk to the AI trade, why frontier tokens still capture an overwhelming share of economic value, the role of continual learning as the third great open question, why most new chip startups should not try to build a better GPU, why Cerebras did something different and hard, why disaggregated inference may extend GPU useful lives to ten or fifteen years and rescue the private credit industry, why being in the token path is the new venture filter, the new prisoner’s dilemma around releasing frontier models via API, an honest rating of Google, Meta, Amazon, and Microsoft, why personal safety is becoming a real AI era risk, and why he remains an AI optimist maximalist who believes this could be the next Pax Americana.

Key Takeaways
- Anthropic added eleven billion dollars of ARR in one month, more than the combined businesses of Palantir, Snowflake, and Databricks built across a decade. There is no precedent for this in the history of capitalism.
- The SaaS and cloud revolution created between five and ten trillion dollars of value over twenty years. AI is replaying that compression on a timeline measured in months.
- The March selloff was a drawdown driven by disagreement with price action, not invalidated thesis. That is the kind of drawdown an investor can lean into.
- Deep Seek Monday in January 2025 was a similar setup. By the day of the selloff, AWS Asia GPU prices had already doubled, GPU availability had fallen, and it was obvious reasoning models would be vastly more compute hungry at inference. The market priced the opposite.
- The Strait of Hormuz closing was actually positive for America. US natural gas (the primary input into US electricity, which feeds AI) fell twenty percent on Bloomberg while Asian and European natural gas doubled or tripled. American manufacturing competitiveness improved overnight.
- The US is now the world’s largest producer and exporter of oil and gas. The economy is dramatically less energy intensive than in the 1970s. The shortage trauma comparison does not hold.
- Tech as a sector traded as cheaply versus the rest of the market in early April as at any point in the last ten years, into the single most bullish moment for AI fundamentals on record.
- Anthropic is dramatically more capital efficient than OpenAI, having burned roughly eighty percent less to reach a similar revenue scale. They have very different structural returns on invested capital.
- Anthropic at roughly nine hundred billion for fifty billion of ARR (growing a thousand percent) is striking. Adjusted for compute constraint, the unconstrained run rate could be one hundred fifty to two hundred billion, putting the implied multiple closer to five times.
- Claude Opus generates roughly seventy percent fewer tokens for the same question than previously, with token quantity tied to answer quality. Subscribers on flat-fee plans are getting a lobotomized model.
- Elon Musk’s superpower is twenty years of making investors money. He never pushes valuation. SpaceX compounded low thirty percent per year for a decade because Musk treats fair pricing as a sacred covenant.
- Capitalism will solve the watts shortage. The current bottleneck has shifted from chips and energy to zoning and political approval. Many capex decisions are paused until after the US midterms.
- The watts shortage probably begins to alleviate in 2027 and 2028. Orbital compute solves it longer term.
- Orbital compute is not Pentagon sized data centers in space. It is racks in space. A Blackwell rack is three thousand pounds, eight feet tall, four feet deep, three feet wide. SpaceX has shown a satellite roughly that size.
- The satellites operate in sun synchronous orbit so solar wings (around five hundred feet per side) always face the sun and the radiator on the dark side always points to deep space.
- Starlink V3 satellites already run at around twenty kilowatts. A Blackwell rack runs at one hundred kilowatts. SpaceX engineers express genuine confidence they have already solved cooling and radiator design at these scales.
- Racks in space are connected with lasers traveling through vacuum, the same lasers already on every Starlink. SpaceX operates the world’s largest satellite fleet and, via xAI Colossus, the world’s largest data center on Earth.
- Inference will move to orbit. Training will stay on Earth for a long time. Terrestrial data centers remain valuable for the rest of an investor’s career.
- The wafer bottleneck is structural and political. TSMC is essentially Taiwan’s GDP, water, and electricity. The leaders see themselves as inheritors of Morris Chang’s sacred legacy and they do not behave like a Western public company.
- Jensen Huang has never had a contract with TSMC. The relationship is run on handshakes and the assumption that things will be fair over time.
- If TSMC did everything Jensen wanted, Nvidia could be selling two to three trillion dollars of GPUs in 2026 and 2027. TSMC’s discipline is the single largest factor preventing a true AI bubble.
- Historically, foundational technologies always get a bubble. Railroads, canals, the internet. The current AI buildout is overwhelmingly funded out of operating cash flow, GPUs are running at one hundred percent utilization, and that is fundamentally different from the year 2000 fiber overbuild.
- If one of Intel or Samsung Foundry catches up at the leading node, the other will follow, and TSMC’s discipline collapses. Watch TSMC capacity decisions to predict a bubble.
- Terafab, the SpaceX and Tesla joint venture to build the world’s largest fab in America, has a partnership with Intel that grants access to fifty years of institutional foundry knowledge. The A teams at ASML, KLA, Lam Research, and Applied Materials will follow Elon’s reputation in hardware engineering.
- The hiring playbook for Terafab includes building Taiwan Town, Japan Town, and Korea Town next to the fab. Recruit the engineers and import their families, their restaurants, and their staff.
- Frontier tokens still capture an overwhelming share of all economic value created at the model layer. This is surprising and is one of the three big open questions for AI investing.
- The Pareto frontier of intelligence versus cost has flipped. Nine months ago Google’s TPU dominated every point on the frontier. Today Anthropic and OpenAI dominate, with Grok 4.3 on the frontier and Gemini 3.1 hanging on.
- Google’s conservative TPU V8 design (partly an attempt to reduce dependence on Broadcom and Nvidia) is the leading explanation for the loss of per token cost leadership.
- AI pricing is shifting from all you can eat to usage based, mirroring the cellular and long distance industries. Cellular stopped being a great growth industry when it went all you can eat. AI just made the opposite move.
- OpenAI and Anthropic together could exceed two hundred billion in ARR this year if compute keeps coming online and frontier token pricing holds.
- The two hundred fifty dollar a month consumer AI plan is no longer enough to evaluate frontier capability. Enterprise plans with usage based billing are required because rate limits are now severe.
- The three biggest open questions for AI investors are: violation of the bitter lesson via ASI or human ingenuity, whether frontier tokens keep commanding their premium, and when continual learning arrives.
- Today’s continual learning is crude reinforcement learning during mid training on verifiable tasks. True continual learning means weights updating dynamically, like a human who learns the first time they touch fire.
- Trying to build a better GPU is a losing strategy. Jensen will copy any one to three percent share design. Startups should target one percent share, do something different, and make it hard enough that Nvidia cannot fast follow.
- Disaggregated inference (separating prefill and decode) opens new design canvases. Prefill is memory capacity bound. Decode is memory bandwidth bound. Each can be optimized independently.
- Cerebras did something different and hard with wafer scale computing. Three generations of chips and real grit to get there.
- Disaggregation of inference may stretch GPU useful lives to ten or fifteen years, dropping financing costs from low sevens to five or six percent, mathematically lowering the cost of the AI buildout and likely saving the private credit industry from its SaaS loan exposure.
- Sellers of shortage outperform buyers of shortage. But owning the largest installed base of what is currently in shortage (hyperscaler CPU fleets, for example) is also a strong position.
- Most of the economic value at the application layer of AI has been destroyed, not created. The exceptions are companies in the token path or in niches small enough that frontier labs ignore them.
- Coding may be the shortest path to ASI. If you can write code, you can write code that does anything. Cursor, Cognition, and Anthropic correctly focused on it.
- Jensen could probably get close to the frontier with his own Nemotron family of models whenever he wants. The fact that he chooses not to is a strategic decision about not commoditizing his customers.
- The new prisoner’s dilemma in AI is whether frontier labs release their best model via API. If everyone agrees not to, Chinese open source falls behind. If anyone defects, the defector pulls ahead on revenue and resources, forcing everyone else to defect.
- Google still owns the largest compute installed base. Without TPU’s prior cost advantage, this matters more. YouTube data has real value in a world of robotics. GCP is going crazy.
- Meta deserves credit for becoming AI first internally faster than any other internet giant. Musa, their first MSL model, is impressively close to the Pareto frontier.
- Amazon is strong because of Trainium and robotics driven retail P&L efficiency. Nova is better than it gets credit for.
- Microsoft flinched on capex in early 2025 and lost position. Satya Nadella’s current decision to use Microsoft compute for Microsoft products rather than reselling to OpenAI is a courageous and probably correct call, even at the cost of an eight hundred dollar stock price.
- The hyperscalers most engaged with startups are Amazon and Nvidia by a mile, followed by Google. Broadcom is the favorite ASIC partner. AMD, Microsoft, and Meta have minimal startup engagement and that will cost them as the best teams are now at startups.
- Personal safety in an AI era requires a family or company safe word that cannot be socially engineered. Deepfake voice and video extortion at the speed of FaceTime is already feasible.
- Ukraine is winning largely on the back of having the best battlefield AI outside America and Israel. Adversaries are starting to internalize what AI dominance means geopolitically.
- An optimistic read is that this becomes a new Pax Americana, the way the post 1945 American nuclear monopoly was used to rebuild Germany and Japan rather than dominate.
- AI cured a friend’s daughter’s rare disease by spinning up a research effort that identified a market drug capable of impacting her condition. That is the upside that keeps Gavin an AI optimist maximalist.
Detailed Summary

The most extraordinary moment in the history of capitalism

Gavin’s framing of the current moment is unusually direct. Anthropic added eleven billion dollars of annual recurring revenue in a single month. The three highest profile SaaS companies of the last decade plus, Palantir, Snowflake, and Databricks, took a decade and tens of thousands of employees collectively to build the combined business that Anthropic added in thirty days. He has been investing through every major tech cycle and says there is no historical analog. Not the dotcom era, not the cloud transition, not mobile. This is its own thing.

The market response, then, was peculiar. The NASDAQ sold off into the single most bullish moment for AI fundamentals on record. Tech traded at roughly its widest discount versus the rest of the market in a decade. Investors who said they wished they had bought into AI during 2022, during COVID, or during Deep Seek Monday got the same valuation setup again in early April, this time with an even clearer inflection.

Why the Strait of Hormuz closing was secretly bullish for America

One reason the macro fear in March may have been mispriced is that the same geopolitical event that drove the selloff was, in practice, a relative benefit to the United States. American natural gas, the input into American electricity, which is the input into American AI training and inference, fell roughly twenty percent. Asian and European natural gas prices doubled or tripled. The US emerged with sharply improved relative manufacturing competitiveness, which is exactly what the current administration cares about.

The 1970s comparison does not hold. The US economy is dramatically less energy intensive, it is now the world’s largest producer and largest exporter of oil and gas, and there are no shortages, only price moves. That backdrop made it easier for disciplined investors to stay focused on AI fundamentals through the volatility.

Anthropic and OpenAI valuations on an unconstrained run rate

Anthropic at roughly nine hundred billion for fifty billion of ARR sounds rich until you adjust for the fact that the company is severely compute constrained. Gavin estimates that, unconstrained, Anthropic might be at one hundred fifty to two hundred billion in run rate revenue, putting the implied multiple closer to five times. He also points out that Claude Opus now generates roughly seventy percent fewer tokens for the same question than it used to. Token quantity correlates with answer quality, and Anthropic is rate limiting and shrinking outputs to ration capacity across its user base.

Anthropic and OpenAI are also structurally very different. Anthropic has burned around eighty percent less cash than OpenAI to reach a comparable revenue scale. That implies very different long term returns on invested capital, though OpenAI has done a better job locking in compute and Sarah Friar is one of the most exceptional CFOs Gavin has worked with.

Why neither lab is raising at a three trillion dollar valuation

The answer Gavin gives is that both labs are deliberately leaving valuation on the table the way Elon has done for two decades. SpaceX compounded at low thirty percent annually for a decade because Elon never pushed price. The result is a permanent superpower of access to capital. Investors trust him because they have made money with him for twenty years. That is a moat that compounds with every round.

Anthropic could probably raise at a one hundred percent premium to its rumored latest mark. They are choosing not to. In an uncertain world (Ukraine, Russia, Iran, Taiwan), preserving the ability to raise more capital later at fair prices is more valuable than maximizing this round.

Watts and wafers, the two real constraints

Capitalism is solving the watts problem. The leading PE infrastructure investors now say zoning and political approval, not chips or energy, are the gating factors. Companies are deferring big capex announcements until after the US midterms. Turbine capacity is being doubled at the manufacturers. Companies like Boom Aerospace are repurposing jet engines for grid use. Watts probably ease meaningfully in 2027 and 2028 and then orbital compute does the rest.

Wafers are the harder problem because they live in Taiwan, run on handshakes, and depend on a corporate culture that does not respond to public market incentives. TSMC is essentially the GDP, water consumption, and electricity consumption of Taiwan. Its leadership treats the company as the legacy of Morris Chang. The Silicon Shield doctrine is real and internal.

Orbital compute as racks in space

The biggest mental update Gavin asks listeners to make is to stop picturing data centers in space as Pentagon sized space stations. A Blackwell rack is three thousand pounds and roughly the size of a refrigerator. SpaceX has shown a concept satellite of about that size. Solar wings extend five hundred feet to each side and the radiator extends hundreds of feet behind, both possible because the orbit is sun synchronous and the orientation is fixed relative to the sun.

SpaceX engineers Gavin has spoken to at Starbase express genuine confidence that they have solved cooling at these power levels. They have. Starlink V3 satellites already operate at twenty kilowatts. A Blackwell rack is one hundred kilowatts. The same company operates the world’s largest satellite fleet and the world’s largest data center on Earth via xAI Colossus. The racks are connected to each other with lasers traveling through vacuum, technology already deployed in every Starlink. The naysayers, Gavin observes, are armchair skeptics and Larry Ellison’s response (he is out there landing rockets, no one else is) is the right frame.

Terafab in Texas and the threat to TSMC’s discipline

Terafab, the SpaceX and Tesla joint venture, intends to be the largest fab in the world. The partnership with Intel grants access to fifty years of foundry institutional knowledge, allowing Terafab to start three to five quarters behind the leading node rather than fifteen years behind. The A teams at the semicap equipment companies (ASML, KLA, Lam Research, Applied Materials) will follow Elon’s reputation in hardware engineering the same way they followed TSMC twenty years ago when Intel stumbled.

The talent strategy is the part most observers underestimate. Recruit the best engineers globally, then import their families, their restaurants, their staff. Build Taiwan Town, Japan Town, and Korea Town next to the fab. Optimize the human experience for the people whose work matters. Intel and Samsung do not think that way.

Bubble watch and the year 2000 comparison

Every foundational technology in modern history has had a bubble. Railroads, canals, the internet. Carlota Perez documented why. Markets correctly identify the importance, diversity of opinion collapses, supply gets ahead of demand, the bubble crashes. The current cycle has two important differences. The buildout is overwhelmingly funded out of operating cash flow, not debt. Every GPU is running at one hundred percent utilization, while at the peak of the fiber bubble ninety nine percent of fiber was unused.

TSMC discipline is the single largest reason a bubble has not formed. If Jensen could buy everything TSMC could theoretically make, Nvidia could sell two to three trillion dollars of GPUs in 2026 and 2027. At some point that becomes more than the market can absorb. If Intel or Samsung Foundry catches up at the leading node, the other will too. TSMC’s pricing discipline collapses and the bubble starts.

The Pareto frontier and the loss of Google’s cost advantage

The most important chart in AI is the Pareto frontier of model intelligence versus per token cost. Nine months ago, Google’s TPU based models dominated every point on it. OpenAI, Anthropic, and xAI sat inside the frontier. Today the frontier is dominated by Anthropic and OpenAI, with Grok 4.3 on the frontier and Gemini 3.1 hanging on by subsidization more than economics. The most likely cause is Google’s conservative TPU V8 design, an attempt to reduce dependence on Broadcom and Nvidia that sacrificed per token economics.

The bitter lesson, frontier tokens, and continual learning

Three open questions dominate AI investing. The first is whether Richard Sutton’s bitter lesson (more compute beats human algorithmic cleverness) gets violated by ASI itself optimizing for efficiency. Closer observers of AI are more skeptical of a violation. Gavin thinks ASI’s first move will be to make itself more efficient and more resourced, which is technically a temporary violation.

The second is whether frontier tokens keep capturing the overwhelming share of economic value at the model layer. Today they do, surprisingly. Gemini 3.1 Pro was mindblowing nine months ago and is intolerable today. The third is when continual learning arrives. Today’s models need a million fire touches to learn what a human learns from one. True continual learning would mean dynamic weight updates in real time and would produce a fast takeoff.

From all you can eat to usage based AI pricing

AI is shifting from flat fee plans to usage based pricing. The historical analogy is cellular and long distance. Both stopped being great growth industries when they went all you can eat. AI just made the opposite move. The consequence is that flat fee subscribers, even on premium consumer plans, get a rate limited and token throttled version of the frontier model. Enterprise plans with usage based billing are now required to evaluate true capability. Gavin thinks the combination of new compute coming online and usage based pricing is what gets OpenAI and Anthropic past two hundred billion in combined ARR this year.

Chip startups, prefill decode disaggregation, and Cerebras

Trying to build a better GPU is the wrong move. The four scaled players (Nvidia, AMD, Trainium, TPU) have copy capability for any one to three percent share design that looks attractive. The good news for startups is that disaggregated inference (separating prefill and decode) opens a richer design canvas. Prefill is memory capacity bound. Decode is memory bandwidth bound. Each can be optimized independently. Andrew Fox’s analogy is a British naval ship of the eighteenth century. Prefill is loading the cannon. Decode is firing it.

Cerebras is the model. Wafer scale computing is genuinely different and genuinely hard. It took three generations of chips to get right. Andrew Feldman and his team had the grit to keep going through chip one being a failure. The design has a high ratio of on chip compute and memory relative to shoreline IO, which is why Cerebras is now experimenting with putting an optical wafer on top of the compute wafer to solve scale out.

GPU useful lives and the rescue of private credit

One of the strongest claims in the conversation is that disaggregated inference will stretch GPU useful lives to ten or fifteen years. The skeptical narrative (GPUs are obsolete in two years, companies are cooking their depreciation books) is wrong. You can put a Cerebras system or Groq LPU in front of older Hopper or Ampere parts, use them only for prefill, and run them until they physically melt. Private credit, which is in pain from SaaS loans and which underwrote GPU loans on three to four year lives, may be saved by this.

If GPU financing rates can come down from low sevens to five or six percent, the mathematics of the AI buildout improves materially. That is a structural tailwind that compounds for years.

The application layer, the token path, and a new prisoner’s dilemma

Trillions of dollars of value have been destroyed at the application layer, not created. Cursor and Cognition are the rare scaled exceptions, and they got there by focusing on coding very early. As Amjad Masad noted, coding is plausibly the shortest path to ASI because a coding agent can write itself into any new domain. Jamin Ball’s frame is that the new venture filter is whether the company is in the token path. Data Bricks is. Most application layer startups are not.

Jensen could probably get close to the frontier with Nemotron whenever he wants, and the strategic question of whether to do that is a new prisoner’s dilemma. If every frontier lab agrees not to release best models via API, Chinese open source falls steadily behind. If anyone defects, the defector gains revenue and resources, and everyone else has to defect. The same dynamic exists between TSMC, Intel, and Samsung. If Nvidia or AMD ever truly used an alternative foundry, that foundry would catch up rapidly.

Rating the hyperscalers

Google has the largest compute installed base, the YouTube data that matters in a robotics world, and a search business that prints. Their loss of TPU cost leadership is the surprise of the year. If Google IO in five days does not produce a leapfrog model, the Nvidia centric narrative gets even stronger.

Meta deserves real credit. Zuckerberg made Meta AI first internally faster than any other internet giant, paid up for the talent contracts when no one else would, and shipped Musa as a first model from MSL that is close to the Pareto frontier. Amazon is well positioned on Trainium, robotics in retail, and a Nova model line that is better than it gets credit for. Microsoft flinched on capex in early 2025 and lost position. Satya Nadella’s current decision to use Microsoft compute for Copilot rather than reselling to OpenAI is courageous and probably correct, even at the cost of stock price.

The most interesting cross hyperscaler metric is startup engagement. Nvidia and Amazon engage deeply with startups. Google is next. Broadcom is the favored ASIC partner. AMD, Microsoft, and Meta have minimal startup engagement, which Gavin believes will cost them as the best teams now sit at startups.

Personal safety, geopolitics, and the Pax Americana case

The closing section turns darker. Personal safety in an AI era requires a family or company safe word that cannot be socially engineered. Deepfake voice and video extortion via something that looks exactly like your child calling on FaceTime is already feasible. Political violence against AI leaders is a real concern. Geopolitically, Ukraine is winning largely because it has the best battlefield AI outside America and Israel. How adversaries respond to that asymmetry is the next great variable.

Gavin’s optimistic frame is the Pax Americana. After 1945 the US had a nuclear monopoly and could have controlled the world. Instead it rebuilt Germany and Japan, both of which became the most reliable American allies for the next eighty years. If AI dominance plays out similarly, this is a generationally positive story rather than a destabilizing one. The personal anecdote that closes the conversation is a friend whose daughter was diagnosed with a rare genetic condition. He spun up agents, identified a drug already on the market that addresses her mutation, and her life is immeasurably different because of AI. That is the upside.

Thoughts

The Anthropic eleven billion in a month framing is the kind of stat that resets priors. The right way to interpret it is not as a one off but as a measure of how fast value can compound when the underlying technology improves on a curve steeper than the ability of the rest of the economy to absorb it. The skeptical question is whether that ARR is durable or whether it is heavily tied to a customer base of other AI companies that are themselves on a single venture funded year of runway. The bullish answer is that frontier coding, frontier research, and frontier enterprise tasks are not going to stop being valuable, and Anthropic is the best at all three. Both can be true. The number is still extraordinary.

The argument that TSMC discipline is the only thing preventing a bubble is the analytically tightest part of the conversation. The implied trade is to watch TSMC capacity additions like a hawk and to be more, not less, cautious if Intel Foundry or Samsung Foundry ever announce real share at the leading node. The Terafab thesis is more speculative but more interesting. If Elon’s talent recruiting playbook works and the Intel partnership gives Terafab a real seat at the table within five years, the geometry of the global semiconductor industry shifts in a way that is bullish for American manufacturing, bullish for power and water infrastructure in Texas, and ambiguous for TSMC itself.

The Pareto frontier discussion deserves more attention than it usually gets. Pricing leadership in AI is not a vanity metric. It determines who can subsidize free tier usage, who can absorb compute shortages, who can ship cheaper enterprise plans, and ultimately whose model becomes the default for any given workload. Google losing per token leadership in nine months is one of the most under analyzed events in the sector and it explains a lot about why Anthropic and OpenAI are growing the way they are. If Google IO does not produce a leapfrog model, the implied verdict on TPU V8 design choices gets a lot harsher.

The application layer destruction point is worth sitting with. Founders building on top of frontier models are competing in a world where the model itself moves faster than any moat they can build, where the model lab can absorb their niche if it gets interesting, and where the only protection is either deep token path integration or a niche so small the lab does not bother. That is a much harsher venture environment than the early SaaS era. The compensating opportunity is that one human can now run a hundred agents, so the ceiling on what a small team can build is correspondingly higher. The bet is that productivity per founder rises faster than competitive pressure from the labs. We will find out.

The orbital compute pitch is the section that will polarize listeners. The naive read is that this is science fiction. The closer read is that every component (sun synchronous orbit, laser interconnect, twenty kilowatt satellite buses, ten thousand satellite manufacturing cadence, full rocket reusability) already exists. The remaining engineering problems are repair, maintenance, and radiator scale, all of which are real but tractable on a five to ten year horizon. The strategic implication is that the political and zoning ceiling on terrestrial data centers becomes less binding if orbital compute is a credible alternative for inference workloads. The investor implication is that being short the watts and cooling complex on a five year horizon is a real trade, not a meme.

Watch the full conversation here.
May 20, 2026
Sundar Pichai on the All-In Podcast: Unpacking Alphabet’s AI Future, Competitive Pressures, and the Next $100B Bets
TLDW (Too Long; Didn’t Watch):

Sundar Pichai, CEO of Alphabet, sat down with the All-In Podcast to discuss AI’s seismic impact on Google Search, the company’s infrastructure and model advantages, the future of human-computer interaction, intense competition (including from China), energy constraints, long-term bets like quantum computing and robotics, and the evolving culture at Google. He remains bullish on Google’s ability to navigate disruption and lead in the AI era, emphasizing a “follow the user” philosophy and relentless innovation.

Executive Summary: Navigating the AI Revolution with Sundar Pichai

In a comprehensive and candid interview on the All-In Podcast (dated May 16, 2025), Alphabet CEO Sundar Pichai offered deep insights into Google’s strategy amidst the transformative wave of Artificial Intelligence. Pichai addressed the “innovator’s dilemma” head-on, asserting Google’s proactive stance in evolving its core Search product with AI, rather than fearing self-disruption. He detailed Google’s significant infrastructure advantages, including custom TPUs, and differentiation in foundational models. The conversation spanned the future of human-computer interaction, the burgeoning competitive landscape, critical energy constraints for AI’s growth, and Google’s “patient” investments in quantum computing and robotics. Pichai also touched upon fostering a high-performance, mission-driven culture and clarified Alphabet’s structure as a technology-first company, not just a holding entity. The overarching theme was one of optimistic resilience, with Pichai confident in Google’s capacity to innovate and lead through this pivotal technological shift.

Key Takeaways from Sundar Pichai’s All-In Interview:
- AI is an Opportunity, Not Just a Threat to Search: Google sees AI as the biggest driver for Search progress, expanding query types and user engagement, not a zero-sum game. “AI Mode” is coming to Search.
- Disrupting Itself Proactively: Pichai rejects the “innovator’s dilemma” if a company leans into user needs and innovation, citing mobile and YouTube Shorts as examples. Cost per AI query is falling; latency is a bigger challenge.
- Infrastructure is a Core Differentiator: Google’s decades of investment in custom hardware (TPUs – now 7th gen “Ironwood”), data centers, and full-stack approach provide a significant cost and performance advantage for training and serving AI models. 50% of 2025 compute capex ($70-75B total) goes to Google Cloud.
- Foundational Model Strength: Google believes its models (like Gemini 2.5 Pro and Flash series) are at the frontier, with ongoing progress in LLMs and beyond (e.g., world models, diffusion models). Data from Google products (with user permission) offers a differentiation opportunity.
- Human-Computer Interaction is Evolving Towards Seamlessness: Pichai sees AR glasses (not immersive displays) as a potential next leap, making computing ambient and intuitive, though system integration challenges remain.
- Energy is a Critical Constraint for AI Growth: Pichai acknowledges electricity as a major gating factor for AI progress and GDP, advocating for innovation in solar, nuclear, geothermal, grid upgrades, and workforce development.
- Long-Term Bets on Quantum and Robotics:
  - Quantum Computing: Pichai believes quantum is where AI was in 2015, predicting a “useful, practical computation” superior to classical within 5 years. Google is at the frontier.
  - Robotics: The combination of AI with robotics is creating a “sweet spot.” Google is developing foundational models (vision, language, action) and exploring product strategies, expecting a “magical moment” in 2-3 years.
- Culture of Innovation and Accountability: Google aims to empower employees within a mission-focused framework, learning from the WFH era and fostering intensity, especially in teams like Google DeepMind. The goal is to attract and retain top talent.
- Competitive Landscape is Fierce but Expansive: Pichai respects competitors like OpenAI, Meta, XAI, and Microsoft, and acknowledges China’s (e.g., DeepSeek) rapid AI progress. He believes AI is a vast opportunity, not a winner-take-all market.
- Alphabet’s Structure: More Than a Holding Company: Alphabet leverages foundational technology and R&D across its businesses (Search, YouTube, Cloud, Waymo, Isomorphic, X). It’s about differentiated value propositions, not just capital allocation.
- Founder Engagement: Larry Page and Sergey Brin are deeply engaged, with Sergey actively coding and contributing to Gemini, providing “unparalleled energy.”
- Regrets & Pride: Pichai is proud of Google’s ability to push foundational R&D into impactful products. A “small regret” includes not acquiring Netflix when intensely debated internally.
In what can only be described as a pivotal moment for the technology landscape, Sundar Pichai, the CEO of Alphabet and Google, joined David Friedberg and discussed the pressing questions surrounding Google’s dominance, its response to the AI revolution, and its vision for the future. This wasn’t just a cursory Q&A; it was a strategic deep-dive into the mind of one of tech’s most influential leaders.

(2:58) The Elephant in the Room: Will AI Kill Search? Google’s Strategy for Self-Disruption

The conversation immediately tackled the “innovator’s dilemma,” a theory that haunts established giants when new paradigms emerge. Friedberg directly questioned if AI, with its chat interfaces and complete answers, poses an existential threat to Google’s $200 billion search advertising cash cow.

Pichai’s response was a masterclass in strategic framing. He emphasized that Google has been “AI-first” for nearly a decade, viewing AI not as a threat, but as the primary driver for advancing Search. “We really felt that AI is what will drive the biggest progress in search,” Pichai stated. He pointed to the success of AI Overviews, now used by 1.5 billion users, which are expanding the types of queries people make. Empirically, Google sees query growth and increased engagement where AI Overviews are triggered.

Critically, Pichai revealed a “whole new dedicated AI experience called AI mode coming to search,” promising a full-on conversational AI experience powered by cutting-edge models. This mode sees users inputting queries “literally long paragraphs,” two to three times longer than traditional search queries. He dismissed the “dilemma” framing: “The dilemma only exists if you treat it as a dilemma… you have to innovate to stay ahead.” He drew parallels to Google’s successful navigation of the mobile transition and YouTube’s thriving alongside TikTok by launching Shorts, even when monetization wasn’t immediately clear. The guiding principle remains: “Follow the user, all else will follow.”

Addressing the unit economics, Pichai downplayed concerns about the cost of serving AI queries, stating, “Google with its infrastructure, I’d wager on that… the cost to serve that query has fallen dramatically in an 18-month time frame.” Latency, he admitted, is a more significant constraint than cost. For ad revenue, AI Overviews are already at baseline parity with traditional search, with potential for improvement as AI can better match commercial intent with relevant information.

(15:32) The Unseen Fortress: Infrastructure Advantage and Foundational Model Differentiation

A cornerstone of Google’s confidence lies in its unparalleled infrastructure. Pichai highlighted Google’s position on the “Pareto frontier of performance and cost,” delivering top models cost-effectively. This is largely due to their custom-built Tensor Processing Units (TPUs). “We are in our seventh generation of TPUs,” Pichai noted, with the latest “Ironwood” generation offering over 40 exaflops per part. This full-stack approach, from subsea cables to custom chips, is crucial for serving AI at scale and managing costs.

Regarding the hefty $70-75 billion capex projected for 2025, Pichai clarified that roughly half of the compute spend is allocated to Google Cloud, supporting its enterprise offerings and enabling innovation from Google DeepMind across various AI domains – not just LLMs, but also image, video, and “world models.”

When asked about Nvidia, Pichai expressed “extraordinary respect” for Jensen Huang and Nvidia’s “world-class” software stack. While Google trains its Gemini models on TPUs internally, they also use Nvidia GPUs and offer them to cloud customers. “I like that flexibility,” he said, “but we are also long-term committed to the TPU direction.”

On the topic of foundational model performance, Pichai acknowledged that progress isn’t always linear (“artificial jag jag intelligence,” as Andrej Karpathy termed it). However, he sees continuous progress and believes Google is “pushing the research frontier in a much broader way than most other people beyond just LLMs.” He doesn’t see fundamental roadblocks to further advancements yet, though progress gets harder, which he believes will distinguish elite teams. He also touched upon the “differentiated innovation opportunity” of leveraging data from Google’s suite of products (like Gmail, Calendar, YouTube) with user permission to create superior, personalized experiences.

(25:08) The Future of Human-Computer Interaction, Hardware, and the AI Competitive Landscape

Looking ahead, Pichai envisions human-computer interaction becoming more seamless, where “computing kind of works for you.” He sees AR glasses – not immersive VR displays, but glasses that augment reality ambiently – as a potential “next leap,” comparable to smartphones in 2006-2007. “When AR really works, I think that’ll wow people,” he mused, while acknowledging existing system integration challenges.

The competitive landscape is undeniably intense. Pichai spoke respectfully of OpenAI (Sam Altman), XAI (Elon Musk), Meta (Mark Zuckerberg), and Microsoft (Satya Nadella), calling them an “impressive group” driving rapid progress. “I think all of us are going to do well in this scenario,” he suggested, emphasizing that AI represents a “much bigger landscape opportunity than all the previous technologies we have known combined.” He even noted that “companies we don’t even know… might be extraordinarily big winners.”

The discussion also covered China’s AI prowess, particularly highlighted by DeepSeek’s efficient models. Pichai admitted that DeepSeek made many “adjust our priors a little bit” about how close Chinese R&D is to the frontier, though he noted Google’s Flash models benchmarked favorably. “China will be very, very competitive on the AI frontier,” he affirmed.

A significant portion of this section involved the engagement of Google’s founders, Larry Page and Sergey Brin. Pichai described them as “deeply involved in their own unique ways,” with Sergey Brin actively “sitting and coding” with the Gemini team, looking at loss curves and model architectures. “To have a founder sitting there… it’s a rare, rare place to be,” Pichai shared, valuing their “nonlinear thinking.”

(35:29) The Energy Bottleneck: AI’s Thirst for Power

A critical, and often underestimated, constraint for AI’s future is energy. Pichai agreed with Elon Musk’s concerns, identifying electricity as “the most likely constraint for AI progress and hence by definition GDP growth.” He stressed this is an “execution challenge,” not an insurmountable physics barrier. Solutions involve embracing innovations in solar (plus batteries), nuclear (SMRs, fusion), geothermal, alongside crucial grid upgrades, streamlined permitting, and addressing workforce shortages (e.g., electricians). While Google faces current supply constraints and project delays due to these factors, Pichai expressed faith in the US’s ability to innovate and meet the moment, driven by capitalist solutions.

(41:20) Google’s Moonshots: Quantum Computing and Robotics

Pichai reiterated Google’s commitment to long-term, patient R&D, citing Waymo as an example of perseverance.

Quantum Computing: The Next Frontier

He likened the current state of quantum computing to where AI was around 2015. “I would say in a 5-year time frame, you would have that moment where some a really useful practical computation… is done in a quantum way far superior to classical computers.” Despite the “noise” in the industry, Pichai is “absolutely confident” in Google’s leading position and expects more exciting announcements this year that will “expand people’s minds.”

Robotics: AI Embodied

The synergy between AI and robotics is creating a “next sweet spot.” Google, with its “world-class” vision-language-action models (Gemini robotics efforts), is actively planning its next moves. While past ventures into the application layer of robotics might have been premature, the current AI advancements make the field ripe for breakthroughs. “We are probably two to three years away from that magical moment in robotics too,” Pichai predicted, suggesting Google could develop something akin to an “Android for robotics” or offer its models like Gemini to power third-party hardware. He mentioned Intrinsic, an Alphabet company, as already working in this direction.

(47:56) Culture, Coddling, and Talent in the Age of AI

Addressing narratives about Google’s “coddling” culture, Pichai explained the original intent behind perks like free food: to foster collaboration and cross-pollination of ideas. While acknowledging the need to constantly refine culture, he emphasized that empowering employees remains a source of strength. He highlighted the intensity and mission-focus within teams like Google DeepMind, where top engineers often work in person five days a week.

“We are not all here in the company to resolve all our personal differences,” he stated. “We are here because you’re excited about… innovating in the service of the mission of the company.” The COVID era was a “big distortion,” and bringing people back, even in a hybrid model, has been crucial. He believes Google continues to attract top-tier talent, including the best PhD researchers, and that the current “exciting and intense” AI moment fosters a sense of optimism reminiscent of early Google.

(56:50) Alphabet’s Identity: Beyond a Holding Company

Pichai clarified that Alphabet isn’t a traditional holding company merely allocating capital. Instead, it’s built on a “foundational technology basis,” leveraging core R&D (like AI, quantum, self-driving tech) to innovate across diverse businesses. “Waymo is going to keep getting better because of the same work we do in Gemini,” he illustrated. The common strand is deep computer science and physics-based R&D, with X (formerly Google X) continuing to play a role as an incubator for moonshots like sustainable agriculture (Tapestries) and grid modernization.

Reflections: Regrets and Pride

When asked about his biggest regrets and proudest achievements, Pichai expressed immense pride in Google’s unique ability to “push the technology frontier” with foundational R&D and translate it into valuable products and businesses. As for regrets, he mentioned, “There are acquisitions we debated hard, came close.” When pressed for a name, he hesitantly offered, “Maybe Netflix. We debated Netflix at some point super intensely inside.” He framed these not as deep regrets but as acknowledgments of alternate paths in a world of “butterfly effects.”

Sundar Pichai’s appearance on the All-In Podcast painted a picture of a leader and a company that are not just reacting to the AI revolution but are actively shaping it. With a clear-eyed view of the challenges and an unwavering belief in Google’s innovative capacity, Pichai’s insights suggest that Alphabet is determined to remain at the forefront of technological advancement for years to come.
May 16, 2025
The Future We Can’t Ignore: Google’s Ex-CEO on the Existential Risks of AI and How We Must Control It

AI isn’t just here to serve you the next viral cat video—it’s on the verge of revolutionizing or even dismantling everything from our jobs to global security. Eric Schmidt, former Google CEO, isn’t mincing words. For him, AI is both a spark and a wildfire, a force that could make life better or burn us down to the ground. Here’s what Schmidt sees on the horizon, from the thrilling to the bone-chilling, and why it’s time for humanity to get a grip.

Welcome to the AI Arms Race: A Future Already in Motion

AI is scaling up fast. And Schmidt’s blunt take? If you’re not already integrating AI into your business, you’re not just behind the times—you’re practically obsolete. But there’s a catch. It’s not enough to blindly ride the AI wave; Schmidt warns that without strong ethics, AI can drag us into dystopian territory. AI might build your company’s future, or it might drive you into a black hole of misinformation and manipulation. The choice is ours—if we’re ready to make it.

The Good, The Bad, and The Insidious: AI in Our Daily Lives

Schmidt pulls no punches when he points to social media as a breeding ground for AI-driven disasters. Algorithms amplify outrage, keep people glued to their screens, and aren’t exactly prioritizing users’ mental health. He sees AI as a master of manipulation, and social platforms are its current playground, locking people into feedback loops that drive anxiety, depression, and tribalism. For Schmidt, it’s not hard to see how AI could be used to undermine truth and democracy, one algorithmic nudge at a time.

AI Isn’t Just a Tool—It’s a Weapon

Think AI is limited to Silicon Valley’s labs? Think again. Schmidt envisions a future where AI doesn’t just enhance technology but militarizes it. Drones, cyberattacks, and autonomous weaponry could redefine warfare. Schmidt talks about “zero-day” cyber attacks—threats AI can discover and exploit before anyone else even knows they exist. In the wrong hands, AI becomes a weapon as dangerous as any in history. It’s fast, it’s ruthless, and it’s smarter than you.

AI That Outpaces Humanity? Schmidt Says, Pull the Plug

The elephant in the room is AGI, or artificial general intelligence. Schmidt is clear: if AI gets smart enough to make decisions independently of us—especially decisions we can’t understand or control—then the only option might be to shut it down. He’s not paranoid; he’s pragmatic. AGI isn’t just hypothetical anymore. It could evolve faster than we can keep up, making choices for us in ways that could irreversibly alter human life. Schmidt’s message is as stark as it gets: if AGI starts rewriting the rules, humanity might not survive the rewrite.

Big Tech, Meet Big Brother: Why AI Needs Regulation

Here’s the twist. Schmidt, a tech icon, says AI development can’t be left to the tech world alone. Government regulation, once considered a barrier to innovation, is now essential to prevent the weaponization of AI. Without oversight, we could see AI running rampant—from autonomous viral engineering to mass surveillance. Schmidt is calling for laws and ethical boundaries to rein in AI, treating it like the next nuclear power. Because without rules, this tech won’t just bend society; it might break it.

Humanity’s Play for Survival

Schmidt’s perspective isn’t all doom. AI could solve problems we’re still struggling with—like giving every kid a personal tutor or giving every doctor the latest life-saving insights. He argues that, used responsibly, AI could reshape education, healthcare, and economic equality for the better. But it all hinges on whether we build ethical guardrails now or wait until the Pandora’s box of AI is too wide open to shut.

Bottom Line: The Clock’s Ticking

AI isn’t waiting for us to get comfortable. Schmidt’s clear-eyed view is that we’re facing a choice. Either we control AI, or AI controls us. There’s no neutral ground here, no happy middle. If we don’t have the courage to face the risks head-on, AI could be the invention that ends us—or the one that finally makes us better than we ever were.

November 14, 2024
Gemini: Google’s Multimodal AI Breakthrough Sets New Standards in Cross-Domain Mastery

Google’s recent unveiling of the Gemini family of multimodal models marks a significant leap in artificial intelligence. The Gemini models are not just another iteration of AI technology; they represent a paradigm shift in how machines can understand and interact with the world around them.

What Makes Gemini Standout?

Gemini models, developed by Google, are unique in their ability to simultaneously process and understand text, images, audio, and video. This multimodal approach allows them to excel across a broad spectrum of tasks, outperforming existing models in 30 out of 32 benchmarks. Notably, the Gemini Ultra model has achieved human-expert performance on the MMLU exam benchmark, a feat that has never been accomplished before.

How Gemini Works

At the core of Gemini’s architecture are Transformer decoders, which have been enhanced for stable large-scale training and optimized performance on Google’s Tensor Processing Units. These models can handle a context length of up to 32,000 tokens, incorporating efficient attention mechanisms. This capability enables them to process complex and lengthy data sequences more effectively than previous models.

The Gemini family comprises three models: Ultra, Pro, and Nano. Ultra is designed for complex tasks requiring high-level reasoning and multimodal understanding. Pro offers enhanced performance and deployability at scale, while Nano is optimized for on-device applications, providing impressive capabilities despite its smaller size.

Diverse Applications and Performance

Gemini’s excellence is demonstrated through its performance on various academic benchmarks, including those in STEM, coding, and reasoning. For instance, in the MMLU exam benchmark, Gemini Ultra scored an accuracy of 90.04%, exceeding human expert performance. In mathematical problem-solving, it achieved 94.4% accuracy in the GSM8K benchmark and 53.2% in the MATH benchmark, outperforming all competitor models. These results showcase Gemini’s superior analytical capabilities and its potential as a tool for education and research.

The model family has been evaluated across more than 50 benchmarks, covering capabilities like factuality, long-context, math/science, reasoning, and multilingual tasks. This wide-ranging evaluation further attests to Gemini’s versatility and robustness across different domains.

Multimodal Reasoning and Generation

Gemini’s capability extends to understanding and generating content across different modalities. It excels in tasks like VQAv2 (visual question-answering), TextVQA, and DocVQA (text reading and document understanding), demonstrating its ability to grasp both high-level concepts and fine-grained details. These capabilities are crucial for applications ranging from automated content generation to advanced information retrieval systems.

Why Gemini Matters

Gemini’s breakthrough lies not just in its technical prowess but in its potential to revolutionize multiple fields. From improving educational tools to enhancing coding and problem-solving platforms, its impact could be vast and far-reaching. Furthermore, its ability to understand and generate content across various modalities opens up new avenues for human-computer interaction, making technology more accessible and efficient.

Google’s Gemini models stand at the forefront of AI development, pushing the boundaries of what’s possible in machine learning and artificial intelligence. Their ability to seamlessly integrate and reason across multiple data types makes them a formidable tool in the AI landscape, with the potential to transform how we interact with technology and how technology understands the world.

December 6, 2023
Microsoft Transitions from Bing Chat to Copilot: A Strategic Rebranding

In a significant shift in its AI strategy, Microsoft has announced the rebranding of Bing Chat to Copilot. This move underscores the tech giant’s ambition to make a stronger imprint in the AI-assisted search market, a space currently dominated by ChatGPT.

The Evolution from Bing Chat to Copilot

Microsoft introduced Bing Chat earlier this year, integrating a ChatGPT-like interface within its Bing search engine. The initiative marked a pivotal moment in Microsoft’s AI journey, pitting it against Google in the search engine war. However, the landscape has evolved rapidly, with the rise of ChatGPT gaining unprecedented attention. Microsoft’s rebranding to Copilot comes in the wake of OpenAI’s announcement that ChatGPT boasts a weekly user base of 100 million.

A Dual-Pronged Strategy: Copilot for Consumers and Businesses

Colette Stallbaumer, General Manager of Microsoft 365, clarified that Bing Chat and Bing Chat Enterprise would now collectively be known as Copilot. This rebranding extends beyond a mere name change; it represents a strategic pivot towards offering tailored AI solutions for both consumers and businesses.

The Standalone Experience of Copilot

In a departure from its initial integration within Bing, Copilot is set to become a more autonomous experience. Users will no longer need to navigate through Bing to access its features. This shift highlights Microsoft’s intent to offer a distinct, streamlined AI interaction platform.

Continued Integration with Microsoft’s Ecosystem

Despite the rebranding, Bing continues to play a crucial role in powering the Copilot experience. The tech giant emphasizes that Bing remains integral to their overall search strategy. Moreover, Copilot will be accessible in Bing and Windows, with a dedicated domain at copilot.microsoft.com, parallel to ChatGPT’s model.

Competitive Landscape and Market Dynamics

The rebranding decision arrives amid a competitive AI market. Microsoft’s alignment with Copilot signifies its intention to directly compete with ChatGPT and other AI platforms. However, the company’s partnership with OpenAI, worth billions, adds a complex layer to this competitive landscape.

The Future of AI-Powered Search and Assistance

As AI continues to revolutionize search and digital assistance, Microsoft’s Copilot is poised to be a significant player. The company’s ability to adapt and evolve in this dynamic field will be crucial to its success in challenging the dominance of Google and other AI platforms.

November 15, 2023
Amazon Charts New Territory with ‘Vega’: A Homegrown OS for Smart Devices

Amazon, the global e-commerce behemoth, is reportedly taking a bold step away from Android with the development of its own operating system for Fire TVs and smart displays. According to sources and internal discussions, the project, internally dubbed ‘Vega’, is set to revolutionize the software backbone of Amazon’s suite of connected devices.

The initiative, which has been under the radar since as early as 2017, has gained traction recently with the involvement of notable industry professionals like former Mozilla engineer Zibi Braniecki. With Vega, Amazon aims to shed the technical limitations imposed by Android’s legacy code, which was originally designed for mobile phones, not the burgeoning smart home market.

Vega is poised to offer a Linux-based, web-forward operating system, pivoting towards React Native for app development. This shift promises a more unified and efficient development environment, enabling programmers to create versatile apps that are operable across a myriad of devices and operating systems.

This strategic move by Amazon seems twofold: gaining technological independence from Google’s Android, and establishing a more robust platform for reaching consumers through various devices, potentially increasing revenue through targeted ads and services.

As Vega’s development continues, with a possible rollout on select Fire TV devices by next year, Amazon sets the stage for a new era in smart device interaction, aligning itself for greater control over its technological destiny and consumer reach.

November 9, 2023
Leveraging Efficiency: The Promise of Compact Language Models

In the world of artificial intelligence chatbots, the common mantra is “the bigger, the better.”

Large language models such as ChatGPT and Bard, renowned for generating authentic, interactive text, progressively enhance their capabilities as they ingest more data. Daily, online pundits illustrate how recent developments – an app for article summaries, AI-driven podcasts, or a specialized model proficient in professional basketball questions – stand to revolutionize our world.

However, developing such advanced AI demands a level of computational prowess only a handful of companies, including Google, Meta, OpenAI, and Microsoft, can provide. This prompts concern that these tech giants could potentially monopolize control over this potent technology.

Further, larger language models present the challenge of transparency. Often termed “black boxes” even by their creators, these systems are complicated to decipher. This lack of clarity combined with the fear of misalignment between AI’s objectives and our own needs, casts a shadow over the “bigger is better” notion, underscoring it as not just obscure but exclusive.

In response to this situation, a group of burgeoning academics from the natural language processing domain of AI – responsible for linguistic comprehension – initiated a challenge in January to reassess this trend. The challenge urged teams to construct effective language models utilizing data sets that are less than one-ten-thousandth of the size employed by the top-tier large language models. This mini-model endeavor, aptly named the BabyLM Challenge, aims to generate a system nearly as competent as its large-scale counterparts but significantly smaller, more user-friendly, and better synchronized with human interaction.

Aaron Mueller, a computer scientist at Johns Hopkins University and one of BabyLM’s organizers, emphasized, “We’re encouraging people to prioritize efficiency and build systems that can be utilized by a broader audience.”

Alex Warstadt, another organizer and computer scientist at ETH Zurich, expressed that the challenge redirects attention towards human language learning, instead of just focusing on model size.

Large language models are neural networks designed to predict the upcoming word in a given sentence or phrase. Trained on an extensive corpus of words collected from transcripts, websites, novels, and newspapers, they make educated guesses and self-correct based on their proximity to the correct answer.

The constant repetition of this process enables the model to create networks of word relationships. Generally, the larger the training dataset, the better the model performs, as every phrase provides the model with context, resulting in a more intricate understanding of each word’s implications. To illustrate, OpenAI’s GPT-3, launched in 2020, was trained on 200 billion words, while DeepMind’s Chinchilla, released in 2022, was trained on a staggering trillion words.

Ethan Wilcox, a linguist at ETH Zurich, proposed a thought-provoking question: Could these AI language models aid our understanding of human language acquisition?

Traditional theories, like Noam Chomsky’s influential nativism, argue that humans acquire language quickly and effectively due to an inherent comprehension of linguistic rules. However, language models also learn quickly, seemingly without this innate understanding, suggesting that these established theories may need to be reevaluated.

Wilcox admits, though, that language models and humans learn in fundamentally different ways. Humans are socially engaged beings with tactile experiences, exposed to various spoken words and syntaxes not typically found in written form. This difference means that a computer trained on a myriad of written words can only offer limited insights into our own linguistic abilities.

However, if a language model were trained only on the vocabulary a young human encounters, it might interact with language in a way that could shed light on our own cognitive abilities.

With this in mind, Wilcox, Mueller, Warstadt, and a team of colleagues launched the BabyLM Challenge, aiming to inch language models towards a more human-like understanding. They invited teams to train models on roughly the same amount of words a 13-year-old human encounters – around 100 million. These models would be evaluated on their ability to generate and grasp language nuances.

Eva Portelance, a linguist at McGill University, views the challenge as a pivot from the escalating race for bigger language models towards more accessible, intuitive AI.

Large industry labs have also acknowledged the potential of this approach. Sam Altman, the CEO of OpenAI, recently stated that simply increasing the size of language models wouldn’t yield the same level of progress seen in recent years. Tech giants like Google and Meta have also been researching more efficient language models, taking cues from human cognitive structures. After all, a model that can generate meaningful language with less training data could potentially scale up too.

Despite the commercial potential of a successful BabyLM, the challenge’s organizers emphasize that their goals are primarily academic. And instead of a monetary prize, the reward lies in the intellectual accomplishment. As Wilcox puts it, the prize is “Just pride.”

May 31, 2023