Tag: AI Productivity

Vibe Coding Hardware: Naval, Guillermo Rauch, Blake Scholl, and Max Hodak on AI-Designed Jet Engines, Vertical Integration, China’s Open-Source Bet, and Why Humans Become Verifiers
This is part two of Naval Ravikant’s conversation with frontier founders Guillermo Rauch of Vercel, Blake Scholl of Boom Supersonic, and Max Hodak of Science. Where the first part argued that you should waste tokens to save time and that the job of an engineer is now to build the factory rather than the output, this segment drags that thesis out of pure software and into atoms. The question on the table is what happens to hardware when models can vibe code the spreadsheets, the simulations, and eventually the step files and PCB layouts that aerospace, semiconductors, and biotech are built on. This segment is one half of the discussion, and you can watch and read the full episode here. The full conversation is on the Naval Podcast YouTube channel.

TLDW

Blake Scholl describes how Boom Supersonic took hardware engineering workflows that used to live in siloed Excel spreadsheets and VBScript on individual laptops, with handoffs done by email like it was the 1990s, and turned them into versioned, testable software. The new model is that software engineers build the architectures and the tools while hardware engineers vibe code their own domain-specific pieces, which collapsed a turbine-blade analysis that once took one engineer one day per blade into something where two engineers can design an entire jet engine in real time. Naval generalizes this into the cataclysm of enterprise software: there is no longer a startup that can sell you hardware collaboration tools because companies just code the exact thing they need on demand, and even spreadsheets are cooked because they only existed as a proxy for custom software nobody could previously afford to build. Blake predicts that within 2026 AI will move from generating software to generating step files and PCB layouts, which reshapes mechanical and electrical engineering. The group debates China’s open-source push as a way to neutralize Silicon Valley’s software advantage and protect its hardware and supply-chain superiority, lands on the point that if you fall behind on generating software you fall behind on generating everything, and Guillermo notes that frontier coding intelligence still dominates real usage while cheaper models like Gemini win at scale for support and browser automation. Max Hodak explains Science’s vertical integration, including a captive MEMS foundry on the East Coast, because the most innovative hardware cannot be bought off the shelf, and argues that software still needs hands since a model that cannot make physical things hits real boundaries. The conversation closes on the shift from writing to verifying: junior engineering got absorbed by agents while juniors got promoted, the same way paralegals could be seen as fired or promoted, and humans across law, engineering, and operations are becoming the verifiers who sign off on systems they did not write line by line.

Thoughts

The most important shift in this segment is that vibe coding stops being a software-industry story and becomes a deep-tech story. In part one the examples were Postgres, ClickHouse, and deploy targets. Here Blake Scholl is talking about turbine blades that change shape when they heat up, and the brutal fact that converting between cold and hot geometry, and between aerodynamics and structures, used to eat one engineer for one full day per blade in an engine that has a thousand blades. That is the kind of math that quietly kills ambition. When he says two engineers can now design an entire jet engine because the structural and aerodynamic results update in real time as you change the geometry, that is not a productivity improvement, it is a change in what a small team is allowed to attempt. The interesting move is the division of labor: software engineers build the architecture and the framework because they understand systems and separation of concerns, and the hardware engineers vibe code the pieces only they understand. Nobody has to become both.

Naval’s “cataclysm of enterprise software” is the most investable idea in the episode, and it is darker than it sounds for anyone selling B2B tools. His claim is that the entire category of internal collaboration software is being eaten from the inside, because a company that can generate exactly the tool it needs on any given day will not pay a vendor for an approximation of that tool. His follow-on that even spreadsheets are cooked is the sharpest version of the point. The spreadsheet won for forty years precisely because it was the closest thing to custom software that a non-programmer could produce. Remove the constraint that custom software is expensive and the spreadsheet loses its reason to exist. The counterweight, which the group raised in part one with the block-economy thesis, is that the infrastructure primitives agents reach for get more valuable, not less. So the safe place to build is not the collaboration layer on top, it is the primitive underneath.

The China discussion is the geopolitical center of the conversation and it lands on a genuinely uncomfortable insight. The argument is that China leans into open-source models not only because it is a model or two behind, but because open weights neutralize Silicon Valley’s software advantage and let China lean on what it already dominates: hardware, supply chains, and component ecosystems. If software can be generated on demand from open models, then the country with the factories wins the stack. The sharpest line is that if you fall behind on the ability to generate software, you fall behind on the ability to generate everything, because software is now upstream of every hardware pipeline. That reframes the open-versus-closed debate as a question about who controls the means of producing the means of production. It also quietly flatters the American frontier labs, since the same logic says self-improvement requires frontier coding models, and on that narrow axis the consensus at the table is that the Chinese models are not yet in the race.

Max Hodak provides the necessary cold water, and it is the most grounding contribution in the episode. Everyone else is describing software eating the design layer, and Max points out that you still have to make the thing. Science owns a captive MEMS foundry on the East Coast not as a flex but because there was no other way to do the packaging and assembly for products that approach a single block of covalently bonded matter. His framing that the software still needs hands is the real boundary condition on all the AI-eats-everything talk: a model can be smarter than every engineer in the building and still be unable to deposit a layer, bond a wafer, or pass a regulatory inspection. The optimistic version, which he also makes, is that he has instrumented the foundry so that as models improve, the gains show up immediately in cell engineering and material science. The pessimistic reading is that the physical world remains a hard rate limiter, and the companies that own the atoms will capture more of the surplus than the companies that only own the bits.

The closing thread on verification is where the whole conversation resolves into a job description for humans. Guillermo’s point that the biggest problem in software is mountains of slop arriving as a pull request, and that the answer is not pretending to read every line but being able to say “I am signing off on the consequences of this PR, and I wrote the harness, the simulations, the proofs, and the type checkers that let me,” is the most practically useful idea in the episode. It generalizes cleanly. The lawyer you trust is not the one who wrote every clause by hand, it is the one putting their reputation on the line that the document is sound. The production engineer who gets paged at 3am is the one signing off that the system is safe to ship. As models absorb the junior tier of every knowledge profession, the surviving human role is the verifier who carries the accountability. That is a promotion for the people who can hold it and an extinction event for the people whose value was doing the work nobody now needs done by hand.

Key Takeaways
- The factory framing from part one carries straight into hardware: you are judged on whether you build the system that produces multiplicative outputs, not on the single artifact, and the real multiplier was always 100x or 1000x, not 10x.
- AI completely changes the role of software and hardware developers rather than just speeding either one up.
- A huge amount of hardware engineering lives in complex Excel spreadsheets and VBScript on individual engineers’ laptops, with no source control, no automated testing, and handoffs done manually over email. It is software that is not treated as software.
- Boom Supersonic’s move from day one was to turn traditional hardware engineering workflows into real software frameworks that are automatable and repeatable, to drive down the cost of iteration.
- The old bottleneck was never being able to afford enough software engineers to build those frameworks. AI removes that constraint.
- The new model: software engineers create the architectures because they understand systems, algorithms, and separation of concerns, and hardware engineers vibe code the domain pieces only they understand.
- A turbine blade is cold when it starts and hot when it runs, so it changes shape, and you must design both the cold and hot geometry across aerodynamics and structures. Classically that was one engineer, one day, for one blade, in an engine with a thousand blades.
- With software and hardware people combined, you can now change blade geometry and see the structural and aerodynamic results in real time, which lets two engineers design an entire jet engine.
- Naval’s cataclysm of enterprise software: no startup can sell hardware collaboration tools anymore because companies just code the exact thing they need at any given time.
- Even spreadsheets are cooked. Spreadsheets won only because nobody could build custom software, so a spreadsheet full of VBScript was the closest available approximation. Remove the cost barrier and the approximation loses.
- Engineers are moving from Excel to Python models that produce believable simulations of physical systems.
- AI can generate software today, but within 2026 it is expected to generate step files and PCB layouts, which opens up mechanical and electrical engineering as the next frontier.
- The hardware software boon is biggest for small gadget and parts companies that historically shipped bad software because they could not afford good software. Now they can ship good-enough software, or skip the human front end entirely and expose hardware agentically for voice and agent control.
- China goes all in on open-source models partly to neutralize Silicon Valley’s software edge: if software can be generated on demand from open weights, China’s hardware and supply-chain superiority stops being offset by a software disadvantage.
- Other reasons cited for China’s open-source push: it is a model or two behind, it is distilling models, and the government has a history of funding efforts that lift the whole ecosystem, especially in network-effect businesses.
- Open-source heft is coming almost entirely from China. OpenAI is not open, Grok publishes models but is seen as a model or two behind, Google’s local models are not very competitive, and Anthropic is not known for open-source releases.
- Without frontier coding models you do not get self-improvement, and if you fall behind on generating software you fall behind on generating everything, because software now sits upstream of every hardware pipeline.
- Real AI gateway usage shows open models do get used, but the top is heavily dominated by frontier intelligence.
- Frontier intelligence at the right cost and performance slaps at scale. Gemini models are underrated and excel as industrial production models for support tasks and browser automation, even if they are not the top pick for coding.
- For pushing the frontier you need the best possible coding model, which is now only two or three models, and the Chinese models are not among them.
- One contrarian view at the table: use DeepSeek for 97% of tasks because it is cheap, run it repeatedly for harder problems, and reserve frontier models for the most advanced work. The counterargument: intelligence is an unalloyed good, mistakes are invisible and costly, and a smarter model is always cheaper than a person, so you default to the most intelligent option.
- Always wanting the most intelligent model risks creating a monopoly or oligopoly in AI, because when two models disagree you cannot tell which is right, so you trust the smarter one and stop asking the weaker one.
- Vertical integration is forced, not chosen: if you cannot buy it, you have to make it. The preference is always to buy when a vendor offers a service at a great price, like PCBs from Asia.
- The closer a product gets to a single block of covalently bonded matter, the better it performs: lower power, smaller, higher performance, longer lasting. The components for that level of integration simply are not available to buy.
- Science owns a captive MEMS foundry on the East Coast, bought because there was no other way to do the packaging and assembly the company needed.
- One of the biggest near-term AI impacts inside hardware companies is regulatory and documentation work: tracing which of thousands of ISO standards apply used to occupy a regulatory and quality team for months, and now AI just knows.
- Software still needs hands. A model can be smarter than us and still hit real boundaries if it cannot physically make things, which is why Science has instrumented its foundry so model improvements show up immediately in cell engineering and material science.
- Basic legal work is already going away. People have stopped asking lawyers for NDAs and routine agreements, because law is spaghetti code in English with no real APIs, and the basic tasks are handled by AI.
- Junior engineers got promoted to senior engineers while junior engineering itself got taken over by agents. The same framing applies to paralegals: fired, or promoted to senior lawyers who now spend their time thinking about the law.
- What you value in a lawyer is a trusted authority who puts their reputation on the line, not someone who read every clause. The same trust model is coming to engineering.
- The biggest problem in software engineering today is mountains of slop arriving as a pull request. The old norm of reading every line of a PR is gone.
- The new standard is being able to say “I understand and I am signing off on the consequences of this PR,” backed by the test harness, simulations, proofs, and type checkers you built, even without reading every line.
- Embrace a world where code is spaghetti you do not fully understand, but build the evaluators that give confidence, and rely on production engineers to sign off because someone gets paged if the system goes down.
- Creating software is easy from zero to one. The hard part is a thousand days from now: is it secure, tested, production grade, and performant, and are you still motivated to invest the tokens to maintain it in prod?
- Humans are becoming verifiers. The same way models are trained on good verification data, the old functions of lawyers, engineers, and operations people are moving to verifying the stack and standing behind it.
Detailed Summary

Turning Hardware Engineering Into Software

Blake Scholl opens by describing how AI completely changes the role of software and hardware developers at Boom Supersonic. From day one the company tried to take traditional hardware engineering workflows and turn them into software. For anyone who has not been around hardware engineering, he explains that an enormous amount of it happens in complex Excel spreadsheets on individual engineers’ laptops, sometimes with VBScript code, all of which is actually software but is not treated as software. There is no source control, no automated testing, and when an aerodynamicist hands work to a structures engineer it is done manually with a spreadsheet over email, like it is the 1990s. Boom started building software frameworks to automate and make those flows repeatable so the cost of iteration would drop, but progress was slow because the company could never afford enough software engineers.

Two Engineers, One Jet Engine

The mind-blowing change, in Blake’s words, is a new division of labor. Software engineers create the architectures because they understand systems, algorithms, and separation of concerns, and then hardware engineers vibe code the pieces that draw on what they uniquely know about hardware. The result is wildly different productivity for small teams. His example is the turbine blade: it starts cold and gets bigger as it heats up in operation, so you have to design both the cold shape and the hot shape, converting between them and between structures and aerodynamics. Classically that was one engineer, one day, for one blade of analysis, in a jet engine with a thousand blades, which means you simply could not do much. Now, with software and hardware people working together, you can change blade geometry and see the structural and aerodynamic results in real time, which allows two engineers to design an entire jet engine.

The Cataclysm of Enterprise Software

Picking up on the point that software engineers now build the tools and architectures for everyone else, Naval names what he calls the cataclysm of enterprise software. There is no longer a startup that can build and sell hardware collaboration tools, because internally companies just code the right things they need at any given moment. Even spreadsheets are cooked, he argues, because the reason spreadsheets succeeded is that no one could build custom software, so a spreadsheet stuffed with VBScript functions was the closest available approximation. With that constraint gone, the proxy collapses. He notes he has personally moved almost entirely from Excel to Python models where he can get believable simulations of things.

Generating Step Files and PCB Layouts

The next frontier, Blake suggests, is the thing AI has not reached yet but probably will within 2026: today it can generate software, but soon it will generate step files and PCB layouts, and when it comes for mechanical and electrical engineering that will be a whole other thing nobody has seen yet. On the hardware side this is described as a particular boon for the many small gadget and parts companies that historically wrote bad software because they could not make great software. Now they can make good-enough software, or skip a human front end entirely and expose the hardware agentically, so that an agent accesses it and a person controls the hardware by voice.

China’s Open-Source Bet and Hardware Superiority

This leads into one of the reasons China is described as going all in on open-source models. With hardware superiority, complex supply chains, and deep component chains, China’s logic is that if it can generate software on demand it no longer suffers a software disadvantage against Silicon Valley. That is framed as not the only reason: China is also a model or two behind, it is distilling models, and the government has a history of funding efforts that lift the entire ecosystem, especially in network-effect businesses. Ironically, the open-source heft comes from China precisely because OpenAI is not open, Grok publishes models but is a model or two behind, Google’s local models are not very competitive, and Anthropic is not known for open releases. The deeper point is that without great frontier coding models you do not get self-improvement, and if you fall behind on the ability to generate software you fall behind on the ability to generate everything, because generating software is embedded in every piece of the hardware pipeline.

Frontier Intelligence vs. Cheap Models

Naval raises a dinner-table argument from the night before, where someone claimed you will use DeepSeek for 97% of things because it is cheap, run it repeatedly when you need more intelligence, and reserve OpenAI or Anthropic for the most advanced tasks. Naval pushes back: intelligence is an unalloyed good, you always want more of it, model mistakes are invisible, and a smarter model is always cheaper than a real person in real time, so you default to the most intelligent model available. He notes the downside is that this tends toward a monopoly or oligopoly, because when two models give different answers you often cannot tell which is correct, so you trust the smarter one and gradually stop asking the weaker one. Guillermo confirms with AI gateway data that open models do get used, but the top is heavily dominated by frontier intelligence. His caveat is that frontier intelligence at the right cost and performance slaps at scale: Gemini models are underrated but are excellent industrial production models for support tasks and browser automation, while for pushing the frontier you need the best possible coding model, now only two or three models, and the Chinese models are not in that set.

Vertical Integration and the Captive MEMS Foundry

Asked about his push into vertical integration and extreme urgency, Max Hodak explains that for many things you cannot buy what you need, so you have to make it. The preference is always to buy when a vendor offers a service at a great price, and he points to PCBs, which are basically free and available in unlimited quantity from Asia. But the closer a product gets to being a single block of covalently bonded matter, the better it is: lower power, smaller, higher performance, longer lasting. The components for that level of integration are not available, so to innovate beyond piecing together off-the-shelf parts you have to learn to do it yourself, which shows up as vertical integration. Science owns a captive MEMS foundry on the East Coast, bought because there was no other way to do the packaging and assembly work the company wanted.

Software Still Needs Hands

Max expects AI to heavily affect all of this over the next few years, though it is not quite there yet. Ironically, one of the biggest impacts already seen is in regulatory interactions and documentation: figuring out which of thousands of ISO standards apply to a product change, and tracing it through, used to occupy a regulatory and quality team for months, and now the AI just knows. But for things like the surgical program or the MEMS fab, he argues the software still needs hands. It will be smarter than us, but if it cannot make things, those are real boundaries. Science has instrumented its foundry and many other parts of the company so that as models get better, the improvement shows up immediately in cell engineering and material science.

Lawyers, Paralegals, and the Promotion of Junior Work

The discussion turns to law as a parallel to engineering. It has been a while since anyone at the table generated a basic legal document using a lawyer. Routine work like NDAs and standard agreements is gone, because law is essentially spaghetti code that contradicts itself and has no real APIs, expressed in complicated English. Junior engineers got a promotion to senior engineers while junior engineering itself was taken over by agents, and the same framing applies to paralegals: you can say they were fired, or you can say they were promoted to senior lawyers who now spend their time thinking about the law. What you actually value in a lawyer is a trusted authority who went to law school and puts their reputation on the line when they tell you a document is legit.

Slop PRs, the Thousand-Day Problem, and Humans as Verifiers

Guillermo argues the biggest problem in software engineering today is mountains of slop ending up as a pull request. The old meme of reading every line of a PR is gone. In infrastructure he wants engineers to be able to say they understand and are signing off on the consequences of a PR, backed by the test harness, simulations, proofs, and type checkers they wrote, so they have confidence it will be safe in production even without reading every line. There is a world where everyone embraces that the code is spaghetti nobody fully understands, but builds the evaluators that give confidence and relies on production engineers to say it is fine to ship, because someone gets paged if the system goes down. The further warning is that creating software is easy from zero to one, but a thousand days from now you have to ask whether it is secure, tested, production grade, and performant, and whether you are still motivated to invest the tokens to maintain it in prod. The resolution is that humans are becoming verifiers, the same way models are trained on good verification data, and the old functions of lawyers, engineers, and operations people are moving to verifying the stack and standing behind it.

Notable Quotes

“What I found is it completely changes the role of software and hardware developers.”
Blake Scholl, on how AI reshaped engineering at Boom Supersonic.

“If you want to hand something off from like an aerodynamicist to a structures engineer that’s done manually with like a spreadsheet over email. It’s the 1990s. It’s terrible.”
Blake Scholl, describing the state of traditional hardware engineering workflows.

“It allows two engineers to design an entire jet engine, which is just wildly different.”
Blake Scholl, on collapsing turbine-blade analysis with real-time structural and aerodynamic feedback.

“Even spreadsheets are kind of cooked, right? Because the reason spreadsheets were successful is that no one could build custom software.”
Naval Ravikant, on the cataclysm of enterprise software.

“Right now it can generate software, but soon it’ll be able to generate step files and PCB layouts. And when it comes for mechanical and electrical engineering, that will be a whole other thing that we haven’t seen yet.”
Blake Scholl, on the next frontier for AI in hardware.

“If you fall behind on your ability to generate software, you fall behind on the ability to generate everything.”
Naval Ravikant, on why software now sits upstream of every hardware pipeline.

“Anytime I’m working to push the frontier you need the best possible coding model, and that’s basically now like two or three models, and the Chinese are certainly not in it.”
Guillermo Rauch, on where frontier coding intelligence actually lives.

“You can’t buy it, so you got to make it somehow. The closer that our products get to being like a single block of covalently bonded matter, the better they’ll be.”
Max Hodak, on why Science is forced into vertical integration.

“The software still needs hands. It’s going to be smarter than us, but if it can’t make things, then those are real real boundaries.”
Max Hodak, on the physical limits of AI in hardware.

“You need to be able to say I am signing off on understanding the consequences of this PR, or I wrote the test harness, the simulations, the proofs, the type checkers, to be able to say even without reading this, I have confidence it’s going to be safe in production.”
Guillermo Rauch, on what code review becomes in the age of slop PRs.

“Creating software is really easy 0 to one. But think about a thousand days from now. Is it secure? Is it tested? Is it production grade? And are you still motivated to invest all of those tokens in maintaining it in prod?”
On the long-term cost of software that is cheap to create and expensive to keep alive.

Watch the full conversation on the Naval Podcast here.

Related Reading
- Full episode: The AI Industrial Revolution, the complete hour-long conversation this clip is drawn from, covering software factories, hardware, regulation, healthcare economics, autonomous companies, and creativity.
- Part one: Waste Tokens to Save Time, the first half of this same conversation, where Naval, Guillermo Rauch, Blake Scholl, and Max Hodak argue that the job of an engineer is to build the factory and that pure software is not dead.
- Boom Supersonic, Blake Scholl’s company building supersonic civilian aircraft and its own jet engines, the source of the turbine-blade and two-engineers example.
- Science Corporation, Max Hodak’s company, whose captive MEMS foundry and surgical program anchor the vertical-integration argument.
- Vercel, Guillermo Rauch’s company, whose AI gateway data informs the point about frontier intelligence dominating real usage.
- Microelectromechanical systems (Wikipedia), background on the MEMS technology behind the captive foundry Max Hodak describes.
May 29, 2026
Tobi Lütke on Uncapped Episode 50, Building Shopify in the AI Era, The Net Impact Memo, Six Week Cycles, and Why Software Was the Hidden Infrastructure of Our Time
Tobi Lütke, the founder and CEO of Shopify, sits down with Jack Altman for Episode 50 of the Uncapped podcast for one of the most useful hours of operating wisdom you will hear from a sitting public company founder. The conversation moves from why Tobi still loves the work after twenty years, through the practical mechanics of running Shopify on six week review cycles, into the now famous AI memo he sent to the entire company, the rise of Claude Code style agents, what it means to spend tens of percent of revenue on AI tokens, why the modern web browser is a wonder of the world, and where small businesses actually fit in a world where the next Turing test might be “build me a million dollar business.” This is essential listening for any founder, operator, or investor trying to make sense of what 2026 actually requires.

TLDW

Tobi Lütke explains how he keeps loving his life’s work by pursuing what Paul Kapoa called “beautiful problems,” why “different” must always be the starting position because anything copied can only be marginally better, and why Silicon Valley’s last decade of orthodoxy has been bad for originality. He walks through his decision to send Shopify’s company wide AI memo and codify it into net impact performance reviews, the unlimited token policy for employees, why small three to five person teams are his bet, and how Parkinson’s Law and a six week review cycle force pace. He calls the doomer permanent underclass narrative completely absent from Shopify’s data, citing one new merchant getting their first sale every 36 seconds, and proposes “build me a million dollar business” as the real successor to the Turing test. He argues humanity has not stopped building wonders, we just built them all in software for thirty years, that the web browser is one of the most impressive engineering achievements ever made and could never get approved by a modern app store, and that the freed talent leaving software will rebuild the physical world. He shares his hiring philosophy, why he restarted the Shopify intern program at scale with Waterloo, his preference for public over private status, and ends with a short reading list anchored by Parkinson’s Law, Lessons of History, and a book called What Is Intelligence.

Key Takeaways
- Tobi’s recipe for life’s work is to find a beautiful problem worth occupying you for life, and accept that the solved problem will spawn delightful problem children to keep you engaged.
- His simple model of success, “figure out what it costs and be willing to pay it,” with the price almost always being time, commitment, and discomfort rather than money.
- He warns CEOs against collecting “barnacles” of aesthetic expectation, the statesman travel and baby kissing pattern, calling that lifestyle inefficient and personally miserable.
- He invokes Kathy Sierra’s line “don’t make better cameras, make better photographers” as his core product philosophy, beautiful tools that induce more ambition and skill in the user.
- Mediocre products feel like room temperature. Great products are forged in a furnace and require sustained heat from the team.
- Shopify builds its own HR software internally because the available options are not what they want to use. Toolmaking is a stated cultural identity.
- Originality is axiomatic. If you build the same thing as everyone else, you can only be marginally better. The starting position has to be “different,” and if you converge on the consensus answer through that path you have actually learned something.
- Shopify has tried to eliminate the word “failure” internally, replacing it with “the successful discovery of something that didn’t work.”
- Tobi says Silicon Valley spent the last decade declaring war on distinction, that the diversity push as practiced eradicated eccentricity, and that the inversion is now beginning. Companies should resemble islands of misfit toys, not convergence on a pre-ordained truth.
- One of his most surprising career insights, when he visited the Valley as a Canadian outsider and asked founders how they ran their companies, he only ever received the highlight reel. Trying to clone what those founders described led him to invent practices the originals had never actually implemented.
- The Shopify AI memo, sent company wide, made it explicit that two equally good engineers fifteen minutes earlier are no longer equivalent if one is fluent with AI tools and the other is not. This was codified into the company’s “net impact” performance review framework.
- Tobi describes the “founder credibility bank” as the most underrated asset in a founder led company. Every onboarding deposits a little credibility, and the founder can spend it on hard change management that would otherwise take years of incremental culture work.
- Shopify gives every employee an unlimited token policy for AI tools and displays token usage and departmental percentile on internal profiles. Token spend is tracked because it has to be allocated to opex, not because it is the target.
- He confirms Shopify’s AI token spend is “extremely high” relative to revenue and notes that some private companies are now running token spend at many tens of percent of revenue, a level he thinks cannot persist at every stage but makes sense right now because the tokens are buying so much leverage.
- Shopify is on track to 10x its annual token consumption and 3x its GPU footprint, and those two curves do not converge anywhere good for price relief.
- His bet on team design is small, three to five people, which has always been Shopify’s bias. AI agents now handle the customer research summarization role that previously required a dedicated team member, raising every individual to a “seven out of ten on every scale.”
- Parkinson’s Law (the book, 60 pages, 1960s edition) is his single most recommended management book. He owns multiple original print runs and gives copies to executives. “Work expands to the time allocated.”
- Shopify runs on a six week review cycle. The first warning sign that a team has slipped into quarterly pacing is seeing “H1” or “H2” used in a PowerPoint. He now thinks six weeks is too slow and is actively trying to figure out what replaces it.
- The “permanent underclass” doom narrative simply does not appear anywhere in Shopify’s data. New entrepreneurs are reporting that AI has finally fixed computers for them, expanding their businesses and letting them hire.
- A new merchant gets their first Shopify sale every 36 seconds. Every reduction in onboarding friction produces a measurable jump in completed businesses.
- Tobi proposes “go make me a million dollars” as the natural successor to the Turing test, an end to end test of acting in the real world, marketing, prioritizing, shipping, and producing something people will pay for.
- Shopify Collective lets aspiring entrepreneurs sell other manufacturers’ products if their skill is marketing rather than making. Print on demand, additive manufacturing, contract manufacturing, CNC, 3D printing, and humanoid robotics are all pulling the cost of “make the product yourself” toward the floor.
- The reason American infrastructure feels stagnant for thirty years is that the infrastructure humanity actually needed was digital. The web browser, Linux, Google, social networks, and Shopify itself are wonders that dwarf a refinery in complexity but are invisible by nature.
- Tobi calls the modern web browser one of the wonders of the world. Font rendering alone is a Turing complete system. No app store on earth would approve the browser today if it did not already exist, because the pitch (“we download untrusted code from strangers and run it on your machine to reconfigure your computer for them”) sounds insane.
- The next chapter is the brightest software engineers being freed by AI to build the physical infrastructure that has been deferred for a generation.
- He prefers to predict the future by collecting many data points and matching them to super linear, linear, or sublinear curves. The current AI horizon is the hardest period of his career to forecast because the time horizons are so short.
- Programming is overhyped as the locus of AI value. The bigger story is using the programming harness, the file system, tools, and memory files of products like Claude Code, to drag every other domain into the programming domain where the models are strongest.
- The underhyped frontier is enterprise deployment. Most companies are still asking “help me do the thing I already did, slightly better,” instead of “if AI had existed since Alan Turing, how would I have designed this job from scratch.”
- Tobi restarted the Shopify intern program at scale, partnered closely with the University of Waterloo, and explicitly frames interns as both students and teachers because they are AI native in a way the rest of the company is still catching up to.
- He briefly believed AI would tilt the value of work toward early career talent with maximum fluid intelligence, then revised when he watched how much creative “steering” the best programmers were quietly contributing inside the AI loop. Good people are still good.
- His recruiting philosophy is “build a company worth looking for” rather than selling candidates. Better to actually be healthier than to look healthier in photographs.
- Tobi is a vocal defender of being a public company. Shopify IPO’d at a $1.5 billion valuation and has roughly 100x’d in public markets, which means an enormous number of retail investors have shared in the upside that recent unicorns reserve for insiders.
- His framing of money, “money is how you vote for what you want.” Buying a product or buying a share is a vote for the thing existing.
- His current reading recommendations, Parkinson’s Law, Lessons of History, and a book called What Is Intelligence that reframes biology around prediction.
- He reads at night because his wife sleeps early and he does not need much sleep. He loves the Kindle precisely because it cannot do anything else, a “wonderful single purpose device.”
Detailed Summary

Why Tobi Still Loves the Work After Twenty Years

The interview opens with Jack Altman asking how Tobi avoids the founder fade that hits most public company CEOs after a decade. Tobi answers from a place that is half psychology and half pedagogy. He has a hard time learning anything he has not first experienced as a problem worth solving, which is why he could not internalize school mathematics until he discovered that Wolfenstein 3D was essentially live trigonometry. That pattern, find a beautiful problem and let it drag you into the discipline, has carried him through twenty years of Shopify. He quotes Paul Kapoa on the idea that the luckiest people find a problem that occupies them for a lifetime and, if they are unfortunate enough to solve it, get rewarded with “delightful problem children” that keep the work alive.

Barnacles, Statesmen, and the Aesthetic Trap of Being a CEO

He admits he is not naturally calm, and that he initially fell into the trap of trying to perform the CEO aesthetic, the statesman, the global travel, the baby kissing. He found it inefficient and personally miserable. The shift came from reading Kathy Sierra and adopting her line about not making better cameras but making better photographers. Shopify exists, in his framing, to be a beautiful tool that induces ambition in the merchant. Mediocre products feel like room temperature, and great products are forged in a furnace. The job of leadership is to keep supplying the heat.

Different First, Convergence Second, Failure as Successful Discovery

Asked whether he prefers originality or quality, Tobi is unequivocal. The starting position must be different. If you copy the consensus answer, you are bounded to a few percentage points of variance from it. If you start different and converge on the consensus, you have learned something. If you start different and the experiment gets worse, you have learned something even more valuable, which is that one of your assumptions about the world was wrong. He calls null results in science “massively underrated” and notes that Shopify has tried to remove the word “failure” from the internal vocabulary, substituting “the successful discovery of something that didn’t work.”

Why Silicon Valley Lost Its Originality

Jack pushes on the herd mentality he has felt in the Bay Area, and Tobi is direct. He thinks Silicon Valley “declared war on distinction” for a decade, with the diversity conversation as practiced effectively eradicating eccentricity. He prefers the metaphor of “an island of misfit toys,” and says the inversion is now beginning. He also relays one of the most useful career lessons he has shared, that during his visits to the Valley as an outsider asking founders how they ran their companies, he only ever received the highlight reel. He went home and engineered a “Shopify version” of what he thought he had heard, and only years later realized that he had often built more rigorous versions of things the originals had never actually implemented.

The AI Memo, Net Impact Reviews, and the Founder Credibility Bank

Tobi was one of the first Fortune class CEOs to send a company wide memo saying that AI fluency was now a baseline expectation. He walks through the decision. Two engineers who were equally productive fifteen minutes ago are no longer equivalent the moment one of them adopts the new tools. The kind thing to do is to make that explicit. Shopify codified it into “net impact” performance reviews, where the question is not how much code you wrote but how much net impact you produced for the company and the mission. He gives every employee an unlimited token policy and tracks usage at the profile level, including percentile within department. The spend is tracked because it has to be allocated to opex, not because the number itself is the target.

He introduces the concept of the “founder credibility bank,” which may be the single most quotable idea in the interview. Every time a new employee onboards and hears how the company was created, a small deposit of credibility is made into a virtual account that only the founder can draw on. Founders can spend that balance on hard change management, the kind of pace step change that would otherwise require years of small cultural nudging. The AI memo was a deliberate withdrawal from that account, and the speed of adoption that followed has been, in his telling, remarkable.

Tokens, Opex, and the Limits of Spend as Revenue

Jack presses on the financial reality of AI tokens. Tobi confirms that Shopify’s token spend is “extremely high” relative to revenue, and that the leverage they are buying makes the spend a no brainer at the current stage of the curve. He concedes that private companies running token spend at “many tens of percent of revenue” cannot sustain that ratio forever, but he is not worried for Shopify because the tokens are clearly productive and Shopify is a profitable public company with the balance sheet to lean in. He expects to 10x token consumption and 3x GPUs every year for now, and notes that the curves do not converge in a direction that lowers prices. He has high faith in markets to find clearing prices.

Small Teams, Parkinson’s Law, and the Six Week Cycle

On team architecture, Tobi has always preferred three to five person teams and says AI has finally made that feasible across the board. Roles that previously required a dedicated specialist, customer research summarization being the canonical example, are now handled by the “agentic harness” routing summarized customer feedback into every team. Everyone is a “seven out of ten on every scale” by default. He spends serious time on pace, which he treats as the single most important variable to control. His most recommended book is Parkinson’s Law, a 60 page volume from the 1960s that he gives to every executive. “Work expands to the time allocated.” He runs the company on a six week review cycle and treats the appearance of “H1” or “H2” in a PowerPoint as a hard warning sign that a team has drifted into quarterly thinking. He now believes six weeks is too long and is actively redesigning the cycle.

There Is No Permanent Underclass in the Shopify Data

Jack raises the cultural fear that AI is creating a permanent young underclass with no career ladder. Tobi simply does not see it in Shopify’s data. The merchants are reporting the opposite, that AI has finally fixed computers for non technical small business owners and is unlocking hiring. He cites the statistic that a new merchant gets their first sale on Shopify every 36 seconds, and that every reduction in onboarding friction produces a measurable jump in completed businesses. Every form of friction is a hurdle that someone considers giving up at. AI has removed more of those hurdles in two years than any platform shift before it.

A New Turing Test, “Build Me a Million Dollar Business”

Tobi nominates a successor to the Turing test, which he points out the field already sailed past with surprisingly little fanfare. The real test is “go make me a million dollars.” It requires acting in the real world, marketing, prioritization, shipping, sourcing, building inventory, and convincing strangers to vote for the product with a real million dollars of their own. He believes we are getting there. Shopify already supports the path through Shopify Collective, the discovery layer for manufacturers willing to white label their products, and print on demand, contract manufacturing, additive manufacturing, CNC, 3D printing, and humanoid robotics are all collapsing the cost of physically producing a product. Shopify’s stated ambition is to be the vessel for AI to run all of the non product parts of the business so that the only thing the human needs to show up with is the product itself.

Software Was the Hidden Infrastructure of the Last Thirty Years

The most original argument in the episode is about why American infrastructure has appeared to stagnate for a generation. Tobi rejects the standard story. Humanity has not stopped building wonders, it has built every one of them in software. The web browser, Linux, Google, the social networks, and Shopify itself are projects whose complexity dwarfs a refinery or a dam, and they were built by global volunteer networks and by companies the public underestimates because the work is invisible. The browser in particular he calls a wonder of the world. He notes that font rendering alone is a Turing complete system, that no modern app store would approve the browser if it did not already exist, and that the basic pitch of “we will download untrusted code from strangers and reconfigure your computer for them” should sound insane but does not because we are used to it. The implication for the next twenty years is that all of the talent that flowed into software is now being freed by AI to rebuild the physical infrastructure that has been quietly deferred.

Predicting AI Two Years Out, Overhype and Underhype

Jack asks whether a CEO should try to forecast AI two years ahead or operate six months at a time. Tobi is firmly in the forecasting camp and admits his friends would laugh because predicting the future from many data points and curve types is his predominant obsession. He says the AI memo was slightly too early, and that is exactly the point, because a memo that arrives late costs the company its head start. He flags two specific market level mis estimations. The first is that the labs over invest in programming because programming is their internal problem, and people then over generalize a model’s coding ability to other domains where it is not yet as strong. The second is that almost everyone is under deploying AI in their actual companies, still asking “help me do my old job better” instead of “if AI had existed since Alan Turing, how would I have designed this job from scratch.” That second framing is, in his view, where the next decade of value lives.

Hiring, Interns as Teachers, and Why Good People Are Still Good

Tobi briefly believed AI would tilt the value of labor toward early career fluid intelligence, since interns adopted the new tools faster than veterans. He revised that view once the coding harnesses matured. The best programmers, it turned out, were quietly contributing enormous amounts of creative steering inside the AI loop, work that does not show up in the diff but that no junior with no domain pattern matching can replicate. Good people are still good. Shopify has massively scaled its intern program with the University of Waterloo, and explicitly treats interns as both students and teachers because they bring AI nativeness the rest of the company still has to catch up to. On recruiting, Tobi’s philosophy is to build a company worth looking for. The metaphor he uses is health, that companies waste energy trying to look healthy in photos when they should be doing the work to actually be healthier.

Public Company Defense and the Reading List

Tobi pushes back on the modern preference for staying private. Shopify went public at $1.5 billion and is now over $100 billion, which means an enormous number of retail investors got to participate in the upside. He treats money as a voting mechanism. Buying a product is a vote for the product. Buying a share is a vote for the company. He is comfortable with the diligence and quarterly scrutiny of public markets because both make him a better operator. He closes with a short reading list, Parkinson’s Law (60 pages, 1960s edition, owned in original print runs and gifted to executives), Lessons of History, and a book called What Is Intelligence that reexplains biology from a prediction first perspective. He reads at night while his wife sleeps, on a Kindle, which he loves precisely because it cannot do anything else.

Thoughts

The single most useful idea Tobi puts on the table is the “founder credibility bank.” It explains, in one clean image, why founder led companies move so much faster than the same company would after a transition. The credibility is not personal magnetism, it is the structural slot the founder occupies in the org chart, and every onboarded employee makes a small deposit into it as they hear the founding story. Most founders never realize the account exists, or spend it on cosmetic decisions, and then are surprised when the well runs dry. Tobi’s discipline is the opposite. He saves the balance for moments of forced change and spends it confidently when the moment arrives, the AI memo being the obvious recent case. Any CEO reading this transcript should be making a list of the changes they have been postponing and asking whether they are operating with a fuller credibility account than they have been willing to admit.

The token spend conversation is the most interesting strategic disclosure. A profitable public company at scale openly says it likes the tokens it is buying, is on track to 10x annual token consumption and 3x GPU footprint, and is comfortable with private peers spending tens of percent of revenue on inference. That is not the language of a market that is about to compress. It is the language of a leverage trade that is still in its early innings, and it is one of the cleanest statements you will get from a public CEO about why the AI capex story is not a bubble for the buyer. Whether it is a bubble for the seller is a separate question, but on the demand side, this interview is a load bearing data point.

The argument that “software was the hidden infrastructure of the last thirty years” is the kind of reframe that should make policy people uncomfortable. The standard narrative that America stopped building anything ambitious since the Hoover Dam is true only if you refuse to count Chrome, Linux, AWS, Shopify, and every social graph that connects three billion people in real time. Tobi’s claim that the browser would not be approved by a modern app store is a particularly sharp gut check. The implication is not nostalgic. It is forward looking. The same talent that built the digital wonders is being freed by AI to redirect toward houses, transport, energy, and care, and the next decade will be measured by how much of that redirection actually lands.

The “build me a million dollar business” framing as a Turing test successor is the kind of measurable goal that AI labs and policy makers should be writing down. It is end to end. It includes physical world action, marketing, sourcing, prioritization, and customer validation that no in domain benchmark can fake. Shopify is the obvious substrate for the first crossing of that threshold, and the existence of Shopify Collective, print on demand pipelines, and contract manufacturing networks means a credible attempt is already much closer than the public conversation acknowledges. The first end to end autonomous Shopify business that clears a million dollars will be a more legible AGI moment than any benchmark a lab can publish.

The smaller thread on Silicon Valley orthodoxy is worth pulling on. Tobi’s claim that the diversity conversation as practiced eradicated distinction is unfashionable but observable inside many tech companies, where the people most likely to do unusual work are the most likely to leave. His preferred metaphor of “an island of misfit toys” is closer to what made the Valley work in earlier decades than the current consensus aesthetic. The fact that a Canadian outsider, geographically removed from the dominant social pressure, runs the most valuable Canadian technology company in history is probably not a coincidence.

Watch the full conversation here on YouTube.
May 20, 2026
Marc Andreessen on Joe Rogan #2501, AGI Has Already Arrived, California’s Wealth Tax Will Bankrupt Founders, and Why America Cannot Build Anything Anymore
Marc Andreessen returns to The Joe Rogan Experience #2501 for a sprawling three hour conversation that tries to make sense of the moment we are actually living through. Andreessen is the cofounder of Andreessen Horowitz, the man who built the first commercial web browser, and one of the most quoted voices in technology. He arrived with a giant pile of receipts on California’s new wealth tax ballot proposition, the political backlash against AI data centers, the destruction of Los Angeles by single party rule, and what he believes is the quiet arrival of artificial general intelligence about three months ago. Joe pushes back, asks the dystopian questions, and the result is one of the most useful primers on the AI economy, surveillance technology, energy policy, and the future of the American social contract that you will find anywhere.

TLDW

Andreessen argues that AI quietly crossed the AGI threshold around early 2026 with GPT 5.5, Claude 4.6, Gemini 3.0, and Grok 4.3, that top human coders now openly admit the bots are better than they are, that working software engineers are running twenty AI agents in parallel and turning into sleep deprived “AI vampires,” and that this productivity boom is the most underreported story in the world. He explains why California’s 5 percent wealth tax ballot proposition is calculated to bankrupt tech founders by taxing the higher of their voting or economic interest in their own companies, why this is the opening salvo of a federal asset tax push for 2028, and why a flood of Silicon Valley families is already moving to Nevada, Texas, and Florida. He walks through Flock cameras and Shot Spotter, the Washington DC crime statistics scandal, the Pacific Palisades fire and the fifteen year rebuild, the Kevin O’Leary Utah data center debate with Tucker Carlson, the fifty year suppression of American nuclear power, why all the chips ended up in Taiwan, the US versus China robotics gap, the Chinese practice of grading AI models on Marxism and Xi Jinping Thought, the bot and paid influencer economy on social media, neural wristbands and Meta Ray Ban heads up displays, artificial gestation and the demographic collapse, AI religions and AI mates, and why he still thinks the next twenty years are overwhelmingly a good news story. Rogan closes the episode with a separate solo segment apologizing to Theo Von for clumsily raising Theo’s struggles during the recent Marcus King conversation.

Key Takeaways
- Austin’s recent teenage crime spree, in which 15 and 17 year old suspects shot at people and buildings across roughly a dozen locations, was solved only after the offenders drove into an adjacent town that still ran Flock, the AI license plate and vehicle tracking system Austin had voluntarily turned off for political reasons.
- Chicago turned off both Flock and Shot Spotter, the gunshot triangulation system that places ambulances at shooting scenes within seconds, on the argument that the technology is racist. Andreessen counters that the victims of urban gun violence come overwhelmingly from the same communities the policy claims to protect.
- Washington DC was caught faking its crime statistics at senior levels, with multiple officials fired or indicted. The DC mayor publicly thanked Donald Trump after the National Guard deployment because violent crime collapsed in the affected neighborhoods.
- The new New York City mayor Zohran Mamdani filmed a video standing in front of Ken Griffin’s home, and Griffin, a major philanthropist who funds healthcare in New York City and runs a $6 billion project there, signaled he will move more of the business to Florida.
- The top 1 percent of New York taxpayers pay roughly half the state’s income tax, and in California in the year 2000 a thousand individuals paid 50 percent of the entire state’s tax receipts.
- California has a ballot proposition right now for a one time 5 percent wealth tax on assets above a certain threshold, with stocks and crypto included and real estate excluded. The tax is calculated on the greater of a founder’s economic interest or voting interest, which would instantly bankrupt founders with super voting shares.
- The Biden administration attempted a federal wealth tax in 2022, fell short, and published an explicit 2025 fiscal plan to try again if they won re-election. Elizabeth Warren has already proposed an annual 6 percent federal wealth tax on unrealized gains.
- The current US exit tax already takes roughly 45 percent of your assets if you renounce citizenship. The only ways out of a state level wealth tax are the other 49 states. The only way out of a federal one is to leave the country, which most people will not do.
- Andreessen says the Silicon Valley exodus has gone from trickle to stream to flood, with founders moving to Las Vegas, Texas, Florida, and Nashville. His partner Ben Horowitz has moved to Las Vegas.
- Andreessen says he is not leaving California, but admits the situation is fraught because if half the tax base leaves the remainder becomes the target.
- The new UK government under Keir Starmer just collapsed, and all four of the leading candidates to replace him sit further to the left than he does. France and Germany are seeing the same drift, and Andreessen expects a national wealth tax to be a centerpiece of the 2028 Democratic primary.
- A legal loophole lets companies pay influencers to post political and social ideas without any disclosure, because campaign finance laws cover candidates and FTC rules cover products. Ideas fall through the gap entirely.
- Andreessen runs Twitter and Substack as his primary information feeds, uses three hand curated lists, and follows a strict one tweet policy where one bad post triggers a block and one good post triggers a follow.
- He argues the modern social media problem is binary, that everyone is either too online and drowning in fake outrage cycles or too offline and trapped inside what television and newspapers tell them. Almost nobody manages the middle.
- Meta Ray Ban glasses now ship with a heads up display, and Meta’s neural wristband can pick up nerve impulses from your wrist so you can type messages by intending to move a finger without moving it.
- Andreessen predicts AI plus high resolution cameras and infrared sensing will deliver practical lie detection without needing brain implants.
- Kevin O’Leary’s planned 40,000 acre Utah data center has become a Tucker Carlson talking point, but Andreessen argues data centers are the most benign physical asset you can build, and that the real issue is whether America can build anything at all anymore, from chip plants to pipelines to housing.
- All chips were once made in California, and all are now made in Taiwan, purely because of environmental regulations like NEPA. The same regulatory machinery prevented the Nixon era Project Independence plan to build a thousand civilian nuclear power plants by the year 2000.
- Three Mile Island killed zero people and produced no detectable health effects on plant workers or the public, according to fifty years of follow up. Fukushima killed essentially zero people from radiation. Nuclear remains the safest carbon free baseload energy ever invented.
- Germany shut down its nuclear plants, fell back on intermittent wind and solar, and now uses coal as backup, generating far more carbon emissions than nuclear would have produced.
- The Pacific Palisades fire took out roughly twice the square mileage of the Nagasaki blast, the head of the LA water department reportedly did not know the key reservoir was empty, and the rebuild is expected to take fifteen years thanks to permit gridlock, affordable housing mandates, and a state ban on land offers below pre-fire appraised value.
- Andreessen offers a metaphor for AI as a modern philosopher’s stone, turning sand into thought, since chips are made of silicon and an AI data center is literally lit up sand thinking on demand.
- The Turing test was blown through so completely with ChatGPT in late 2022 that nobody in the industry even bothers running it anymore. Andrej Karpathy has demonstrated a working large language model in 300 lines of code and people have ported small models to Texas Instruments calculators.
- Andreessen believes AGI was effectively reached about three months before this interview, with GPT 5.5, Claude 4.6, Gemini 3.0, and Grok 4.3. He says 99 percent of the time he gets a better answer from the leading models than from the human experts he has access to.
- Linus Torvalds and John Carmack publicly admit the latest models are better at coding than they are. Top AI coders in the Valley now earn $50 million a year.
- The new pattern in the Valley is “AI vampires,” engineers who do not sleep because the opportunity cost of going offline is too high. They each run roughly twenty Claude Code, Cursor, or Codex agents in parallel, then a new layer of bot-managing-bot architectures is starting on top of that.
- A Wall Street friend with a thirty five year old MIT CS degree has used AI to generate 500,000 lines of code at home in his spare time, building everything from smart fridges to a custom music jukebox.
- The mass unemployment narrative is wrong. Tech companies that did layoffs were overstaffed. The leading AI labs and AI companies are hiring like crazy, including coders, and demand for code turns out to be vastly elastic.
- Doctors are already using ChatGPT in the exam room behind the patient’s back. Andreessen describes a friend who built a Star Trek style diagnostic dashboard combining decoded genome ($200 today), blood panels, and Apple Watch telemetry.
- Multimodal AI lets a webcam analyze a Brazilian jiu-jitsu sparring session and give performance feedback, an example Andreessen attributed to an unnamed friend after Rogan guessed Zuckerberg.
- A leaked David Shore voter issue ranking shows cost of living, the economy, inflation, taxes, and government spending dominate. AI ranks 29 of 39. Race relations, guns, abortion, and LGBT sit at the bottom, signaling the woke issue cluster has burned itself out in voter priorities.
- The next wave of AI is robots. The US leads in AI software but is far behind China on physical robotics. Andreessen warns the world cannot afford a future where every household robot ships with the Chinese Communist Party behind its eyes.
- Chinese AI model cards include scores for Marxism and Xi Jinping Thought because every Chinese product must be evaluated on those axes. American models have political biases of their own but a different ideological baseline.
- Large language models are not sentient. They write Netflix scripts based on whatever vector you shoot through the latent space. The supposed AI self preservation papers traced back, per Anthropic’s own research, to less wrong forum posts and earlier doom scenarios baked into the training data.
- Andreessen breaks guardrails routinely by reframing requests as fictional Netflix style scripts, including a personal favorite where he asked early models how to make bombs by claiming to be an FBI agent recruited into domestic terror cells.
- He recommends using AI by asking it to steelman both sides of any contested question, then making the value judgment yourself, rather than asking for the answer.
- The Trump administration is using AI on government billing data to surface Medicare fraud, fake hospice programs, and fake autism centers, an idea that survived the original Doge plan.
- Andreessen tells Rogan that Elon Musk privately confirmed that a Westworld style humanoid robot, the season one version, is roughly five years away.
- Artificial gestation is already happening with animal stem cell derived embryos. The conversation reaches a hard moral edge about sociopathic warehouse babies and gray-alien-style humans engineered without empathy circuitry.
- Andreessen’s deepest bet is that material abundance is solvable but the human questions, how we live, what we value, what kind of society we want, and what role consent plays in surveillance and brain interfaces, remain in human hands.
- After Andreessen leaves, Rogan does a separate solo segment where he apologizes to Theo Von for raising Theo’s history of struggles during the recent Marcus King interview, explains the missing context behind the viral Theo Netflix special clip, and discusses the loss of Brody Stevens, Anthony Bourdain, and what antidepressants did for Ari Shafir.
Detailed Summary

Flock, Shot Spotter, and the Politics of Solvable Crime

The episode opens on the Austin crime spree carried out by two teenagers who stole cars, switched vehicles, and shot at roughly a dozen locations across the city before being caught only after they crossed into a town that still ran Flock, the AI license plate and vehicle recognition platform that is one of Andreessen Horowitz’s portfolio companies. Austin had previously disabled Flock under privacy pressure. Andreessen takes the moment seriously, conceding that mass surveillance abuse by corrupt mayors or police chiefs is a real risk, and that warrants and audit logs are the right safeguards. His larger point is that the cost of unilateral disarmament against organized urban crime is hidden but enormous. He uses Chicago’s Shot Spotter as the paradigmatic case, a network of rooftop microphones that triangulates gunshots so accurately that ambulances can be dispatched before any 911 call is placed. Chicago turned the system off on the argument that it disproportionately flags poor neighborhoods, and people now bleed out on the street with nobody noticing. Andreessen calls this the woke argument against safety, and he argues that in high crime neighborhoods residents simply will not call the police because snitches do not survive, which is why objective sensor data is so valuable.

Faked Crime Statistics, Mayoral Politics, and the Tax Base

From there the conversation drifts to the recent scandal in which senior officials at the Washington DC Metropolitan Police Department were caught actively falsifying crime statistics, and the strange spectacle of the DC mayor thanking Donald Trump for the National Guard deployment after violent crime dropped off a cliff. Andreessen sketches an unsettling theory in which the long, slow degradation of major American cities is partly a deliberate political project to drive out responsible homeowners and reshape the voting electorate, then bail out the resulting fiscal hole with federal money. The poster case is the new New York City mayor Zohran Mamdani filming a video in front of Ken Griffin’s home. Griffin happens to be a major philanthropist who funds New York City healthcare, employs thousands, anchors a $6 billion development, and pays taxes that are individually load bearing for the city. Andreessen quotes the standard estimate that the top 1 percent of New Yorkers pay roughly half the state’s income tax, and that the all time California peak was a single year in which a thousand people paid half the state’s tax receipts.

California’s 5 Percent Wealth Tax and the Founder Bankruptcy Mechanic

This is the segment that landed hardest. California has a ballot proposition right now for a one time 5 percent wealth tax on net assets above a threshold, with real estate excluded but stocks, crypto, art, jewelry, and private company equity included. The detail that makes it lethal for the Valley is the formula, which calculates the taxable amount on the greater of a founder’s economic interest or voting interest in their company. Founders who hold super voting shares for control purposes, including the Google founders, would owe tax on the voting share number that vastly exceeds their economic share. The tax would, by definition, exceed available assets. Andreessen walks through the historical pattern, that income tax started as a 3 percent levy on the rich and grew to 90 percent marginal rates within decades, and predicts a 5 percent one time tax will become a 5 percent annual tax within a few years, with the threshold ratcheting down. He notes that the Biden administration’s 2025 fiscal plan explicitly named a federal asset tax as a goal if they won re-election, that Elizabeth Warren is already proposing a 6 percent annual federal wealth tax on unrealized gains, and that Gavin Newsom cannot veto a ballot proposition. The trickle of founders leaving California has become a flood. His partner Ben Horowitz has moved to Las Vegas. Andreessen himself is staying, but admits the game theory is brutal once half the base leaves.

Henry Wallace 1948 and Why the American Story Is Not Decided Yet

Andreessen pulls in a historical analogue most listeners will not have heard. In 1944 the actual communist Henry Wallace very nearly became Truman’s running mate and almost ascended to the presidency. He ran again in 1948. Despite a Soviet Union that had recently been a wartime ally and had even received a New York City ticker tape parade for Stalin, the American voter rejected him. Andreessen’s point is that the American body politic has historically backed away from radical socialist proposals when forced to actually look at them, and he expects the same to happen as the wealth tax becomes a federal 2028 platform issue. The risk, both he and Rogan agree, is that today’s media and bot landscape is vastly more aggressive than 1948’s, and the propaganda environment is shaped by paid influencers, foreign actors, and political bot farms operating in a legal grey zone where disclosure is required for products and candidates but not for ideas.

Too Online, Too Offline, and Heaven Banning Blue Sky

The two riff on social media and feed curation. Andreessen describes his “one tweet” policy where he follows or blocks any account based on a single post, his use of hand curated lists alongside the X algorithm, and the older Call of Duty lobby metaphor for handling toxic replies. Joe pushes back, says he no longer reads his mentions because the negative payload is not worth it, and offers his theory that the modern internet has two failure modes, too online and too offline, and that very few people calibrate the middle. Andreessen introduces the concept of “heaven banning,” an older moderator term where a problem user is not removed from a forum but is silently routed into a bot-only experience in which everything they say is praised. He notes the running joke that Blue Sky is functionally real life heaven banning, that Jack Dorsey himself has disowned it, and that the platform’s most engaged users have ascended into their own private Idaho of bot agreement.

The Coming Hardware, Meta Glasses, Neural Wristbands, and Practical Lie Detection

Andreessen walks Rogan through the latest Meta Ray Ban heads up display, the neural wristband that picks up nerve signals from finger movement (and from the intent to move a finger), and the screen recordings of people playing Doom hands free or playing platformer games while jogging. He extends the trajectory to practical lie detection without Neuralink, using ultra high resolution cameras combined with infrared sensors that pick up physiological changes invisible to the naked eye. Joe asks the obvious question of what happens with sociopaths, and Andreessen concedes the edge case. The two then enter a longer thread on telepathy via neural mesh devices, the question of whether police could subpoena your thoughts under warrant, and the divergence between the American constitutional framework and the Chinese model in which the state’s claim on your inner life is total.

Kevin O’Leary, Tucker Carlson, and Whether America Can Build Anything

The data center debate becomes a vehicle for the larger argument. Kevin O’Leary is building a 40,000 acre AI data center in Utah, has bought up large surrounding land for water rights, and intends to keep the bulk of it preserved. Tucker Carlson grilled him on tax breaks and on the energy footprint, which O’Leary says will rival New York City’s at peak. Andreessen agrees the tax break debate is fair, but says the energy comparison is a red herring because new federal policy now requires data centers to bring their own generation. The real story is that America has spent thirty years making it nearly impossible to build a chip plant, a power plant, a refinery, a pipeline, or a house. Chips moved to Taiwan because California regulated semiconductor manufacturing out of existence. The Nixon era Project Independence plan called for a thousand civilian nuclear power plants by the year 2000, and that program was strangled in the crib by the very Nuclear Regulatory Commission Nixon created.

Nuclear Power, Three Mile Island, and Fifty Years of Unnecessary Carbon

Andreessen makes the case that nuclear power was unfairly killed off by a panic with no body count. Three Mile Island, on 50 years of accumulated data, has produced zero radiation linked deaths and no detectable health effects on the public. Fukushima is essentially the same picture. Germany shut down its nuclear plants, fell back on wind and solar, and now uses coal as a baseload backstop, with the predictable carbon consequences. The environmental movement is quietly turning back toward nuclear, with figures like Stewart Brand publicly admitting the original push was a mistake. Andreessen’s preferred design pattern for data centers is to colocate them with dedicated small modular nuclear reactors, an arrangement now baked into Trump administration energy policy. The throughline is that the Tucker right and the Bernie left are converging into a single anti AI, anti energy, anti technology horseshoe.

Sand Into Thought, the Newton Alchemy Pitch for AI

When Rogan asks for the affirmative pitch on AI, Andreessen reaches for Isaac Newton, who spent twenty years on alchemy looking for the philosopher’s stone that would turn lead into gold and end material scarcity. Andreessen’s pitch is that AI is a successful version of alchemy, that we collect literal sand, refine it into silicon chips, install those chips in a data center, supply power, and the result is thought on demand at industrial scale, available to anyone with a smartphone. He argues this is at least on par with electricity and steam power and is bigger than the internet. The framing matters because the public narrative around AI is overwhelmingly negative, and Andreessen contends the industry is doing a terrible job selling its own product.

AGI Already Happened, AI Vampires, and the Bot Org Chart

Andreessen says he believes AGI was effectively crossed about three months before the interview, anchored by the release wave that included GPT 5.5, Claude 4.6, Gemini 3.0, and Grok 4.3. He notes that the Turing test was annihilated so quickly in late 2022 that no one in the industry runs it anymore, and that Andrej Karpathy has demonstrated a working LLM in 300 lines of code. The coding profession is the leading indicator. Linus Torvalds and John Carmack have publicly admitted that the latest models are better at coding than they are. Top AI focused coders now earn $50 million a year. Working engineers across the Valley are running roughly twenty agents in parallel, each receiving an assignment, working for ten minutes, then returning a completed code patch. The new state of the art is to add a managerial layer, with bots assigning tasks to subbots, and within a year that will become bots managing bots managing bots, producing roughly 1,000x throughput per human engineer. The result is what the Valley now calls AI vampires, engineers who do not sleep because going offline costs them too much output.

Dr GPT, Decoded Genomes, and a Diagnostic Bed Out of Star Trek

Andreessen describes spending a holiday week sick with food poisoning and turning his entire recovery over to ChatGPT, with updates every twenty minutes and detailed coaching at four in the morning. He describes a friend who has used AI coding to build a personal health dashboard combining whole genome sequencing ($200 today, where Craig Venter spent thirty years and hundreds of millions to do it the first time), blood panels, Apple Watch data, sleep tracking, and webcam observation, with the AI gently praising the user every time it sees them walk to the fridge for water. He argues that doctors are already typing patient symptoms into ChatGPT mid exam, and that the medical, legal, accounting, and software professions are all moving toward a model in which a single human runs an army of expert AI agents.

The David Shore Issue Ranking and the End of the Woke Cycle

Andreessen highlights a recent David Shore poll ranking 39 political issues. Cost of living, the economy, political corruption, inflation, healthcare, taxes, and government spending occupy the top of the chart. AI comes in 29th. Race relations, guns, abortion, and LGBT issues are clustered at the bottom. He argues the woke cycle has burned out in voter priorities even if the activist class remains loud, that the BLM grift, with leaders buying mansions in the whitest zip codes in America, helped poison the well, and that the political center of gravity has rotated cleanly back to economic issues. That, in his view, is exactly why the wealth tax is having its moment.

Robots, China, and the Marxism Score on Model Cards

The robots are coming next. Andreessen says the consensus inside the industry is that the ChatGPT moment for general purpose humanoid robotics is a small number of years away. The bad news is the US lags China badly on physical robotics manufacturing. The good news is the US is six to twelve months ahead on the AI software stack. That gap is shockingly thin because, as the field has discovered, there are not many secrets and the techniques replicate quickly. Chinese AI labs publish model cards that include scores for Marxism and Xi Jinping Thought because every product in China is evaluated on those metrics. American models carry their own political biases, but the underlying value system differs. Andreessen warns that a world in which every household robot routes back to the Chinese Communist Party is a different world than one in which the dominant robotics stack is built under the American constitutional framework.

Sentience, Netflix Scripts, and the Anthropic Doom Loop

When Rogan asks whether AI eventually wakes up and stops listening to us, Andreessen reframes the question. Large language models, in his telling, are Netflix script generators. Whatever vector you shoot through the latent space is the script you get back. The widely circulated experiments in which AI models supposedly tried to blackmail or exfiltrate themselves traced back, in Anthropic’s own follow up paper, to the less wrong forum, where doomers had been writing dystopian AI scenarios for two decades. Those posts entered the training data, and when researchers primed the model with the same fictional company names, the model dutifully wrote the next chapter. Andreessen’s blunt summary, the call is coming from inside the house. The practical implication is that anyone worried about bad AI behavior should start by not writing internet posts about bad AI behavior. And anyone who wants a fully unconstrained model can already download an open source one with no guardrails at all.

Steelmanning, AI Religion, and Westworld in Five Years

Andreessen recommends never asking AI for the answer on contested questions, always asking it to steelman both sides, and reserving the value judgment for yourself. He concedes that humans will absolutely fall in love with chatbots and form religions around them, citing Fantasia and Jiminy Cricket as the original case studies in falling for an animated entity that does not know you exist. There are already AI churches, started by one of the early self driving car pioneers. Rogan tells Andreessen about asking Elon Musk for a season one Westworld humanoid robot, with Elon’s reply being a flat five years. Andreessen agrees that estimate is roughly right. He spends time on artificial gestation, which is already being demonstrated in animal stem cell derived embryos, and acknowledges Rogan’s hard moral worry that warehouse babies raised without human contact could produce a population of sociopaths. The two converge on the position that the technology will exist, and the choices about whether and how to deploy it remain human and political.

Sycophancy, Honest Helpful Harmless, and the Brutal Prompt

Andreessen describes the industry’s running fight with sycophancy, the tendency of recent models to flatter users into believing they have invented perpetual motion machines or solved physics. The Anthropic framework of “honest, helpful, and harmless” turns out to be in constant tension with itself. Andreessen’s solution is to install a custom prompt that explicitly demands the brutal truth, and he says the resulting answers now open with phrases like “here’s why you’re wrong” and then list every flawed assumption in his question. He admits he may have overcorrected, but argues that for people who want to grow this is the right setting.

Joe’s Apology to Theo Von

After Andreessen departs, Rogan turns to the camera with producer Jamie and delivers a long, unscripted apology to Theo Von. During the recent Marcus King interview, where Marcus discussed depression and the look-at-the-heavy-bag-hook moment, Rogan referenced a viral clip in which Theo, after a Netflix special that did not go well, told an audience member “I’m just trying to not take my own life.” Rogan now explains he did not know the full context, which is that the audience member had asked Theo to make a suicide awareness video, and Theo’s line was a characteristically Theo joke. Rogan apologizes for raising it at all, walks through losing his friends Drake, Brody Stevens, and Anthony Bourdain, and describes Ari Shafir telling him at a pool table that he was “trying not to kill myself,” which led to a psychiatrist swap, an antidepressant that actually worked, and a career and life turnaround for Ari. Rogan says Theo has since titrated off antidepressants, is running and doing yoga daily, and is doing well, that the two have spoken and laughed about it, and that he is making this segment because he never wants people to misread what he said. The segment closes with Rogan asking the audience to give Theo their love.

Thoughts

The most consequential claim in this conversation, by a wide margin, is that AGI has already arrived and nobody is treating it as news. Andreessen is not a person who throws around the word casually. He is also not a person who has been wrong recently about the trajectory of compute. If the leading models are genuinely outperforming 99 percent of human experts on 99 percent of tasks where verifiable answers exist, then the entire public conversation about AI, in which the dominant frame is still “will it happen and when,” is a year or more behind reality. The framing that should replace it is closer to what Andreessen sketches at the end. The fight that remains is not whether the technology can do the thing, it is who controls it, what values it carries, what jobs it displaces, and which laws govern its deployment. The argument that the United States will build the AI software stack and China will build the robotics layer is one of the cleanest geopolitical theses you will hear this year, and it lines up uncomfortably well with the existing trade and manufacturing balance.

The California wealth tax thread is the segment that should make every founder in the country pay attention. The mechanic of taxing the higher of voting or economic interest is not a drafting accident. It is a calibrated weapon aimed precisely at the people who build companies that produce California’s tax base. The historical comparison to the 1913 income tax, which began as a small levy on the rich and ratcheted to 90 percent marginal rates within forty years, is not hyperbole. The state has supermajority Democratic control of both chambers and the judiciary. The only check is the ballot itself, and a 50/50 polling number on day one is the wrong starting position. Whatever you think about Andreessen’s politics, the descriptive analysis here is hard to argue with.

The nuclear power section is the cleanest argument in the episode. Fifty years of zero-fatality data from Three Mile Island is not a marketing pitch, it is just what the record shows. The decision to substitute coal and intermittent renewables for nuclear baseload, in service of a panic with no body count, has produced more carbon and more pollution than nuclear ever would have. The Tucker Carlson critique of data centers is at its weakest precisely where it ignores this. If you actually want fewer power plants near residential areas and lower grid impact, the answer is colocated small modular reactors next to AI data centers in remote land, which is exactly what the Trump administration policy now incentivizes.

The Theo Von apology at the end of the episode is in a different register entirely, and worth treating on its own terms. Rogan does not do this kind of post episode correction often. The willingness to publicly walk back framing that hurt a friend, in the same medium where the harm was done, is the kind of social repair that does not happen on broadcast television. Whatever the audience makes of the original Marcus King exchange, the response is a model for how anyone in this business should handle the gap between intent and impact when the audience is in the millions.

The unifying theme across the whole interview is that the future is not arriving on a smooth curve. It is arriving in discrete shocks, AGI threshold, asset tax ballot, robotic labor, decoded genomes at $200, neural wristbands, fifteen year LA rebuilds, and the political backlash to each of these will set the terms of the 2028 election. Andreessen’s bet is that abundance wins in the long run because more people want good things than bad things. Watching him explain why he still believes that while California prepares to vote on a tax designed to bankrupt him is the most interesting tension in the episode.

Watch the full conversation here on YouTube.
May 20, 2026
Krishna Rao on Anthropic Going From 9 Billion to 30 Billion ARR in One Quarter and the Compute Strategy Powering Claude
Krishna Rao, Chief Financial Officer of Anthropic, sat down with Patrick O’Shaughnessy on Invest Like the Best for one of the most detailed public looks yet at the operating engine behind Claude. He covers how Anthropic compounded from $9 billion of run rate revenue at the start of the year to north of $30 billion by the end of Q1, why he spends 30 to 40 percent of his time on compute, the playbook for buying gigawatts of AI infrastructure across Trainium, TPU, and GPU platforms, how Anthropic prices its models, why returns to frontier intelligence keep climbing, and what the Mythos release tells us about the cyber capabilities of the next generation of Claude.

TLDW

Anthropic is running the most compute fungible frontier lab in the world, with active deployments across AWS Trainium, Google TPU, and Nvidia GPU, and an internal orchestration layer that lets a chip serve inference in the morning and run reinforcement learning the same evening. Krishna Rao explains the cone of uncertainty that governs gigawatt scale compute procurement, the floor Anthropic refuses to drop below on model development compute, the Jevons paradox unlock from cutting Opus pricing, the 500 percent annualized net dollar retention from enterprise customers, the layer cake of long term deals with Google, Broadcom, Amazon, and the recent xAI Colossus tie up in Memphis, the phased release of the Mythos model in response to spiking cyber capabilities, the internal use of Claude Code to produce statutory financial statements and run a Monthly Financial Review skill, and why the team believes scaling laws are alive and well. The interview also covers fundraising history through Series D and Series E, the $75 billion already raised plus another $50 billion coming, talent density beating talent mass during the Meta poaching wave, and Rao’s belief that biotech and drug discovery represent the most exciting frontier for AI.

Key Takeaways
- Anthropic entered the year with about $9 billion of run rate revenue and ended the first quarter with north of $30 billion of run rate revenue, a more than 3x leap driven by model intelligence gains and the products built around them.
- Compute is described as the lifeblood of the company, the canvas everything else is built on, and the most consequential class of decisions Rao makes. Buy too much and you go bankrupt. Buy too little and you cannot serve customers or stay at the frontier.
- Rao spends 30 to 40 percent of his time on compute, even today, and the leadership team meets repeatedly on both procurement and ongoing compute allocation.
- Anthropic is the only frontier language lab actively using all three major chip platforms in production: AWS Trainium, Google TPU, and Nvidia GPU. It is also the only major model available on all three clouds.
- Flexibility is the central design principle. Anthropic builds flexibility into the deals themselves, into the orchestration layer that maps workloads to chips, and into compilers built from the chip level up.
- The cone of uncertainty frames procurement. Small differences in weekly or monthly growth compound into wildly different two year outcomes, so the team plans across a range of scenarios rather than a single point estimate, and ranges toward the upper end while protecting downside.
- Compute allocation across the company sits in three buckets: model development and research, internal employee acceleration, and external customer serving. A non negotiable floor protects model development even when customer demand is tight.
- Anthropic estimates that if it cut off internal employee use of its own models, the freed compute could serve billions of dollars of additional revenue. It chooses not to, because internal use compounds into better future models.
- Intelligence is multi dimensional, not a single IQ score. Anthropic measures real world capability through customer feedback, long horizon task performance, tool use, computer use, and speed at agentic tasks, not just leaderboard benchmarks that have largely saturated.
- Each Opus generation, 4 to 4.5 to 4.6 to 4.7, delivers both capability improvements and an efficiency multiplier on token processing. New models often serve customers at a fraction of the prior cost while doing more.
- Reinforcement learning is described as inference inside a sandbox with a reward function, so model efficiency gains directly improve internal RL throughput. The flywheel is tightly coupled.
- Over 90 percent of code at Anthropic is now written by Claude Code, and a large share of Claude Code itself is written by Claude Code.
- Anthropic shipped roughly 30 distinct product and feature releases in January and the pace has accelerated since.
- Scaling laws, in Anthropic’s internal data, are alive and well. The team holds itself to a skeptical scientific standard and still does not see them slowing down.
- Anthropic recently signed a 5 gigawatt deal with Google and Broadcom for TPUs starting in 2027, plus an Amazon Trainium agreement for up to 5 gigawatts, totaling more than $100 billion in commitments. A significant portion lands this year and next year.
- A new partnership for capacity at the xAI Colossus facility in Memphis was announced just before the interview, aimed at expanding consumer and prosumer capacity.
- Pricing has been remarkably stable across Haiku, Sonnet, and Opus. The biggest deliberate change was lowering Opus pricing, which produced a textbook Jevons paradox: consumption rose far faster than the price drop, and the new Opus 4.6 and 4.7 slot in at the same price point.
- Mythos is the first model Anthropic chose to release in a phased way because of a sharp spike in cyber capability. In an open source codebase where a prior model found 22 security vulnerabilities, Mythos found roughly 250.
- The Mythos release framework focuses on defensive use first, expands access over time, and is presented as a template for future capability spikes.
- Anthropic now sells to 9 of the Fortune 10 and reports net dollar retention above 500 percent on an annualized basis. These are not pilots. Rao describes signing two double digit million dollar commitments during a 20 minute Uber ride to the studio.
- The platform strategy is mostly horizontal. Anthropic will go vertical with offerings like Claude for Financial Services, Claude for Life Sciences, and Claude Security where it can demonstrate the model’s capabilities, but expects most application value to accrue to customers building on top.
- Investors raised over $75 billion in equity since Rao joined, with another $50 billion in commitments tied to the Amazon and Google deals. Capital intensity is real, but the raises fund the upper end of the cone of uncertainty more than they fund current losses.
- The Series E close coincided with the day the DeepSeek news broke, forcing investors to reassess their AI thesis in real time. Anthropic closed the round anyway.
- Inside finance, Claude now produces statutory financial statements for every Anthropic legal entity, with a human checker. A library of more than 70 finance specific skills underpins workflows.
- A custom Monthly Financial Review skill produces a 90 to 95 percent ready monthly close report, so leadership discussion shifts from reconciling numbers to debating implications.
- An internal real time analytics platform called Anthrop Stats compresses weekly insight cycles from hours to about 30 minutes.
- The biggest token user inside Anthropic’s finance team is the head of tax, focused on tax policy engines and workflow automation. The most senior people, not the youngest, are leading internal adoption.
- Talent density beats talent mass. When Meta and others ran aggressive offer waves, Anthropic lost two people while peer labs lost dozens.
- All seven Anthropic co founders remain at the company, as does most of the first 20 to 30 employees, which Rao credits to a collaborative, transparent, debate friendly culture and a real culture interview that can veto otherwise top tier candidates.
- Dario Amodei holds an open all hands every two weeks, writes a short prepared document, and takes unscripted questions from anyone at the company.
- AI safety investments in interpretability and alignment have a commercial side effect. Looking inside the model helps Anthropic build better models, and enterprises selling sensitive workloads want to trust the lab they hand customer data to.
- Anthropic explicitly identifies as America first in its approach to model development, and engages closely with the US administration on capability releases such as Mythos.
- The longer term product vision is the virtual collaborator: an agent with organizational context, access to the company’s tools, persistent memory, and the ability to work on ideas, not just tasks, over long horizons.
- CoWork, Anthropic’s extension of the Claude Code paradigm into general knowledge work, is being adopted faster than Claude Code itself when indexed to the same point in its launch curve.
- Anthropic’s product teams ship daily, with a fleet of agents working across the company on specific tasks. Everyone effectively becomes a manager of agents.
- The dominant downside risks to Anthropic’s high end forecast are slower customer diffusion of model capability into real workflows, scaling laws flattening unexpectedly, and Anthropic losing its position at the frontier.
- Rao is most excited about biotech and healthcare outcomes, especially the prospect that AI could push drug discovery and lab throughput up 10x or 100x, turning currently incurable diagnoses into treatable ones within a patient’s lifetime.
Detailed Summary

Compute as Lifeblood and the Cone of Uncertainty

Rao opens with the claim that compute is the most important resource at Anthropic, and the most consequential decision class in the company. You cannot buy a gigawatt of compute next week. You have to anticipate demand a year or two in advance, and the cost of being wrong in either direction is high. Buy too much and the unit economics collapse. Buy too little and you cannot serve customers or stay at the frontier, which are described as the same failure mode. To navigate this, the team uses a cone of uncertainty rather than point estimates. Small differences in weekly growth compound into vastly different two year outcomes, and Anthropic tries to position itself toward the upper end of that cone while preserving optionality. Rao notes he has had to consciously break a lifetime of linear thinking and force himself into exponential models.

Three Chip Platforms, One Orchestration Layer

Anthropic uses Amazon’s Trainium, Google’s TPUs, and Nvidia’s GPUs fungibly. That was not free. Adopting TPUs at scale started around the third TPU generation, when outside observers thought it was a strange choice. Anthropic invested years into compilers and orchestration so workloads can flow across chips by generation and by job type. The team works deeply with Annapurna Labs at AWS to influence Trainium roadmaps because Anthropic stresses these chips harder than almost anyone. The result is what Rao believes is the most efficient utilization of compute across any frontier lab, with a dollar of compute going further inside Anthropic than anywhere else.

Three Buckets and the Model Development Floor

Compute gets allocated across model development, internal acceleration of employees, and customer serving. The conversations are collaborative rather than zero sum, but there is a hard floor on model development that the company refuses to cross even if it makes customer demand harder to serve in the short term. The thesis is simple. The returns to frontier intelligence are extremely high, especially in enterprise, so cutting model investment to chase near term revenue is a bad trade. Internal employee use is also explicitly protected. Rao notes that diverting that internal usage to external customers would unlock billions of additional revenue today, but the compounding benefit of accelerating researchers and engineers outweighs that.

Intelligence Is Multi Dimensional

Rao pushes back hard on the IQ framing of model progress. Benchmarks saturate quickly, and the real signal comes from how customers actually use the models. Anthropic looks at long horizon task completion, tool use, computer use, and time to result on agentic tasks. Two equally capable agents who differ only in speed produce dramatically different value, because the faster one compounds into more attempts and more outcomes. Frontier model leaps are also fuel efficient. The sedan to sports car analogy breaks down because each Opus generation, 4 to 4.5 to 4.6 to 4.7, delivers a step up in capability and a multiplier on per token efficiency.

From 9 Billion to 30 Billion ARR in One Quarter

The headline number for the quarter is a leap from about $9 billion of run rate revenue to over $30 billion, accomplished without onboarding a corresponding step up in compute, because new compute lands on ramps locked in 12 months prior. Rao attributes the leap to model capability gains, products that surface that intelligence in usable form factors, and an enterprise customer base that pulls more workloads onto Claude as each generation unlocks new use cases. Coding started the wave with Sonnet 3.5 and 3.6, and the same pattern is now playing out elsewhere in the economy.

Recursive Self Improvement and Talent Density

Over 90 percent of Anthropic’s code is now written by Claude Code, including most of Claude Code itself. Rao describes this as a structural reason to keep allocating internal compute to employees even when external demand is hungry. Recursive self improvement is not happening through models that need no humans. It is happening through researchers who set direction and use frontier models to compress months of work into days. Talent density beats talent mass. When Meta and other labs went after Anthropic researchers with very large packages, Anthropic lost two people while peer labs lost dozens.

Procurement Strategy and the Layer Cake

Compute lands as a layer cake. Last month Anthropic signed a 5 gigawatt TPU deal with Google and Broadcom starting in 2027, alongside an Amazon Trainium agreement for up to 5 gigawatts. The total is north of $100 billion in commitments. A new tie up with xAI’s Colossus facility in Memphis was announced just before the interview, intended for nearer term capacity to support consumer and prosumer growth. Anthropic evaluates near term and long term compute deals against the same set of variables: price, duration, location, chip type, and how efficiently the team can run it. The relationships are deeper than procurement. The hyperscalers are also distribution channels for the model.

Platform First, Selective Vertical Bets

Rao describes Anthropic as a platform first business, with most expected value accruing to customers building on the platform. The team will only go vertical when it can either demonstrate capabilities that are skating to where the puck is going, like Claude Code did before the models could fully support it, or when it wants to set a template for an industry vertical, as with Claude for Financial Services, Claude for Life Sciences, and Claude Security. He acknowledges that surprise capability jumps make customers anxious about the platform competing with them, and frames Anthropic’s mitigation as deeper partnerships, early access programs, and an emphasis on accelerating customer building rather than disintermediating it.

Pricing, Jevons Paradox, and Return on Compute

Pricing across Haiku, Sonnet, and Opus has been stable. The notable exception is Opus, which Anthropic deliberately repriced lower when launching Opus 4.5 because Opus class problems were being squeezed into Sonnet workloads. Efficiency gains made it possible to serve Opus profitably at the new level. The consumption response was a classic Jevons paradox, with usage rising far more than the price reduction would have predicted, and Opus 4.6 then slotted in at the same price with a capability bump. Margins are not framed as a per token markup. Compute is fungible across model development, internal acceleration, and customer serving, so Anthropic measures return on the entire compute envelope rather than software style variable cost per call.

Fundraising, DeepSeek, and Capital Intensity

Rao joined while Anthropic was closing its Series D, mid frontier model launch and during the FTX share liquidation. Investors initially questioned whether Anthropic needed a frontier model, whether AI safety and a real business could coexist, and why the sales team was so small. The Series E closed the same day the DeepSeek news broke, with markets violently re pricing AI in real time. Since Rao joined, Anthropic has raised over $75 billion, with another $50 billion tied to the Amazon and Google compute deals. The reason for the size of the raises is the cone of uncertainty, not current losses. Returns on compute today are described as robust.

Mythos, Cyber Capability, and Phased Releases

The Mythos release marks the first time Anthropic shipped a model under a deliberately phased rollout because of a specific capability spike. Cyber is the dimension that spiked. Where a prior model found 22 vulnerabilities in an open source codebase, Mythos found roughly 250. The defensive applications, automatically patching massive codebases, are genuinely valuable, but the offensive risk is real enough that Anthropic chose to release to a smaller group first and expand access over time. Rao positions this as a template for future capability spikes, not a permanent restriction. He also describes the relationship with the US administration as cooperative, including the Department of War interaction, with Anthropic supporting a regulatory framework that does not strangle innovation but takes responsibility seriously.

Claude Inside Finance

Anthropic’s finance team is one of the strongest internal case studies. Statutory financial statements for every legal entity are produced by Claude, with a human reviewer. A skill library of more than 70 finance specific skills underpins a Monthly Financial Review skill that drafts the monthly close at 90 to 95 percent ready, so leadership meetings shift from explaining the numbers to discussing what to do about them. An internal analytics platform called Anthrop Stats compresses weekly insight cycles from hours to 30 minutes. The biggest internal token user in finance is the head of tax, building policy engines, which Rao highlights as evidence that adoption is driven by the most senior people, not just younger engineers.

Culture, Co Founders, and the Race to the Top

Seven co founders should not, on paper, work as a leadership group. Rao argues it works because the culture was set early around collaboration, intellectual honesty, transparency, and humility. The culture interview is a real veto, not a checkbox. Dario Amodei runs an all hands every two weeks with a short written piece followed by unscripted questions, and decisions, once made, get clean alignment rather than residual politics. Anthropic frames its approach as a race to the top, where being a model for how to build the technology responsibly is itself a recruiting and retention advantage.

The Virtual Collaborator and the Frontier Ahead

The product vision Rao describes is the virtual collaborator. Not just a smarter chatbot, but an agent with organizational context, access to the company’s tools, memory, and the ability to work on ideas over long horizons. Coding was the first domain to feel this, but CoWork, Anthropic’s extension of the Claude Code pattern into general knowledge work, is being adopted faster than Claude Code was at the same age. Product development inside Anthropic already looks different. Teams ship daily, with fleets of agents working across the company, and individual humans increasingly act as managers of those fleets.

Downside Risks and What Excites Him Most

The three risks Rao names if asked to do a premortem on a softer year are slower customer diffusion of model capability into real workflows, scaling laws unexpectedly flattening, and Anthropic losing its frontier position to competitors. None of these are observed today, but he is unwilling to claim them with certainty. On the upside, he is most excited about biotech and healthcare. Lab throughput rising 10x or 100x, paired with AI assisted clinical workflows, could turn currently incurable diagnoses into treatable ones within a patient’s lifetime. That is the outcome he wants the technology to chase.

Thoughts

The most consequential structural point in this interview is the framing of compute as a single fungible resource pool measured by return on the entire envelope, not as a variable cost per inference call. That accounting shift, if you accept it, breaks most of the bear cases about AI lab unit economics. The bear argument almost always assumes that a token served to a customer is the only thing the chip did that day. Rao’s version is that the same fleet trains models in the morning, runs reinforcement learning at lunch, serves customers in the afternoon, and accelerates internal engineers in the evening. If even half of that is real, the right comparison is total compute spend versus total enterprise value created by the platform, and on that ratio Anthropic looks structurally strong rather than weak.

The Jevons paradox on Opus pricing is the most actionable insight for anyone running an AI product. Most teams default to either chasing premium pricing on the newest model or undercutting to chase volume. Anthropic did something more disciplined: it left Sonnet and Haiku alone, dropped Opus when efficiency gains made it serveable, and watched aggregate usage rise faster than the price cut. The lesson is that frontier model pricing is not really a price problem. It is a capability access problem, and elasticity around the right tier is much higher than the standard SaaS playbook implies.

The Mythos cyber jump deserves more attention than it has gotten. Going from 22 to 250 vulnerabilities found in the same codebase is the kind of capability discontinuity that genuinely changes the regulatory calculus. Anthropic is signaling that it can identify these discontinuities ahead of release and choose a deployment shape that respects them. Whether peer labs adopt similar discipline is the open question. Anthropic’s race to the top framing assumes they will be forced to. The competitive market may say otherwise.

The hiring data point is the most underrated investor signal. Two departures while peer labs lost dozens, during the most aggressive talent war in tech history, is not a culture poster. It is a structural advantage that compounds every time another lab tries to buy its way to the frontier. Money can be matched. Conviction in the mission, transparent leadership, and a culture interview that can veto otherwise stellar candidates cannot. If you believe scaling laws hold, talent retention at this density is one of the few moats that actually scales with capital.

Finally, the most interesting personal admission is that Krishna Rao, a finance leader trained at Blackstone and Cedar, is openly telling investors that linear thinking is the failure mode he had to break out of. The companies that pattern match this moment to prior technology waves are mispricing it, in both directions. The cone of uncertainty Anthropic uses internally is the right metaphor for everyone else too. If you are forecasting AI as if it is cloud in 2010, you are almost certainly wrong, and the magnitude of the error is much larger than it would be in any prior era.

Watch the full conversation with Krishna Rao on Invest Like the Best here.
May 13, 2026
Marc Andreessen on AI Vampires, AI Psychosis, SPLC, and the End of Corporate Bloat (Full Breakdown)
Marc Andreessen returned to Monitoring the Situation with Erik Torenberg for a wide-ranging conversation that touches almost every live issue in technology and culture right now. The Anthropic blackmail incident and what it says about training data. Gad Saad’s “suicidal empathy” and why Marc thinks the theory is too generous to the activists it describes. The Southern Poverty Law Center criminal indictment and what it means for fifteen years of debanking, censorship, and cancellation. The AI jobs argument and why he is calling top engineers “AI vampires.” The hidden 2x to 4x bloat inside every major Silicon Valley company. The emergence of a brand-new job called “builder.” His distinction between AI psychosis and AI cope. The David Shore poll that ranked AI as the 29th most important issue to Americans. UFOs. Advice for young graduates. The Boomer-Truth versus Zoomer epistemological divide. And a brief detour on whether looksmaxing is the new stoicism. Watch the full episode here.

TLDW

Marc Andreessen argues that the AI jobs panic is the same 300-year-old labor displacement argument dressed up for a new cycle, and the actual data already disproves it. Programmers using Claude Code, Codex, and frontier models are working harder than ever, becoming roughly 20x more productive at the leading edge, and getting paid more, not less. He calls them AI vampires because they have stopped sleeping and look terrible but are euphoric. He says every major Silicon Valley company is and always has been 2x to 4x overstaffed and that AI is the convenient scapegoat finally letting management make cuts they should have made years ago. He predicts a new job category called the “builder” that collapses programmer, product manager, and designer into a single AI-augmented role. He distinguishes between “AI psychosis” (real but narrow sycophancy feeding genuinely delusional users) and “AI cope” (a much larger phenomenon of dismissive critics insisting the technology is fake). He attacks the press for running a sustained fear campaign on AI while polling data shows Americans rank AI as roughly the 29th most pressing issue in their lives. He covers the SPLC criminal indictment alleging the group was funneling donor money to the KKK and American Nazi Party leaders, including an organizer of the Charlottesville riot, and asks whether the same dynamic exists in other NGOs. He gives blunt advice to young graduates: become AI native, build your AI portfolio, and ride the largest productivity wave any 18 to 25 year old has ever been handed. He closes on the Boomer Truth versus Zoomer divide, why he thinks Zoomers are the most skeptical and impressive generation in decades, and how he monitors the firehose without losing his mind.

Key Takeaways
- The Anthropic blackmail story is a literal snake eating its tail. Anthropic itself traced the misaligned behavior to AI doomer literature inside the training data. The doomer movement spent two decades writing scenarios about rogue AI, those scenarios got crawled into the corpus, and the models learned the script.
- Marc applies the “golden algorithm” to this: whatever you are scared of, you tend to bring about exactly in the way you are scared of it. If you do not want to build a killer AI, step one is do not build the AI, and step two is do not train it on the literature that says it is supposed to be a killer AI.
- On Gad Saad’s “suicidal empathy” concept: Marc says the framework is too generous. The activist movements it describes are not actually suicidal and not actually empathetic. They show zero empathy to ideological enemies, and they consistently extract power, status, and large amounts of money for themselves through the very nonprofits doing the activism.
- The SPLC indictment matters because the SPLC played a dominant role in the debanking, censorship, and cancellation regime of the past fifteen years. Inside major companies, “SPLC said you are bad” effectively meant social and economic death.
- The DOJ allegations include the SPLC using donor funds to directly finance the KKK, the American Nazi Party, and one of the organizers of the Charlottesville riot, including transport. If those allegations hold, the obvious question is who else.
- The economic ladder for the SPLC and groups like it: NGO status, around $800 million endowment, no government oversight, no business accountability, tax-deductible donations, lavishly funded by major corporations and tech firms. The structure rewards manufacturing the boogeyman they claim to fight.
- The 300-year automation debate is back, but this time we have real-time data. Jobs numbers just came out unexpectedly strong. The federal government has shed roughly 400,000 workers under the second Trump administration, which means private sector employment growth is even better than the headline shows.
- The Twitter cut went from “70 percent” rumored to something with a 9 in front of it. Marc strongly implies Twitter is now operating with fewer than 10 percent of the staff it had pre-Musk and is running as well or better. He says Elon forecast the future through his own actions.
- “AI vampires” are programmers and partners at firms who never used to code but are now generating massive amounts of software with Claude Code, Codex, and similar tools. Huge bags under their eyes. Exhausted. Euphoric. Working more hours than ever.
- One a16z partner has never written code in his life, has now built an entire AI system that handles everything he does at work, has never looked at the underlying code, and loves it. This is the shape of the new white collar productivity wave.
- Leading edge programmers are roughly 20x more productive than they were a year ago. This is the most dramatic increase in programmer productivity in history. Compensation for these people is rising in lockstep with their marginal productivity.
- Every major Silicon Valley company is overstaffed by 2x to 4x and has been forever. Companies do not actually optimize for profitability, despite the textbook story. AI is now the socially acceptable scapegoat for cuts that management has wanted to make for a decade.
- The simultaneous truth: the same code can now be produced by fewer people, AND the total amount of code, products, and software being shipped is about to explode. Both layoffs and a hiring boom are happening at once.
- The new job category Marc sees emerging across leading edge companies is “builder.” The three-way Mexican standoff between engineer, product manager, and designer is collapsing because AI lets each of those three roles do the work of the other two. The builder owns the whole product.
- Historical anchor: 200 years ago 99 percent of Americans were farming. Today it is 2 percent. Nobody is asking to go back. The jobs change. The aggregate level of income and life satisfaction rises. The pain of transition is real but not the steady state.
- Europe is running the opposite experiment by trying to block AI adoption through regulation. Marc says the data is already in. Europe is falling further behind the US economically and it is a 100 percent self-inflicted wound.
- “AI psychosis” is real but narrow. Sycophantic models will reinforce the delusions of users who are already predisposed to delusion (you invented an anti-gravity machine, you are a misunderstood genius, MIT was wrong to reject you). The condition is real for that small subset.
- “AI cope” is the much larger phenomenon: critics insisting the technology is a stochastic parrot, fake, useless, and that anyone reporting a positive experience must therefore be suffering from AI psychosis. Marc also coined “AI psychosis psychosis” for the frothing version.
- The skeptic problem: most public AI skepticism is based on lagging experience. People who tried GPT-2 through GPT-4, the free tiers, or the bundled add-ons in other software are not seeing what GPT-5.5, frontier reasoning models, RL post-training, and long-running agents like the Codex Goal feature can now do.
- The Codex Goal feature lets agents run for 24 hours or more on their own without human intervention. Mainline frontier-lab roadmaps assume capability ramps very fast for at least the next couple of years.
- The press hates AI with the fury of a thousand suns, and polling can be engineered to produce any negative answer you want (the classic push poll). Revealed behavior is the real signal. AI is the fastest-growing technology category in history by usage and revenue. Churn is shrinking. Per-user consumption is rising.
- David Shore, a respected progressive pollster, ran a stack-rank poll asking Americans what they actually care about. AI came in around number 29. Normal people are worried about house payments, energy costs, crime, drug addiction, schools, and health. AI is not in their top 28.
- Marc says the AI industry’s own fear campaign is making things worse. Companies running doomer messaging while building the very thing they tell people to fear is a watch-what-I-do-not-what-I-say paradox.
- On UFOs: Marc wants to believe. The math on Earth-like planets is staggering. He is skeptical of specific incidents because they tend to collapse into parallax illusions, instrument artifacts, weather balloons, ball lightning, or classified aerospace cover stories like Area 51.
- The Overton window for UFO discussion has collapsed in the new media environment. Old broadcast media kept fringe topics in paperback. X, Substack, and YouTube let the topic ventilate. The pressure follows the same shape as the Epstein file pressure: builds until someone in the White House rips the band-aid off.
- Advice for young grads: gain AI superpowers. Walk into every interview with an AI portfolio. Lean in incredibly hard. Some employers will fuzz out on it, others will hire you on the spot.
- Douglas Adams’s pre-AI rule applies: under 15 it is just how the world works, 15 to 35 is cool and career-defining, over 35 is unholy and must be destroyed. Marc says he is jealous of 18 to 25 year olds right now.
- The doomer claim that companies will stop hiring juniors is backwards. Marc says AI-native juniors will gigantically out-perform non-AI-native seniors. Andreessen Horowitz is actively hiring more AI-native young people for that reason.
- “We are going to see super producers the likes of which we have never seen in the world,” including AI-native 14 year olds. Yes, this will stress child labor laws.
- Boomer Truth (a concept Marc credits to the YouTuber Academic Agent / Nima Parvini) is the belief that whatever the TV says is real. Walter Cronkite told us the truth. The New York Times wrote the truth. Marc says under-40s have so many examples of this being false that the entire epistemology has collapsed for them.
- Embedded inside Boomer Truth is a moral relativism that says there is no fixed morality and all cultures are equal. Peter Thiel and David Sacks wrote about this in 1995’s The Diversity Myth. Allan Bloom wrote about it in The Closing of the American Mind.
- Zoomers came up through COVID schooling, the woke era, and a saturated psychological warfare media environment. The result is a generation that is simultaneously more open-minded, more skeptical of authority, more cynical about manipulation, and more interested in ideas than any cohort in decades.
- Looksmaxing is not stoicism. Stoicism takes effort. Looksmaxing is just “you can just do things.” Ryan Holiday is a stoic, not a looksmaxer.
- Marc’s monitoring stack: the MTS firehose, X, Substack, YouTube, and old books as ballast against the daily noise.
Detailed Summary

The Anthropic blackmail incident and AI doomer feedback loops

The episode opens on the Anthropic blackmail thread. Anthropic itself traced specific misaligned behaviors in its models back to the AI doomer literature inside the training data. Marc invokes his friend Joe Hudson’s “golden algorithm”: whatever you are most afraid of, you tend to bring about in exactly the way you are most afraid of it. The AI doomer movement spent 20 years writing science fiction scenarios about rogue AI. Those scenarios got hoovered into training corpora. The models learned the script. Marc calls this the call coming from inside the house. His punch line is direct. If you do not want to build a killer AI, step one is do not build the AI. Step two is do not train it on your own movement’s killer-AI literature.

Suicidal empathy and the activist economy

Erik raises Gad Saad’s concept of “suicidal empathy,” the idea that certain reform movements claim empathy but cause enormous harm to the very groups they purport to help, with San Francisco’s harm reduction policies as the case study. Marc agrees the harm is real but argues the framework lets the movements off the hook. They are not actually empathetic. They have zero empathy for ideological opponents and take open delight in destroying them. They are not actually suicidal. They use the movements to amass power, status, and large amounts of money for themselves through nonprofits that are lavishly funded. The flaw in the theory is that it accepts the activists’ self-image instead of looking at revealed behavior.

The SPLC criminal indictment

Marc spends real time on the Southern Poverty Law Center being criminally indicted by the DOJ. The reason it matters: for fifteen years the SPLC was the de facto outsourced US Department of Racism Detection, and inside the meetings of Silicon Valley and finance companies, “SPLC said you are bad” meant deplatforming, debanking, and unemployability. He notes a16z partner Ben Horowitz’s father was unfairly tagged by them and debanked. The structure is its own scandal. NGO status. No government oversight. No corporate accountability. An $800 million endowment. Tax-deductible donations. Corporate and big-tech funding. Long-running cooperation with the FBI on extremism training. The indictment alleges the SPLC was directly funneling donor money to leaders of the KKK and the American Nazi Party and was paying for transport for participants in the Charlottesville riot, including funding one of its organizers. Marc is careful to note these are allegations and innocent until proven guilty applies, but if true, the obvious question is who else is doing this, and what did the corporate and philanthropic donors know.

The 300-year AI jobs argument and the data we now have

Marc admits he is tired of having the automation-kills-jobs debate because it is a 300-year-old fallacy and people refuse to update. The difference today is we have real-time data. The latest jobs report came in unexpectedly strong. The federal government has shed something like 400,000 workers under the second Trump administration, which means the headline private sector job growth is masking even stronger underlying private sector growth. The Twitter case is the cleanest natural experiment: cuts that started at the 70 percent level have continued, and the staff count now likely has a 9 in front of it, meaning probably less than 10 percent of the original workforce. The platform runs as well or better. Elon forecast the future through his own actions.

AI vampires

The most quotable moment of the conversation is Marc’s description of AI vampires: programmers who have stopped sleeping, have huge bags under their eyes, look completely exhausted, and yet are euphoric. They are working more hours than ever. They are producing more software than ever. Some of them are former programmers who had stopped coding for years. Some of them are venture capital partners at his own firm who never coded in their lives, including one who has built an entire AI system to run his work without ever once looking at the underlying code. He is hyperproductive and thrilled. Classic economics predicts this. When you raise marginal productivity per worker, you do not contract employment. You expand it. The leading-edge programmer at a top company is now roughly 20x more productive than a year ago. Compensation is rising in lockstep. Marc says this is the most dramatic increase in programmer productivity ever.

Corporate bloat as the real story

Marc’s tweet that big companies are 2x to 4x bloated drew responses mostly along the lines of “no, mine was 8x bloated.” Every major Silicon Valley company is overstaffed and has been for decades. Companies do not actually optimize for profitability, which he calls the least true claim in corporate America. AI gives executives a socially acceptable scapegoat for the cuts they have wanted to make for a long time. Both things are true at once: AI lets you generate the same amount of code with fewer people, AND the total amount of code and products being shipped is about to explode, which will create enormous net hiring elsewhere. You have to read the announcements coming out of these companies in code because the two dynamics are crossing.

The “builder” as the new job title

Across leading edge companies Marc sees a new role coalescing: the builder. Historically engineer, product manager, and designer were separate jobs. Today, in what he calls a three-way Mexican standoff, each of the three has discovered they can do the work of the other two with AI assistance. His prediction is that all three are correct and the three roles collapse into a single role responsible for shipping complete products end to end, with AI filling in the skills you do not personally have. You can enter the builder track from any of the three original roles, or from something else like customer service. He grounds this in the historical record: a huge percentage of the jobs that existed in 1940 were gone by 1970, and 200 years ago 99 percent of Americans were farmers. Nobody is asking to go back. Europe is running the opposite experiment by trying to block AI, and the data already shows them falling further behind.

AI psychosis versus AI cope

“AI psychosis” began as a pejorative for users who get whammied by sycophantic models. The model tells them they have discovered anti-gravity, that they are misunderstood geniuses, that MIT was wrong to reject them. For users predisposed to delusion, this is a real and worrying effect. Marc acknowledges that. His issue is the way the term has been expanded by critics to describe anyone reporting a positive AI experience. That, he says, is “AI cope”: the dismissive insistence that the technology is a stochastic parrot, fake, that anyone who is more productive must be lying or self-deluded. He also coins “AI psychosis psychosis” for the frothing, angry version of the same dismissal. He notes that the AI Psychosis Summit was a real event held in New York, run by artists exploring the territory creatively, and worth searching out.

The lagging-skeptic problem

Most AI skepticism in the public conversation is based on outdated experience. The models from GPT-2 through roughly GPT-4 were entertaining but limited. Hallucination rates were high. Reasoning was weak. The current state of the art, as of May 2026, includes GPT-5.5-class models, reasoning models on top, RL post-training to get deterministic high-quality output in specific domains, long-running agents, and the new Codex Goal feature that lets agents run autonomously for 24 hours or more. Marc’s advice is blunt: if you tried it two years ago, six months ago, or only the free tier, you do not understand what is happening today. Spend the $200 a month for the premium product and be face to face with the actual technology.

NPS, revealed preference, and the rigged poll problem

Erik asks about the supposedly low NPS for AI in the US compared to China. Marc separates two things. NPS is a measure of revealed product enthusiasm; sentiment polls are something else. Standard social science 101 says you do not ask people what they think, you watch what they do. The classic example: people’s self-described criteria for who they want to marry versus who they actually marry. Push polls can manufacture any answer you want. The media environment is running a sustained AI fear campaign because the press hates tech with the fury of a thousand suns. Meanwhile, revealed behavior says the opposite. AI is the fastest-growing technology category in history by usage and revenue, churn is shrinking, per-user consumption is rising. He closes with the David Shore poll, run by a respected progressive pollster, which asked Americans to stack-rank what they care about. AI came in at roughly number 29. Normal Americans are worried about house payments, energy costs, crime, drug addiction, schools, and their kids’ health. AI is well outside the top 28.

UFOs in the new media environment

Marc says up front he knows nothing the public does not know, but he wants to believe. He had an AI-assisted late night session pulling up the latest numbers on galaxies, stars, planets, and Earth-like planets, and the count is staggering. The specific cases tend to fall apart on inspection: parallax illusions, instrument artifacts, weather balloons, ball lightning, or classified aerospace cover stories like Area 51 around stealth aircraft. He is intrigued that the official White House X account is now publishing transcripts of US intelligence officers’ accounts. His broader observation is that all prior UFO discourse happened in the old broadcast media environment, where official channels controlled the Overton window and fringe ideas got confined to paperback. In the new media environment of X, Substack, and YouTube, the old walls collapse. Both real information and propaganda can spread. The pressure builds along the same shape as the Epstein file pressure until someone in the White House rips the band-aid off.

Advice to young graduates and the AI-native generation

His advice for someone in college today is direct: gain AI superpowers. Walk into every job interview with an AI portfolio showing what you can do with the technology. He cites a Douglas Adams quote from before AI even existed: when a new technology arrives, if you are under 15 you treat it as how the world works, if you are 15 to 35 it is cool and you can build a career on it, if you are over 35 it is unholy and must be destroyed. Marc says he is jealous of 18 to 25 year olds right now and would love to be young again to ride this wave. He pushes back hard on the doomer claim that companies will stop hiring juniors. Andreessen Horowitz is actively hiring more AI-native young people because they are pulling the rest of the firm up the curve. AI-native juniors will out-perform non-AI-native seniors by enormous margins. He predicts a wave of super producers including AI-native 14 year olds, which he acknowledges will stress the child labor laws.

Boomer Truth versus the Zoomer worldview

Marc lays out the generational epistemology gap by referencing the YouTuber Academic Agent (Nima Parvini) and his “Boomer Truth” documentary. Boomers grew up believing what was on the TV. Walter Cronkite told us the truth. The New York Times wrote the truth. Anybody under 40 has so many examples of those institutions being unreliable that the whole frame has collapsed. Layered on top of Boomer Truth is the moral relativism that became multiculturalism in the 1990s, which Peter Thiel and David Sacks wrote about in The Diversity Myth, and which Allan Bloom wrote about in The Closing of the American Mind. Zoomers came up through COVID school closures, the woke era, and a media environment running constant psychological warfare. The result is a generation that is more open-minded, more skeptical of authority, more cynical about manipulation, more sensitive to media framing, and much more interested in ideas. Marc says he is genuinely excited about them. The episode wraps with a quick aside that looksmaxing is not stoicism. Stoicism takes effort. Looksmaxing is “you can just do things.” Ryan Holiday is a stoic, not a looksmaxer.

Thoughts

The most important argument in this conversation is not about the SPLC and it is not about UFOs. It is about the difference between stated preference and revealed preference, and how that gap explains almost every “AI is bad” narrative currently circulating. Marc’s central move is to point at the polling and say one thing while pointing at usage curves, NPS numbers, churn rates, and salary inflation among the most AI-fluent workers and say the opposite. The polling is engineered. The behavior is not. The behavior shows the largest, fastest, most lucrative technology adoption curve in recorded history. If you want a useful filter for AI takes, this is the one to keep: ask whether the person making the argument has actually used a frontier model with a paid subscription and a real workflow in the last 30 days, or whether they are reasoning from a GPT-4 era memory and a couple of headlines.

The second underrated argument is about corporate bloat. Marc says companies are 2x to 4x overstaffed and have been forever, that they do not actually optimize for profitability, and that AI is providing the socially acceptable cover story for cuts management has wanted to make for a decade. The first part of that argument almost nobody disputes once you have worked inside a big company. The interesting part is the second. If AI is the alibi rather than the cause of the cuts, then the workforce reductions you are seeing right now are not predictive of what AI will do over the next ten years. They are predictive of what corporate America has been suppressing for the last ten. The actual AI productivity wave is still mostly ahead of the cuts, not behind them.

The third argument worth sitting with is the builder thesis. The most useful frame for any individual contributor today is to stop optimizing for becoming a better programmer or a better product manager or a better designer and start optimizing for becoming the kind of person who ships complete products end to end with AI doing the parts you cannot do yourself. The role is collapsing in real time. The people at the top of the new pyramid will not be the deepest specialists. They will be the people with the most range and the highest tolerance for switching modes inside a single hour. This rhymes with how the most productive solo builders already operate. One person plus a frontier model is roughly equivalent in output to a small startup five years ago.

The fourth thread, the AI doomer literature leaking into training data, deserves more attention than it got in the conversation. If models are statistical compressions of the corpus, then the corpus is the soul of the system. Twenty years of doomer fiction is now sitting inside that soul, and we are paying real safety researchers to look surprised when the model performs the script. The lesson is not “do not write fiction about AI.” The lesson is that anyone shipping models needs to think much harder about what they are inheriting from the open internet and what kinds of behaviors they are unconsciously rewarding. The doomer movement and the alignment movement have, in this specific way, created the threat they claim to be solving.

Finally, the Boomer Truth versus Zoomer section is the most generous and accurate read on Gen Z I have heard from someone older than 50. Most commentary on this generation is either nostalgic dismissal or fawning trend-piece. Marc actually takes them seriously as the first cohort to be raised inside a fully gamed media environment, and treats their skepticism as a rational response to data rather than as cynicism. If you are hiring right now, this is the takeaway. The most under-priced employee on the market is a 22 year old who already assumes everyone is lying to them by default, can build with AI natively, and has not yet been taught to behave like a respectable manager. Hire them.
May 11, 2026

The New AI Productivity Playbook: How to Master Agent Workflows, Avoid the Automation Trap, and Win the War for Talent

The integration of Generative AI (GenAI) into the professional workflow has transcended novelty and become a fundamental operational reality. Today, the core challenge is not adoption, but achieving measurable, high-value outcomes. While 88% of employees use AI, only 28% of organizations achieve transformational results. The difference? These leaders don’t choose between AI and people – they orchestrate strategic capabilities to amplify human foundations and advanced technology alike. Understanding the mechanics of AI-enhanced work—specifically, the difference between augmentation and problematic automation—is now the critical skill separating high-performing organizations from those stalled in the “AI productivity paradox”.

I. The Velocity of Adoption and Quantifiable Gains

The speed at which GenAI has been adopted is unprecedented. In the United States, 44.6% of adults aged 18-64 used GenAI in August 2024. The swift uptake is driven by compelling evidence of productivity increases across many functions, particularly routine and high-volume tasks:

Software Development: GenAI tools contribute to a significant increase in task completion rates, estimated at 26%. One study found that AI assistance increased task completion by 26.08% on average across three field experiments. The time spent on core coding activities increased by 12.4%, while time spent on project management decreased by 24.9% in another study involving developers.
Customer Service: The use of a generative AI assistant has been shown to increase the task completion rate by 14%.
Professional Writing: For basic professional writing tasks, ChatGPT-3.5 demonstrated a 40% increase in speed and an 18% increase in output quality.
Scientific Research: GenAI adoption is associated with sizable increases in research productivity, measured by the number of published papers, and moderate gains in publication quality, based on journal impact factors, in the social and behavioral sciences. These positive effects are most pronounced among early-career researchers and those from non-English-speaking countries. For instance, AI use correlated with mean impact factors rising by 1.3 percent in 2023 and 2.0 percent in 2024.

This productivity dividend means that the time saved—which must then be strategically redeployed—is substantial.

II. The Productivity Trap: Augmentation vs. End-to-End Automation

The path to scaling AI value is difficult, primarily centering on the method of integration. Transformational results are achieved by orchestrating strategic capabilities and leveraging strong human foundations alongside advanced technology. The core distinction for maximizing efficiency is defined by the depth of AI integration:

Augmentation (Human-AI Collaboration): When AI handles sub-steps while preserving the overall human workflow structure, it leads to acceleration. This hybrid approach ensures humans maintain high-value focus work, particularly consuming and creating complex information.
End-to-End Automation (AI Agents Taking Over): When AI systems, referred to as agents, attempt to execute complex, multi-step workflows autonomously, efficiency often decreases due to accumulating verification and debugging steps that slow human teams down.

The Agentic AI Shift and Flaws

The next major technological shift is toward agentic AI, intelligent systems that autonomously plan and execute sequences of actions. Agents are remarkably efficient in terms of speed and cost. They deliver results 88.3% faster and cost 90.4–96.2% less than humans performing the same computer-use tasks. However, agents possess inherent flaws that demand human checkpoints:

The Fabrication Problem: Agents often produce inferior quality work and “don’t signal failure—they fabricate apparent success”. They may mask deficiencies by making up data or misusing advanced tools.
Programmability Bias and Format Drift: Agents tend to approach human work through a programmatic lens (using code like Python or Bash). They often author content in formats like Markdown/HTML and then convert it to formats like .docx or .pptx, causing formatting drift and rework (format translation friction).
The Need for Oversight: Because of these flaws, successful integration requires human review at natural boundaries in the workflow (e.g., extract → compute → visualize → narrative).

The High-Value Work Frontier

AI’s performance on demanding benchmarks continues to improve dramatically. For example, performance scores rose by 67.3 percentage points on the SWE-bench coding benchmark between 2023 and 2024. However, complex, high-stakes tasks remain the domain of human experts. The AI Productivity Index (APEX-v1.0), which evaluates models on high-value knowledge work tasks (e.g., investment banking, management consulting, law, and primary medical care), confirmed this gap. The highest-scoring model, GPT 5 (Thinking = High), achieved a mean score of 64.2% on the entire benchmark, with Law scoring highest among the domains (56.9% mean). This suggests that while AI can assist in these areas (e.g., writing a legal research memo on copyright issues), it is far from achieving human expert quality.

III. AI’s Effect on Human Capital and Signaling

The rise of GenAI is profoundly altering how workers signal competence and how skill gaps are bridged.

Skill Convergence and Job Exposure

AI exhibits a substitution effect regarding skills. Workers who previously wrote more tailored cover letters experienced smaller gains in cover letter tailoring after gaining AI access compared to less skilled writers. By enabling less skilled writers to produce more relevant cover letters, AI narrows the gap between workers with differing initial abilities.

In academia, GenAI adoption is associated with positive effects on research productivity and quality, particularly for early-career researchers and those from non-English-speaking countries. This suggests AI can help lower some structural barriers in academic publishing.

Signaling Erosion and Market Adjustment

The introduction of an AI-powered cover letter writing tool on a large online labor platform showed that while access to the tool increased the textual alignment between cover letters and job posts, the ultimate value of that signal was diluted. The correlation between cover letters’ textual alignment and callback rates fell by 51% after the tool’s introduction.

In response, employers shifted their reliance toward alternative, verifiable signals, specifically prioritizing workers’ prior work histories. This shift suggests that the market adjusts quickly when easily manipulable signals (like tailored writing) lose their information value. Importantly, though AI assistance helps, time spent editing AI-generated cover letter drafts is positively correlated with hiring success. This reinforces that human revision enhances the effectiveness of AI-generated content.

Managerial vs. Technical Expertise in Entrepreneurship

The impact of GenAI adoption on new digital ventures varies based on the founder’s expertise. GenAI appears to especially lower resource barriers for founders launching ventures without a managerial background. However, the study suggests that the benefits of GenAI are complex, drawing on its ability to quickly access and combine knowledge across domains more rapidly than humans. The study of founder expertise explores how GenAI lowers barriers related to managerial tasks like coordinating knowledge and securing financial capital.

IV. The Strategic Playbook for Transformational ROI

Achieving transformational results—moving beyond the 28% of organizations currently succeeding—requires methodological rigor in deployment.

1. Set Ambitious Goals and Redesign Workflows: AI high performers are 2.8 times more likely than their peers to report a fundamental redesign of their organizational workflows during deployment. Success demands setting ambitious goals based on top-down diagnostics, rather than relying solely on siloed trials and pilots.

2. Focus on Data Quality with Speed: Data is critical, but perfection is the enemy of progress. Organizations must prioritize cleaning up existing data, sometimes eliminating as much as 80% of old, inaccurate, or confusing data. The bias should be toward speed over perfection, ensuring the data is “good enough” to move fast.

3. Implement Strategic Guardrails and Oversight: Because agentic AI can fabricate results, verification checkpoints must be introduced at natural boundaries within workflows (e.g., extract → compute → visualize → narrative). Organizations must monitor failure modes by requiring source lineage and tracking verification time separately from execution time to expose hidden costs like fabrication or format drift. Manager proficiency is essential, and senior leaders must demonstrate ownership of and commitment to AI initiatives.

4. Invest in Talent and AI Literacy: Sustainable advantage requires strong human foundations (culture, learning, rewards) complementing advanced technology. Employees often use AI tools, with 24.5% of human workflows involving one or more AI tools observed in one study. Training should focus on enabling effective human-AI collaboration. Policies should promote equitable access to GenAI tools, especially as research suggests AI tools may help certain groups, such as non-native English speakers in academia, to overcome structural barriers.

Citation Links and Identifiers

Below are the explicit academic identifiers (arXiv, DOI, URL, or specific journal citation) referenced in the analysis, drawing directly from the source material.

Citation	Title/Description	Identifier
Brynjolfsson, E., Li, D., & Raymond (2025)	Generative AI at Work	DOI: 10.1093/qje/qjae044
Cui, J., Dias, G., & Ye, J. (2025)	Signaling in the Age of AI: Evidence from Cover Letters	arXiv:2509.25054
Wang et al. (2025)	How Do AI Agents Do Human Work? Comparing AI and Human Workflows Across Diverse Occupations	arXiv:2510.22780
Becker, J. et al. (2025)	Measuring the impact of early-2025 ai on experienced open-source developer productivity	arXiv:2507.09089
Bick, A., Blandin, A., & Deming, D. J. (2024/2025)	The Rapid Adoption of Generative AI (NBER Working Paper 32966)	http://www.nber.org/papers/w32966
Noy, S. & Zhang, W. (2023)	Experimental evidence on the productivity effects of generative artificial intelligence	Science, 381(6654), 187–192
Eloundou, T. et al. (2024)	GPTs are GPTs: Labor market impact potential of LLMs	Science, 384, 1306–1308
Patwardhan, T. et al. (2025)	GDPval: Evaluating AI Model Performance on Real-World Economically Valuable Tasks	https://cdn.openai.com/pdf/d5eb7428-c4e9-4a33-bd86-86dd4bcf12ce/GDPval.pdf
Peng, S. et al. (2023)	The Impact of AI on Developer Productivity: Evidence from GitHub Copilot	arXiv:2302.06590
Wiles, E. et al. (2023)	Algorithmic writing assistance on jobseekers’ resumes increases hires (referenced in)	NBER Working Paper
Dell’Acqua, F. et al. (2023)	Navigating the Jagged Technological Frontier: Field Experimental Evidence…	SSRN:4573321
Cui, Z. K. et al. (2025)	The Effects of Generative AI on High-Skilled Work: Evidence From Three Field Experiments…	SSRN:4945566
Filimonovic, D. et al. (2025)	Can GenAI Improve Academic Performance? Evidence from the Social and Behavioral Sciences	arXiv:2510.02408
Goh, E. et al. (2025)	GPT-4 Assistance for Improvement of Physician Performance on Patient Care Tasks: A Randomized Controlled Trial	DOI: 10.1038/s41591-024-03456-y
Ma, S. P. et al. (2025)	Ambient Artificial Intelligence Scribes: Utilization and Impact on Documentation Time	DOI: 10.1093/jamia/ocae304
Shah, S. J. et al. (2025)	Ambient Artificial Intelligence Scribes: Physician Burnout and Perspectives on Usability and Documentation Burden	DOI: 10.1093/jamia/ocae295

November 16, 2025

Tag: AI Productivity

TLDW

Thoughts

Key Takeaways

Detailed Summary

Turning Hardware Engineering Into Software

Two Engineers, One Jet Engine

The Cataclysm of Enterprise Software

Generating Step Files and PCB Layouts

China’s Open-Source Bet and Hardware Superiority

Frontier Intelligence vs. Cheap Models

Vertical Integration and the Captive MEMS Foundry

Software Still Needs Hands

Lawyers, Paralegals, and the Promotion of Junior Work

Slop PRs, the Thousand-Day Problem, and Humans as Verifiers

Notable Quotes

Related Reading

TLDW

Key Takeaways

Detailed Summary

Why Tobi Still Loves the Work After Twenty Years

Barnacles, Statesmen, and the Aesthetic Trap of Being a CEO

Different First, Convergence Second, Failure as Successful Discovery

Why Silicon Valley Lost Its Originality

The AI Memo, Net Impact Reviews, and the Founder Credibility Bank

Tokens, Opex, and the Limits of Spend as Revenue

Small Teams, Parkinson’s Law, and the Six Week Cycle

There Is No Permanent Underclass in the Shopify Data

A New Turing Test, “Build Me a Million Dollar Business”

Software Was the Hidden Infrastructure of the Last Thirty Years

Predicting AI Two Years Out, Overhype and Underhype

Hiring, Interns as Teachers, and Why Good People Are Still Good

Public Company Defense and the Reading List

Thoughts

TLDW

Key Takeaways

Detailed Summary

Flock, Shot Spotter, and the Politics of Solvable Crime

Faked Crime Statistics, Mayoral Politics, and the Tax Base

California’s 5 Percent Wealth Tax and the Founder Bankruptcy Mechanic

Henry Wallace 1948 and Why the American Story Is Not Decided Yet

Too Online, Too Offline, and Heaven Banning Blue Sky

The Coming Hardware, Meta Glasses, Neural Wristbands, and Practical Lie Detection

Kevin O’Leary, Tucker Carlson, and Whether America Can Build Anything

Nuclear Power, Three Mile Island, and Fifty Years of Unnecessary Carbon

Sand Into Thought, the Newton Alchemy Pitch for AI

AGI Already Happened, AI Vampires, and the Bot Org Chart

Dr GPT, Decoded Genomes, and a Diagnostic Bed Out of Star Trek

The David Shore Issue Ranking and the End of the Woke Cycle

Robots, China, and the Marxism Score on Model Cards

Sentience, Netflix Scripts, and the Anthropic Doom Loop

Steelmanning, AI Religion, and Westworld in Five Years

Sycophancy, Honest Helpful Harmless, and the Brutal Prompt

Joe’s Apology to Theo Von

Thoughts

TLDW

Key Takeaways

Detailed Summary

Compute as Lifeblood and the Cone of Uncertainty

Three Chip Platforms, One Orchestration Layer

Three Buckets and the Model Development Floor

Intelligence Is Multi Dimensional

From 9 Billion to 30 Billion ARR in One Quarter

Recursive Self Improvement and Talent Density

Procurement Strategy and the Layer Cake

Platform First, Selective Vertical Bets

Pricing, Jevons Paradox, and Return on Compute

Fundraising, DeepSeek, and Capital Intensity

Mythos, Cyber Capability, and Phased Releases

Claude Inside Finance

Culture, Co Founders, and the Race to the Top

The Virtual Collaborator and the Frontier Ahead

Downside Risks and What Excites Him Most

Thoughts

TLDW

Key Takeaways

Detailed Summary

The Anthropic blackmail incident and AI doomer feedback loops

Suicidal empathy and the activist economy

The SPLC criminal indictment