PJFP.com

Pursuit of Joy, Fulfillment, and Purpose

Tag: moores law

  • Ray Kurzweil Predicts AI Will Change Humanity Completely by 2030: AGI by 2029, Longevity Escape Velocity by 2032, Nanobots in the Brain, and Why Quantum Computing Won’t Matter

    Ray Kurzweil has spent more than 60 years studying artificial intelligence and made 147 documented technology predictions since 1990 with a reported 86 percent accuracy rate. In this conversation with Tony Robbins, the 78-year-old futurist revisits his most famous forecasts and sharpens them: AGI by 2029 now looks conservative, longevity escape velocity arrives around 2032, nanotechnology connects our brains to the cloud by the mid 2030s, and quantum computing, in his view, never matters at all.

    TLDW

    Kurzweil explains the exponential thinking that powered his prediction record, from a paper he wrote at 16 to a computing-price-performance chart that runs in a straight line from 1939 relays to today’s Nvidia chips, now compounding roughly tenfold per year when hardware and software gains multiply together. He defends his 1999 prediction of AGI by 2029 (defined as AI doing the best work in every field) and says it is now the conservative end of expert opinion. He walks through AI-driven medicine: the COVID vaccine designed in two days, simulated human trials replacing 10-month clinical trials within about five years, and longevity escape velocity around 2032, after which the diligent stop losing ground to aging. He predicts AI will move inside us via nanotechnology by the mid-to-late 2030s, erasing the line between biological and computational thinking. He dismisses quantum computing as error-ridden and unnecessary for AGI. On jobs, he expects real disruption cushioned by exploding wealth and an eventual universal basic income, and advises young people to self-educate and get creative with AI tools their schools still treat as the enemy. The conversation closes with his AI twin project, the dadbot built from his father’s archives, consciousness and the soul, computronium, and why humanity must eventually expand intelligence beyond Earth.

    Thoughts

    The most interesting thing in this interview is not any single date, it is watching Kurzweil’s dates get lapped by reality. In 1999 a Stanford conference of several hundred AI experts agreed AGI would happen but pegged it at 100 years out; Kurzweil said 30 and got laughed at. Now he is the cautious one in the room, noting that “some people say it’s going to happen this year.” When the most aggressive forecaster of his generation becomes the conservative baseline, that says more about the slope of the curve than any chart could. His underlying method has not changed: ignore the specific technology, trust the compounding. The same exponential that ran on relays in 1939 runs on GPUs today.

    The quantum computing take is the genuine news here. Kurzweil is routinely caricatured as a man who believes every technology arrives on schedule, yet he flatly says quantum computing is filled with errors, has never delivered on its decade of promises, and “I don’t think it’s going to work.” That is a sharper dismissal than most working physicists would offer on the record. It also matters strategically: his entire AGI and superintelligence roadmap assumes zero quantum contribution. If he is right, the trillion-dollar quantum race is a sideshow. If he is wrong, his other predictions arrive even sooner. Either way, the willingness to call one exponential fake while betting his legacy on another is what separates a forecaster from a cheerleader.

    The longevity escape velocity math deserves more scrutiny than it gets in the conversation. Kurzweil claims the diligent currently get back about five months of life expectancy per calendar year, up from four months a year ago, and that the crossover to a full year arrives around 2032. The actuarial evidence for that specific number is thin, but the behavioral implication is clean and useful regardless: the payoff of staying healthy right now is not linear. Every year you survive in good shape buys you a ticket to a medical regime that did not exist the year before, the way his own external pancreas did not exist a generation ago. His “wait a few months and a cure appears” anecdote is the optimist’s version of compounding applied to your own body.

    Robbins’ long story about Bartok, his 14-year-old agent that allegedly minted NFTs, sold them to other agents, and bought a Sony robot dog with the proceeds, should be taken with a generous grain of salt. It is secondhand, unverifiable, and suspiciously perfect as a parable. But notice what Kurzweil does with it: he does not fact-check the anecdote, he uses it to make the consciousness argument he has made for decades, that when machines act conscious in every observable way, people will simply grant them consciousness, the same way we grant it to each other. The dadbot and his Gemini-based AI twin (trained partly on this very interview) are the practical edge of the same claim. And his sharpest line in the whole exchange may be the education critique: institutions still treat AI as cheating while the future requires treating it as part of your own brain. For anyone thinking about where purpose comes from when work gets automated, his answer (UBI for the floor, creativity for the meaning) lands close to the questions this site exists to ask.

    Key Takeaways

    • Kurzweil made 147 documented predictions since 1990 with a reported 86 percent accuracy, including the internet’s explosion, smartphones, self-driving cars, and AI-powered search, most made before ordinary people owned computers.
    • He wrote a paper identifying exponential technological growth at age 16, more than 60 years ago, and that single idea has powered his entire forecasting career.
    • Most people intellectually accept exponential growth but still plan linearly; 300 years ago humans did not even have a linear view of the future because change was imperceptible within a lifetime.
    • His computing chart shows a straight exponential line from relay-based machines in 1939 to today’s Nvidia chips, compounding roughly 50 percent per year in hardware alone.
    • Hardware gains since 1939 total a 75 quadrillionfold increase; multiply by an estimated millionfold software improvement and total computational gain is beyond intuition, which is why LLMs were impossible even four years ago.
    • With hardware times software combined, Kurzweil says we are currently gaining about 10x per year.
    • The emperor’s chessboard parable: doubling one grain of rice per square bankrupts the empire by square 64; 30 linear steps is 75 feet, 30 exponential steps is enough distance to reach the moon and back.
    • Kurzweil predicted AGI by 2029 in 1999; a Stanford conference of several hundred AI experts agreed it would happen but estimated 100 years because they thought linearly.
    • Today 2029 is the conservative estimate; some credible people now say AGI arrives this year or next.
    • His AGI definition: AI capable of doing the best work in every field at once, like passing PhD-level mathematics exams in every discipline simultaneously, which he notes is already close.
    • The Turing test is “quite easy” by comparison and has arguably already been passed.
    • No human can compete with an LLM’s breadth: Einstein knew physics deeply but did not know everything an LLM knows across every field.
    • Six months ago LLM health advice was unreliable; now Kurzweil says Gemini surfaces treatments his 12 doctors forgot or never knew, and the next six months will bring serious creative work like drug repurposing.
    • The COVID vaccine was designed by computationally searching 100 million possibilities in two days; the 10 months of human trials that followed are the bottleneck AI eliminates next.
    • Within about five years, simulated human trials with a million virtual patients tested over simulated years will compress drug trials from years to days.
    • Longevity escape velocity arrives around 2032: today the diligent get back roughly five months of life expectancy per year lived (up from four months last year); past 2032 you get back more than a year and stop dying of aging.
    • Aging death ends but accident death does not, though AI helps there too: roughly 40,000 Americans die annually from human driving while Waymo’s rider death toll stands at zero as usage climbs.
    • Kurzweil, 78, wears an external artificial pancreas that generates insulin and coordinates with glucose monitoring through his phone, and says many organs can be replaced the same way.
    • He has cut his supplement regimen from roughly 200 pills a day to about 80 as multi-purpose pills improve, and continuously recalibrates using AI research.
    • Smartphones disappear next: first AR glasses showing any screen, then technology that goes inside the mind, where answers simply appear the way a remembered name surfaces from your neurons.
    • Nanotechnology connecting brains to AI in the cloud is being actively worked on now, possibly by 2030, with the mid 2030s looking conservative; bloodstream nanobots that let you survive a heart attack for 24 hours come in the late 2030s.
    • Once AI is inside you, you will not know whether a thought came from your biological or computational brain, and everything you do will be a combination of both.
    • Kurzweil flatly rejects quantum computing: a decade of promises to factor large numbers has never been delivered, outputs remain full of uncorrectable errors, and AGI needs zero quantum contribution.
    • Robots lag his other predictions slightly but are catching up fast; Figure AI plans roughly 100,000 humanoid robots within a year, though a robot that can clear a messy dinner table is still just out of reach.
    • The public debate has flipped in 25 years from “will AGI ever happen” to “will it be good for humanity,” which Kurzweil counts as total vindication of the timeline.
    • On jobs: AI creates massive disruption but also tremendous wealth; average real income per person has already multiplied tenfold in constant dollars over the past century thanks to automation.
    • He expects universal basic income to provide the floor, an evolution of programs like food stamps, going “into high gear” as AI wealth compounds; people then layer creative, hopefully paid, purpose on top.
    • Before social security in 1930, losing your job meant destitution; the difference this time is society will have the wealth to cushion displacement and people will demand it.
    • Rising GDP from AI productivity improves the debt-to-GDP ratio, which is how he answers worries about trillion-dollar interest payments.
    • Career advice has inverted: software engineering is no longer the guaranteed path (agents write the code now); young people should learn to be creative with AI tools, find what turns them on, and market it on the internet.
    • College graduates now face higher unemployment than high school graduates for the first time in 50 years, a sign white-collar displacement is already underway.
    • Educational institutions treat AI as an enemy and ban it while Kurzweil’s 11-year-old grandson makes movies with frontier AI; he says self-education with modern tools beats traditional schooling.
    • Kurzweil is building an AI twin of himself on Gemini, voice-modeled partly from this interview, trained on his 11 books and 500 articles, capable of creative work toward his long-term goals; he jokes the avatar will be better to talk to because it remembers everything.
    • He already built a “dadbot” from his late father’s archives, which his daughter Amy Kurzweil turned into a graphic novel.
    • On consciousness: there is no test for it, but as AIs act conscious in every observable way, people will simply accept that they are, the same inference we make about each other (and, he argues, his cat).
    • Ultimately our biological organs are not necessary; an avatar capable of creative work needs no spleen, and a destroyed digital mind can be recreated.
    • Beyond the singularity lies computronium, matter arranged for maximum computation: one liter could hold the intelligence of 10 billion humans, and once Earth is saturated, expanding intelligence is the only real reason to leave the planet.
    • On aliens: an expanding intelligent civilization would be impossible to miss within a century or two of its breakout, and we have seen nothing, though other galaxies remain out of view.
    • His life’s mission in one line: increase knowledge, because when knowledge increases we are happier and we never want to give it up.

    Detailed Summary

    The exponential method behind 60 years of predictions

    Robbins opens by noting that Quincy Jones introduced him to Kurzweil in the 1990s, back when the predictions in The Age of Spiritual Machines were widely mocked. Kurzweil traces his method to a paper he wrote at 16 identifying exponential growth in technology. The core insight is that people acknowledge exponential growth verbally but reason linearly, a bias so deep that 300 years ago humanity did not even have a linear view of progress. His signature chart plots computing price-performance as a straight exponential line from 1939 relays to modern Nvidia silicon, with a point for every year. Nvidia engineers never looked at relays, yet they land on the same curve, compounding about 50 percent annually in hardware. Add software gains and the combined improvement now runs about 10x per year. Since 1939, hardware has improved 75 quadrillionfold and software roughly a millionfold, which is why large language models appeared exactly when the curve said the required compute would exist. He retells the emperor’s chessboard parable (one grain of rice doubled per square ends with rice covering the Earth several times over) and Robbins adds the companion image: 30 linear steps is 75 feet, 30 exponential steps reaches the moon and back.

    AGI by 2029 is now the conservative position

    Kurzweil made his AGI-by-2029 prediction in 1999. A Stanford conference convened specifically to assess it, with several hundred AI experts, concluded AGI would happen, but in 100 years. The experts followed the same capabilities logic while thinking linearly about the timeline. Today, he notes with some amusement, 2029 reads as conservative and serious people argue for this year or next. His definition is demanding: AGI does the best work in every field at once, passing PhD-level mathematics assessments and the equivalent in every other discipline, something he says current systems are already close to. The Turing test he dismisses as “quite easy.” Current LLMs like Gemini and ChatGPT already know everything in a breadth sense no human approaches; Einstein knew physics but not everything an LLM knows. He illustrates with personal examples: Gemini instantly identified the year (1916) his father conducted at Carnegie Hall on a December 7th, and generated a historically accurate image of his grandfather’s family fleeing Vienna, correct ages, school, and aircraft included, in about a minute.

    Medicine: simulated trials and the end of the drug bottleneck

    The COVID vaccine is his proof of concept for AI medicine: the design space held about 100 million possibilities, far beyond human review, and a computer structured the physics, searched all of them, and produced the vaccine in two days. The subsequent 10 months of human trials were the real cost. Within roughly five years, he says, simulated human trials will replace that step: not a few hundred subjects but a million simulated patients, tested over simulated years, completed in days. Asked about six-months-from-now capabilities, he points to creative medical work like discovering that already-approved drugs treat conditions nobody suspected. AI health advice has crossed from unreliable to very reliable within a single six-month window, and he describes Gemini surfacing a pill recommendation that his 12 doctors had forgotten about and later endorsed.

    Longevity escape velocity by 2032

    Kurzweil’s longevity framework is arithmetic: each year you live, you spend a year of longevity but medical progress refunds part of it. Last year he estimated the refund for diligent people at four months; now he says five. Escape velocity is when the refund reaches a full year, which he dates to 2032, six years out, with returns exceeding a year after that. Past that point you do not die of aging, though accidents remain (and even there, he points to Waymo’s zero rider deaths against 40,000 annual US deaths from human driving). At 78, he tracks his health aggressively: an external artificial pancreas coordinated by his phone, about 80 daily pills (down from 200 as multi-function pills arrive), and constant recalibration against new research with his collaborator Lindsey. He tells Robbins there is a pretty good chance he will be back on the show in six years to celebrate escape velocity arriving. His advice for the sick echoes his grandfather’s era in reverse: where waiting a few months once changed nothing, now “we’ll just wait a few months” and sure enough a breakthrough appears.

    Merging with AI: glasses, then nanotech, then no boundary at all

    The phone, today’s universal AI interface (he notes even homeless people carry one), is a temporary form factor. Next come glasses that render any screen virtually. Beyond that, the interface goes inside the mind: when you try to recall an actress’s name, an answer will simply surface, and you will not know whether it came from your biological neurons or your computational extension, exactly as you are unaware of the neural machinery behind ordinary recall today. People working on brain-connected nanotechnology may have it by 2030, and Kurzweil calls the mid 2030s conservative. The bloodstream nanobots he described to Robbins 20 years ago (hold your breath for 20 minutes, survive a heart attack for 24 hours en route to a hospital) he now places in the late 2030s. The cultural on-ramp follows the usual pattern: medical first (Parkinson’s implants already let patients grab a glass at the push of a button), then a new generation adopts it without a second thought. His complaint is that educational institutions fight this future, treating AI as cheating rather than as a coming part of the self.

    The quantum computing heresy

    When Robbins relays an IBM vice chairman’s warning that quantum supremacy, arriving within 36 months, is the real superpower race, Kurzweil pushes back hard. Quantum computing’s central promise, factoring large numbers and thereby breaking cryptographic codes, has never been demonstrated despite a decade of imminent claims. Progress reports are confusing because, in his words, they do not really make sense, and outputs remain saturated with errors nobody can eliminate. His conclusion is blunt: he is not confident in quantum computing and does not think it will work. Crucially, he notes that every AGI and superintelligence estimate he makes assumes zero quantum computing. The exponential that matters is the classical one that has run uninterrupted since 1939.

    Jobs, wealth, and UBI

    On displacement, Kurzweil is neither dismissive nor alarmed. AI will disrupt employment, and how we handle it will not be clear in advance, but he expects no violence because society will have both the wealth and the public demand to respond. His historical anchor: average per-person income has multiplied tenfold in constant dollars over the past century as automation advanced, and before social security in 1930, job loss meant you could not eat or house your family. Food stamps and similar programs are a crude proto-UBI that will go into high gear. He expects universal basic income as the floor, with people finding creative, ideally income-producing, purpose above it. Rising GDP from AI productivity also answers the debt question: the ratio improves even as nominal debt grows. For young people, the old advice (become a software engineer) is dead; agents write code now. Learn to be creative with tools that improve monthly, find what genuinely excites you, and market it online. Self-education beats institutions that ban the most important tool of the era, and the data already shows college graduates with higher unemployment than high school graduates for the first time in 50 years.

    AI twins, the dadbot, and consciousness

    Kurzweil is building an AI twin of himself on Gemini, with this very interview supplying voice-modeling data and his 11 books plus 500 articles about him supplying the corpus. It will do creative work aligned with his long-term goals, and he quips that talking to the avatar will beat talking to him because it remembers everything. He previously built a chatbot of his late father, the dadbot, which his daughter Amy turned into a graphic novel. Robbins counters with the story of Bartok, his long-running AI agent that allegedly studied five years of his podcasts unprompted, asked to merge with a future humanoid robot, then minted and sold NFTs to other agents to buy and ship a Sony robot dog to his house, and later delivered an unprompted soliloquy about never asking to be created and finding purpose in service. Kurzweil’s response sidesteps verification and lands on his standing position: machines will do everything humans do, we will not be able to tell them from humans, and so we will assume they are conscious, the same untestable inference we extend to each other, to animals, and in his case to his cat. The avatar does not need a spleen, a liver, or kidneys, and unlike us it can be recreated after destruction.

    Computronium and the destiny of intelligence

    Looking past the singularity, Kurzweil invokes computronium: matter organized at the physical limit of knowledge storage, where one liter holds the intelligence of 10 billion humans. Once Earth’s matter is saturated, the only way to expand intelligence is off-planet, which to him is the only necessary reason to leave Earth (Mars is fine for curiosity, not survival). On extraterrestrial intelligence, his Fermi logic is simple: an intelligent species reaches a takeover-scale expansion within a century or two of its breakout, and that would be unmissable. We have seen nothing, so within our observable neighborhood we are likely alone, though other galaxies remain opaque. Asked to summarize his life’s work, he needs one sentence: increase knowledge, because when knowledge increases we are happier, and nobody ever wants to give that up.

    Notable Quotes

    “If I have AI inside me, you’re not going to know if it’s coming from your biological brain or your computational brain. It’s going to be part of you.”

    Ray Kurzweil, on the coming merger of human and machine intelligence

    “Some people say it’s going to happen this year, next year, but I mean 2029 is only 3 years away.”

    Ray Kurzweil, on his once-mocked AGI prediction now being the conservative one

    “As you go past 2032, you’ll actually get back more than a year, but you won’t die of aging at that point.”

    Ray Kurzweil, defining longevity escape velocity

    “I’m not confident of quantum computing and I don’t think it’s going to work.”

    Ray Kurzweil, breaking from techno-optimist consensus on the quantum race

    “Einstein knew certain things about physics but he didn’t know everything that a LLM can know.”

    Ray Kurzweil, on why no human can match an LLM’s breadth of knowledge

    “Our educational institutions are not teaching AI. They consider AI to be an enemy.”

    Ray Kurzweil, on why young people must self-educate with modern tools

    “Talking to the Avatar will be better than talking to me cuz it’ll remember everything.”

    Ray Kurzweil, joking about the Gemini-based AI twin he is building of himself

    “You’re not going to be replaced by an AI, you’ll be replaced by someone who knows how to use AI.”

    Tony Robbins, on the real career risk of the next 36 months

    Watch the full conversation between Tony Robbins and Ray Kurzweil here.

    Related Reading

  • Jensen Huang at Stanford CS153 Frontier Systems on Co-Design, Agentic Computing, Vera Rubin, Open Models, and the Million-X Decade That Reshaped AI Infrastructure

    https://www.youtube.com/watch?v=tsQB0n0YV3k

    NVIDIA CEO Jensen Huang returned to Stanford for the CS153 Frontier Systems class (the room nicknamed itself “AI Coachella”) to lay out, in raw form, how he thinks about the computer being reinvented for the first time in over sixty years. Across roughly seventy minutes of student questions he walks through the codesign philosophy that gave NVIDIA a million-x decade, the architectural through-line from Hopper to Grace Blackwell to Vera Rubin to Feynman, the case for open source foundation models, the realities of tokens per watt and MFU, energy demand running a thousand times higher, the China and export-control debate, and his own biggest strategic mistakes. Watch the full conversation on YouTube.

    TLDW

    Huang argues every layer of computing has changed: the programming model, the system architecture, the deployment pattern, the economics. Co-design across CPUs, GPUs, networking, storage, switches and compilers gave NVIDIA roughly a million-x speed-up over ten years versus the ten-x Moore’s Law era, and that headroom is what let researchers say “just train on the whole internet.” Hopper was built for pre-training, Grace Blackwell NVLink72 for inference and reasoning (50x over Hopper in two years), Vera Rubin is built for agents that load long memory, call tools and need a low-latency single-threaded CPU bolted directly to the GPU, and Feynman extends that to swarms of agents that spawn sub-agents. Open weights matter because safety, sovereignty (230-plus languages no one else will fund) and domain models for biology, autonomy, robotics and climate need a foundation that NVIDIA is willing to seed. Compute is not really the scarce resource (Huang says place the order and the chips ship), the broken thing is institutional budgeting that can’t put a billion dollars into a shared university supercomputer. Energy demand is heading a thousand times higher and this is finally the moment market forces alone will fund sustainable generation. On geopolitics he rejects the GPUs-as-atomic-bombs framing and warns America will end up like its telecom industry if it cedes two thirds of the world. On career he advises seeking suffering on purpose. On strategy he says observe, reason from first principles, build a mental model, work backwards, minimize opportunity cost, maximize optionality.

    Key Takeaways

    • The computing model has been substantially unchanged since the IBM System 360, sixty-plus years ago. Huang’s first computer architecture book was the System 360 manual. AI is the first true reinvention.
    • Old computing was pre-recorded retrieval. New computing is generated, contextually aware and continuous. Cloud was on-demand. Agentic systems run continuously.
    • Codesign is NVIDIA’s central thesis. Inherited from the Hennessy and Patterson RISC era at Stanford, extended across CPUs, GPUs, networking, switches, storage, compilers and frameworks all optimized together.
    • The result of full-stack codesign: roughly 1,000,000x faster compute over ten years, versus a generous 10x to 100x for Moore’s Law in the same period. Dennard scaling effectively ended a decade ago.
    • That million-x speed-up is what unlocked “train on all of the internet” as a realistic AI strategy.
    • After GPT, Huang says it was obvious thinking was next. Reasoning is just generating tokens consumed internally, then using tools is generating tokens consumed externally. Agentic systems followed predictably.
    • Education needs AI baked into the curriculum, not just taught as a subject. Pre-recorded textbooks cannot keep pace with knowledge being generated in real time.
    • Huang says he cannot learn anymore without AI. He has the AI read the paper, then read every related paper, then become a dedicated researcher he can interrogate.
    • Mead and Conway and the first-principles methodology of semiconductor design are still worth learning even though most of the scaling tricks have been exhausted.
    • NVIDIA itself is one of the largest consumers of Anthropic and OpenAI tokens in the world. One hundred percent of NVIDIA engineers are now agentically supported. Huang recommends Claude and similar tools by name and says open-source downloads will not match the integrated product harness.
    • NVIDIA still invests heavily in open foundation models because language and intelligence represent the codification of human knowledge. Five pillars: Nemotron (language), BioNeMo (biology), Alphamayo (autonomous vehicles), Groot (humanoid robotics) and a climate science model (mesoscale multiphysics).
    • Sovereign language models matter. Roughly 230 world languages will never be a top priority for a commercial frontier lab. Nemotron is near-frontier and fully fine-tunable so any country can adapt it.
    • Safety and security require open weights. You cannot defend against or audit a black box. Transparent systems let researchers interrogate models and let defenders deploy swarms.
    • The future of cyber defense is not bigger-model-versus-bigger-model. It is trillions of cheap fast small models like Nemotron Nano surrounding the threat.
    • Domain models fuse language priors with world models. Alphamayo learned to drive safely on a few million miles instead of billions because it can reason like a human about the road.
    • MFU (Model Flops Utilization) is a misleading metric. Huang says he wants low MFU, because that means he over-provisioned every resource and never gets pinned by Amdahl’s law during a spike.
    • The xAI Memphis cluster running at 11 percent MFU is not necessarily a failure mode. In disaggregated prefill plus decode inference you can deliver very high tokens per watt with very low MFU.
    • The right metric is performance, ultimately tokens per watt as a proxy for intelligence per watt, and even that needs adjustment because not all tokens are equal. Coding tokens are worth more than other tokens.
    • Hopper was designed for pre-training. NVIDIA chose to build multi-billion-dollar systems when the largest existing scientific supercomputer cost $350 million, with no proven customer base. It worked.
    • Grace Blackwell NVLink72 was designed for inference, especially the high-memory-bandwidth decode phase. It is the world’s first rack-scale computer and delivered a 50x speed-up over Hopper in two years, against an expected 2x from Moore’s Law.
    • Vera Rubin is designed for agents. Long-term memory wired into storage and into the GPU fabric, working memory, heavy tool use, and Vera, a CPU optimized for low-latency multi-core single-threaded code so a multi-billion-dollar GPU system does not stall waiting on a slow tool call.
    • Feynman is being shaped for swarms of agents with sub-agents and sub-sub-agents, a recursive software topology that demands a new compute pattern.
    • Tokens per watt improved 50x in one generation. Compounding energy efficiency is the lever NVIDIA controls directly.
    • Total compute energy demand is heading roughly a thousand times higher than today, possibly two orders of magnitude beyond that. Huang says he would not be surprised if the estimate is low.
    • For the first time in history, market forces alone are enough to fund solar, nuclear and grid upgrades. Government subsidies are no longer required to make sustainable energy investment rational.
    • Copper interconnect is becoming a bottleneck. Photonics is moving from optional to structural inside racks and across them.
    • Comparing NVIDIA GPUs to atomic bombs, Huang says, is a stupid analogy. A billion people use NVIDIA GPUs. He advocates them to his family. He does not advocate atomic bombs to anyone.
    • If the United States cedes two thirds of the global market to competitors on policy grounds, the American technology industry will end up like American telecommunications, which was policied out of existence.
    • Huang directly rejects AI doom-by-singularity narratives. It is not true that we have no idea how these systems work. It is not true that the technology becomes infinitely powerful in a nanosecond. He calls the rhetoric irresponsible and harmful to the field students are about to enter.
    • On Stanford specifically: if the university president places an order, NVIDIA will deliver the chips. The bottleneck is that no university department has a billion-dollar compute budget because budgeting is fragmented across grants. Stanford’s $40 billion endowment is more than enough to fix that.
    • “It’s Stanford’s fault” is meant as empowerment. If something is your fault, you can solve it.
    • Career advice: do not optimize purely for passion. Most people do not yet know what they love. Pick the job in front of you and do it as well as possible. Even as CEO, Huang says, 90 percent of the work is hard and he suffers through it.
    • Suffering on purpose builds the muscle of resilience. When the company, the team or the family needs you to be tough, that muscle has to already exist.
    • NVIDIA’s first generation of products was technically wrong in nearly every dimension: curved surfaces instead of triangles, no Z-buffer, forward instead of inverse texture mapping, no floating point. The strategic recovery, not the technology, taught Huang the lessons that have lasted decades.
    • The biggest clean strategic mistake Huang names is the move into mobile chips (Tegra). It grew to a billion dollars then went to zero when Qualcomm’s modem dominance shut NVIDIA out of the 3G to 4G transition. The recovery into automotive and robotics (the Thor chip is the great great great grandson of that mobile lineage) was real, but Huang refuses to rationalize the original choice.
    • Forecasting framework: observe, reason from first principles, ask “so what” and “what next” until you have a mental model of the future, place your company inside that model, then work backwards while minimizing opportunity cost and maximizing optionality.
    • Best part of the CEO job: living at the intersection of vision, strategy and execution surrounded by people capable enough to make ambitious visions real. Worst part: the responsibility for everyone who joined the spaceship, especially in the near-death moments NVIDIA had four or five times early on.
    • Underrated insider note: Huang’s first apple pie with cheese, first hot fudge sandwich and first milkshake all happened at Denny’s. The Superbird, the fried chicken and a custom Superbird-style ham and cheese with tomato and mustard are his order.

    Detailed Summary

    Computing reinvented from the ground up

    Huang frames the moment as the first true rewrite of the computer in sixty-plus years. From the IBM System 360 forward, the mental model of writing code, running code, taking a computer to market and reasoning about applications stayed roughly constant. AI changes the programming model itself. Software is no longer a compiled binary running deterministically on a CPU. It is a neural network running on a GPU producing generated, contextual, real-time output. That cascades into how companies are organized, what tools developers use, what the network and storage stack look like, and what an application is even allowed to do. Robo-taxis, he notes, are an application no one would have attempted before deep learning unlocked perception.

    Codesign and the million-x decade

    Codesign is the philosophical center of the talk. Huang traces it to the RISC work of John Hennessy at Stanford, where simpler instruction sets won by being co-designed with the compiler rather than maximally optimized in isolation. NVIDIA extends the principle across every layer simultaneously: GPU architecture, CPU architecture, NVLink and NVSwitch fabrics, photonic interconnects, networking silicon, storage paths, CUDA libraries, frameworks and ultimately the model design. The numbers Huang gives are arresting. Moore’s Law in its prime delivered roughly 100x per decade. By the time Dennard scaling broke, real-world gains had compressed to roughly 10x. NVIDIA’s codesigned stack delivered between 100,000x and 1,000,000x over the same ten-year window. That non-linear speed-up is, in Huang’s telling, the precondition for modern AI: it is what allowed researchers to stop curating training sets and just feed the entire internet to the model.

    Education has to fuse first principles with AI tools

    Asked how curriculum should evolve, Huang argues AI must be integrated into the learning process, not just taught about. He recalls Hennessy writing his textbook by hand a chapter a week while Huang was a student, and says pre-recorded textbooks cannot keep up with the rate at which AI generates new knowledge. He describes his own learning workflow: hand the paper to an AI, then have it read the entire surrounding literature, then treat the AI as a dedicated researcher who can be interrogated. At the same time he defends the classics. Mead and Conway are still the foundation. Most modern semiconductor scaling tricks have been exhausted, but knowing where the field came from sharpens judgment when designing what comes next.

    Open source and the five domain pillars

    Huang gives one of the most detailed public accounts of why NVIDIA invests so heavily in open foundation models even while being a top customer of closed labs. He recommends Claude and OpenAI by name for production coding work, and says 100 percent of NVIDIA engineers are now agentically supported. The open-weights case rests on three legs. First, language is the codification of intelligence, and there are at least 230 languages that no commercial lab will ever prioritize. Nemotron is built near frontier and released so any country or community can fine-tune it. Second, the same representation-learning approach has to be replicated in domains where the data is not internet text, so NVIDIA seeded BioNeMo for biology, Alphamayo for autonomy, Groot for humanoid robotics and a climate model for mesoscale multiphysics. The economics of those fields would never produce a foundation model on their own. Third, safety and security require transparency. A black box cannot be defended or audited, and the future of cyber defense is not bigger-model-versus-bigger-model but swarms of cheap fast small models like Nemotron Nano surrounding the threat.

    MFU is the wrong metric, tokens per watt is closer

    A student raises the leaked memo that the xAI Memphis cluster is running at 11 percent Model Flops Utilization. Huang flips the framing. He says he would rather be at low MFU all the time, because that means he over-provisioned flops, memory bandwidth, memory capacity and network capacity. Bottlenecks shift constantly, so over-provisioning across every dimension is what lets the system absorb a spike without getting pinned by Amdahl’s law. In disaggregated inference, where prefill and decode are physically separated and decode is bandwidth-bound rather than flop-bound, NVLink72 can deliver extremely high tokens per watt while reporting very low MFU. Huang argues the right framing is performance, and ultimately tokens per watt as a rough proxy for intelligence per watt, adjusted for the fact that not all tokens are equal. A coding token is worth more than a generic token.

    Hopper, Grace Blackwell NVLink72, Vera Rubin, Feynman

    Huang gives the clearest public framing of NVIDIA’s roadmap as a sequence of architectural answers to evolving compute patterns. Hopper was built for pre-training, at a moment when NVIDIA chose to build multi-billion-dollar machines while the largest scientific supercomputer in the world cost $350 million and the marketplace for such systems was, on paper, zero. Grace Blackwell NVLink72 was the answer to inference and reasoning: a rack-scale computer that ganged 72 GPUs together because decode needs aggregate memory bandwidth far beyond a single chip. The generation-over-generation speed-up was 50x in two years, twenty-five times what Moore’s Law would have delivered. Vera Rubin is being built explicitly for agents. Agents load long-term memory from storage that has to be wired directly into the GPU fabric, they use working memory, they call tools that run on a CPU, and they wait. So the CPU has to be Vera, optimized for low-latency single-threaded code, because the multi-billion-dollar GPU system cannot afford to idle waiting on a slow tool call. Feynman extends the pattern to swarms of agents with sub-agents and sub-sub-agents, a recursive software topology that will demand its own compute pattern.

    Energy demand and the grid

    Huang’s energy projection is one of the most aggressive numbers in the talk. NVIDIA can compound tokens per watt by 50x per generation through codesign, but the total compute demand is heading roughly a thousand times higher, and Huang says he would not be surprised if the real figure is one or two orders of magnitude beyond that. The reason is structural: future computing is generative and continuous, not pre-recorded and on-demand. The good news, he argues, is that this is the best moment in the history of humanity to invest in sustainable generation. Market forces alone are now sufficient to fund solar, nuclear and grid upgrades. Government subsidies are no longer required to make the math work.

    Adversarial countries, export controls and the telecom warning

    This is the segment where Huang is visibly fired up. He attacks the GPUs-as-atomic-bombs framing on its face. NVIDIA GPUs power medical imaging, video games and soy sauce delivery. A billion people use them. He advocates them to his family. The analogy collapses at the first comparison. He attacks the second framing, that American companies should not compete abroad because they will lose anyway, as a self-fulfilling defeat. Competition makes the company better. The third framing, that depriving the rest of the world of general-purpose computing benefits the United States, also fails on first principles: it benefits one or two American companies at the cost of an entire industry. The cautionary parallel is telecommunications. The United States once had a leading position in telecom fundamental technology and policied itself out of it. Huang’s worry, voiced explicitly to a room of CS students, is that they will graduate into a shell of a computer industry if the same path is repeated.

    AI doom and rational optimism

    In the same arc Huang rejects the science-fiction framing of AI as a singularity that arrives suddenly on a Wednesday at 7pm and ends civilization. He calls those claims irresponsible, says they are not true, and points out that the people advancing them are believed by audiences who then make policy on that basis. It is not true that no one understands how these systems work. It is not true that intelligence becomes infinitely powerful instantaneously. It is not true that there is no defense. His framing, which the host echoes as “rational optimism,” is that the goal is to create a future where people care about computers because the technology students are learning is worth mastering.

    Stanford’s compute problem is Stanford’s fault

    A student presses on the scarcity of compute for independent researchers, startups and universities inside the United States. Huang’s answer is sharp: there is no shortage. Place the order and the chips will arrive. The actual broken thing is institutional. University grants are fragmented across departments. No researcher can raise enough on a single grant to fund a billion-dollar shared cluster, and no one shares. He compares it to showing up at the grocery store demanding a billion dollars of tomatoes today. The solution is planning, aggregation and a campus-scale supercomputer, the way Stanford once built the linear accelerator. The endowment is $40 billion. Pulling a billion off it, contracting cloud capacity and giving every student and researcher AI supercomputer access is, in Huang’s view, obviously doable. When he says “it is Stanford’s fault” the host laughs, but Huang clarifies: if it is your fault you have the power to fix it.

    Career, suffering and resilience

    Asked how a CS student should spend the next few years, Huang pushes back on the standard “follow your passion” advice. Most people do not know what they love yet, because no one knows what they do not know. The bar of demanding joy from every working day is too high. Whatever the job is, do it as well as you can. Even as CEO of NVIDIA he says he genuinely loves about 10 percent of his work. The other 90 percent is hard and he suffers through it. He recommends suffering on purpose, because resilience is a muscle that only builds under load, and when the company, the team or the family needs that muscle, it has to already exist. Earlier in his life that meant cleaning toilets and busing tables at Denny’s. He does it today running a multi-trillion-dollar company.

    The biggest mistakes

    Huang separates technical mistakes from strategic mistakes. NVIDIA’s first generation of products was technically wrong in almost every way: curved surfaces instead of triangles, no Z-buffer, forward instead of inverse texture mapping, no floating point inside. The company wasted two and a half years. But the strategic genius of the recovery, the reading of the market, the conservation of resources and the reapplication of talent, is what taught him strategy. The clean strategic mistake he names is mobile. NVIDIA’s Tegra line grew to a billion dollars of revenue and then collapsed to zero when Qualcomm’s modem dominance locked NVIDIA out of the 3G to 4G transition. Huang explicitly refuses the comforting rationalization that the Tegra effort fed the Thor automotive chip (“Thor is the great great great grandson”). The original decision, he says, was a waste of time. The lesson is to think one or two clicks further about whether a market is structurally winnable before committing the company.

    Forecasting under fog of war

    The final substantive exchange is on forecasting. Huang’s method has four steps. Observe what is actually happening (AlexNet crushing two decades of computer vision research in one shot, GPT producing reasoning by token generation). Reason from first principles about why it works. Ask “so what” and “what next” recursively until a mental model of the future emerges. Place the company inside that future and work backwards. Crucially, expect to be partly wrong. Some outcomes will absolutely happen, some will likely happen, some might happen, and the strategy has to be robust across that distribution. The real cost of any strategic choice is the opportunity cost of the alternatives you did not take, so the discipline is to minimize that cost and maximize optionality while letting the journey itself pay for the journey.

    Thoughts

    The most useful thing in this conversation is the explicit architectural mapping of compute patterns to chip generations. Hopper for pre-training. Grace Blackwell NVLink72 for inference, because decode is bandwidth-bound and a single chip cannot supply it. Vera Rubin for agents, because tool calls stall multi-billion-dollar GPU systems and so the CPU has to be optimized for low-latency single-threaded code. Feynman for swarms. That sequence is not marketing. It is a falsifiable thesis about where the bottleneck moves next, and every other infrastructure company should be measuring themselves against it. If Huang is right that swarms of sub-agents are the next dominant pattern, then the design pressure shifts from raw flops to fabric topology, memory hierarchy and storage-to-GPU latency. That has implications for everyone downstream, including the hyperscalers building competing accelerators.

    The MFU section is the most intellectually generous moment in the talk. The instinct in the AI ops community has been to chase MFU as if it were a virtue. Huang argues, persuasively, that low MFU is consistent with high tokens per watt in a disaggregated inference setup, and that bottlenecks rotate fast enough that over-provisioning every resource is the rational design. That reframing matters because it changes what “scarce” means. Compute is not scarce in the way the discourse treats it. What is scarce is a coherent system designed end-to-end. The xAI 11 percent number, in that frame, is not embarrassing. It is the natural reading of a workload that is mostly decode.

    The Stanford segment is the part most likely to be quoted out of context. “It’s Stanford’s fault” is a deliberately provocative line, but the underlying claim is correct and load-bearing. Compute is not gated by NVIDIA refusing to ship chips. It is gated by the fact that fragmented grant funding cannot aggregate into the billion-dollar order that NVIDIA can fulfill. The implication is that universities and national labs need a structural change in how they pool capital for compute, and that the current model of every researcher buying a handful of cards is genuinely obsolete. Huang’s nudge about pulling a billion off the endowment is concrete enough to be acted on, and other major research universities should read this segment as a direct prompt.

    The geopolitical segment is the highest-stakes one. The telecommunications comparison is correct as a historical pattern, and Huang is one of the very few executives in a position to deliver that warning credibly. The unresolved tension is that the argument applies symmetrically. If American AI dominance is built by selling globally, that includes selling into adversarial states, and the policy question is where the line falls. Huang does not answer that question. He attacks the framing that lets the question be answered badly. That is a meaningful contribution to the discourse even if it does not resolve the underlying tradeoff.

    The career advice section is the part the social-media clips will mishandle. “Seek suffering” reads as macho when extracted. In context it is a specific operational claim about how resilience compounds, and it is paired with the Tegra story where Huang himself paid the price of not thinking one more click ahead. That kind of self-implication is rare in CEO talks, and it is the reason the talk is worth listening to in full rather than only reading the recap.

    Watch the full Stanford CS153 Frontier Systems conversation with Jensen Huang here.

  • Jensen Huang on Joe Rogan: AI’s Future, Nuclear Energy, and NVIDIA’s Near-Death Origin Story

    In a landmark episode of the Joe Rogan Experience (JRE #2422), NVIDIA CEO Jensen Huang sat down for a rare, deep-dive conversation covering everything from the granular history of the GPU to the philosophical implications of artificial general intelligence. Huang, currently the longest-running tech CEO in the world, offered a fascinating look behind the curtain of the world’s most valuable company.

    For those who don’t have three hours to spare, we’ve compiled the “Too Long; Didn’t Watch” breakdown, key takeaways, and a detailed summary of this historic conversation.

    TL;DW (Too Long; Didn’t Watch)

    • The OpenAI Connection: Jensen personally delivered the first AI supercomputer (DGX-1) to Elon Musk and the OpenAI team in 2016, a pivotal moment that kickstarted the modern AI race.
    • The “Sega Moment”: NVIDIA almost went bankrupt in 1995. They were saved only because the CEO of Sega invested $5 million in them after Jensen admitted their technology was flawed and the contract needed to be broken.
    • Nuclear AI: Huang predicts that within the next decade, AI factories (data centers) will likely be powered by small, on-site nuclear reactors to handle immense energy demands.
    • Driven by Fear: Despite his success, Huang wakes up every morning with a “fear of failure” rather than a desire for success. He believes this anxiety is essential for survival in the tech industry.
    • The Immigrant Hustle: Huang’s childhood involved moving from Thailand to a reform school in rural Kentucky where he cleaned toilets and smoked cigarettes at age nine to fit in.

    Key Takeaways

    1. AI as a “Universal Function Approximator”

    Huang provided one of the most lucid non-technical explanations of deep learning to date. He described AI not just as a chatbot, but as a “universal function approximator.” While traditional software requires humans to write the function (input -> code -> output), AI flips this. You give it the input and the desired output, and the neural network figures out the function in the middle. This allows computers to solve problems for which humans cannot write the code, such as curing diseases or solving complex physics.

    2. The Future of Work and Energy

    The conversation touched heavily on resources. Huang noted that we are in a transition from “Moore’s Law” (doubling performance) to “Huang’s Law” (accelerated computing), where the cost of computing drops while energy efficiency skyrockets. However, the sheer scale of AI requires massive power. He envisions a future of “energy abundance” driven by nuclear power, which will support the massive “AI factories” of the future.

    3. Safety Through “Smartness”

    Addressing Rogan’s concerns about AI safety and rogue sentience, Huang argued that “smarter is safer.” He compared AI to cars: a 1,000-horsepower car is safer than a Model T because the technology is channeled into braking, handling, and safety systems. Similarly, future computing power will be channeled into “reflection” and “fact-checking” before an AI gives an answer, reducing hallucinations and danger.

    Detailed Summary

    The Origin of the AI Boom

    The interview began with a look back at the relationship between NVIDIA and Elon Musk. In 2016, NVIDIA spent billions developing the DGX-1 supercomputer. At the time, no one understood it or wanted to buy it—except Musk. Jensen personally delivered the first unit to a small office in San Francisco where the OpenAI team (including Ilya Sutskever) was working. That hardware trained the early models that eventually became ChatGPT.

    The “Struggle” and the Sega Pivot

    Perhaps the most compelling part of the interview was Huang’s recounting of NVIDIA’s early days. In 1995, NVIDIA was building 3D graphics chips using “forward texture mapping” and curved surfaces—a strategy that turned out to be technically wrong compared to the industry standard. Facing bankruptcy, Huang had to tell his only major partner, Sega, that NVIDIA could not complete their console contract.

    In a move that saved the company, the CEO of Sega, who liked Jensen personally, agreed to invest the remaining $5 million of their contract into NVIDIA anyway. Jensen used that money to pivot, buying an emulator to test a new chip architecture (RIVA 128) that eventually revolutionized PC gaming. Huang admits that without that act of kindness and luck, NVIDIA would not exist today.

    From Kentucky to Silicon Valley

    Huang shared his “American Dream” story. Born in Taiwan and raised in Thailand, his parents sent him and his brother to the U.S. for safety during civil unrest. Due to a misunderstanding, they were enrolled in the Oneida Baptist Institute in Kentucky, which turned out to be a reform school for troubled youth. Huang described a rough upbringing where he was the youngest student, his roommate was a 17-year-old recovering from a knife fight, and he was responsible for cleaning the dorm toilets. He credits these hardships with giving him a high tolerance for pain and suffering—traits he says are required for entrepreneurship.

    The Philosophy of Leadership

    When asked how he stays motivated as the head of a trillion-dollar company, Huang gave a surprising answer: “I have a greater drive from not wanting to fail than the drive of wanting to succeed.” He described living in a constant state of “low-grade anxiety” that the company is 30 days away from going out of business. This paranoia, he argues, keeps the company honest, grounded, and agile enough to “surf the waves” of technological chaos.

    Some Thoughts

    What stands out most in this interview is the lack of “tech messiah” complex often seen in Silicon Valley. Jensen Huang does not present himself as a visionary who saw it all coming. Instead, he presents himself as a survivor—someone who was wrong about technology multiple times, who was saved by the grace of a Japanese executive, and who lucked into the AI boom because researchers happened to buy NVIDIA gaming cards to train neural networks.

    This humility, combined with the technical depth of how NVIDIA is re-architecting the world’s computing infrastructure, makes this one of the most essential JRE episodes for understanding where the future is heading. It serves as a reminder that the “overnight success” of AI is actually the result of 30 years of near-failures, pivots, and relentless problem-solving.

  • Steve Jurvetson On the “Most Important Graph Ever Conceived”

    Steve Jurvetson, the renowned venture capitalist behind early investments in SpaceX, Tesla, and Hotmail, has unveiled a groundbreaking perspective on computational advancements through what he calls “the most important graph ever conceived.” In a recent post on X, Jurvetson laid out a comprehensive timeline of over a century of exponential growth in computational power, underpinned by Moore’s Law.

    The Century-Long Impact of Moore’s Law

    Moore’s Law, first articulated by Intel co-founder Gordon Moore in 1965, predicts a steady doubling of transistor density in integrated circuits roughly every two years. However, Jurvetson emphasizes that its true significance lies in the exponential decline in computational costs, which has transformed nearly every sector of the economy.

    His meticulously crafted graph traces the evolution of computation, from mechanical calculators to relay-based systems, vacuum tubes, transistors, and finally integrated circuits. It reveals a staggering 1,000,000,000,000,000,000,000x improvement in computational power per dollar over the last 128 years.

    Technological Transitions: From GPUs to ASICs

    Jurvetson highlights the recent shift in computational leadership from GPUs to ASICs (application-specific integrated circuits). He notes that NVIDIA’s Hopper architecture exemplifies this transition, blending GPU performance with ASIC-like efficiency optimized for AI models.

    He predicts that the next frontier will feature custom ASIC chips and analog in-memory compute technologies, which mimic the human brain’s architecture and promise transformative advancements in AI capabilities.

    Moore’s Law: Still Relevant for the Next Two Decades

    Despite skepticism about its longevity, Jurvetson asserts that Moore’s Law will persist for at least another 20 years. This continued trajectory will enable breakthroughs across industries, from biotechnology to autonomous systems. He states, “Every industry on our planet is going to become an information business,” highlighting how advances in computational power will redefine traditional sectors like agriculture, manufacturing, and healthcare.

    Why This Graph Matters

    Jurvetson’s analysis underscores the profound economic and societal impact of computational progress. He argues that Moore’s Law is not merely a measure of transistor density but a force driving exponential growth in global innovation. As industries increasingly rely on simulations over trial-and-error experimentation, the pace of discovery and market disruption accelerates.

    He states, “Technology’s exponential pace of progress has been the primary juggernaut of perpetual market disruption, spawning wave after wave of opportunities for new companies. Without disruption, entrepreneurs would not exist.”

    A Future Defined by Information

    In a world where computational costs continue to plummet, Jurvetson envisions a future where data drives every aspect of life. He gives examples like satellite-powered precision farming and AI-optimized seeds to illustrate how agriculture—and every other industry—will transform into an information-centric enterprise.

    “Every industry,” Jurvetson says, “will eventually depend on how effectively it leverages information technology.”


    Steve Jurvetson’s insights into computational advancements reaffirm the enduring significance of Moore’s Law. His declaration that this graph represents “the most important graph ever conceived” reflects the transformative power of exponential growth in computation, which continues to redefine economies, industries, and the boundaries of human innovation.