PJFP.com

Pursuit of Joy, Fulfillment, and Purpose

Tag: Energy efficiency

  • Jensen Huang at Stanford CS153 Frontier Systems on Co-Design, Agentic Computing, Vera Rubin, Open Models, and the Million-X Decade That Reshaped AI Infrastructure

    https://www.youtube.com/watch?v=tsQB0n0YV3k

    NVIDIA CEO Jensen Huang returned to Stanford for the CS153 Frontier Systems class (the room nicknamed itself “AI Coachella”) to lay out, in raw form, how he thinks about the computer being reinvented for the first time in over sixty years. Across roughly seventy minutes of student questions he walks through the codesign philosophy that gave NVIDIA a million-x decade, the architectural through-line from Hopper to Grace Blackwell to Vera Rubin to Feynman, the case for open source foundation models, the realities of tokens per watt and MFU, energy demand running a thousand times higher, the China and export-control debate, and his own biggest strategic mistakes. Watch the full conversation on YouTube.

    TLDW

    Huang argues every layer of computing has changed: the programming model, the system architecture, the deployment pattern, the economics. Co-design across CPUs, GPUs, networking, storage, switches and compilers gave NVIDIA roughly a million-x speed-up over ten years versus the ten-x Moore’s Law era, and that headroom is what let researchers say “just train on the whole internet.” Hopper was built for pre-training, Grace Blackwell NVLink72 for inference and reasoning (50x over Hopper in two years), Vera Rubin is built for agents that load long memory, call tools and need a low-latency single-threaded CPU bolted directly to the GPU, and Feynman extends that to swarms of agents that spawn sub-agents. Open weights matter because safety, sovereignty (230-plus languages no one else will fund) and domain models for biology, autonomy, robotics and climate need a foundation that NVIDIA is willing to seed. Compute is not really the scarce resource (Huang says place the order and the chips ship), the broken thing is institutional budgeting that can’t put a billion dollars into a shared university supercomputer. Energy demand is heading a thousand times higher and this is finally the moment market forces alone will fund sustainable generation. On geopolitics he rejects the GPUs-as-atomic-bombs framing and warns America will end up like its telecom industry if it cedes two thirds of the world. On career he advises seeking suffering on purpose. On strategy he says observe, reason from first principles, build a mental model, work backwards, minimize opportunity cost, maximize optionality.

    Key Takeaways

    • The computing model has been substantially unchanged since the IBM System 360, sixty-plus years ago. Huang’s first computer architecture book was the System 360 manual. AI is the first true reinvention.
    • Old computing was pre-recorded retrieval. New computing is generated, contextually aware and continuous. Cloud was on-demand. Agentic systems run continuously.
    • Codesign is NVIDIA’s central thesis. Inherited from the Hennessy and Patterson RISC era at Stanford, extended across CPUs, GPUs, networking, switches, storage, compilers and frameworks all optimized together.
    • The result of full-stack codesign: roughly 1,000,000x faster compute over ten years, versus a generous 10x to 100x for Moore’s Law in the same period. Dennard scaling effectively ended a decade ago.
    • That million-x speed-up is what unlocked “train on all of the internet” as a realistic AI strategy.
    • After GPT, Huang says it was obvious thinking was next. Reasoning is just generating tokens consumed internally, then using tools is generating tokens consumed externally. Agentic systems followed predictably.
    • Education needs AI baked into the curriculum, not just taught as a subject. Pre-recorded textbooks cannot keep pace with knowledge being generated in real time.
    • Huang says he cannot learn anymore without AI. He has the AI read the paper, then read every related paper, then become a dedicated researcher he can interrogate.
    • Mead and Conway and the first-principles methodology of semiconductor design are still worth learning even though most of the scaling tricks have been exhausted.
    • NVIDIA itself is one of the largest consumers of Anthropic and OpenAI tokens in the world. One hundred percent of NVIDIA engineers are now agentically supported. Huang recommends Claude and similar tools by name and says open-source downloads will not match the integrated product harness.
    • NVIDIA still invests heavily in open foundation models because language and intelligence represent the codification of human knowledge. Five pillars: Nemotron (language), BioNeMo (biology), Alphamayo (autonomous vehicles), Groot (humanoid robotics) and a climate science model (mesoscale multiphysics).
    • Sovereign language models matter. Roughly 230 world languages will never be a top priority for a commercial frontier lab. Nemotron is near-frontier and fully fine-tunable so any country can adapt it.
    • Safety and security require open weights. You cannot defend against or audit a black box. Transparent systems let researchers interrogate models and let defenders deploy swarms.
    • The future of cyber defense is not bigger-model-versus-bigger-model. It is trillions of cheap fast small models like Nemotron Nano surrounding the threat.
    • Domain models fuse language priors with world models. Alphamayo learned to drive safely on a few million miles instead of billions because it can reason like a human about the road.
    • MFU (Model Flops Utilization) is a misleading metric. Huang says he wants low MFU, because that means he over-provisioned every resource and never gets pinned by Amdahl’s law during a spike.
    • The xAI Memphis cluster running at 11 percent MFU is not necessarily a failure mode. In disaggregated prefill plus decode inference you can deliver very high tokens per watt with very low MFU.
    • The right metric is performance, ultimately tokens per watt as a proxy for intelligence per watt, and even that needs adjustment because not all tokens are equal. Coding tokens are worth more than other tokens.
    • Hopper was designed for pre-training. NVIDIA chose to build multi-billion-dollar systems when the largest existing scientific supercomputer cost $350 million, with no proven customer base. It worked.
    • Grace Blackwell NVLink72 was designed for inference, especially the high-memory-bandwidth decode phase. It is the world’s first rack-scale computer and delivered a 50x speed-up over Hopper in two years, against an expected 2x from Moore’s Law.
    • Vera Rubin is designed for agents. Long-term memory wired into storage and into the GPU fabric, working memory, heavy tool use, and Vera, a CPU optimized for low-latency multi-core single-threaded code so a multi-billion-dollar GPU system does not stall waiting on a slow tool call.
    • Feynman is being shaped for swarms of agents with sub-agents and sub-sub-agents, a recursive software topology that demands a new compute pattern.
    • Tokens per watt improved 50x in one generation. Compounding energy efficiency is the lever NVIDIA controls directly.
    • Total compute energy demand is heading roughly a thousand times higher than today, possibly two orders of magnitude beyond that. Huang says he would not be surprised if the estimate is low.
    • For the first time in history, market forces alone are enough to fund solar, nuclear and grid upgrades. Government subsidies are no longer required to make sustainable energy investment rational.
    • Copper interconnect is becoming a bottleneck. Photonics is moving from optional to structural inside racks and across them.
    • Comparing NVIDIA GPUs to atomic bombs, Huang says, is a stupid analogy. A billion people use NVIDIA GPUs. He advocates them to his family. He does not advocate atomic bombs to anyone.
    • If the United States cedes two thirds of the global market to competitors on policy grounds, the American technology industry will end up like American telecommunications, which was policied out of existence.
    • Huang directly rejects AI doom-by-singularity narratives. It is not true that we have no idea how these systems work. It is not true that the technology becomes infinitely powerful in a nanosecond. He calls the rhetoric irresponsible and harmful to the field students are about to enter.
    • On Stanford specifically: if the university president places an order, NVIDIA will deliver the chips. The bottleneck is that no university department has a billion-dollar compute budget because budgeting is fragmented across grants. Stanford’s $40 billion endowment is more than enough to fix that.
    • “It’s Stanford’s fault” is meant as empowerment. If something is your fault, you can solve it.
    • Career advice: do not optimize purely for passion. Most people do not yet know what they love. Pick the job in front of you and do it as well as possible. Even as CEO, Huang says, 90 percent of the work is hard and he suffers through it.
    • Suffering on purpose builds the muscle of resilience. When the company, the team or the family needs you to be tough, that muscle has to already exist.
    • NVIDIA’s first generation of products was technically wrong in nearly every dimension: curved surfaces instead of triangles, no Z-buffer, forward instead of inverse texture mapping, no floating point. The strategic recovery, not the technology, taught Huang the lessons that have lasted decades.
    • The biggest clean strategic mistake Huang names is the move into mobile chips (Tegra). It grew to a billion dollars then went to zero when Qualcomm’s modem dominance shut NVIDIA out of the 3G to 4G transition. The recovery into automotive and robotics (the Thor chip is the great great great grandson of that mobile lineage) was real, but Huang refuses to rationalize the original choice.
    • Forecasting framework: observe, reason from first principles, ask “so what” and “what next” until you have a mental model of the future, place your company inside that model, then work backwards while minimizing opportunity cost and maximizing optionality.
    • Best part of the CEO job: living at the intersection of vision, strategy and execution surrounded by people capable enough to make ambitious visions real. Worst part: the responsibility for everyone who joined the spaceship, especially in the near-death moments NVIDIA had four or five times early on.
    • Underrated insider note: Huang’s first apple pie with cheese, first hot fudge sandwich and first milkshake all happened at Denny’s. The Superbird, the fried chicken and a custom Superbird-style ham and cheese with tomato and mustard are his order.

    Detailed Summary

    Computing reinvented from the ground up

    Huang frames the moment as the first true rewrite of the computer in sixty-plus years. From the IBM System 360 forward, the mental model of writing code, running code, taking a computer to market and reasoning about applications stayed roughly constant. AI changes the programming model itself. Software is no longer a compiled binary running deterministically on a CPU. It is a neural network running on a GPU producing generated, contextual, real-time output. That cascades into how companies are organized, what tools developers use, what the network and storage stack look like, and what an application is even allowed to do. Robo-taxis, he notes, are an application no one would have attempted before deep learning unlocked perception.

    Codesign and the million-x decade

    Codesign is the philosophical center of the talk. Huang traces it to the RISC work of John Hennessy at Stanford, where simpler instruction sets won by being co-designed with the compiler rather than maximally optimized in isolation. NVIDIA extends the principle across every layer simultaneously: GPU architecture, CPU architecture, NVLink and NVSwitch fabrics, photonic interconnects, networking silicon, storage paths, CUDA libraries, frameworks and ultimately the model design. The numbers Huang gives are arresting. Moore’s Law in its prime delivered roughly 100x per decade. By the time Dennard scaling broke, real-world gains had compressed to roughly 10x. NVIDIA’s codesigned stack delivered between 100,000x and 1,000,000x over the same ten-year window. That non-linear speed-up is, in Huang’s telling, the precondition for modern AI: it is what allowed researchers to stop curating training sets and just feed the entire internet to the model.

    Education has to fuse first principles with AI tools

    Asked how curriculum should evolve, Huang argues AI must be integrated into the learning process, not just taught about. He recalls Hennessy writing his textbook by hand a chapter a week while Huang was a student, and says pre-recorded textbooks cannot keep up with the rate at which AI generates new knowledge. He describes his own learning workflow: hand the paper to an AI, then have it read the entire surrounding literature, then treat the AI as a dedicated researcher who can be interrogated. At the same time he defends the classics. Mead and Conway are still the foundation. Most modern semiconductor scaling tricks have been exhausted, but knowing where the field came from sharpens judgment when designing what comes next.

    Open source and the five domain pillars

    Huang gives one of the most detailed public accounts of why NVIDIA invests so heavily in open foundation models even while being a top customer of closed labs. He recommends Claude and OpenAI by name for production coding work, and says 100 percent of NVIDIA engineers are now agentically supported. The open-weights case rests on three legs. First, language is the codification of intelligence, and there are at least 230 languages that no commercial lab will ever prioritize. Nemotron is built near frontier and released so any country or community can fine-tune it. Second, the same representation-learning approach has to be replicated in domains where the data is not internet text, so NVIDIA seeded BioNeMo for biology, Alphamayo for autonomy, Groot for humanoid robotics and a climate model for mesoscale multiphysics. The economics of those fields would never produce a foundation model on their own. Third, safety and security require transparency. A black box cannot be defended or audited, and the future of cyber defense is not bigger-model-versus-bigger-model but swarms of cheap fast small models like Nemotron Nano surrounding the threat.

    MFU is the wrong metric, tokens per watt is closer

    A student raises the leaked memo that the xAI Memphis cluster is running at 11 percent Model Flops Utilization. Huang flips the framing. He says he would rather be at low MFU all the time, because that means he over-provisioned flops, memory bandwidth, memory capacity and network capacity. Bottlenecks shift constantly, so over-provisioning across every dimension is what lets the system absorb a spike without getting pinned by Amdahl’s law. In disaggregated inference, where prefill and decode are physically separated and decode is bandwidth-bound rather than flop-bound, NVLink72 can deliver extremely high tokens per watt while reporting very low MFU. Huang argues the right framing is performance, and ultimately tokens per watt as a rough proxy for intelligence per watt, adjusted for the fact that not all tokens are equal. A coding token is worth more than a generic token.

    Hopper, Grace Blackwell NVLink72, Vera Rubin, Feynman

    Huang gives the clearest public framing of NVIDIA’s roadmap as a sequence of architectural answers to evolving compute patterns. Hopper was built for pre-training, at a moment when NVIDIA chose to build multi-billion-dollar machines while the largest scientific supercomputer in the world cost $350 million and the marketplace for such systems was, on paper, zero. Grace Blackwell NVLink72 was the answer to inference and reasoning: a rack-scale computer that ganged 72 GPUs together because decode needs aggregate memory bandwidth far beyond a single chip. The generation-over-generation speed-up was 50x in two years, twenty-five times what Moore’s Law would have delivered. Vera Rubin is being built explicitly for agents. Agents load long-term memory from storage that has to be wired directly into the GPU fabric, they use working memory, they call tools that run on a CPU, and they wait. So the CPU has to be Vera, optimized for low-latency single-threaded code, because the multi-billion-dollar GPU system cannot afford to idle waiting on a slow tool call. Feynman extends the pattern to swarms of agents with sub-agents and sub-sub-agents, a recursive software topology that will demand its own compute pattern.

    Energy demand and the grid

    Huang’s energy projection is one of the most aggressive numbers in the talk. NVIDIA can compound tokens per watt by 50x per generation through codesign, but the total compute demand is heading roughly a thousand times higher, and Huang says he would not be surprised if the real figure is one or two orders of magnitude beyond that. The reason is structural: future computing is generative and continuous, not pre-recorded and on-demand. The good news, he argues, is that this is the best moment in the history of humanity to invest in sustainable generation. Market forces alone are now sufficient to fund solar, nuclear and grid upgrades. Government subsidies are no longer required to make the math work.

    Adversarial countries, export controls and the telecom warning

    This is the segment where Huang is visibly fired up. He attacks the GPUs-as-atomic-bombs framing on its face. NVIDIA GPUs power medical imaging, video games and soy sauce delivery. A billion people use them. He advocates them to his family. The analogy collapses at the first comparison. He attacks the second framing, that American companies should not compete abroad because they will lose anyway, as a self-fulfilling defeat. Competition makes the company better. The third framing, that depriving the rest of the world of general-purpose computing benefits the United States, also fails on first principles: it benefits one or two American companies at the cost of an entire industry. The cautionary parallel is telecommunications. The United States once had a leading position in telecom fundamental technology and policied itself out of it. Huang’s worry, voiced explicitly to a room of CS students, is that they will graduate into a shell of a computer industry if the same path is repeated.

    AI doom and rational optimism

    In the same arc Huang rejects the science-fiction framing of AI as a singularity that arrives suddenly on a Wednesday at 7pm and ends civilization. He calls those claims irresponsible, says they are not true, and points out that the people advancing them are believed by audiences who then make policy on that basis. It is not true that no one understands how these systems work. It is not true that intelligence becomes infinitely powerful instantaneously. It is not true that there is no defense. His framing, which the host echoes as “rational optimism,” is that the goal is to create a future where people care about computers because the technology students are learning is worth mastering.

    Stanford’s compute problem is Stanford’s fault

    A student presses on the scarcity of compute for independent researchers, startups and universities inside the United States. Huang’s answer is sharp: there is no shortage. Place the order and the chips will arrive. The actual broken thing is institutional. University grants are fragmented across departments. No researcher can raise enough on a single grant to fund a billion-dollar shared cluster, and no one shares. He compares it to showing up at the grocery store demanding a billion dollars of tomatoes today. The solution is planning, aggregation and a campus-scale supercomputer, the way Stanford once built the linear accelerator. The endowment is $40 billion. Pulling a billion off it, contracting cloud capacity and giving every student and researcher AI supercomputer access is, in Huang’s view, obviously doable. When he says “it is Stanford’s fault” the host laughs, but Huang clarifies: if it is your fault you have the power to fix it.

    Career, suffering and resilience

    Asked how a CS student should spend the next few years, Huang pushes back on the standard “follow your passion” advice. Most people do not know what they love yet, because no one knows what they do not know. The bar of demanding joy from every working day is too high. Whatever the job is, do it as well as you can. Even as CEO of NVIDIA he says he genuinely loves about 10 percent of his work. The other 90 percent is hard and he suffers through it. He recommends suffering on purpose, because resilience is a muscle that only builds under load, and when the company, the team or the family needs that muscle, it has to already exist. Earlier in his life that meant cleaning toilets and busing tables at Denny’s. He does it today running a multi-trillion-dollar company.

    The biggest mistakes

    Huang separates technical mistakes from strategic mistakes. NVIDIA’s first generation of products was technically wrong in almost every way: curved surfaces instead of triangles, no Z-buffer, forward instead of inverse texture mapping, no floating point inside. The company wasted two and a half years. But the strategic genius of the recovery, the reading of the market, the conservation of resources and the reapplication of talent, is what taught him strategy. The clean strategic mistake he names is mobile. NVIDIA’s Tegra line grew to a billion dollars of revenue and then collapsed to zero when Qualcomm’s modem dominance locked NVIDIA out of the 3G to 4G transition. Huang explicitly refuses the comforting rationalization that the Tegra effort fed the Thor automotive chip (“Thor is the great great great grandson”). The original decision, he says, was a waste of time. The lesson is to think one or two clicks further about whether a market is structurally winnable before committing the company.

    Forecasting under fog of war

    The final substantive exchange is on forecasting. Huang’s method has four steps. Observe what is actually happening (AlexNet crushing two decades of computer vision research in one shot, GPT producing reasoning by token generation). Reason from first principles about why it works. Ask “so what” and “what next” recursively until a mental model of the future emerges. Place the company inside that future and work backwards. Crucially, expect to be partly wrong. Some outcomes will absolutely happen, some will likely happen, some might happen, and the strategy has to be robust across that distribution. The real cost of any strategic choice is the opportunity cost of the alternatives you did not take, so the discipline is to minimize that cost and maximize optionality while letting the journey itself pay for the journey.

    Thoughts

    The most useful thing in this conversation is the explicit architectural mapping of compute patterns to chip generations. Hopper for pre-training. Grace Blackwell NVLink72 for inference, because decode is bandwidth-bound and a single chip cannot supply it. Vera Rubin for agents, because tool calls stall multi-billion-dollar GPU systems and so the CPU has to be optimized for low-latency single-threaded code. Feynman for swarms. That sequence is not marketing. It is a falsifiable thesis about where the bottleneck moves next, and every other infrastructure company should be measuring themselves against it. If Huang is right that swarms of sub-agents are the next dominant pattern, then the design pressure shifts from raw flops to fabric topology, memory hierarchy and storage-to-GPU latency. That has implications for everyone downstream, including the hyperscalers building competing accelerators.

    The MFU section is the most intellectually generous moment in the talk. The instinct in the AI ops community has been to chase MFU as if it were a virtue. Huang argues, persuasively, that low MFU is consistent with high tokens per watt in a disaggregated inference setup, and that bottlenecks rotate fast enough that over-provisioning every resource is the rational design. That reframing matters because it changes what “scarce” means. Compute is not scarce in the way the discourse treats it. What is scarce is a coherent system designed end-to-end. The xAI 11 percent number, in that frame, is not embarrassing. It is the natural reading of a workload that is mostly decode.

    The Stanford segment is the part most likely to be quoted out of context. “It’s Stanford’s fault” is a deliberately provocative line, but the underlying claim is correct and load-bearing. Compute is not gated by NVIDIA refusing to ship chips. It is gated by the fact that fragmented grant funding cannot aggregate into the billion-dollar order that NVIDIA can fulfill. The implication is that universities and national labs need a structural change in how they pool capital for compute, and that the current model of every researcher buying a handful of cards is genuinely obsolete. Huang’s nudge about pulling a billion off the endowment is concrete enough to be acted on, and other major research universities should read this segment as a direct prompt.

    The geopolitical segment is the highest-stakes one. The telecommunications comparison is correct as a historical pattern, and Huang is one of the very few executives in a position to deliver that warning credibly. The unresolved tension is that the argument applies symmetrically. If American AI dominance is built by selling globally, that includes selling into adversarial states, and the policy question is where the line falls. Huang does not answer that question. He attacks the framing that lets the question be answered badly. That is a meaningful contribution to the discourse even if it does not resolve the underlying tradeoff.

    The career advice section is the part the social-media clips will mishandle. “Seek suffering” reads as macho when extracted. In context it is a specific operational claim about how resilience compounds, and it is paired with the Tegra story where Huang himself paid the price of not thinking one more click ahead. That kind of self-implication is rare in CEO talks, and it is the reason the talk is worth listening to in full rather than only reading the recap.

    Watch the full Stanford CS153 Frontier Systems conversation with Jensen Huang here.

  • The AI Revolution Unveiled: Jonathan Ross on Groq, NVIDIA, and the Future of Inference


    TL;DR

    Jonathan Ross, Groq’s CEO, predicts inference will eclipse training in AI’s future, with Groq’s Language Processing Units (LPUs) outpacing NVIDIA’s GPUs in cost and efficiency. He envisions synthetic data breaking scaling limits, a $1.5 billion Saudi revenue deal fueling Groq’s growth, and AI unlocking human potential through prompt engineering, though he warns of an overabundance trap.

    Detailed Summary

    In a captivating 20VC episode with Harry Stebbings, Jonathan Ross, the mastermind behind Groq and Google’s original Tensor Processing Unit (TPU), outlines a transformative vision for AI. Ross asserts that inference—deploying AI models in real-world scenarios—will soon overshadow training, challenging NVIDIA’s GPU stronghold. Groq’s LPUs, engineered for affordable, high-volume inference, deliver over five times the cost efficiency and three times the energy savings of NVIDIA’s training-focused GPUs by avoiding external memory like HBM. He champions synthetic data from advanced models as a breakthrough, dismantling scaling law barriers and redirecting focus to compute, data, and algorithmic bottlenecks.

    Groq’s explosive growth—from 640 chips in early 2024 to over 40,000 by year-end, aiming for 2 million in 2025—is propelled by a $1.5 billion Saudi revenue deal, not a funding round. Partners like Aramco fund the capital expenditure, sharing profits after a set return, liberating Groq from financial limits. Ross targets NVIDIA’s 40% inference revenue as a weak spot, cautions against a data center investment bubble driven by hyperscaler exaggeration, and foresees AI value concentrating among giants via a power law—yet Groq plans to join them by addressing unmet demands. Reflecting on Groq’s near-failure, salvaged by “Grok Bonds,” he dreams of AI enhancing human agency, potentially empowering 1.4 billion Africans through prompt engineering, while urging vigilance against settling for “good enough” in an abundant future.

    The Big Questions Raised—and Answered

    Ross’s insights provoke profound metaphorical questions about AI’s trajectory and humanity’s role. Here’s what the discussion implicitly asks, paired with his responses:

    • What happens when creation becomes so easy it redefines who gets to create?
      • Answer: Ross champions prompt engineering as a revolutionary force, turning speech into a tool that could unleash 1.4 billion African entrepreneurs. By making creation as simple as talking, AI could shift power from tech gatekeepers to the masses, sparking a global wave of innovation.
    • Can an underdog outrun a titan in a scale-driven game?
      • Answer: Groq can outpace NVIDIA, Ross asserts, by targeting inference—a massive, underserved market—rather than battling over training. With no HBM bottlenecks and a scalable Saudi-backed model, Groq’s agility could topple NVIDIA’s inference share, proving size isn’t everything.
    • What’s the human cost when machines replace our effort?
      • Answer: Ross likens LPUs to tireless employees, predicting a shift from labor to compute-driven economics. Yet, he warns of “financial diabetes”—a loss of drive in an AI-abundant world—urging us to preserve agency lest we become passive consumers of convenience.
    • Is the AI gold rush a promise or a pipe dream?
      • Answer: It’s both. Ross foresees billions wasted on overhyped data centers and “AI t-shirts,” but insists the total value created will outstrip losses. The winners, like Groq, will solve real problems, not chase fleeting trends.
    • How do we keep innovation’s spirit alive amid efficiency’s rise?
      • Answer: By prioritizing human agency and delegation—Ross’s “anti-founder mode”—over micromanagement, he says. Groq’s 25 million token-per-second coin aligns teams to innovate, not just optimize, ensuring efficiency amplifies creativity.
    • What’s the price of chasing a future that might not materialize?
      • Answer: Seven years of struggle taught Ross the emotional and financial toll is steep—Groq nearly died—but strategic bets (like inference) pay off when the wave hits. Resilience turns risk into reward.
    • Will AI’s pursuit drown us in wasted ambition?
      • Answer: Partially, yes—Ross cites VC’s “Keynesian Beauty Contest,” where cash floods copycats. But hyperscalers and problem-solvers like Groq will rise above the noise, turning ambition into tangible progress.
    • Can abundance liberate us without trapping us in ease?
      • Answer: Ross fears AI could erode striving, drawing from his boom-bust childhood. Prompt engineering offers liberation—empowering billions—but only if outliers reject “good enough” and push for excellence.

    Jonathan Ross’s vision is a clarion call: AI’s future isn’t just about faster chips or bigger models—it’s about who wields the tools and how they shape us. Groq’s battle with NVIDIA isn’t merely corporate; it’s a referendum on whether innovation can stay human-centric in an age of machine abundance. As Ross puts it, “Your job is to get positioned for the wave”—and he’s riding it, challenging us to paddle alongside or risk being left ashore.

  • The Untapped Potential of LNG Cold Energy: A Chilling Opportunity


    tl;dw

    Asianometry’s video discusses the underutilized “cold energy” produced during LNG regasification (the process of turning liquefied natural gas back into its gaseous state). This cold energy, usually wasted, has potential applications in power generation (using methods like the Rankine cycle), industrial processes (like air separation and carbon capture), desalination, and even cooling data centers. The video highlights examples of countries already using this technology and emphasizes the vast potential of this currently wasted resource as the LNG industry grows.


    The video from Asianometry explores the potential of LNG (Liquefied Natural Gas) cold energy.

    • LNG Transportation: LNG is transported in liquid form, requiring significant energy to cool the gas and then reheat it (regasification) at the destination.
    • Cold Energy as a Byproduct: The regasification process generates a large amount of cold energy, which is often wasted.
    • Potential Applications: The video discusses various applications for this cold energy, including:
      • Power Generation: Using the temperature difference to drive turbines and generate electricity.
      • Industrial Processes: Improving the efficiency of air separation and carbon capture.
      • Desalination: Enhancing desalination processes by using the cold energy to cool the system.
    • Examples: The video highlights examples of countries like Japan and Thailand that are already utilizing LNG cold energy for industrial purposes, such as air separation.

    The video concludes by emphasizing the significant potential of LNG cold energy as a valuable resource and encourages further exploration of its applications to improve energy efficiency and sustainability.


    The global demand for energy is constantly increasing, driving the search for more efficient and sustainable solutions. While Liquefied Natural Gas (LNG) has emerged as a crucial energy source, a significant byproduct of its processing – cold energy – remains largely untapped. This article delves into the potential of LNG cold energy, exploring its origins, promising applications, and the challenges that need to be addressed to fully realize its potential.

    The Rise of LNG and the Cold Energy Byproduct:

    Natural gas, a relatively clean-burning fossil fuel, plays a vital role in the global energy mix. However, transporting natural gas over long distances via pipelines can be economically challenging. LNG provides a solution by cooling natural gas to approximately -162°C (-260°F), condensing it into a liquid that occupies 600 times less volume. This liquefaction process enables efficient transportation by specialized tankers across oceans.

    Upon arrival at import terminals, LNG undergoes regasification, the process of converting it back into its gaseous state for distribution through pipelines. This regasification requires adding heat to the extremely cold LNG, resulting in a significant amount of “cold energy” – a temperature differential between the LNG and the surrounding environment. This cold energy, often around -15°C (5°F), is typically dissipated as waste, representing a substantial loss of potential energy.

    Understanding the Regasification Process:

    The regasification process relies on specialized equipment called vaporizers. Several types exist, each with its own advantages and disadvantages:

    • Direct Fired Vaporizers: An older technology using burners to directly heat the LNG. These are less common today due to corrosion and efficiency concerns.
    • Submerged Combustion Vaporizers (SCVs): These vaporizers pass cold LNG through pipes submerged in hot water heated by submerged combustion. While widely used, particularly in the United States, SCVs can suffer from corrosion caused by acidic byproducts of combustion.
    • Open Rack Vaporizers (ORVs): ORVs utilize the temperature difference between LNG and warmer seawater. LNG flows through pipes exposed to seawater, facilitating heat exchange. This method is highly efficient where suitable seawater temperatures are available.

    Regardless of the method used, the regasification process inevitably generates a significant amount of cold energy.

    Unlocking the Potential: Applications of LNG Cold Energy:

    The potential applications of LNG cold energy are diverse and offer significant opportunities for energy efficiency and sustainability:

    • Power Generation: Utilizing the temperature differential between the cold LNG and the ambient environment can drive power generation systems.
      • Direct Expansion: This method directly uses the pressure change during regasification to drive a turbine and generate electricity.
      • Organic Rankine Cycle (ORC): ORC systems employ a working fluid with a low boiling point. The cold LNG cools the working fluid, creating a temperature gradient that drives a turbine and generates power. Cascading ORC systems can further enhance efficiency.
    • Industrial Applications:
      • Air Separation: The cryogenic temperatures of LNG can significantly reduce the energy required for separating air into its constituent components, such as nitrogen and oxygen, valuable for various industries.
      • Cryogenic Carbon Capture: Cooling flue gas from industrial processes to very low temperatures can facilitate the separation and capture of CO2, mitigating greenhouse gas emissions.
    • Desalination:
      • Thermal Desalination Enhancement: Integrating LNG cold energy into thermal desalination processes, such as Multi-Stage Flash (MSF), can improve efficiency by cooling the condensing steam.
      • Freezing Desalination: This method uses cold energy to freeze seawater into an ice slurry, separating the ice (freshwater) from the brine. While technically challenging, it offers the potential for high energy efficiency.
    • Data Center Cooling: Data centers consume vast amounts of energy for cooling. Utilizing LNG cold energy can provide a sustainable and efficient cooling solution, reducing their environmental impact.
    • Cold Storage and Food Preservation: The cold energy can be directly used for cooling warehouses, cold storage facilities, and other applications requiring low temperatures, such as food preservation and pharmaceutical storage.

    Challenges and Future Outlook:

    Despite the significant potential, several challenges hinder the widespread adoption of LNG cold energy utilization:

    • Location Constraints: LNG import terminals are often located far from potential end-users of the cold energy, requiring infrastructure for transport.
    • Economic Viability: The capital costs associated with implementing cold energy utilization technologies need to be carefully evaluated against the potential energy savings.
    • Matching Supply and Demand: The continuous availability of cold energy from regasification needs to be matched with consistent demand for its applications.

    However, growing awareness of energy efficiency and sustainability is driving increased interest in LNG cold energy utilization. Technological advancements, policy support, and innovative business models are paving the way for greater adoption of these technologies.

    Overlooked

    LNG cold energy represents a significant, yet often overlooked, opportunity to improve energy efficiency and sustainability. By strategically implementing various applications, we can transform this waste stream into a valuable resource, contributing to a cleaner and more sustainable energy future. As the LNG industry continues to grow, so too does the potential for harnessing this chilling opportunity.

  • Unmasking the Double Standards: Environmentalists’ Contradictory Stance on Bitcoin and Electric Cars

    Unmasking the Double Standards: Environmentalists' Contradictory Stance on Bitcoin and Electric Cars

    In recent years, the focus on climate change and its potential consequences has grown exponentially. With this increase in attention has come a wave of environmental activism, with many supporters advocating for sustainable technology and reduced carbon emissions. However, some environmentalists have been accused of hypocrisy for their seemingly contradictory views on various technologies, specifically Bitcoin and electric cars. This article will explore the reasons behind this criticism and examine the environmental impact of both technologies.

    The Environmental Impact of Bitcoin

    Bitcoin, a digital cryptocurrency, has come under fire from environmentalists due to its significant energy consumption. The process of mining Bitcoin, which involves solving complex mathematical problems to validate transactions and create new coins, requires massive amounts of computing power. This power demand has led to the consumption of vast amounts of electricity, with some estimates suggesting that Bitcoin’s total energy usage rivals that of entire countries.

    Critics argue that this energy consumption contributes to increased greenhouse gas emissions, exacerbating climate change. Additionally, many Bitcoin mining operations rely on non-renewable energy sources such as coal, further contributing to pollution and environmental degradation.

    The Environmental Benefits of Electric Cars

    In contrast, electric vehicles (EVs) are often hailed as a green alternative to traditional internal combustion engine vehicles. By replacing fossil fuel-powered cars with electric ones, environmentalists argue that we can significantly reduce transportation-related greenhouse gas emissions, which account for a significant portion of global emissions.

    EVs also have the potential to run on renewable energy sources, such as solar or wind power, further reducing their environmental impact. Additionally, electric cars are generally more energy-efficient than their gasoline-powered counterparts, requiring less energy to travel the same distance.

    The Hypocrisy Argument

    Given the environmental concerns associated with Bitcoin, it’s not surprising that many environmentalists oppose its widespread adoption. However, some critics argue that this opposition is hypocritical when considering the support for electric vehicles, which also have an environmental impact.

    While it is true that EVs have a lower overall carbon footprint than traditional cars, they are not entirely devoid of environmental concerns. For example, the production of batteries for electric vehicles requires the extraction of minerals like lithium and cobalt, which can have significant environmental and social consequences.

    Furthermore, the electricity used to power electric cars often comes from non-renewable sources like coal and natural gas, which contribute to greenhouse gas emissions. Although EVs can be powered by renewable energy, this is not always the case, and critics argue that environmentalists should be more consistent in their evaluation of the environmental impacts of various technologies.

    While there is no denying that both Bitcoin and electric vehicles have environmental implications, it is essential to recognize that the impacts of these technologies are not equal. Electric cars offer a more sustainable alternative to traditional vehicles, while the environmental concerns surrounding Bitcoin are harder to justify.

    However, critics do raise a valid point in calling for consistency in evaluating the environmental impact of different technologies. Environmentalists must strive to apply the same scrutiny to all technologies and consider the broader context in which they operate. Only then can we work towards a truly sustainable future.

  • Unearthing Bitcoin’s Green Potential: A Sustainable Cryptocurrency Future

    Unearthing Bitcoin's Green Potential: A Sustainable Cryptocurrency Future

    Contrary to popular belief, Bitcoin is not an environmental disaster but rather holds untapped potential for a sustainable future. By analyzing its energy consumption, decentralized nature, and innovative technologies, we can see how Bitcoin can contribute positively to our planet. In this essay, we will explore the green potential of Bitcoin and debunk the common misconceptions surrounding its environmental impact.

    Energy Efficiency: Bitcoin mining, the process of validating transactions and adding them to the blockchain, has been criticized for its high energy consumption. However, it is essential to acknowledge that a substantial portion of this energy comes from renewable sources. A 2021 study found that around 39% of the total energy used in Bitcoin mining came from renewables, a number that has been steadily increasing. Furthermore, mining centers are often located in regions with abundant renewable energy resources, taking advantage of low-cost electricity and minimizing their carbon footprint.

    Decentralization and Reduced Resource Waste: Unlike traditional centralized financial systems, Bitcoin operates on a decentralized, peer-to-peer network. This decentralization reduces the need for physical infrastructure and the environmental impact associated with building and maintaining bank branches, ATMs, and payment processing centers. Additionally, Bitcoin’s digital nature eliminates the need for paper-based transactions, such as printing banknotes and checks, leading to a reduction in paper waste and deforestation.

    Incentivizing Renewable Energy Development: The demand for energy-efficient and cost-effective mining practices has led to a surge in renewable energy innovations. Bitcoin miners, driven by profit motives, are more inclined to use renewable energy sources due to their lower costs. This fosters the development of renewable energy projects and encourages further investment in green technologies.

    E-Waste Reduction: Critics often highlight the electronic waste generated by discarded mining equipment. However, the growth of specialized mining hardware has resulted in more energy-efficient devices with longer lifespans. Moreover, the recycling and repurposing of old mining equipment can significantly reduce e-waste, promoting a circular economy in the technology sector.

    Conclusion: Though Bitcoin has faced criticism for its environmental impact, it is essential to recognize its potential for promoting a sustainable future. By leveraging renewable energy sources, reducing resource waste, and incentivizing green technological innovation, Bitcoin can contribute to our global efforts in combating climate change. As we move towards a more environmentally conscious world, embracing Bitcoin’s green potential will play a crucial role in creating a sustainable financial ecosystem.

    Topics for Further Exploration:

    1. Comparing the environmental impact of Bitcoin to traditional financial systems.
    2. The role of government policies in promoting sustainable cryptocurrency mining practices.
    3. Assessing the potential of other cryptocurrencies with eco-friendly features.
    4. Exploring the connection between blockchain technology and sustainable development goals.
    5. Investigating the potential of recycling and repurposing e-waste from cryptocurrency mining.
  • Nuclear Fusion and Artificial Intelligence: How These Technologies Could Nearly Eliminate Energy Costs by 2050

    Nuclear fusion has the potential to be a nearly limitless and clean source of energy, and there have been significant advancements in the field in recent years. Many experts believe that fusion could be a viable source of electricity within the next few decades, and some even predict that it could be nearly free by 2050.

    One of the main challenges in achieving practical nuclear fusion is finding a way to sustain the high temperatures and pressures required for the reaction to occur. This requires developing materials that can withstand the extreme conditions and finding a way to confine and control the plasma, which is the hot, ionized gas that fuels the fusion reaction.

    There are several approaches to achieving nuclear fusion, including magnetic confinement, inertial confinement, and laser-based methods. Each of these approaches has its own set of challenges, but significant progress has been made in recent years in developing materials and techniques to overcome these challenges.

    One promising approach is the use of high-temperature superconductors, which can be used to create powerful magnets that can confine and control the plasma. These superconductors have the potential to significantly improve the efficiency and stability of fusion reactions, making them a more viable option for practical use.

    Another key factor in achieving practical fusion is the development of advanced computing and artificial intelligence (AI) technologies. These technologies can be used to optimize the design and operation of fusion reactors, as well as to predict and mitigate potential problems.

    There are already several major projects underway to develop fusion energy, including the International Thermonuclear Experimental Reactor (ITER), which is a joint project involving 35 countries. ITER is expected to be operational by the 2030s, and many experts believe that it could be a major step towards achieving practical fusion energy.

    While there are still many challenges to overcome, the potential for nearly limitless, clean, and cheap energy from nuclear fusion is very real. With continued research and development, it is possible that fusion could be a nearly free source of energy by 2050, potentially revolutionizing the way we produce and use energy.