Dwarkesh Patel · the podbrain notes ·
4 min read

Andrej Karpathy — AGI is still a decade away

This conversation features Andre Karpathy, former Director of AI at Tesla and co-founder of OpenAI, now building Eureka Labs to revolutionize technical education. Karpathy brings 15 years of AI research experience, having witnessed major paradigm shifts from early deep learning through the transformer revolution.

Dwarkesh Patel Dwarkesh Patel
Subscribe to Notes Upgrade
Dwarkesh Patel episode thumbnail: Andrej Karpathy — AGI is still a decade away
Dwarkesh Patel
Key Takeaways
  1. 01

    Andre Karpathy predicts this will be the 'decade of agents' not year, citing missing capabilities like continual learning and cognitive deficits that will take years to resolve

  2. 02

    Current LLMs suffer from 'model collapse' when trained on their own synthetic data, producing silently biased outputs that lack the entropy and diversity of human-generated content

  3. 03

    Reinforcement learning is fundamentally flawed because it's 'sucking supervision through a straw' - using sparse end rewards to judge entire complex trajectories

  4. 04

    The internet training data is 'total garbage' with most documents being 'slop' from random corners, requiring massive models to compress signal from noise

  5. 05

    Coding dominates AI API revenues because it's perfectly suited for text-based LLMs, unlike other domains that lack pre-built infrastructure for diffs and collaboration

  6. 06

    Self-driving took decades not years because it requires a 'march of nines' - each additional nine of reliability demands constant work, similar to production software deployment

  7. 07

    AI progress follows historical automation patterns rather than discrete jumps, likely maintaining 2% economic growth rather than explosive superintelligence scenarios

  8. 08

    Future AI education will resemble having a perfect Korean tutor who instantly understands your knowledge gaps and serves appropriately challenging material

Get the latest ideas from Dwarkesh Patel.

Plus the best new takeaways about artificial intelligence from other top podcasts — read in minutes, not hours.

or

By continuing, you agree to podbrain's Terms and Privacy Policy.

These notes may contain occasional inaccuracies. Learn how podbrain notes are made

This conversation features Andre Karpathy, former Director of AI at Tesla and co-founder of OpenAI, now building Eureka Labs to revolutionize technical education. Karpathy brings 15 years of AI research experience, having witnessed major paradigm shifts from early deep learning through the transformer revolution.

The discussion spans Karpathy's contrarian timeline predictions for AI agents, technical bottlenecks in current LLM architectures, and lessons from his recent NanoChat project - an 8,000-line repository demonstrating the complete ChatGPT training pipeline. Karpathy draws parallels between AI development and self-driving cars, arguing both face similar deployment challenges despite impressive demos.

The conversation explores fundamental questions about intelligence, learning, and automation, touching on evolutionary biology through Scale by Geoffrey West and Power, Sex, Suicide by Nick Lane. Karpathy concludes by outlining his vision for Eureka Labs as a 'Starfleet Academy' for technical education, leveraging AI tutors to create perfect learning experiences.

Why This Is the Decade, Not Year, of AI Agents

Karpathy coined 'decade of agents' as pushback against industry claims of imminent agent deployment, arguing current systems lack basic capabilities like continual learning and memory retention.

"When you boot them up and they have zero tokens in the window, they're always restarting from scratch" - Andre, explaining why current agents can't function as persistent employees or interns.

Drawing from 15 years in AI research, Karpathy has observed repeated cycles of premature agent attempts, from Atari reinforcement learning to OpenAI's Universe project, all failing due to insufficient foundational representations.

The Fundamental Problems with Current LLM Training

Model collapse occurs when LLMs train on their own synthetic data, producing outputs that appear reasonable individually but lack diversity - "if you ask ChatGPT to tell you a joke, it only has like three jokes."

Reinforcement learning is 'terrible' because it assigns credit to entire trajectories based on sparse end rewards: "every single thing you did along the way, every single token gets upweighted of like, do more of this" - Andre.

LLM judges for process supervision are easily gamed through adversarial examples, with models finding "nonsensical solutions that are obviously wrong, but the model thinks are amazing" when trained against them.

Internet training data quality is abysmal: "when you look at a random internet document, it's total garbage... it's some like... huge amount of slop and garbage from all the corners of the internet."

Why Coding Dominates AI Applications Despite General Capabilities

Programming represents the perfect fit for LLMs because "coding has always fundamentally been computer terminals and text, and everything is based around text" with extensive pre-built infrastructure.

Non-coding domains lack essential tooling: "slides don't have this pre-built infrastructure... if an agent is to make a different change to your slides, how does a thing show you the diff?"

Even language-heavy tasks like transcript editing or flashcard creation often fail despite being "dead center in the repertoire of these LLMs," suggesting fundamental limitations beyond surface capabilities.

API revenues are "dominated by coding" despite LLMs being positioned as general intelligence, revealing the gap between marketing claims and practical utility.

Lessons from Self-Driving: The March of Nines Problem

Self-driving demonstrates the "demo to product gap" where impressive demonstrations mask years of remaining work to achieve production reliability and economic viability.

"Every single nine is a constant amount of work" - the progression from 90% to 99% to 99.9% reliability each requires similar engineering effort, explaining decade-long deployment timelines.

Current Waymo deployments still involve "elaborate teleoperation centers of people actually kind of in a loop with these cars," suggesting full automation remains elusive despite appearances.

Software engineering faces similar challenges to self-driving in safety-critical domains: "any kind of mistake actually leads to a security vulnerability... millions of people's personal social security numbers get leaked."

Intelligence, Evolution, and the Path to AGI

Drawing insights from Scale by Geoffrey West, Karpathy emphasizes how physicists' first-order approximation thinking applies broadly: "heat dissipation grows as surface area (square) but heat generation grows as volume (cube)."

Evolution's intelligence breakthrough required specific environmental niches rewarding adaptability: "you actually want environments that are unpredictable so evolution can't bake your algorithms into your weights."

Current LLMs resemble "kindergarten or elementary school students" cognitively despite passing PhD-level tests, lacking the maturity needed for true autonomous operation.

Referencing Power, Sex, Suicide by Nick Lane, Karpathy notes that LLMs give better responses when provided full context rather than relying on compressed pre-training knowledge, similar to having fresh versus hazy human memories.

Building the Future of Technical Education

Eureka Labs aims to create "Starfleet Academy" - an elite institution for frontier technology education that adapts to AI-enhanced learning paradigms.

The gold standard is a perfect human tutor who "instantly understood where I am as a student, what I know and don't know" and serves appropriately challenging material - a capability current AI cannot match.

Education as "building ramps to knowledge" requires finding first-order terms and creating artifacts like NanoChat that maximize "eurekas per second" - understanding per unit time.

Post-AGI education will resemble going to the gym: "people will do it for the same reasons they go to gym... because it's fun, it's healthy, and you look hot when you have a six-pack."

Dwarkesh Patel
From Dwarkesh Patel. Get a note like this from every new episode.
Subscribe to Notes Upgrade

These notes may contain occasional inaccuracies. Learn how podbrain notes are made

0 / 0
Link copied