Skip to main content
Dwarkesh Podcast

Some thoughts on the Sutton interview

11 min episode · 2 min read
·

Episode

11 min

Read time

2 min

AI-Generated Summary

Key Takeaways

  • Compute efficiency critique: LLMs spend most compute during deployment without learning anything, only learning during training on tens of thousands of years of human experience data inefficiently.
  • Imitation learning as foundation: Pretrained LLMs serve as essential priors for reinforcement learning, similar to how AlphaGo used human games before AlphaZero bootstrapped from scratch to superhuman performance.
  • Continual learning gap: Current LLMs learn approximately one bit per episode of tens of thousands of tokens during RL, while animals extract maximum signal continuously from environmental observations.

What It Covers

Dwarkesh reflects on Richard Sutton's perspective that current LLMs waste compute during deployment without learning, requiring new architectures for continual learning and true intelligence.

Key Questions Answered

  • Compute efficiency critique: LLMs spend most compute during deployment without learning anything, only learning during training on tens of thousands of years of human experience data inefficiently.
  • Imitation learning as foundation: Pretrained LLMs serve as essential priors for reinforcement learning, similar to how AlphaGo used human games before AlphaZero bootstrapped from scratch to superhuman performance.
  • Continual learning gap: Current LLMs learn approximately one bit per episode of tens of thousands of tokens during RL, while animals extract maximum signal continuously from environmental observations.

Notable Moment

Dwarkesh compares pretraining data to fossil fuels as non-renewable but essential intermediaries, arguing civilization needed them to reach solar panels despite not being the final solution.

Know someone who'd find this useful?

You just read a 3-minute summary of a 8-minute episode.

Get Dwarkesh Podcast summarized like this every Monday — plus up to 2 more podcasts, free.

Pick Your Podcasts — Free

Keep Reading

More from Dwarkesh Podcast

We summarize every new episode. Want them in your inbox?

Similar Episodes

Related episodes from other podcasts

You're clearly into Dwarkesh Podcast.

Every Monday, we deliver AI summaries of the latest episodes from Dwarkesh Podcast and 192+ other podcasts. Free for up to 3 shows.

Start My Monday Digest

No credit card · Unsubscribe anytime