#330 Sebastian Risi: Why AI Should Be Grown, Not Trained
Episode
60 min
Read time
3 min
Topics
Artificial Intelligence
AI-Generated Summary
Key Takeaways
- ✓Neuroevolution vs. Gradient Descent: Rather than following a single downhill slope toward a solution, neuroevolution deploys a population of candidates across the entire search landscape simultaneously, using variation and selection to find solutions. This approach navigates non-differentiable, jagged problem spaces where backpropagation fails — making it applicable to discrete actions, novel architectures, and hyperparameter search without requiring smooth mathematical functions.
- ✓Hebbian Plasticity for Damage Resilience: Networks trained with local Hebbian learning rules — where connection strength updates based on how often two neurons fire together — continuously rewire during operation rather than freezing weights after training. In robotics experiments, quadruped controllers using these plastic networks maintained locomotion after leg removal, a scenario never seen during training, because the network self-organized in real time using only evolved local rules.
- ✓Growing Networks from a Single Neuron: Risi's Neural Developmental Program embeds a small recurrent network inside every neuron, allowing the overall network to grow from one node to thousands by having neurons communicate locally and decide when to spawn new nodes or modify connections. Tested on robotic tasks and a small MNIST variant, networks reached several thousand nodes — orders of magnitude smaller than current models, but the architecture scales without predefined structure.
- ✓Evolutionary Model Merging for Capability Combination: Sakana AI's evolutionary model merging uses evolutionary search to identify which layers from separate pre-trained models to combine, producing a merged model that inherits capabilities from both parents. In published experiments, a Japanese-language model and a math-specialized model were merged to produce a single model proficient in both domains — without any additional gradient-based training on combined data.
- ✓LLMs as Mutation Operators for Scientific Search: Replacing hand-coded genetic operators with large language models creates a powerful hybrid search system. The LLM generates candidate solutions or code variants, evolutionary selection scores them by fitness, and the best candidates seed the next generation. Applied to circle-packing optimization and early AI Scientist experiments — including a workshop-accepted paper — this loop improves automatically as the underlying language model improves.
What It Covers
Sebastian Risi, researcher at Sakana AI and author of *Neuroevolution*, explains why evolutionary algorithms offer a fundamentally different path to AI than gradient descent — covering plastic neural networks that rewire during operation, networks that grow from a single neuron, and how combining large language models with evolutionary search could automate scientific discovery.
Key Questions Answered
- •Neuroevolution vs. Gradient Descent: Rather than following a single downhill slope toward a solution, neuroevolution deploys a population of candidates across the entire search landscape simultaneously, using variation and selection to find solutions. This approach navigates non-differentiable, jagged problem spaces where backpropagation fails — making it applicable to discrete actions, novel architectures, and hyperparameter search without requiring smooth mathematical functions.
- •Hebbian Plasticity for Damage Resilience: Networks trained with local Hebbian learning rules — where connection strength updates based on how often two neurons fire together — continuously rewire during operation rather than freezing weights after training. In robotics experiments, quadruped controllers using these plastic networks maintained locomotion after leg removal, a scenario never seen during training, because the network self-organized in real time using only evolved local rules.
- •Growing Networks from a Single Neuron: Risi's Neural Developmental Program embeds a small recurrent network inside every neuron, allowing the overall network to grow from one node to thousands by having neurons communicate locally and decide when to spawn new nodes or modify connections. Tested on robotic tasks and a small MNIST variant, networks reached several thousand nodes — orders of magnitude smaller than current models, but the architecture scales without predefined structure.
- •Evolutionary Model Merging for Capability Combination: Sakana AI's evolutionary model merging uses evolutionary search to identify which layers from separate pre-trained models to combine, producing a merged model that inherits capabilities from both parents. In published experiments, a Japanese-language model and a math-specialized model were merged to produce a single model proficient in both domains — without any additional gradient-based training on combined data.
- •LLMs as Mutation Operators for Scientific Search: Replacing hand-coded genetic operators with large language models creates a powerful hybrid search system. The LLM generates candidate solutions or code variants, evolutionary selection scores them by fitness, and the best candidates seed the next generation. Applied to circle-packing optimization and early AI Scientist experiments — including a workshop-accepted paper — this loop improves automatically as the underlying language model improves.
- •Co-evolving Agents and Environments via Curriculum: The POET algorithm and its successors evolve agent and environment simultaneously, starting with simple terrain and progressively increasing difficulty. Agents that would fail on complex environments from the start succeed when scaffolded through graduated challenges. Extending this with LLM-generated Unity environments or neural network world models could allow neuroevolution to tackle significantly more complex tasks than current fixed-environment benchmarks support.
Notable Moment
Risi describes a counterintuitive failure in plasticity research: a network trained in a T-maze consistently chose the smaller reward because it learned to track and follow rewards — technically closer to the correct behavior than a network that always turned right, yet scoring worse. Traditional fitness selection would eliminate the more capable network first.
You just read a 3-minute summary of a 57-minute episode.
Get Eye on AI summarized like this every Monday — plus up to 2 more podcasts, free.
Pick Your Podcasts — FreeKeep Reading
More from Eye on AI
The Single Biggest Barrier to AI Adoption Isn't the Technology — It's This | Errol Gardner of EY
May 22 · 54 min
Marketing School
The AI Search Strategy That Actually Works
May 25
More from Eye on AI
Oliver Dial of IBM: Quantum Advantage Is Happening This Year
May 19 · 50 min
a16z Podcast
Why AI Isn’t Killing SaaS Yet
May 25
More from Eye on AI
We summarize every new episode. Want them in your inbox?
The Single Biggest Barrier to AI Adoption Isn't the Technology — It's This | Errol Gardner of EY
Oliver Dial of IBM: Quantum Advantage Is Happening This Year
Why Agentic-First Startups Won't Disrupt Enterprises as Fast as Everyone Thinks | Kris Lovejoy
Loris Degioanni: Why AI Is Breaking Cybersecurity, and What Comes Next
#342 Andrew Thangaraj: The $5,000 IIT Degree: Can India Fix Its Broken Education System?
Similar Episodes
Related episodes from other podcasts
Marketing School
May 25
The AI Search Strategy That Actually Works
a16z Podcast
May 25
Why AI Isn’t Killing SaaS Yet
Animal Spirits
May 25
Talk Your Book: Investing in the Rise of the Robots
Capital Allocators
May 25
Fundraising Mastery: The Tao of Kimmer – John Kim (EP.503)
How I Built This
May 25
Justin’s Nut Butter: Justin Gold. He Was Waiting Tables, Then...He Reinvented Peanut Butter.
Explore Related Topics
This podcast is featured in Best AI Podcasts (2026) — ranked and reviewed with AI summaries.
Read this week's AI & Machine Learning Podcast Insights — cross-podcast analysis updated weekly.
You're clearly into Eye on AI.
Every Monday, we deliver AI summaries of the latest episodes from Eye on AI and 192+ other podcasts. Free for up to 3 shows.
Start My Monday DigestNo credit card · Unsubscribe anytime