Sebastian Risi

#330 Sebastian Risi: Why AI Should Be Grown, Not Trained

Apr 6, 202661 minAI Researcher

AI Summary

→ WHAT IT COVERS Sebastian Risi, researcher at Sakana AI and author of *Neuroevolution*, explains why evolutionary algorithms offer a fundamentally different path to AI than gradient descent — covering plastic neural networks that rewire during operation, networks that grow from a single neuron, and how combining large language models with evolutionary search could automate scientific discovery. → KEY INSIGHTS - **Neuroevolution vs. Gradient Descent:** Rather than following a single downhill slope toward a solution, neuroevolution deploys a population of candidates across the entire search landscape simultaneously, using variation and selection to find solutions. This approach navigates non-differentiable, jagged problem spaces where backpropagation fails — making it applicable to discrete actions, novel architectures, and hyperparameter search without requiring smooth mathematical functions. - **Hebbian Plasticity for Damage Resilience:** Networks trained with local Hebbian learning rules — where connection strength updates based on how often two neurons fire together — continuously rewire during operation rather than freezing weights after training. In robotics experiments, quadruped controllers using these plastic networks maintained locomotion after leg removal, a scenario never seen during training, because the network self-organized in real time using only evolved local rules. - **Growing Networks from a Single Neuron:** Risi's Neural Developmental Program embeds a small recurrent network inside every neuron, allowing the overall network to grow from one node to thousands by having neurons communicate locally and decide when to spawn new nodes or modify connections. Tested on robotic tasks and a small MNIST variant, networks reached several thousand nodes — orders of magnitude smaller than current models, but the architecture scales without predefined structure. - **Evolutionary Model Merging for Capability Combination:** Sakana AI's evolutionary model merging uses evolutionary search to identify which layers from separate pre-trained models to combine, producing a merged model that inherits capabilities from both parents. In published experiments, a Japanese-language model and a math-specialized model were merged to produce a single model proficient in both domains — without any additional gradient-based training on combined data. - **LLMs as Mutation Operators for Scientific Search:** Replacing hand-coded genetic operators with large language models creates a powerful hybrid search system. The LLM generates candidate solutions or code variants, evolutionary selection scores them by fitness, and the best candidates seed the next generation. Applied to circle-packing optimization and early AI Scientist experiments — including a workshop-accepted paper — this loop improves automatically as the underlying language model improves. - **Co-evolving Agents and Environments via Curriculum:** The POET algorithm and its successors evolve agent and environment simultaneously, starting with simple terrain and progressively increasing difficulty. Agents that would fail on complex environments from the start succeed when scaffolded through graduated challenges. Extending this with LLM-generated Unity environments or neural network world models could allow neuroevolution to tackle significantly more complex tasks than current fixed-environment benchmarks support. → NOTABLE MOMENT Risi describes a counterintuitive failure in plasticity research: a network trained in a T-maze consistently chose the smaller reward because it learned to track and follow rewards — technically closer to the correct behavior than a network that always turned right, yet scoring worse. Traditional fitness selection would eliminate the more capable network first. 💼 SPONSORS None detected 🏷️ Neuroevolution, Evolutionary Algorithms, Neural Plasticity, Artificial Life, AI Scientific Discovery, Sakana AI

Read Full Summary Listen

#330 Sebastian Risi: Why AI Should Be Grown, Not Trained

Eye on AI

Apr 2, 202659 minAI Researcher

AI Summary

→ WHAT IT COVERS Sebastian Risi, researcher at Sakana AI, explains neuroevolution — using evolutionary algorithms instead of gradient descent to optimize neural networks — and explores biologically inspired approaches including plastic networks, growing architectures, and combining large language models with evolutionary search to advance AI capabilities. → KEY INSIGHTS - **Hebbian Plasticity over Fixed Weights:** Networks trained with local Hebbian learning rules — where connection strength changes based on how often paired neurons fire together — demonstrate real-time adaptability that static networks lack. A quadrupedal robot controlled by such a network continues functioning after losing a leg, despite never encountering that scenario during training, because weights continuously update throughout operation. - **Evolutionary Model Merging:** Rather than training new models from scratch, evolution can identify which layers from existing pretrained models to combine. Sakana AI demonstrated this by merging a Japanese-language model with a math-specialized model, producing a single model competent in both domains — a scalable strategy for capability expansion without full retraining cycles. - **LLMs as Mutation Operators:** Evolutionary search becomes significantly more powerful when a language model replaces hand-coded mutation functions. In circle-packing optimization, an LLM generates solution variants, fitness scores rank them, and the process iterates — navigating solution spaces that gradient descent cannot traverse because no differentiable objective exists across discrete or code-based representations. - **Quality Diversity over Single-Objective Optimization:** Optimizing purely for fitness score causes evolutionary systems to get trapped — a network learning a T-maze reward task performs worse than random chance by consistently choosing the smaller reward. Researchers should apply quality diversity algorithms that simultaneously reward exploration breadth and solution quality, preventing premature convergence to locally decent but globally poor strategies. - **Co-evolving Agent and Environment:** Training agents against static environments produces brittle specialists. The POET algorithm and its successors evolve terrain difficulty alongside the agent, starting simple and progressively increasing complexity. This curriculum approach enables bipedal robots to eventually navigate obstacle courses they could never learn directly — a principle now extendable using LLMs to generate Unity environments via code. → NOTABLE MOMENT Risi describes a counterintuitive failure mode in plasticity research: a network trained to adapt in a T-maze consistently learned the worst possible strategy — always choosing the smaller reward — scoring below random chance, yet appearing closer to the correct solution than a network that ignored rewards entirely. 💼 SPONSORS None detected 🏷️ Neuroevolution, Evolutionary Algorithms, Neural Plasticity, AI Research, Open-Ended Learning

Read Full Summary Listen

Featured On 1 Podcast

Eye on AI

Top resources Sebastian Risi mentions

Sakana AI

Neuroevolution

All Appearances

#330 Sebastian Risi: Why AI Should Be Grown, Not Trained

AI Summary

#330 Sebastian Risi: Why AI Should Be Grown, Not Trained

AI Summary

Explore More

Never miss Sebastian Risi's insights