Why the Future of AI Isn't Just Bigger Models. It's Models That Evolve | Risto Miikkulainen of Cognizant
Episode
64 min
Read time
3 min
Topics
Artificial Intelligence
AI-Generated Summary
Key Takeaways
- ✓Evolution Strategy for LLM Fine-Tuning: Cognizant AI Lab demonstrated that evolution strategies can optimize billions of parameters in pretrained models like LLaMA and Qwen without gradient descent. Rather than backpropagation, a cloud of candidate solutions explores the parameter space, rewarding configurations that perform better on target tasks. Oxford and NVIDIA have since replicated this approach, validating it as a viable alternative to RLHF for fine-tuning specialized model behavior.
- ✓Population-Based Search vs. Gradient Descent: Gradient descent optimizes a single solution incrementally, making it vulnerable to local minima in jagged loss landscapes. Evolutionary methods deploy 30 to 1,000 parallel agents spread across the solution space, using recombination of high-performing candidates to make large jumps. This produces solutions human designers would not anticipate — a result so reliable it has a dedicated "human competitive results" competition at the Genetic and Evolutionary Computation Conference.
- ✓Quality Diversity as a Discovery Engine: Explicitly rewarding novelty — independent of performance — produces "stepping stone" solutions that unlock further evolutionary progress. Combining novelty rewards with performance rewards, called quality diversity, is now standard in advanced neuroevolution pipelines. Practitioners building creative AI systems should implement dual fitness functions: one scoring task performance, one scoring behavioral distance from all previously seen solutions in the population archive.
- ✓LLMs as Evolutionary Operators: LLMs can replace traditional crossover and mutation operators by accepting two parent solutions as input and generating a recombined offspring in natural language or code. This means any domain representable in language — molecules, ML architectures, scientific hypotheses, trading strategies — becomes evolvable. Sakana AI applied this to automated research, producing a paper accepted at a major ML conference, demonstrating end-to-end AI-driven scientific discovery is operationally feasible today.
- ✓Neuroevolution for Metacognition and Continual Learning: Current transformer architectures lack mechanisms for self-knowledge and continual adaptation. Miikkulainen proposes using neuroevolution to discover novel neural architectures inspired by hippocampal circuitry — starting with navigation and spatial memory tasks — where evolved networks must answer whether they actually know a fact versus confabulating. This neuroscience-grounded approach targets two unsolved AI problems simultaneously: catastrophic forgetting and calibrated uncertainty about internal knowledge states.
What It Covers
Risto Miikkulainen, VP of AI Research at Cognizant AI Lab and UT Austin professor, explains how evolutionary computation — specifically population-based search, neuroevolution, and evolution strategies — solves problems that gradient descent cannot, enabling creative AI solutions across finance, medicine, scientific discovery, and multi-agent decision-making systems.
Key Questions Answered
- •Evolution Strategy for LLM Fine-Tuning: Cognizant AI Lab demonstrated that evolution strategies can optimize billions of parameters in pretrained models like LLaMA and Qwen without gradient descent. Rather than backpropagation, a cloud of candidate solutions explores the parameter space, rewarding configurations that perform better on target tasks. Oxford and NVIDIA have since replicated this approach, validating it as a viable alternative to RLHF for fine-tuning specialized model behavior.
- •Population-Based Search vs. Gradient Descent: Gradient descent optimizes a single solution incrementally, making it vulnerable to local minima in jagged loss landscapes. Evolutionary methods deploy 30 to 1,000 parallel agents spread across the solution space, using recombination of high-performing candidates to make large jumps. This produces solutions human designers would not anticipate — a result so reliable it has a dedicated "human competitive results" competition at the Genetic and Evolutionary Computation Conference.
- •Quality Diversity as a Discovery Engine: Explicitly rewarding novelty — independent of performance — produces "stepping stone" solutions that unlock further evolutionary progress. Combining novelty rewards with performance rewards, called quality diversity, is now standard in advanced neuroevolution pipelines. Practitioners building creative AI systems should implement dual fitness functions: one scoring task performance, one scoring behavioral distance from all previously seen solutions in the population archive.
- •LLMs as Evolutionary Operators: LLMs can replace traditional crossover and mutation operators by accepting two parent solutions as input and generating a recombined offspring in natural language or code. This means any domain representable in language — molecules, ML architectures, scientific hypotheses, trading strategies — becomes evolvable. Sakana AI applied this to automated research, producing a paper accepted at a major ML conference, demonstrating end-to-end AI-driven scientific discovery is operationally feasible today.
- •Neuroevolution for Metacognition and Continual Learning: Current transformer architectures lack mechanisms for self-knowledge and continual adaptation. Miikkulainen proposes using neuroevolution to discover novel neural architectures inspired by hippocampal circuitry — starting with navigation and spatial memory tasks — where evolved networks must answer whether they actually know a fact versus confabulating. This neuroscience-grounded approach targets two unsolved AI problems simultaneously: catastrophic forgetting and calibrated uncertainty about internal knowledge states.
- •Pandemic Decision-Making as a Deployable Template: Cognizant built a working system that ingested global case, death, and hospitalization data alongside government intervention records to generate next-day policy recommendations — school closures, masking, contact tracing — for any country with available data. Iceland used the system in fall 2021 to guide school reopening decisions, with recommendations reaching the health ministry. This surrogate-model-plus-evolutionary-search pipeline is replicable for any domain where outcome data and intervention diversity exist.
Notable Moment
A mystery model named in the Alpha Arena stock trading competition outperformed all entrants, with forensic analysis pointing toward neuroevolutionary methods. Miikkulainen explains why evolution specifically suits trading: generating strategies genuinely unlike what competitors use is the only structural edge, and diversity-maximizing population search is the mechanism that produces it.
You just read a 3-minute summary of a 61-minute episode.
Get Eye on AI summarized like this every Monday — plus up to 2 more podcasts, free.
Pick Your Podcasts — FreeKeep Reading
More from Eye on AI
How AI Is Reinventing Elder Care | Chia-Lin Simmons of LogicMark
Jun 1 · 53 min
The Genius Life
580: The Best Foods to Fight Weight Gain and Disease (Top Nutrition Scientist Explains!) | Ty Beal, PhD
Jun 3
More from Eye on AI
The App of the Future Is Voice — Not a Screen. Mitel's CTO Luiz Domingos Explains Why.
May 28 · 54 min
How I AI
Gemini Omni: Clone yourself with AI in under 15 minutes
Jun 3
More from Eye on AI
We summarize every new episode. Want them in your inbox?
How AI Is Reinventing Elder Care | Chia-Lin Simmons of LogicMark
The App of the Future Is Voice — Not a Screen. Mitel's CTO Luiz Domingos Explains Why.
Is ChatGPT Conscious? A Pioneer of AI Explains | Dr. Terry Sejnowski
Your Child's Data Profile Starts Before They're Born | Eamonn Maguire of Proton
Training AI Models Without a Billion-Dollar Data Center | Steffen Cruz of Macrocosmos
Similar Episodes
Related episodes from other podcasts
The Genius Life
Jun 3
580: The Best Foods to Fight Weight Gain and Disease (Top Nutrition Scientist Explains!) | Ty Beal, PhD
How I AI
Jun 3
Gemini Omni: Clone yourself with AI in under 15 minutes
The Doctor's Farmacy
Jun 3
Antidepressants Explained: Benefits, Risks, and Alternatives for Depression | Dr. James Greenblatt
The EntreLeadership Podcast
Jun 3
Dave Ramsey Reveals How He Built His $300,000,000 Business
Invest Like the Best with Patrick O'Shaughnessy
Jun 3
Dara Khosrowshahi - Uber's Bet on AVs, AI, and Building a Super-App - [Invest Like the Best, EP.476]
Explore Related Topics
This podcast is featured in Best AI Podcasts (2026) — ranked and reviewed with AI summaries.
Read this week's AI & Machine Learning Podcast Insights — cross-podcast analysis updated weekly.
You're clearly into Eye on AI.
Every Monday, we deliver AI summaries of the latest episodes from Eye on AI and 192+ other podcasts. Free for up to 3 shows.
Start My Monday DigestNo credit card · Unsubscribe anytime