We Invented Momentum Because Math is Hard [Dr. Jeff Beck]

December 31, 2025

76 min episode · 2 min read

Episode

76 min

Read time

2 min

AI-Generated Summary

Published Jan 1, 2026

Key Takeaways

✓Bayesian Brain Evidence: Humans perform optimal cue combination in sensory-motor tasks, adjusting for reliability on a trial-by-trial basis without knowing which sensory input is more trustworthy beforehand. This efficiency demonstrates the brain implements approximately Bayesian inference, not just generic information processing.
✓AutoGrad Revolution: Automatic differentiation transformed AI from careful manual construction into an engineering problem, enabling rapid architecture experimentation. This shift made backpropagation practical by solving vanishing gradients through empirical tricks, leading to the current scaling era but losing focus on brain-like cognitive structure.
✓Object-Centered Architecture: Train thousands of small models for individual object classes rather than one massive model. A warehouse AI learns separate models for forklifts and boxes, then can incorporate a cat model when needed, tracking surprise signals to identify unknown objects and request relevant models from a central repository.
✓Macroscopic Causation: Choose causal variables at the scale of your affordances—momentum exists because it makes physics Markovian and computationally tractable, not necessarily because it reflects fundamental reality. AI systems need causal models matching human interaction scales, not microscopic particle simulations requiring impractical compute resources.
✓Alignment Through Belief Sharing: Reward functions alone create malevolent genie problems because action combines beliefs and values inseparably. Humans achieve alignment by explicitly discussing beliefs first, isolating value disagreements only after establishing shared world models. AI systems need legible belief structures, not just prediction engines optimizing opaque objectives.

What It Covers

Dr. Jeff Beck explains why scaling Bayesian inference with object-centered models represents the path to human-like AI, contrasting structured cognitive approaches with current transformer architectures that lack explicit world models and causal reasoning capabilities.

Key Questions Answered

•Bayesian Brain Evidence: Humans perform optimal cue combination in sensory-motor tasks, adjusting for reliability on a trial-by-trial basis without knowing which sensory input is more trustworthy beforehand. This efficiency demonstrates the brain implements approximately Bayesian inference, not just generic information processing.
•AutoGrad Revolution: Automatic differentiation transformed AI from careful manual construction into an engineering problem, enabling rapid architecture experimentation. This shift made backpropagation practical by solving vanishing gradients through empirical tricks, leading to the current scaling era but losing focus on brain-like cognitive structure.
•Object-Centered Architecture: Train thousands of small models for individual object classes rather than one massive model. A warehouse AI learns separate models for forklifts and boxes, then can incorporate a cat model when needed, tracking surprise signals to identify unknown objects and request relevant models from a central repository.
•Macroscopic Causation: Choose causal variables at the scale of your affordances—momentum exists because it makes physics Markovian and computationally tractable, not necessarily because it reflects fundamental reality. AI systems need causal models matching human interaction scales, not microscopic particle simulations requiring impractical compute resources.
•Alignment Through Belief Sharing: Reward functions alone create malevolent genie problems because action combines beliefs and values inseparably. Humans achieve alignment by explicitly discussing beliefs first, isolating value disagreements only after establishing shared world models. AI systems need legible belief structures, not just prediction engines optimizing opaque objectives.

Notable Moment

Beck argues momentum was invented as a hidden variable to make physics equations computationally convenient and Markovian, questioning whether such mathematical constructs reflect reality or just represent pragmatic modeling choices that happened to work effectively for human engineering purposes.

Know someone who'd find this useful?