Skip to main content
CH

Carina Hong

Carina Hong**verified Generation as Performance Gain**lean as Dual-purpose Infrastructure**formal Math Transfer Learning Parallels Coding**hardware Verification as Near-term Revenue Anchor
3episodes
3podcasts

Featured On 3 Podcasts

Top resources Carina Hong mentions

Books, tools, and gear cited across podcast appearances. Ranked by frequency.

SignalCast may earn commission on purchases via affiliate links on each resource page.

All Appearances

3 episodes
Latent Space

🔬Scaling Past Informal AI - Carina Hong, Axiom Math

Latent Space
93 minCEO and founder of Axiom Math

AI Summary

→ WHAT IT COVERS Carina Hong, CEO of Axiom Math, explains how formal verification using the Lean proof language enables verified AI reasoning rather than merely correcting hallucinations. Axiom scored 120/120 on the 2025 Putnam exam, raised $200M at a $1.6B valuation, and argues formal math provides transfer learning advantages that informal LLM scaling cannot replicate at superintelligence scale. → KEY INSIGHTS - **Verified Generation as Performance Gain:** Formal verification is not a quality-control tax but a direct performance multiplier. Axiom's system scored 120/120 on the 2025 Putnam exam, outperforming the best human score of 110 and DeepSeek's 103, using orders of magnitude less compute and data than frontier labs. This demonstrates that verified generation produces higher sample efficiency, allowing smaller teams to exceed frontier lab benchmarks on structured reasoning tasks. - **Lean as Dual-Purpose Infrastructure:** Lean functions simultaneously as a functional programming language and a formal proof checker via the Curry-Howard correspondence, which maps proofs to programs. Developers can write autograd in Lean, verify distributed systems components, or prove mathematical theorems within the same environment. Practitioners building AI reasoning pipelines should evaluate Lean not as a niche academic tool but as a Turing-complete substrate for co-generating code and correctness proofs together. - **Formal Math Transfer Learning Parallels Coding:** Anthropic's coding focus in 2023-2024 was underestimated because structured, formal data transfers horizontally across reasoning domains rather than staying vertical. Axiom applies the same thesis to formal math: Lean proof data provides structured, verifiable training signal that transfers to software verification, hardware verification, and general reasoning. Teams building reasoning systems should prioritize formally verifiable data sources over volume of informal chain-of-thought data. - **Hardware Verification as Near-Term Revenue Anchor:** Chip verification currently requires a 1:3 to 1:4 ratio of engineering time and team size dedicated purely to verification relative to design. There is no partial credit for a mostly-verified GPU. Axiom positions formal proof generation as a direct replacement for manual verification labor in ASIC projects, where a single unverified edge case invalidates the entire design. This represents a concrete, high-value enterprise deployment path beyond pure mathematics research. - **Axle Tooling Enables Claude-Based Lean Workflows:** Axiom released Axle (Axiom Lean Engine), a free suite of 14 Lean meta-programming tools including VerifyProof, which runs 100x faster than the prior standard tool Comparator. On the Verina code-verification benchmark of 189 problems, Axiom's system solved 187 with no benchmark-specific modifications. Developers can integrate Axle directly with Claude Code today to generate and verify Lean proofs without configuring a local Lean toolchain, lowering the barrier to formal verification in production workflows. - **Blueprint Authorship Remains the Human Bottleneck:** Large-scale formalization projects like sphere packing in 8 dimensions still rely on human-authored blueprints that decompose theorems into subtasks assignable across contributors. Auto-generated blueprints — high-level proof sketches that structure collaborative formalization — remain an unsolved technical problem that multiple groups are racing to crack. Mathematicians and AI researchers targeting collaborative theorem proving should focus engineering effort on blueprint generation as the current rate-limiting step, not proof search itself. - **Specification Gap Limits Enterprise Deployment:** Formal verification guarantees correctness only relative to a written specification, and humans consistently underspecify what they want from complex systems. A financial audit system or flight controller cannot be fully verified if the specification omits edge cases. Axiom's near-term mitigation combines mutation-based LLM unit-test generation to surface unspecified cases as conjecture proposals, then feeds confirmed specifications to the prover. Teams adopting formal verification should invest in specification tooling and conjecture generation before expecting end-to-end automated correctness guarantees. → NOTABLE MOMENT When discussing why informal LLM scaling cannot reach mathematical superintelligence, Hong points out that frontier math benchmarks required collaboration with EPFL because expert evaluators are genuinely scarce — there are not enough humans who understand results in the Langlands program to grade outputs at scale. Infinite compute budgets cannot solve a human-attention bottleneck. 💼 SPONSORS None detected 🏷️ Formal Verification, AI Reasoning, Lean Proof Language, Mathematical AI, Hardware Verification, Transfer Learning, Verified Code Generation

AI Summary

→ WHAT IT COVERS Carina Hong, CEO of Axiom Math, explains how her company builds AI mathematicians that combine generation and verification using formal languages like Lean. Axiom scored nine out of twelve on the 2026 Putnam exam, surpassing last year's top human performer, demonstrating breakthrough capabilities in formal mathematical reasoning and proof verification. → KEY INSIGHTS - **Formal Verification Architecture:** Axiom combines three core components: a prover system that generates proofs, a conjecture system that proposes theorems, and a knowledge base that stores proven results. Auto-formalization converts natural language mathematics into Lean code, enabling deterministic verification of probabilistic AI outputs. This architecture achieves higher sample efficiency than pure language model approaches by grounding generation in verifiable formal systems. - **Putnam Performance Breakthrough:** Axiom Prover solved nine of twelve problems on the 2026 Putnam exam within time limits, matching the previous year's top human score. The median Putnam score among thousands of top undergraduate math students is zero, making any correct solution significant. This performance demonstrates AI can now compete with elite human mathematicians on novel, unseen competition problems requiring creative problem-solving. - **Commercial Applications in Verification:** Hardware verification teams are one-third to one-fourth the size of design teams, with verification taking up to three years in chip development. Formal verification can prove code equivalence during migrations, verify database consistency in Byzantine fault tolerance scenarios, and ensure safety-critical code correctness. AWS took five years to manually formalize memory isolation in their hypervisor, a task AI could accelerate significantly. - **Auto-Formalization as Core Technology:** Converting natural language mathematical statements into formal Lean code is harder than proving theorems because no solution exists yet to verify correctness. In code verification, test cases with input-output pairs provide grounding signals for formal specifications. Axiom treats auto-formalization as fundamental infrastructure, not just data generation, with statement formalization being more challenging than proof formalization due to lack of verification signals. - **Future of Mathematical Research:** Mathematicians will operate at higher abstraction levels, using AI as diligent graduate students to verify intuitions and handle technical lemmas. The system constructs counterexamples for sanity checking and generates interesting patterns for conjecture formation. Top mathematicians like Terence Tao can focus on theory-building and intuition while AI handles computational verification, similar to how LaTeX replaced typewriters without eliminating mathematical research. → NOTABLE MOMENT Hong reveals her management philosophy stems from listening to underground Chinese rock bands at age five, whose best work came from periods of hunger and struggle before commercial success. She deliberately preserves this underdog mindset at Axiom despite raising sixty-four million dollars, maintaining acute awareness of larger incumbents to stay uncomfortable and driven. 💼 SPONSORS None detected 🏷️ Formal Verification, AI Mathematics, Lean Theorem Proving, Hardware Verification, Mathematical Reasoning

AI Summary

→ WHAT IT COVERS Carina Hong, founder of Acxiom, explains building AI mathematicians through formal verification using Lean programming language, combining auto-formalization, theorem proving, and self-play systems to achieve mathematical reasoning with provable guarantees. → KEY INSIGHTS - **Data Scarcity Challenge:** Formal math has only 10 million Lean tokens versus one trillion Python tokens, creating a 100,000x data gap that requires auto-formalization and synthetic generation to bridge for effective model training. - **Three Convergence Factors:** AI mathematicians become viable now through post-training reinforcement learning advances, Lean 4 adoption since September 2023, and code generation techniques crossing performance thresholds that transfer to mathematical proving. - **Self-Play Architecture:** Acxiom builds systems where provers and conjecturers interact, with provers providing reward signals for conjectures, creating self-improving loops that expand mathematical knowledge bases through verification and proposal cycles. - **Auto-Formalization Limitations:** Current models struggle to convert natural language proofs longer than five lines into Lean without human intervention, with no established benchmarks for measuring statement formalization accuracy beyond syntax checking. → NOTABLE MOMENT Hong reveals research mathematicians typically spend months stuck on single problems with nothing to report, contrasting sharply with Olympiad training's constant dopamine hits, explaining her motivation to build AI systems that accelerate mathematical intuition. 💼 SPONSORS [{"name": "Capital One", "url": null}, {"name": "Google DeepMind", "url": "https://ai.studio/build"}] 🏷️ Formal Verification, Mathematical Reasoning, Lean Programming, Auto-Formalization

Explore More

Frequently Asked Questions

What podcasts has Carina Hong appeared on?

Carina Hong has appeared on 3 podcasts we summarize, including Latent Space, Gradient Dissent, The TWIML AI Podcast — 3 episodes in total. Every appearance is listed below with an AI-generated summary.

Does Carina Hong appear as a guest speaker on podcasts?

Yes. Carina Hong has been a guest on 3 shows we track, across 3 episodes. Browse each appearance below to read the key takeaways and listen to the original.

Where can I find summaries of Carina Hong's interviews?

Read AI-generated summaries of all 3 of Carina Hong's podcast appearances on SignalCast — each with key insights and a link to the full episode.

Never miss Carina Hong's insights

Subscribe to get AI-powered summaries of Carina Hong's podcast appearances delivered to your inbox weekly.

Start Free Today

No credit card required • Free tier available