20VC: Scale, Surge, Turing, Mercor: Who Wins & Who Loses in Data Labelling | Is Revenue in Data Labelling Real or GMV? | Why 99% of Knowledge Work Will Go and What Happens Then? | Why SaaS is Dead in a World of AI with Jonathan Siddharth @ Turing

December 1, 2025

68 min episode · 2 min read

Jonathan Siddharth

Episode

68 min

Read time

2 min

Topics

Sales & Revenue, Artificial Intelligence, Science & Discovery

AI-Generated Summary

Published Dec 25, 2025

Key Takeaways

✓Data Evolution: AI training shifted from simple labeling tasks like sorting numbers to complex multi-step workflows requiring expert humans across verticals. Models now need data showing how to operate computers, call APIs, and execute real business workflows through reinforcement learning environments, not just imitation learning.
✓RL Environment Architecture: Turing creates mini world models with clones of business applications using synthetic data, where AI agents try different trajectories to complete tasks. The curriculum difficulty must balance between too easy (no learning) and too hard (no progress), similar to AlphaZero's self-play approach in mastering Go.
✓Custom Model Economics: Enterprises need smaller fine-tuned models (500M to 10B parameters) for specific workflows like insurance underwriting, trained on proprietary data that stays on-premises. These specialized models outperform trillion-parameter world models for narrow tasks while protecting competitive data from reaching frontier labs or competitors.
✓Enterprise Deployment Reality: Successful AI implementation requires first-mile schlep (consolidating fragmented data from spreadsheets and departed employees into structured formats) and last-mile schlep (building cursor-like interfaces for partial autonomy, training humans, collecting feedback). Ninety-five percent of pilots fail due to skipping these steps.
✓SaaS Disruption Thesis: Traditional SaaS dies because building AI applications on LLMs becomes trivially easy, foundation models move into apps layer with agentic capabilities, and software designed for human GUI navigation becomes obsolete. Companies will build custom solutions internally rather than subscribe to 80-100 third-party products.

What It Covers

Jonathan Siddharth explains how Turing shifted from talent marketplace to research accelerator, generating complex data through reinforcement learning environments to train frontier AI models for seven of eight major labs at $350M ARR.

Key Questions Answered

•Data Evolution: AI training shifted from simple labeling tasks like sorting numbers to complex multi-step workflows requiring expert humans across verticals. Models now need data showing how to operate computers, call APIs, and execute real business workflows through reinforcement learning environments, not just imitation learning.
•RL Environment Architecture: Turing creates mini world models with clones of business applications using synthetic data, where AI agents try different trajectories to complete tasks. The curriculum difficulty must balance between too easy (no learning) and too hard (no progress), similar to AlphaZero's self-play approach in mastering Go.
•Custom Model Economics: Enterprises need smaller fine-tuned models (500M to 10B parameters) for specific workflows like insurance underwriting, trained on proprietary data that stays on-premises. These specialized models outperform trillion-parameter world models for narrow tasks while protecting competitive data from reaching frontier labs or competitors.
•Enterprise Deployment Reality: Successful AI implementation requires first-mile schlep (consolidating fragmented data from spreadsheets and departed employees into structured formats) and last-mile schlep (building cursor-like interfaces for partial autonomy, training humans, collecting feedback). Ninety-five percent of pilots fail due to skipping these steps.
•SaaS Disruption Thesis: Traditional SaaS dies because building AI applications on LLMs becomes trivially easy, foundation models move into apps layer with agentic capabilities, and software designed for human GUI navigation becomes obsolete. Companies will build custom solutions internally rather than subscribe to 80-100 third-party products.

Notable Moment

Siddharth reveals he spends every weekend manually selecting 15-20 clips per podcast episode, taking three hours per show. He acknowledges this exact workflow could be automated with fine-tuned models trained on his past clip selections, demonstrating the model capability overhang he describes.

Know someone who'd find this useful?

You just read a 3-minute summary of a 65-minute episode.

Get 20VC (20 Minute VC) summarized like this every Monday — plus up to 2 more podcasts, free.

Pick Your Podcasts — Free

Keep Reading

20VC: Anthropic Raises $45BN but Falls Short on Compute | OpenAI Crushes with GPT5.5 and Codex: Back in the Game? | China Blocks Manus $2BN Deal to Meta | Thoma Bravo Hand Back Medallia Keys to Creditors | Why Google is a Bigger Buy Than Ever Before

Apr 30 · 85 min

The TWIML AI Podcast

How to Engineer AI Inference Systems with Philip Kiely - #766

Apr 30

20VC: Applovin: $160BN Market Cap, $5.48BN Revenue, $10M EBITDA Per Head | Why the Best Do Not Need Mentorship | Why Founders Should Not Angel Invest | Why Kindness in Business Will Slow You Down with Adam Foroughi

Apr 27 · 80 min

Eye on AI

#341 Celia Merzbacher: Beyond the Buzzword: The Real State of Quantum Computing, Sensing, and AI in 2025

Apr 30

Similar Episodes

Related episodes from other podcasts

The TWIML AI Podcast

Apr 30

Explore Related Topics

🤝Sales & Revenue 🤖Artificial Intelligence 🔬Science & Discovery

This podcast is featured in Best Investing Podcasts (2026) — ranked and reviewed with AI summaries.

Read this week's AI & Machine Learning Podcast Insights — cross-podcast analysis updated weekly.

You're clearly into 20VC (20 Minute VC).

Every Monday, we deliver AI summaries of the latest episodes from 20VC (20 Minute VC) and 192+ other podcasts. Free for up to 3 shows.

Start My Monday Digest

No credit card · Unsubscribe anytime

20VC: Scale, Surge, Turing, Mercor: Who Wins & Who Loses in Data Labelling | Is Revenue in Data Labelling Real or GMV? | Why 99% of Knowledge Work Will Go and What Happens Then? | Why SaaS is Dead in a World of AI with Jonathan Siddharth @ Turing

AI-Generated Summary

Key Takeaways

What It Covers

Key Questions Answered

Notable Moment

Keep Reading

20VC: Anthropic Raises $45BN but Falls Short on Compute | OpenAI Crushes with GPT5.5 and Codex: Back in the Game? | China Blocks Manus $2BN Deal to Meta | Thoma Bravo Hand Back Medallia Keys to Creditors | Why Google is a Bigger Buy Than Ever Before

How to Engineer AI Inference Systems with Philip Kiely - #766

20VC: Applovin: $160BN Market Cap, $5.48BN Revenue, $10M EBITDA Per Head | Why the Best Do Not Need Mentorship | Why Founders Should Not Angel Invest | Why Kindness in Business Will Slow You Down with Adam Foroughi

#341 Celia Merzbacher: Beyond the Buzzword: The Real State of Quantum Computing, Sensing, and AI in 2025

More from 20VC (20 Minute VC)

20VC: Anthropic Raises $45BN but Falls Short on Compute | OpenAI Crushes with GPT5.5 and Codex: Back in the Game? | China Blocks Manus $2BN Deal to Meta | Thoma Bravo Hand Back Medallia Keys to Creditors | Why Google is a Bigger Buy Than Ever Before

20VC: Applovin: $160BN Market Cap, $5.48BN Revenue, $10M EBITDA Per Head | Why the Best Do Not Need Mentorship | Why Founders Should Not Angel Invest | Why Kindness in Business Will Slow You Down with Adam Foroughi

20Product: Replit CEO on Why Coding Models Are Plateauing | Why the SaaS Apocalypse is Justified: Will Incumbents Be Replaced? | Why IDEs Are Dead and Do PMs Survive the Next 3-5 Years with Amjad Masad

20VC: Cursor Acquired for $60BN by xAI | Anthropic Hits $1TRN in Secondary Markets | Did Anthropic Just Kill Figma, Adobe and Canva | Rippling Hits $1BN in ARR | Salesforce Goes Headless: Smart or Stupid | Cerebras IPO 2.0

20VC: Everyone is Wrong; We Will Have More Developers in Five Years | Why Frontier Labs Will Be Way More Valuable Than They Are Today | Are SaaS Companies Cooked: Which Thrive & Which Die with Aaron Levie, Founder at Box

Similar Episodes

How to Engineer AI Inference Systems with Philip Kiely - #766

#341 Celia Merzbacher: Beyond the Buzzword: The Real State of Quantum Computing, Sensing, and AI in 2025

Google Invests $40B Into Anthropic, GPT 5.5 Drops, and Google Cloud Dominates | EP #252

Carna Health On Closing the Gap in CKD Prevention

Lincoln International's Brian Garfield - how is AI impacting private markets valuations?

Explore Related Topics

You're clearly into 20VC (20 Minute VC).