Brendan Foody on Teaching AI and the Future of Knowledge Work
Episode
61 min
Read time
2 min
Topics
Artificial Intelligence
AI-Generated Summary
Key Takeaways
- ✓AI Performance Metrics: Frontier models improved 25-30% annually on economically valuable tasks, with GPT-5 scoring 64% on real-world professional work compared to GPT-4's baseline. This measures actual job automation potential rather than academic benchmarks, surveying experts on time allocation and creating corresponding evaluation rubrics.
- ✓Data Collection Strategy: The most valuable training data includes rubrics for grading model outputs and test question-answer pairs with scoring criteria, not just raw content. Models learn by attempting problems repeatedly and receiving scored feedback, similar to how professors grade essays with structured evaluation frameworks.
- ✓Labor Market Transformation: Within five years, majority of high-end knowledge workers will transition from performing tasks to training AI agents and building evaluation environments. This creates new job categories where investment bankers build testing frameworks instead of conducting analyses, applying fixed-cost knowledge investments across unlimited agent deployments.
- ✓Hiring Methodology: Effective talent assessment measures actual job skills through concrete projects rather than vibe-based conversations about background and cultural fit. Companies over-index on personal similarity instead of testing candidates' ability to perform specific deliverables like analyzing data rooms or drafting legal documents under realistic conditions.
- ✓Model Limitations Timeline: AI excels at tasks completable in chat windows but cannot yet draft emails or schedule meetings. Long-horizon tasks spanning 50-100 hours remain challenging. Foody predicts models will struggle to find mistakes when questioned by domain experts within six months, though automating the final 25% of expert capabilities will take significantly longer.
What It Covers
Brendan Foody, 22-year-old CEO of Mercor, discusses building the fastest-growing AI company by hiring experts to train frontier models, achieving 64% automation of economically valuable tasks and creating new job categories for knowledge workers.
Key Questions Answered
- •AI Performance Metrics: Frontier models improved 25-30% annually on economically valuable tasks, with GPT-5 scoring 64% on real-world professional work compared to GPT-4's baseline. This measures actual job automation potential rather than academic benchmarks, surveying experts on time allocation and creating corresponding evaluation rubrics.
- •Data Collection Strategy: The most valuable training data includes rubrics for grading model outputs and test question-answer pairs with scoring criteria, not just raw content. Models learn by attempting problems repeatedly and receiving scored feedback, similar to how professors grade essays with structured evaluation frameworks.
- •Labor Market Transformation: Within five years, majority of high-end knowledge workers will transition from performing tasks to training AI agents and building evaluation environments. This creates new job categories where investment bankers build testing frameworks instead of conducting analyses, applying fixed-cost knowledge investments across unlimited agent deployments.
- •Hiring Methodology: Effective talent assessment measures actual job skills through concrete projects rather than vibe-based conversations about background and cultural fit. Companies over-index on personal similarity instead of testing candidates' ability to perform specific deliverables like analyzing data rooms or drafting legal documents under realistic conditions.
- •Model Limitations Timeline: AI excels at tasks completable in chat windows but cannot yet draft emails or schedule meetings. Long-horizon tasks spanning 50-100 hours remain challenging. Foody predicts models will struggle to find mistakes when questioned by domain experts within six months, though automating the final 25% of expert capabilities will take significantly longer.
Notable Moment
Foody reveals his eighth-grade donut arbitrage business, buying Safeway donuts for five dollars per dozen and reselling them for two dollars each at school. After the principal shut him down, he moved operations 50 feet off campus and paid his mother 20 dollars weekly to transport inventory.
You just read a 3-minute summary of a 58-minute episode.
Get Conversations with Tyler summarized like this every Monday — plus up to 2 more podcasts, free.
Pick Your Podcasts — FreeKeep Reading
More from Conversations with Tyler
Kim Bowes on the Economic Lives of Rome's Ninety Percent
Apr 15 · 61 min
The Mel Robbins Podcast
Do THIS Every Day to Rewire Your Brain From Stress and Anxiety
Apr 27
More from Conversations with Tyler
Arthur Brooks on Reinvention, Religion, and the Science of Happiness
Apr 1 · 59 min
The Model Health Show
The Menopause Gut: Why Metabolism Changes & How to Reclaim Your Body - With Cynthia Thurlow
Apr 27
More from Conversations with Tyler
We summarize every new episode. Want them in your inbox?
Kim Bowes on the Economic Lives of Rome's Ninety Percent
Arthur Brooks on Reinvention, Religion, and the Science of Happiness
Paul Gillingham on Why Mexico Stays Together
Harvey Mansfield on Machiavelli, Straussianism, and the Character of Liberal Democracy
Henry Oliver on Measure for Measure, Late Bloomers, and the Smartest Writers in English
Similar Episodes
Related episodes from other podcasts
The Mel Robbins Podcast
Apr 27
Do THIS Every Day to Rewire Your Brain From Stress and Anxiety
The Model Health Show
Apr 27
The Menopause Gut: Why Metabolism Changes & How to Reclaim Your Body - With Cynthia Thurlow
The Rest is History
Apr 26
664. Britain in the 70s: Scandal in Downing Street (Part 3)
The Learning Leader Show
Apr 26
685: David Epstein - The Freedom Trap, Narrative Values, General Magic, The Nobel Prize Winner Who Simplified Everything, Wearing the Same Thing Everyday, and Why Constraints Are the Secret to Your Best Work
The AI Breakdown
Apr 26
Where the Economy Thrives After AI
Explore Related Topics
This podcast is featured in Best Finance Podcasts (2026) — ranked and reviewed with AI summaries.
Read this week's AI & Machine Learning Podcast Insights — cross-podcast analysis updated weekly.
You're clearly into Conversations with Tyler.
Every Monday, we deliver AI summaries of the latest episodes from Conversations with Tyler and 192+ other podcasts. Free for up to 3 shows.
Start My Monday DigestNo credit card · Unsubscribe anytime