Ep 386: Was 2025 a Great or Terrible Year for AI? (w/ Ed Zitron)
Episode
143 min
Read time
2 min
Topics
Career Growth, Productivity, Relationships
AI-Generated Summary
Key Takeaways
- ✓DeepSeek's efficiency challenge: Chinese startup DeepSeek trained its R1 model for $5.3 million versus American models costing $50-100 billion, demonstrating that frontier AI doesn't require massive data centers. This threatened the industry narrative justifying enormous capital raises, so companies memory-holed the story rather than optimize their own spending.
- ✓AI agents marketing shift: Companies pivoted from AGI superintelligence messaging to workplace agents in early 2025 because chatbot capabilities had plateaued. The agent narrative promised digital labor replacing workers, but required multi-step LLM queries that increased costs without delivering reliable autonomous task completion beyond simple prototypes.
- ✓GPT-4.5 router inefficiency: OpenAI's router model that automatically selects optimal models for tasks actually increased inference costs by eliminating system prompt caching. Each model switch required reprocessing the entire system prompt through GPUs, creating overhead that infrastructure teams internally questioned, contradicting public efficiency claims.
- ✓Anthropic's hidden burn rate: Despite positioning as more efficient than OpenAI, Anthropic spent $2.66 billion on AWS alone in three quarters of 2025, likely matching that on Google Cloud. The company raised $16.5 billion versus OpenAI's $18.3 billion, revealing nearly identical capital consumption rates despite public perception of fiscal discipline.
- ✓OpenAI's revenue-cost mismatch: Through September 2025, OpenAI generated approximately $4.5 billion in revenue while spending $8.67 billion solely on inference costs to run existing models. This inverse relationship where costs scale directly with revenue demonstrates the fundamental unprofitability of large language model deployment at scale.
What It Covers
Cal Newport and AI commentator Ed Zitron analyze twelve major AI stories from 2025, examining whether the year represented progress or failure for artificial intelligence through technical analysis, financial reporting, and industry insider information about OpenAI, Anthropic, and NVIDIA.
Key Questions Answered
- •DeepSeek's efficiency challenge: Chinese startup DeepSeek trained its R1 model for $5.3 million versus American models costing $50-100 billion, demonstrating that frontier AI doesn't require massive data centers. This threatened the industry narrative justifying enormous capital raises, so companies memory-holed the story rather than optimize their own spending.
- •AI agents marketing shift: Companies pivoted from AGI superintelligence messaging to workplace agents in early 2025 because chatbot capabilities had plateaued. The agent narrative promised digital labor replacing workers, but required multi-step LLM queries that increased costs without delivering reliable autonomous task completion beyond simple prototypes.
- •GPT-4.5 router inefficiency: OpenAI's router model that automatically selects optimal models for tasks actually increased inference costs by eliminating system prompt caching. Each model switch required reprocessing the entire system prompt through GPUs, creating overhead that infrastructure teams internally questioned, contradicting public efficiency claims.
- •Anthropic's hidden burn rate: Despite positioning as more efficient than OpenAI, Anthropic spent $2.66 billion on AWS alone in three quarters of 2025, likely matching that on Google Cloud. The company raised $16.5 billion versus OpenAI's $18.3 billion, revealing nearly identical capital consumption rates despite public perception of fiscal discipline.
- •OpenAI's revenue-cost mismatch: Through September 2025, OpenAI generated approximately $4.5 billion in revenue while spending $8.67 billion solely on inference costs to run existing models. This inverse relationship where costs scale directly with revenue demonstrates the fundamental unprofitability of large language model deployment at scale.
Notable Moment
Jensen Huang announced at March GTC that the AI industry had moved from the pre-training scaling era into post-training and inference, essentially telling shareholders that massive ongoing GPU purchases would be required just to run models, not improve them—benefiting NVIDIA while increasing operational costs for AI companies permanently.
You just read a 3-minute summary of a 140-minute episode.
Get Deep Questions with Cal Newport summarized like this every Monday — plus up to 2 more podcasts, free.
Pick Your Podcasts — FreeKeep Reading
More from Deep Questions with Cal Newport
Are We About to Lose Control of AI? | AI Reality Check
Jun 11 · 20 min
20VC (20 Minute VC)
20VC: Cursor Acquired for $60BN by xAI | Anthropic Hits $1TRN in Secondary Markets | Did Anthropic Just Kill Figma, Adobe and Canva | Rippling Hits $1BN in ARR | Salesforce Goes Headless: Smart or Stupid | Cerebras IPO 2.0
Apr 23
More from Deep Questions with Cal Newport
Should I Press Pause? | Monday Advice
Jun 8 · 33 min
20VC (20 Minute VC)
20VC: Anthropic vs The Pentagon: Who Wins | OpenAI's $110BN Mega Round | Cursor Hits $2BN in ARR | Block's 40% Headcount Reduction: AI or Overhiring
Mar 5
More from Deep Questions with Cal Newport
We summarize every new episode. Want them in your inbox?
Are We About to Lose Control of AI? | AI Reality Check
Should I Press Pause? | Monday Advice
How Do I Escape the “Busyness Singularity”? | Monday Advice
Did AI Just “Solve” Math? (Let’s Take a Closer Look) | AI Reality Check
How Do I Reclaim My Schedule? (w/ Laura Vanderkam) | Monday Advice
Similar Episodes
Related episodes from other podcasts
20VC (20 Minute VC)
Apr 23
20VC: Cursor Acquired for $60BN by xAI | Anthropic Hits $1TRN in Secondary Markets | Did Anthropic Just Kill Figma, Adobe and Canva | Rippling Hits $1BN in ARR | Salesforce Goes Headless: Smart or Stupid | Cerebras IPO 2.0
20VC (20 Minute VC)
Mar 5
20VC: Anthropic vs The Pentagon: Who Wins | OpenAI's $110BN Mega Round | Cursor Hits $2BN in ARR | Block's 40% Headcount Reduction: AI or Overhiring
20VC (20 Minute VC)
Jun 11
20VC: SpaceX Launches Largest Ever IPO | OpenAI Files to Go Public | Uber Cuts 23% of HR | Lovable Hits $500M ARR | Founders Revolt Against VCs: The Fundraising Horror Stories Going Viral
20VC (20 Minute VC)
May 28
20VC: OpenAI & SpaceX S1 Drops | NVIDIA's $81BN Revenue Quarter | Cloudlfare and ClickUp Do Controversial Layoffs | Exa, OpenRouter and Polsia Raise Mega Rounds | Uber and Microsoft Declare AI ROI for Developers is Questionable
20VC (20 Minute VC)
Apr 30
20VC: Anthropic Raises $45BN but Falls Short on Compute | OpenAI Crushes with GPT5.5 and Codex: Back in the Game? | China Blocks Manus $2BN Deal to Meta | Thoma Bravo Hand Back Medallia Keys to Creditors | Why Google is a Bigger Buy Than Ever Before
Explore Related Topics
This podcast is featured in Best Mindset Podcasts (2026) — ranked and reviewed with AI summaries.
You're clearly into Deep Questions with Cal Newport.
Every Monday, we deliver AI summaries of the latest episodes from Deep Questions with Cal Newport and 192+ other podcasts. Free for up to 3 shows.
Start My Monday DigestNo credit card · Unsubscribe anytime