Four CEOs on the Future of AI: CoreWeave, Perplexity, Mistral, and IREN
Episode
97 min
Read time
3 min
Topics
Productivity, Fundraising & VC, Leadership
AI-Generated Summary
Key Takeaways
- ✓GPU Depreciation Reality: CoreWeave uses a six-year depreciation schedule for GPUs, and average customer contracts run five years. The claim that GPUs become obsolete in 16–18 months is driven by short-sellers, not operational data. Older Ampere A100s have actually appreciated in price through the year as new companies with smaller models enter the market and absorb previously unavailable capacity at lower cost points.
- ✓Project Finance "Box" Structure: CoreWeave finances GPU infrastructure by isolating each client deal into a discrete special-purpose vehicle containing the customer contract, GPU assets, and data center lease. Cash flows in a waterfall — paying power, interest, and principal first — with surplus returning to CoreWeave. Within 2.5 years of a 5-year deal, all principal and interest is repaid. This structure enabled $35B raised in 18 months and reduced cost of capital by 600 basis points.
- ✓AI Orchestration as Competitive Moat: Perplexity's strategy centers on model-agnostic orchestration — routing queries across GPT, Claude, Gemini, Kimi, Qwen, and others based on task specialization. A "Model Council" feature runs the same prompt across multiple models simultaneously and surfaces where they agree, disagree, and diverge in nuance. This positions Perplexity as infrastructure-layer neutral, capturing value regardless of which frontier model wins.
- ✓Hybrid Local-Server AI Architecture: Perplexity's "Personal Computer" product synchronizes server-side agent orchestration with a local Mac Mini, running privacy-sensitive tasks — tax data, personal notes, local files — on-device while delegating complex long-running tasks to server infrastructure. Users install one executable with no API key management. This hybrid model addresses enterprise and consumer privacy concerns while maintaining access to frontier model capabilities on demand.
- ✓Enterprise AI Requires Data Governance Primitives: Deploying AI agents across enterprise data requires a "context engine" — a semantic map of data sources tagged with access permissions — to prevent compensation data, HR records, or confidential IP from flowing to unauthorized employees or external systems. Mistral deploys forward engineers on-site at client facilities, keeping all training data within the customer's own infrastructure with zero data flowing back to Mistral's servers.
What It Covers
Four AI infrastructure and software CEOs — CoreWeave's Michael Intrator, Perplexity's Aravind Srinivas, Mistral's Arthur Mensch, and IREN's Daniel Roberts — speak at NVIDIA's GTC conference about GPU financing structures, agentic computing, enterprise AI deployment, open-source model specialization, and the physical infrastructure constraints shaping the next decade of AI development.
Key Questions Answered
- •GPU Depreciation Reality: CoreWeave uses a six-year depreciation schedule for GPUs, and average customer contracts run five years. The claim that GPUs become obsolete in 16–18 months is driven by short-sellers, not operational data. Older Ampere A100s have actually appreciated in price through the year as new companies with smaller models enter the market and absorb previously unavailable capacity at lower cost points.
- •Project Finance "Box" Structure: CoreWeave finances GPU infrastructure by isolating each client deal into a discrete special-purpose vehicle containing the customer contract, GPU assets, and data center lease. Cash flows in a waterfall — paying power, interest, and principal first — with surplus returning to CoreWeave. Within 2.5 years of a 5-year deal, all principal and interest is repaid. This structure enabled $35B raised in 18 months and reduced cost of capital by 600 basis points.
- •AI Orchestration as Competitive Moat: Perplexity's strategy centers on model-agnostic orchestration — routing queries across GPT, Claude, Gemini, Kimi, Qwen, and others based on task specialization. A "Model Council" feature runs the same prompt across multiple models simultaneously and surfaces where they agree, disagree, and diverge in nuance. This positions Perplexity as infrastructure-layer neutral, capturing value regardless of which frontier model wins.
- •Hybrid Local-Server AI Architecture: Perplexity's "Personal Computer" product synchronizes server-side agent orchestration with a local Mac Mini, running privacy-sensitive tasks — tax data, personal notes, local files — on-device while delegating complex long-running tasks to server infrastructure. Users install one executable with no API key management. This hybrid model addresses enterprise and consumer privacy concerns while maintaining access to frontier model capabilities on demand.
- •Enterprise AI Requires Data Governance Primitives: Deploying AI agents across enterprise data requires a "context engine" — a semantic map of data sources tagged with access permissions — to prevent compensation data, HR records, or confidential IP from flowing to unauthorized employees or external systems. Mistral deploys forward engineers on-site at client facilities, keeping all training data within the customer's own infrastructure with zero data flowing back to Mistral's servers.
- •Power Arbitrage as Data Center Strategy: IREN secures renewable energy by co-locating data centers at the generation source — wind and solar in West Texas, hydro in British Columbia — rather than near population centers. West Texas has 45–50 gigawatts of wind and solar but only 12 gigawatts of transmission capacity to load centers. Data centers monetize stranded renewable energy locally and export compute output at the speed of light, eliminating transmission infrastructure costs entirely.
- •Jevons Paradox Drives Compute Demand: Faster, cheaper inference does not reduce total compute consumption — it expands it. As image generation drops from two minutes to five seconds with 10x more available compute, users generate exponentially more images. Every software efficiency gain lowers the cost per token, which induces new use cases, new users, and new applications that consume more aggregate compute than existed before, creating a self-reinforcing demand cycle with no visible ceiling.
Notable Moment
Perplexity's Aravind Srinivas revealed that enterprise customers on the $400-per-month maximum tier have collectively saved over $100 million, and that the enterprise segment is now the company's fastest-growing revenue line — outpacing consumer growth. Despite having only around 400 employees, every dollar of Perplexity's revenue carries positive gross margins due to multi-model routing efficiency.
You just read a 3-minute summary of a 94-minute episode.
Get All-In with Chamath, Jason, Sacks & Friedberg summarized like this every Monday — plus up to 2 more podcasts, free.
Pick Your Podcasts — FreeKeep Reading
More from All-In with Chamath, Jason, Sacks & Friedberg
World's First Trillionaire, Anthropic Fable Banned, The New Oligarchs, Iran Peace Deal
Jun 19 · 84 min
Practical AI
The mythos of Mythos and Allbirds takes flight to the neocloud
Apr 23
More from All-In with Chamath, Jason, Sacks & Friedberg
Anthropic's Fable Backlash, Nationalizing AI, Inflation Heats Up & California's Broken Elections
Jun 13 · 102 min
a16z Podcast
Ben Horowitz on AI Infrastructure, Economics and The New Laws of Software
Apr 14
Books, tools, and gear mentioned in this episode
SignalCast may earn commission on purchases via these links. As an Amazon Associate, SignalCast earns from qualifying purchases.
Tools
by OpenAI
“Perplexity's strategy centers on model-agnostic orchestration — routing queries across GPT, Claude, Gemini, Kimi, Qwen, and others based on task specialization.”
by Anthropic
“Perplexity's strategy centers on model-agnostic orchestration — routing queries across GPT, Claude, Gemini, Kimi, Qwen, and others based on task specialization.”
by Google
“Perplexity's strategy centers on model-agnostic orchestration — routing queries across GPT, Claude, Gemini, Kimi, Qwen, and others based on task specialization.”
“Perplexity's strategy centers on model-agnostic orchestration — routing queries across GPT, Claude, Gemini, Kimi, Qwen, and others based on task specialization.”
by Alibaba
“Perplexity's strategy centers on model-agnostic orchestration — routing queries across GPT, Claude, Gemini, Kimi, Qwen, and others based on task specialization.”
Gear
by NVIDIA
“Older Ampere A100s have actually appreciated in price through the year as new companies with smaller models enter the market.”
Products
by Perplexity
“Perplexity's "Personal Computer" product synchronizes server-side agent orchestration with a local Mac Mini, running privacy-sensitive tasks on-device.”
company
“CoreWeave uses a six-year depreciation schedule for GPUs, and average customer contracts run five years.”
“Perplexity's strategy centers on model-agnostic orchestration — routing queries across GPT, Claude, Gemini, Kimi, Qwen, and others based on task specialization.”
“Mistral deploys forward engineers on-site at client facilities, keeping all training data within the customer's own infrastructure with zero data flowing back to Mistral's servers.”
“IREN secures renewable energy by co-locating data centers at the generation source — wind and solar in West Texas, hydro in British Columbia.”
“Four AI infrastructure and software CEOs speak at NVIDIA's GTC conference about GPU financing structures, agentic computing, enterprise AI deployment.”
More from All-In with Chamath, Jason, Sacks & Friedberg
We summarize every new episode. Want them in your inbox?
World's First Trillionaire, Anthropic Fable Banned, The New Oligarchs, Iran Peace Deal
Anthropic's Fable Backlash, Nationalizing AI, Inflation Heats Up & California's Broken Elections
All-In's Best Ideas Pitch Competition: 4 Investors Present Their Top Trades Live
Senators John Fetterman and Dave McCormick: Bipartisanship, Money in DC, Datacenters, Graham Platner
Dan Dreyfus: America's Critical Minerals Crisis is Here
Similar Episodes
Related episodes from other podcasts
Practical AI
Apr 23
The mythos of Mythos and Allbirds takes flight to the neocloud
a16z Podcast
Apr 14
Ben Horowitz on AI Infrastructure, Economics and The New Laws of Software
Latent Space
Apr 11
SF Compute: Commoditizing Compute to solve the GPU Bubble forever
20VC (20 Minute VC)
Jun 15
20VC: Micron Will Be More Valuable Than Meta | How Export Controls Helped Not Hurt China | Power is the Bottleneck to AI | Why Dario Has Done a Disservice to AI with his Labour Replacement Messaging with Aravind Srinivas, Founder @ Perplexity
Odd Lots
Jun 13
Anjney Midha's Plan to Radically Lower the Price of Compute
Explore Related Topics
This podcast is featured in Best Tech Podcasts (2026) — ranked and reviewed with AI summaries.
You're clearly into All-In with Chamath, Jason, Sacks & Friedberg.
Every Monday, we deliver AI summaries of the latest episodes from All-In with Chamath, Jason, Sacks & Friedberg and 192+ other podcasts. Free for one show.
Start My Monday DigestNo credit card · Unsubscribe anytime