Skip to main content
No Priors: Artificial Intelligence | Technology | Startups

The Future of Voice AI: Agents, Dubbing, and Real-Time Translation with ElevenLabs Co-Founder Mati Staniszewski

41 min episode · 2 min read
·

Episode

41 min

Read time

2 min

Topics

Startups, Artificial Intelligence

AI-Generated Summary

Key Takeaways

  • Multi-segment business model: ElevenLabs splits revenue 50-50 between self-serve creator subscriptions and enterprise sales-led agents platform, serving both individual audiobook narrators and Fortune 500 customer support systems with the same foundational voice models underneath both product lines.
  • Voice quality differentiation: Audio AI requires architectural breakthroughs over compute scale. ElevenLabs employs roughly 10 of the estimated 50-100 top audio researchers globally, focusing on controllability and emotional expression rather than just transcription accuracy to beat larger labs on benchmarks.
  • Research-product sequencing: Teams get three months to ship product solutions before waiting for research breakthroughs. This parallel approach prevents product roadmaps from stalling on uncertain research timelines while allowing engineering to build integrations and workflows that capture value when models improve.
  • Cascaded versus fused architectures: Enterprise conversational agents require cascaded speech-to-text plus text-to-speech systems for reliability and tool-calling capabilities today. Fused speech-to-speech models enable more expressive consumer experiences but lack the structured control enterprises need for production deployment over the next year.

What It Covers

ElevenLabs co-founder Mati Staniszewski discusses building a voice AI company from zero to $300M ARR in three years, spanning creative dubbing tools and conversational agent platforms serving 5M monthly users across enterprise and self-serve segments.

Key Questions Answered

  • Multi-segment business model: ElevenLabs splits revenue 50-50 between self-serve creator subscriptions and enterprise sales-led agents platform, serving both individual audiobook narrators and Fortune 500 customer support systems with the same foundational voice models underneath both product lines.
  • Voice quality differentiation: Audio AI requires architectural breakthroughs over compute scale. ElevenLabs employs roughly 10 of the estimated 50-100 top audio researchers globally, focusing on controllability and emotional expression rather than just transcription accuracy to beat larger labs on benchmarks.
  • Research-product sequencing: Teams get three months to ship product solutions before waiting for research breakthroughs. This parallel approach prevents product roadmaps from stalling on uncertain research timelines while allowing engineering to build integrations and workflows that capture value when models improve.
  • Cascaded versus fused architectures: Enterprise conversational agents require cascaded speech-to-text plus text-to-speech systems for reliability and tool-calling capabilities today. Fused speech-to-speech models enable more expressive consumer experiences but lack the structured control enterprises need for production deployment over the next year.

Notable Moment

Staniszewski describes Ukraine building the first agentic government, deploying voice AI across all ministries for citizen services, benefits inquiries, and personalized education tutoring, with engineering leaders embedded in each department coordinating the digital transformation despite ongoing conflict.

Know someone who'd find this useful?

You just read a 3-minute summary of a 38-minute episode.

Get No Priors: Artificial Intelligence | Technology | Startups summarized like this every Monday — plus up to 2 more podcasts, free.

Pick Your Podcasts — Free

Keep Reading

More from No Priors: Artificial Intelligence | Technology | Startups

We summarize every new episode. Want them in your inbox?

Similar Episodes

Related episodes from other podcasts

Explore Related Topics

This podcast is featured in Best AI Podcasts (2026) — ranked and reviewed with AI summaries.

Read this week's Startups & Product Podcast Insights — cross-podcast analysis updated weekly.

You're clearly into No Priors: Artificial Intelligence | Technology | Startups.

Every Monday, we deliver AI summaries of the latest episodes from No Priors: Artificial Intelligence | Technology | Startups and 192+ other podcasts. Free for up to 3 shows.

Start My Monday Digest

No credit card · Unsubscribe anytime