Skip to main content
TS

Tuhin Srivastava

1episode
1podcast

We have 1 summarized appearance for Tuhin Srivastava so far. Browse all podcasts to discover more episodes.

Featured On 1 Podcast

All Appearances

1 episode

AI Summary

→ WHAT IT COVERS BaseTen CEO Tuhin Srivastava explains how his AI inference company pivoted from serving data scientists with small models to becoming fastest-growing inference provider for production applications. → KEY INSIGHTS - **Company pivoting:** Stay lean during market shifts - BaseTen remained 18 people from 2019-2023, enabling rapid pivots when ChatGPT and Stable Diffusion created new opportunities without organizational weight. - **Inference differentiation:** Focus on dedicated capacity over shared endpoints - 99% of BaseTen's business serves custom models with dedicated infrastructure, avoiding commoditized shared model serving markets. - **Technical optimization:** Modern LLM inference requires both infrastructure scaling across thousands of GPUs and runtime optimization using frameworks like VLLM, TensorRT-LLM, and SGLang for performance improvements. - **Market positioning:** Open source adoption follows predictable pattern - companies start with Anthropic/OpenAI, then switch to open source models for cost control, reliability, and data privacy. → NOTABLE MOMENT Srivastava reveals BaseTen killed three of four products in 2022, including an application builder that consumed two dozen employees for 2.5 years of development work. 💼 SPONSORS None detected 🏷️ AI Inference, Machine Learning Infrastructure, Startup Pivoting, LLM Deployment

Explore More

Never miss Tuhin Srivastava's insights

Subscribe to get AI-powered summaries of Tuhin Srivastava's podcast appearances delivered to your inbox weekly.

Start Free Today

No credit card required • Free tier available