How AI Data Platforms Are Shaping the Future of Enterprise Storage - Ep. 281
Episode
35 min
Read time
2 min
Topics
Artificial Intelligence, Science & Discovery
AI-Generated Summary
Key Takeaways
- ✓AI-Ready Data Pipeline: Making unstructured enterprise data usable for AI requires finding, gathering, extracting text, chunking into uniform sizes, enriching with metadata, embedding into numeric representations, and indexing into vector databases for retrieval augmented generation systems.
- ✓Data Velocity Challenge: Enterprises face dual pressure from new data creation plus constant changes to existing documents. Without tracking which files changed, organizations must reindex entire datasets repeatedly, wasting compute resources like rewashing all dishes when only one is dirty.
- ✓Security Through In-Place Processing: Traditional AI pipelines create seven to thirteen copies of datasets across different systems, disconnecting them from source permissions. When access rights change, copied data remains accessible, creating major security vulnerabilities that GPU-in-storage architecture eliminates.
- ✓Agent Deployment in Storage: Storage vendors deploy AI agents directly on GPUs within storage systems to perform tasks like identifying unclassified documents that should be classified, monitoring system telemetry for optimization recommendations, and operating on data without unnecessary movement or copying.
What It Covers
Jacob Lieberman explains how NVIDIA's AI data platform reference design enables GPU-accelerated storage systems that prepare enterprise data for AI agents continuously in place, eliminating security risks from data copying and movement.
Key Questions Answered
- •AI-Ready Data Pipeline: Making unstructured enterprise data usable for AI requires finding, gathering, extracting text, chunking into uniform sizes, enriching with metadata, embedding into numeric representations, and indexing into vector databases for retrieval augmented generation systems.
- •Data Velocity Challenge: Enterprises face dual pressure from new data creation plus constant changes to existing documents. Without tracking which files changed, organizations must reindex entire datasets repeatedly, wasting compute resources like rewashing all dishes when only one is dirty.
- •Security Through In-Place Processing: Traditional AI pipelines create seven to thirteen copies of datasets across different systems, disconnecting them from source permissions. When access rights change, copied data remains accessible, creating major security vulnerabilities that GPU-in-storage architecture eliminates.
- •Agent Deployment in Storage: Storage vendors deploy AI agents directly on GPUs within storage systems to perform tasks like identifying unclassified documents that should be classified, monitoring system telemetry for optimization recommendations, and operating on data without unnecessary movement or copying.
Notable Moment
Lieberman compares AI agents working in storage systems to remote workers being more productive at home, avoiding commute time by keeping compute close to data rather than moving massive datasets to distant processing centers for transformation and analysis.
You just read a 3-minute summary of a 32-minute episode.
Get NVIDIA AI Podcast summarized like this every Monday — plus up to 2 more podcasts, free.
Pick Your Podcasts — FreeKeep Reading
More from NVIDIA AI Podcast
One Brain, Any Robot: Skild AI's Skild Brain Explained - Ep. 295
Apr 22 · 29 min
The Mel Robbins Podcast
Do THIS Every Day to Rewire Your Brain From Stress and Anxiety
Apr 27
More from NVIDIA AI Podcast
How AI Will Change Quantum Computing - Ep. 294
Apr 14 · 31 min
The Model Health Show
The Menopause Gut: Why Metabolism Changes & How to Reclaim Your Body - With Cynthia Thurlow
Apr 27
More from NVIDIA AI Podcast
We summarize every new episode. Want them in your inbox?
One Brain, Any Robot: Skild AI's Skild Brain Explained - Ep. 295
How AI Will Change Quantum Computing - Ep. 294
Building AI Factories: How Red Hat and NVIDIA Turn Enterprise Data Into Intelligence - Ep. 293
Powering the AI Inference Wave with EPRI's Ben Sooter - Ep. 292
AI Agents and the Future of Global Trade with Alibaba’s Kuo Zhang - Ep. 291
Similar Episodes
Related episodes from other podcasts
The Mel Robbins Podcast
Apr 27
Do THIS Every Day to Rewire Your Brain From Stress and Anxiety
The Model Health Show
Apr 27
The Menopause Gut: Why Metabolism Changes & How to Reclaim Your Body - With Cynthia Thurlow
The Rest is History
Apr 26
664. Britain in the 70s: Scandal in Downing Street (Part 3)
The Learning Leader Show
Apr 26
685: David Epstein - The Freedom Trap, Narrative Values, General Magic, The Nobel Prize Winner Who Simplified Everything, Wearing the Same Thing Everyday, and Why Constraints Are the Secret to Your Best Work
The AI Breakdown
Apr 26
Where the Economy Thrives After AI
Explore Related Topics
This podcast is featured in Best AI Podcasts (2026) — ranked and reviewed with AI summaries.
Read this week's AI & Machine Learning Podcast Insights — cross-podcast analysis updated weekly.
You're clearly into NVIDIA AI Podcast.
Every Monday, we deliver AI summaries of the latest episodes from NVIDIA AI Podcast and 192+ other podcasts. Free for up to 3 shows.
Start My Monday DigestNo credit card · Unsubscribe anytime