Skip to main content
BS

Brandon Shibley

Brandon Shibley**cascade Model Architecture**edge Constraint Hierarchy**knowledge Distillation for Small Models**mlops for Distributed Edge Deployments
1episode
1podcast

We have 1 summarized appearance for Brandon Shibley so far. Browse all podcasts to discover more episodes.

Featured On 1 Podcast

Top resources Brandon Shibley mentions

Books, tools, and gear cited across podcast appearances. Ranked by frequency.

SignalCast may earn commission on purchases via affiliate links on each resource page.

All Appearances

1 episode
Practical AI

AI at the Edge is a different operating environment

Practical AI
47 minEdge AI Solutions Engineering Lead at Edge Impulse

AI Summary

→ WHAT IT COVERS Brandon Shibley, Edge AI Solutions Engineering Lead at Edge Impulse (a Qualcomm company), explains how AI deployment at the edge differs fundamentally from cloud environments in 2026, covering hardware constraints, model cascades, MLOps challenges, and the expanding capability of small models on battery-powered devices. → KEY INSIGHTS - **Cascade model architecture:** Rather than running a single large model continuously, deploy a pipeline where a lightweight object detector (such as YOLO) filters out 99% of incoming frames, then passes only relevant detections to a vision-language model for deeper analysis. This approach dramatically reduces power consumption on constrained edge hardware. - **Edge constraint hierarchy:** Design edge AI systems by prioritizing five constraints in order: size, power, connectivity reliability, cost, and latency. Latency requirements vary by application — microseconds for manufacturing lines and autonomous vehicles, acceptable seconds for conversational agents — and this requirement determines where computation must physically live. - **Knowledge distillation for small models:** Compress large frontier models into specialized edge-deployable models by generating extensive query-response pairs from the large model, then training a smaller model on that output. The resulting model retains only domain-relevant knowledge, enabling single-digit to tens-of-billions parameter models to run on devices with under 128GB memory. - **MLOps for distributed edge deployments:** Implement over-the-air update frameworks with version control to manage model drift on deployed devices. Because edge environments change over time, continuously collect new field data, retrain updated model versions centrally using aggregated data from all devices, then roll out updates in controlled stages rather than per-device retraining. - **Low-cost prototyping path:** Start edge AI experimentation using Arduino maker hardware combined with a free Edge Impulse account at edgeimpulse.com. This combination supports data collection, model training, target-aware optimization, and deployment without enterprise hardware. Proof-of-concept builds on commodity hardware translate directly into enterprise-scale production pipelines using the same platform. → NOTABLE MOMENT Shibley reframes biological intelligence as the ultimate edge AI model — organisms have processed sensor data locally for millions of years without cloud connectivity. He argues this biological architecture, where intelligence lives directly at the sensor, is the long-term trajectory for embedded AI systems. 💼 SPONSORS [{"name": "Prediction Guard", "url": "https://predictionguard.com"}] 🏷️ Edge AI, Small Language Models, MLOps, Neural Processing Units, Embedded Systems

Explore More

Never miss Brandon Shibley's insights

Subscribe to get AI-powered summaries of Brandon Shibley's podcast appearances delivered to your inbox weekly.

Start Free Today

No credit card required • Free tier available