Dealing with increasingly complicated agents
Episode
54 min
Read time
2 min
Topics
Productivity, Design & UX, Artificial Intelligence
AI-Generated Summary
Key Takeaways
- ✓Agent Security Model: Any tool exposed to an LLM becomes accessible to anyone controlling LLM input through prompt injection, requiring deterministic authorization controls outside the model itself.
- ✓Password Attack Analogy: Jailbreaking resembles password cracking - focus on limiting attempt frequency rather than perfect blocking, using guardrails as detection signals to suspend suspicious users after multiple triggers.
- ✓Code-Then-Execute Pattern: Generate execution plans before untrusted data enters context, using data flow analysis to enforce tool policies based on input source trustworthiness - most promising security design pattern.
- ✓Complexity Explosion: Modern agent workflows mix multiple untrusted data sources in single LLM contexts, where any malicious component can compromise the entire system through cross-contamination attacks.
What It Covers
Donato Capitella from Reversec explains how AI agents accessing external tools create massive security vulnerabilities, requiring new design patterns beyond traditional LLM red teaming approaches.
Key Questions Answered
- •Agent Security Model: Any tool exposed to an LLM becomes accessible to anyone controlling LLM input through prompt injection, requiring deterministic authorization controls outside the model itself.
- •Password Attack Analogy: Jailbreaking resembles password cracking - focus on limiting attempt frequency rather than perfect blocking, using guardrails as detection signals to suspend suspicious users after multiple triggers.
- •Code-Then-Execute Pattern: Generate execution plans before untrusted data enters context, using data flow analysis to enforce tool policies based on input source trustworthiness - most promising security design pattern.
- •Complexity Explosion: Modern agent workflows mix multiple untrusted data sources in single LLM contexts, where any malicious component can compromise the entire system through cross-contamination attacks.
Notable Moment
Capitella demonstrates how attackers can inject malicious emails into support ticket databases, later triggering phishing responses when legitimate customers submit related queries through the system.
You just read a 3-minute summary of a 51-minute episode.
Get Practical AI summarized like this every Monday — plus up to 2 more podcasts, free.
Pick Your Podcasts — FreeKeep Reading
More from Practical AI
Breaking down the 2026 Stanford AI Index Report
Jun 4 · 47 min
Eye on AI
Every Enterprise Is About to Have a 100,000 Agent Problem | Oren Michaels of Barndoor AI
Jun 6
More from Practical AI
Rebooting Enterprise AI with MCP and Kubernetes
May 28 · 48 min
No Priors: Artificial Intelligence | Technology | Startups
Building an AI Guardian for Enterprise with Onyx Security CEO Maxim Bar Kogan
May 28
More from Practical AI
We summarize every new episode. Want them in your inbox?
Breaking down the 2026 Stanford AI Index Report
Rebooting Enterprise AI with MCP and Kubernetes
Hermes Agent: Agents that grow with you
U.S. Congressman Beyer on AI challenges facing America and the World
The Myth of Model Wars: Open vs Closed AI in 2026
Similar Episodes
Related episodes from other podcasts
Eye on AI
Jun 6
Every Enterprise Is About to Have a 100,000 Agent Problem | Oren Michaels of Barndoor AI
No Priors: Artificial Intelligence | Technology | Startups
May 28
Building an AI Guardian for Enterprise with Onyx Security CEO Maxim Bar Kogan
NVIDIA AI Podcast
May 6
Harrison Chase of LangChain on Deep Agents, LangSmith, and Earning Trust | NVIDIA AI Podcast Ep. 297
Cognitive Revolution
Mar 22
Your Agent's Self-Improving Swiss Army Knife: Composio CTO Karan Vaidya on Building Smart Tools
We Study Billionaires
Feb 18
TECH015: OpenClaw and Self Sovereign AI w/ Alex Gladstein and Justin Moon (Tech Podcast)
Explore Related Topics
This podcast is featured in Best AI Podcasts (2026) — ranked and reviewed with AI summaries.
Read this week's AI & Machine Learning Podcast Insights — cross-podcast analysis updated weekly.
You're clearly into Practical AI.
Every Monday, we deliver AI summaries of the latest episodes from Practical AI and 192+ other podcasts. Free for up to 3 shows.
Start My Monday DigestNo credit card · Unsubscribe anytime