Securing AI Agents via Specialized SLM Ensemble Firewalls

BSidesSF 2026 video 2026-05-20T00:00:00

Abstract

This talk addresses the non-deterministic risks associated with AI agents, specifically "Goal Hijacking" (via prompt injection) and "Rogue Agent" behavior (autonomous errors). The speaker proposes a runtime security architecture that functions as a "firewall" between an agent's reasoning process and its tool execution. By utilizing an ensemble of specialized, fine-tuned Small Language Models (SLMs) and a self-improving Retrieval-Augmented Generation (RAG) memory system, the proposed solution enables real-time, contextual security gating with low latency and reduced false positives.

Loading executive summary...
Loading full markdown...
Match Rate: 9.00/10 (Relevance to core cybersecurity goals)

LINK COPIED TO CLIPBOARD