Security for AI Agents Using an Ensemble of Fine-tuned Mo...

Securing AI Agents via Specialized SLM Ensemble Firewalls

            
                BSidesSF 2026
            
                video
            
            2026-05-20T00:00:00

Abstract

This talk addresses the non-deterministic risks associated with AI agents, specifically "Goal Hijacking" (via prompt injection) and "Rogue Agent" behavior (autonomous errors). The speaker proposes a runtime security architecture that functions as a "firewall" between an agent's reasoning process and its tool execution. By utilizing an ensemble of specialized, fine-tuned Small Language Models (SLMs) and a self-improving Retrieval-Augmented Generation (RAG) memory system, the proposed solution enables real-time, contextual security gating with low latency and reduced false positives.

Loading executive summary...

Loading full markdown...

Match Rate: 9.00/10 (Relevance to core cybersecurity goals)

Securing AI Agents via Specialized SLM Ensemble Firewalls

Abstract

SHARE INTELLIGENCE WIRE