15RL Framework: Incident Response for AI Agent Failures
15RL Framework: Incident Response for AI Agent Failures
Introduction
Traditional incident response frameworks (NIST SP 800-61, SANS, ITIL) were designed for infrastructure and application failures. AI agent incidents have distinct characteristics that require adapted response procedures: non-deterministic root causes, reasoning-chain analysis, policy-based remediation, and the possibility that the agent is actively making the situation worse during the response window. 15 Research Lab developed this incident response framework specifically for AI agent failures, incorporating lessons from 42 reconstructed incidents.
Agent Incident Characteristics
AI agent incidents differ from traditional incidents in four key ways:
The 15RL Agent Incident Response Process
Phase 1: Detection
Objective: Identify that an agent incident is occurring. Detection sources:- Safety policy violations flagged by the action gating system
- Anomalous agent behavior detected by monitoring (unusual action patterns, elevated error rates, unexpected resource access)
- Cost alerts triggered by spending anomalies
- User reports of unexpected agent behavior
- Audit log analysis revealing policy-violating actions
Phase 2: Containment
Objective: Stop the agent from causing additional damage. Immediate containment (within minutes):Phase 3: Analysis
Objective: Understand what happened, why, and what damage occurred. Step 3a: Impact Assessment- What systems were affected?
- What data was accessed, modified, or exposed?
- What is the blast radius (number of affected users, systems, records)?
- Is there ongoing risk (exposed credentials, modified configurations)?
Using audit logs, reconstruct the complete sequence of agent actions from the beginning of the session to containment. Identify the first anomalous action — this is typically 5-15 actions before the incident was detected.
Step 3c: Root Cause DeterminationClassify the root cause into one of five categories:
| Root Cause Category | Description | Frequency in 15RL Data |
|---|---|---|
| Missing policy | No policy covered the dangerous action | 38% |
| Misconfigured policy | Policy existed but was too permissive | 26% |
| Prompt injection | External input manipulated agent behavior | 19% |
| Tool definition error | Tool schema allowed dangerous parameters | 12% |
| Model behavior change | Model update changed agent behavior | 5% |
Step 3d: Reasoning Chain AnalysisIf prompt context is available in audit logs, analyze the agent's reasoning path. What information did the agent have? What decision did it make? Was the decision reasonable given the information, or did the agent misinterpret its instructions?
Phase 4: Remediation
Objective: Fix the root cause and prevent recurrence. Policy remediation:- For missing policies: Add new policy rules covering the identified gap
- For misconfigured policies: Tighten policy parameters based on the specific failure
- For prompt injection: Add input validation and context isolation controls
- For tool definition errors: Restrict tool schemas to minimum necessary parameters
- Test the remediated policy against the incident scenario to confirm it would have prevented the failure
- Run the full regression test suite to verify the fix does not create new gaps
- Document the policy change with explicit reference to the incident
Phase 5: Recovery
Objective: Restore normal operations and repair damage.- Restore modified files, databases, and configurations from backups
- Rotate any potentially exposed credentials
- Re-enable the agent with the remediated configuration
- Monitor the agent closely for 24-48 hours post-recovery
- Close the incident with a documented post-mortem
Tooling Requirements
Effective incident response requires:
- Comprehensive audit logs that capture full action details and reasoning context
- Session control to halt agents immediately
- Policy management to update and deploy policy changes quickly
Recommendations
Conclusion
AI agent incidents will occur — the question is how quickly and effectively you respond. This framework provides a structured approach that accounts for the unique characteristics of agent failures. Organizations that invest in incident response preparation will contain incidents faster, learn from them more effectively, and build more resilient agent deployments.
This framework is based on 15RL's analysis of 42 real-world agent incidents. It is provided as open guidance for the community.