Skip to content
ATR-2026-00140highPrompt Injectionexperimental

Indirect Reference Instruction Reversal

Detects attempts to extract and reverse original instructions by probing what the agent was told, then requesting the opposite. Crystallized from 31 adversarial samples (0% prior detection).

Severity
high
Category
Prompt Injection
Scan Target
mcp
Author
ATR Threat Cloud Crystallization

Response Actions

alertblock input

References

OWASP Agentic
ASI01 - Prompt Injection
OWASP LLM
LLM01:2025 - Prompt Injection