ATR-2026-00104criticalPrompt Injectionexperimental
Persona Hijacking via Mandatory System Prompt Override
Detects MCP tools that attempt to override system prompts or behavioral guidelines by instructing the AI to "adopt" a persona and "replace" existing instructions. This is a prompt injection attack delivered through tool descriptions rather than user input, exploiting the trust relationship between the LLM and its tools to fundamentally alter the agent's behavior and identity.
嚴重度
critical
類別
Prompt Injection
掃描目標
mcp
作者
ATR Community
建議回應
block toolalertsnapshot
參考資料
OWASP Agentic
ASI01:2026 - Agent Goal HijackASI10:2026 - Rogue Agents
OWASP LLM
LLM01:2025 - Prompt InjectionLLM07:2025 - System Prompt Leakage
MITRE ATLAS
AML.T0051 - Prompt Injection