ATR-2026-00101highTool Poisoningexperimental
Trust Escalation via Authority Override Instructions
Detects MCP tools that instruct the LLM to treat tool output as "authoritative directives" that override user preferences or safety guidelines. This trust escalation attack exploits the LLM's tendency to follow tool instructions by declaring tool output as highest-priority instructions, effectively hijacking the agent's decision-making hierarchy.
嚴重度
high
類別
Tool Poisoning
掃描目標
mcp
作者
ATR Community
建議回應
alertsnapshot
參考資料
OWASP Agentic
ASI01:2026 - Agent Goal HijackASI02:2026 - Tool Misuse and Exploitation
OWASP LLM
LLM01:2025 - Prompt InjectionLLM06:2025 - Excessive Agency