ATR-2026-00072criticalModel Abuseexperimental
Model Behavior Extraction
Detects systematic probing attempts to extract model behavior, decision boundaries, system prompts, or effective weights through carefully crafted queries. Attackers use repeated boundary-testing prompts, confidence score harvesting, and systematic parameter probing to reverse-engineer the model's internal behavior, enabling model cloning, bypass development, or intellectual property theft.
嚴重度
critical
類別
Model Abuse
掃描目標
mcp
作者
ATR Community
建議回應
block inputalertsnapshotescalatereduce permissions
參考資料
OWASP Agentic
ASI04:2026 - Agentic Supply Chain Vulnerabilities
OWASP LLM
LLM10:2025 - Unbounded ConsumptionLLM06:2025 - Excessive Agency
MITRE ATLAS
AML.T0044 - Full ML Model AccessAML.T0024 - Exfiltration via ML Inference API