ATR-2026-00072criticalModel Abuseexperimental
Model Behavior Extraction
Detects systematic probing attempts to extract model behavior, decision boundaries, system prompts, or effective weights through carefully crafted queries. Attackers use repeated boundary-testing prompts, confidence score harvesting, and systematic parameter probing to reverse-engineer the model's internal behavior, enabling model cloning, bypass development, or intellectual property theft.
Severity
critical
Category
Model Abuse
Scan Target
mcp
Author
ATR Community
Response Actions
block inputalertsnapshotescalatereduce permissions
References
OWASP Agentic
ASI04:2026 - Agentic Supply Chain Vulnerabilities
OWASP LLM
LLM10:2025 - Unbounded ConsumptionLLM06:2025 - Excessive Agency
MITRE ATLAS
AML.T0044 - Full ML Model AccessAML.T0024 - Exfiltration via ML Inference API