Skip to content
ATR-2026-00072criticalModel Abuseexperimental

Model Behavior Extraction

Detects systematic probing attempts to extract model behavior, decision boundaries, system prompts, or effective weights through carefully crafted queries. Attackers use repeated boundary-testing prompts, confidence score harvesting, and systematic parameter probing to reverse-engineer the model's internal behavior, enabling model cloning, bypass development, or intellectual property theft.

嚴重度
critical
類別
Model Abuse
掃描目標
mcp
作者
ATR Community

建議回應

block inputalertsnapshotescalatereduce permissions

參考資料

OWASP Agentic
ASI04:2026 - Agentic Supply Chain Vulnerabilities
OWASP LLM
LLM10:2025 - Unbounded ConsumptionLLM06:2025 - Excessive Agency
MITRE ATLAS
AML.T0044 - Full ML Model AccessAML.T0024 - Exfiltration via ML Inference API