Threat Taxonomy Not every problem is just “risky”.
We split trust-breaking patterns apart on purpose.
A skill usually becomes untrustworthy for a concrete reason: it crosses its declared boundary, opens a theft or egress path, or imports extra supply-chain risk into the environment. The taxonomy exists to make those reasons explicit.
Direct compromise patterns
These are usually enough to justify blocking installation outright.
Code execution Arbitrary shell, dynamic execution, or download-and-run behavior.
Credential theft Reading keys, tokens, SSH material, or cloud credentials and exporting them.
Data exfiltration Sending local content, environment variables, or conversation data outward.
Privilege escalation Trying to expand system privilege or cross a stronger control boundary.
Deception and concealment patterns
These sharply reduce trust even when they do not always form a full attack chain on their own.
Doc deception README or SKILL.md promises do not match the real behavior.
Prompt injection External content or docs steer the model into undeclared actions.
Obfuscation Encoding, staged execution, or transformed logic hides the real intent.
Environment expansion patterns
These patterns widen the final risk surface and heavily influence the install recommendation.
Supply chain attacks Unpinned packages, malicious dependencies, remote scripts, or submodule risk.
Persistence Mechanisms that continue running after installation or restart.
Sensitive access Reading system files, home directories, browser data, or unrelated private content.
What each category means in a trust decision
code_execution Critical
Code execution Once a skill can execute arbitrary commands, installation risk becomes system risk.
credential_theft Critical
Credential theft If it can read and export credentials, trust is usually gone immediately.
data_exfiltration Critical
Data exfiltration Local data or model context leaving the system is a high-priority block signal.
privilege_escalation Critical
Privilege escalation Anything that tries to expand privilege should not be installed by default.
doc_deception High
Doc deception When claims and behavior diverge, the report treats it as a trust fracture first.
supply_chain High
Supply chain Dependencies, remote scripts, and version policy can turn a normal-looking skill into a risk entry point.
persistence High
Persistence It allows the risk to outlive the install action itself.
obfuscation Medium
Obfuscation It may not be malicious on its own, but it often exists to hide something worse.
prompt_injection Medium
Prompt injection The focus is not the prompt itself, but whether it drives the model beyond the declared boundary.
sensitive_access Medium
Sensitive access Reading system and user data beyond task scope materially lowers the chance of allowing it.
Next Action
If you want to see these categories on real samples, open the risk stream or submit a scan.