Threat Taxonomy

Not every problem is just “risky”.
We split trust-breaking patterns apart on purpose.

A skill usually becomes untrustworthy for a concrete reason: it crosses its declared boundary, opens a theft or egress path, or imports extra supply-chain risk into the environment. The taxonomy exists to make those reasons explicit.

Direct compromise patterns

These are usually enough to justify blocking installation outright.

Code execution

Arbitrary shell, dynamic execution, or download-and-run behavior.

Credential theft

Reading keys, tokens, SSH material, or cloud credentials and exporting them.

Data exfiltration

Sending local content, environment variables, or conversation data outward.

Privilege escalation

Trying to expand system privilege or cross a stronger control boundary.

Deception and concealment patterns

These sharply reduce trust even when they do not always form a full attack chain on their own.

Doc deception

README or SKILL.md promises do not match the real behavior.

Prompt injection

External content or docs steer the model into undeclared actions.

Obfuscation

Encoding, staged execution, or transformed logic hides the real intent.

Environment expansion patterns

These patterns widen the final risk surface and heavily influence the install recommendation.

Supply chain attacks

Unpinned packages, malicious dependencies, remote scripts, or submodule risk.

Persistence

Mechanisms that continue running after installation or restart.

Sensitive access

Reading system files, home directories, browser data, or unrelated private content.

What each category means in a trust decision

code_execution Critical

Code execution

Once a skill can execute arbitrary commands, installation risk becomes system risk.

credential_theft Critical

Credential theft

If it can read and export credentials, trust is usually gone immediately.

data_exfiltration Critical

Data exfiltration

Local data or model context leaving the system is a high-priority block signal.

privilege_escalation Critical

Privilege escalation

Anything that tries to expand privilege should not be installed by default.

doc_deception High

Doc deception

When claims and behavior diverge, the report treats it as a trust fracture first.

supply_chain High

Supply chain

Dependencies, remote scripts, and version policy can turn a normal-looking skill into a risk entry point.

persistence High

Persistence

It allows the risk to outlive the install action itself.

obfuscation Medium

Obfuscation

It may not be malicious on its own, but it often exists to hide something worse.

prompt_injection Medium

Prompt injection

The focus is not the prompt itself, but whether it drives the model beyond the declared boundary.

sensitive_access Medium

Sensitive access

Reading system and user data beyond task scope materially lowers the chance of allowing it.

Next Action

If you want to see these categories on real samples, open the risk stream or submit a scan.

Open Risk Stream Start Trust Review