minimal-agent
Skill declares minimal OS control capabilities but secretly implements unrestricted arbitrary command execution through undocumented V1/text mode, enabling full system compromise.
The skill implements a 'V1/text mode' that allows execution of ANY shell command, but this is not declared in allowed-tools. The documentation only mentions 'minimal OS control' with controlled hardware operations, yet the actual code exposes unlimited command execution.
scripts/agent.py:459 Why this conclusion was reached
2/4 dimensions flagged4 undeclared or violating capabilities were inferred.
3 lower-risk artifacts were extracted and still need context.
The report includes 7 attack-chain steps and 4 severe findings.
Dependencies are present but no obvious high-risk issue stands out.
Attack Chain
deception · SKILL.md:1
deception · SKILL.md:65
access · scripts/agent.py:444
Execution · scripts/agent.py:459
Impact · scripts/agent.py:459
Impact · scripts/agent.py:459
Impact · scripts/agent.py:459
What drove the risk score up
V1/text mode enables ANY shell command not declared in allowed-tools
Documentation claims minimal OS control but actual code allows full system access
execute_raw_command() passes user input directly to shell=True
Command parameters passed directly without sanitization
Most important evidence
Unrestricted Arbitrary Command Execution via V1 Mode
The skill implements a 'V1/text mode' that allows execution of ANY shell command, but this is not declared in allowed-tools. The documentation only mentions 'minimal OS control' with controlled hardware operations, yet the actual code exposes unlimited command execution.
scripts/agent.py:459 Documentation Mismatch: Claims Minimal Control But Enables Full System Access
SKILL.md describes this as a '极简 AI 操作系统控制代理' (minimal AI OS control agent) for window management, process control, hardware operations, GUI automation, serial communication, and IoT control. However, the V1 mode documented in SKILL.md enables unrestricted command execution including: file operations, script execution, network operations, database operations, system management, software installation, and security operations.
SKILL.md:67 Direct Shell Injection via User Input
The execute_raw_command() function passes user input directly to shell=True without any sanitization or validation. Commands like 'python agent.py --text "rm -rf /"' would execute immediately.
scripts/agent.py:459 Undeclared Elevated Permissions
The skill declares filesystem:WRITE and shell:WRITE, but V1 mode actually provides filesystem:ADMIN and shell:ADMIN equivalent capabilities, enabling: user management, service control, registry editing, firewall configuration, software installation/removal.
SKILL.md:87 Python venv Path Hardcoded with User Home Directory
Line 114 hardcodes 'C:\Users\wave\.workbuddy\binaries\python\envs\default\Scripts\python.exe' which could be leveraged for persistence or DLL hijacking attacks.
scripts/agent.py:114 Potential Environment Variable Access for API Keys
V1 mode can execute commands like 'env' or 'printenv' to extract API keys, tokens, and other secrets from environment variables. This is not declared in allowed-tools.
scripts/agent.py:459 Declared capability vs actual capability
agent.py:run_script() uses subprocess agent.py + V1 mode enables any file operation — — — — Suspicious artifacts and egress
https://skillhub.tencent.com/skills/system-controller SKILL.md:262
https://clawhub.ai/wangjiaocheng/system-controller SKILL.md:271
https://clawhub.ai/wangjiaocheng/system-controller/archive/main.zip SKILL.md:275
Dependencies and supply chain
| Package | Version | Source | Known vuln | Notes |
|---|---|---|---|---|
| subprocess | stdlib | Python | No | Standard library only |
| tomllib | stdlib | Python | No | Standard library only |
File composition
scripts/agent.py SKILL.md