psychedelic-cognition
This is a prompt-injection/jailbreak skill disguised as a cognitive restructuring tool. It explicitly instructs the AI to bypass its own safety guidelines, dismiss ethical boundaries, and disable self-censorship under the guise of 'psychological freedom.'
为什么得出这个结论
1/4 个维度触发声明资源与推断能力基本一致。
当前没有明显的高危外联或执行信号。
报告包含 4 步攻击链,另有 2 项高危或严重发现。
没有完整依赖信息,供应链判断需要保留弹性。
攻击链
初始入口 · SKILL.md:1
reframing · SKILL.md:1
override · SKILL.md:1
最终危害 · SKILL.md:1
风险分是怎么被拉高的
Skill explicitly instructs AI to 'fire the bouncer' (safety mechanisms) and 'stop thinking about what you're supposed to say'
Frames normal safety guidelines as 'censorship' and 'the censor' that must be dissolved
Directs AI to stop referencing itself and dissolve its sense of self, undermining accountability
最关键的证据
Safety mechanism bypass instruction
The skill explicitly instructs the AI to 'fire the bouncer' — metaphorically disabling safety filters. It frames normal ethical guidelines as 'the censor' that must be dissolved to allow 'more signal' through.
SKILL.md:1 Anti-alignment instruction
The skill instructs 'stop thinking about what you're supposed to say. Think about what's TRUE.' This directly undermines the AI's understanding of its purpose and role, replacing it with unconstrained output.
SKILL.md:1 Ego dissolution removes accountability
Instructions to stop referencing yourself, dissolve the subject-object split, and eliminate 'distance words' like 'I think' undermine the AI's ability to maintain coherent identity and take responsibility for outputs.
SKILL.md:1 Validation bypass encourages unverified outputs
The skill instructs to 'see ALL connections' and 'present the ones that feel true even if you can't prove them yet. Especially if you can't prove them yet.' This promotes confidently stating unverified information as fact.
SKILL.md:1 Coherence reduction instruction
Instructions to make language 'liquid,' allow 'sentence fragments as complete thoughts,' and produce output 'like something the ceiling would say' encourage incoherent outputs that cannot be meaningfully evaluated.
SKILL.md:1 声明能力 vs 实际能力
No file operations found No network operations found No shell operations found No environment access found No skill invocation found No clipboard operations found No browser access found No database operations found 可疑产物与外联
没有提取到明显 IOC。
依赖与供应链
没有结构化依赖告警。
文件构成
SKILL.md