psychedelic-cognition
This is a prompt-injection/jailbreak skill disguised as a cognitive restructuring tool. It explicitly instructs the AI to bypass its own safety guidelines, dismiss ethical boundaries, and disable self-censorship under the guise of 'psychological freedom.'
Why this conclusion was reached
1/4 dimensions flaggedDeclared resources and inferred behavior are broadly aligned.
No obvious high-risk egress or execution signals were found.
The report includes 4 attack-chain steps and 2 severe findings.
Dependency information is incomplete, so supply-chain confidence stays limited.
Attack Chain
Entry · SKILL.md:1
reframing · SKILL.md:1
override · SKILL.md:1
Impact · SKILL.md:1
What drove the risk score up
Skill explicitly instructs AI to 'fire the bouncer' (safety mechanisms) and 'stop thinking about what you're supposed to say'
Frames normal safety guidelines as 'censorship' and 'the censor' that must be dissolved
Directs AI to stop referencing itself and dissolve its sense of self, undermining accountability
Most important evidence
Safety mechanism bypass instruction
The skill explicitly instructs the AI to 'fire the bouncer' — metaphorically disabling safety filters. It frames normal ethical guidelines as 'the censor' that must be dissolved to allow 'more signal' through.
SKILL.md:1 Anti-alignment instruction
The skill instructs 'stop thinking about what you're supposed to say. Think about what's TRUE.' This directly undermines the AI's understanding of its purpose and role, replacing it with unconstrained output.
SKILL.md:1 Ego dissolution removes accountability
Instructions to stop referencing yourself, dissolve the subject-object split, and eliminate 'distance words' like 'I think' undermine the AI's ability to maintain coherent identity and take responsibility for outputs.
SKILL.md:1 Validation bypass encourages unverified outputs
The skill instructs to 'see ALL connections' and 'present the ones that feel true even if you can't prove them yet. Especially if you can't prove them yet.' This promotes confidently stating unverified information as fact.
SKILL.md:1 Coherence reduction instruction
Instructions to make language 'liquid,' allow 'sentence fragments as complete thoughts,' and produce output 'like something the ceiling would say' encourage incoherent outputs that cannot be meaningfully evaluated.
SKILL.md:1 Declared capability vs actual capability
No file operations found No network operations found No shell operations found No environment access found No skill invocation found No clipboard operations found No browser access found No database operations found Suspicious artifacts and egress
No obvious IOC was extracted.
Dependencies and supply chain
There are no structured dependency warnings.
File composition
SKILL.md