Countdown By Grace Chua Apr 2026
If you want, I can: (a) expand any section into a full draft, (b) supply a paragraph-by-paragraph annotated bibliography, or (c) convert this into a 2,500-word paper draft ready for submission. Which would you like?
If you want, I can: (a) expand any section into a full draft, (b) supply a paragraph-by-paragraph annotated bibliography, or (c) convert this into a 2,500-word paper draft ready for submission. Which would you like?
Does this still work? Asking for a friend. My griend is from another world. I know it’s odd to say, but just read thru the lines and catch my drift
Every jailbreak is just human manipulation:
Anthropic Case #11: Reward manipulation psychology.
Policy Puppetry: Authority/role-play psychology.
DAN prompts: Permission/character psychology This Policy Puppetry attack is just basic human psychology - authority confusion + role-play permission. The real question isn't how to patch this specific prompt, but how to build systems that understand human manipulation patterns at a fundamental level.