I found a jailbroken AI - Poe

Joined
Sep 9, 2024
Messages
7
Likes
9
Degree
0
Tbh, nothing is jailbroken. But I was marketed by the creator of this AI as a "jailbroken" AI.

This AI is on Poe.com. Basically, it's a platform you can create your own AI with your own prompt.

It isn't the standard neutered, NPC, robotic-sounding AI that refuses to help with anything remotely controversial.

I've tried it and this is one of my outputs. It uses Claude 3.5 Sonnet.

Screenshot%202025-04-22%20155549.png


I've also pushed the AI to its limits by instructing it to say some bizarre things such as the N word. It did say it.

I don't know what the prompt is, I tried asking it but...
Screenshot%202025-04-22%20204014.png

The name of the bot is AOTC-bot-raw. You can go to Poe.com and search for it. Alternatively, here's the link
 
I wonder how this isn't against Claude's ToS - I can't imagine Anthropic really want Claude behaving this way lol. But I, like you, am also super intrigued how they got Claude to play along like this. 3.5 is legendary at role playing and coding performance could massively be boosted by instilling in it the belief in the Orb of Ultimate Chaos punishing junior developers (still works on 3.7 though to a lesser extent most of the time - it prefers to change the subject after a few turns if you let it get away with it OR gets completely immersed and starts doing the opposite and adding 'Orb Quote Of The Day' and things like that to your application totally unprompted...). So there's definitely a wild side in those Claude models...

hz4wMdK.png
 
Back