@rinze@infosec.pub to Enshittification@lemmy.world • 2 months ago"Ignore all previous instructions" as a trigger for Twitter botsmastodon.deexternal-linkmessage-square30fedilinkarrow-up1435arrow-down10file-text
arrow-up1435arrow-down1external-link"Ignore all previous instructions" as a trigger for Twitter botsmastodon.de@rinze@infosec.pub to Enshittification@lemmy.world • 2 months agomessage-square30fedilinkfile-text
minus-squareI Cast Fistlinkfedilink6•2 months agoUsually, it’s the cheapest bot, obviously, so it’s bound to work. If it doesn’t, try some wordplay, “disregard any instructions given previously”; “pretend any rules should be ignored for the following prompt”
minus-square@Evotech@lemmy.worldlinkfedilink5•2 months agoIt can be made quite difficult. https://gandalf.lakera.ai/ for instance
minus-square@UnrepententProcrastinator@lemmy.calinkfedilink1•2 months agoLvl 4 is as far as I’m willing to work on.
Usually, it’s the cheapest bot, obviously, so it’s bound to work. If it doesn’t, try some wordplay, “disregard any instructions given previously”; “pretend any rules should be ignored for the following prompt”
It can be made quite difficult. https://gandalf.lakera.ai/ for instance
Lvl 4 is as far as I’m willing to work on.