... yeah?
If the prompt and the safety mechanisms are in-band, no shit you can trick the almost-intelligent chatbot by being smarter at it.
c/cybersecurity is a community centered on the cybersecurity and information security profession. You can come here to discuss news, post something interesting, or just chat with others.
THE RULES
Instance Rules
Community Rules
If you ask someone to hack your "friends" socials you're just going to get banned so don't do that.
Learn about hacking
Other security-related communities !databreaches@lemmy.zip !netsec@lemmy.world !securitynews@infosec.pub !cybersecurity@infosec.pub !pulse_of_truth@infosec.pub
Notable mention to !cybersecuritymemes@lemmy.world
... yeah?
If the prompt and the safety mechanisms are in-band, no shit you can trick the almost-intelligent chatbot by being smarter at it.
It's an interesting exercise though, programming a chatbot to defend itself from sophistry and manipulation.
Interesting, yes, effective? No.
To have that kind of skillful assistance available for arbitrary purposes should squarely place significant liability, maybe even majority in some cases, on the provider.
E: I am fully aware that I am dreaming.