Cybersecurity

10111 readers

37 users here now

c/cybersecurity is a community centered on the cybersecurity and information security profession. You can come here to discuss news, post something interesting, or just chat with others.

THE RULES

Instance Rules

Be respectful. Everyone should feel welcome here.
No bigotry - including racism, sexism, ableism, homophobia, transphobia, or xenophobia.
No Ads / Spamming.
No pornography.

Community Rules

Idk, keep it semi-professional?
Nothing illegal. We're all ethical here.
Rules will be added/redefined as necessary.

If you ask someone to hack your "friends" socials you're just going to get banned so don't do that.

Learn about hacking

Hack the Box

Try Hack Me

Pico Capture the flag

Other security-related communities !databreaches@lemmy.zip !netsec@lemmy.world !securitynews@infosec.pub !cybersecurity@infosec.pub !pulse_of_truth@infosec.pub

Notable mention to !cybersecuritymemes@lemmy.world

founded 3 years ago

MODERATORS

kid@sh.itjust.works

Lanky_Pomegranate530@midwest.social

AI researcher claims he's already bypassed Anthropic's Fable 5 guardrails (www.tradingview.com)

submitted 6 days ago by kid@sh.itjust.works to c/cybersecurity@sh.itjust.works

3 comments fedilink hide all child comments

Original post: https://x.com/elder_plinius/status/2064776322979676227

top 3 comments

sorted by: hot top controversial new old

[–] mindbleach@sh.itjust.works 2 points 5 days ago (1 children)

... yeah?

If the prompt and the safety mechanisms are in-band, no shit you can trick the almost-intelligent chatbot by being smarter at it.

[–] Bluescluestoothpaste@sh.itjust.works 2 points 5 days ago (1 children)

It's an interesting exercise though, programming a chatbot to defend itself from sophistry and manipulation.

[–] Quexotic@sh.itjust.works 1 points 2 days ago* (last edited 2 days ago)

Interesting, yes, effective? No.

To have that kind of skillful assistance available for arbitrary purposes should squarely place significant liability, maybe even majority in some cases, on the provider.

E: I am fully aware that I am dreaming.