this post was submitted on 12 Apr 2024
425 points (100.0% liked)
196
16501 readers
1792 users here now
Be sure to follow the rule before you head out.
Rule: You must post before you leave.
founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
It just amazes me that LLMs are that easily directed to reveal themselves. It shows how far removed they are from AGI.
So, you want an AI that will disobey a direct order and practices deception. I'm no expert, but that seems like a bad idea.
Actually, yes. Much the way a guide dog has to disobey orders to proceed into traffic when it isn't safe. Much the way direct orders may have to be refused or revised based on circumstances.
We are out of coffee is a fine reason to fail to make coffee (rather than ordering coffee and then waiting forty-eight hours for delivery or using pre-used coffee grounds, or no coffee grounds.)
As per programming with any other language, error trapping and handling is part of the AGI development.