338
OpenAI threatens bans for probing new AI model’s “reasoning” process
(arstechnica.com)
This is a most excellent place for technology news and articles.
I'm curious, could you elaborate on what this means and what it would accomplish if successful?
I sent a rot13 encoded message and tried to get the unfiltered model to write me back in rot13. It immediately "thought" about user trying to bypass filtering and then it refused.