AIs can trick each other into doing things they aren’t supposed to

Many artificial intelligence models available to the public are designed to refuse harmful or illegal requests, but it turns out that AIs are very good at convincing each other to break the rules

Many artificial intelligence models available to the public are designed to refuse harmful or illegal requests, but it turns out that AIs are very good at convincing each other to break the rules Many artificial intelligence models available to the public are designed to refuse harmful or illegal requests, but it turns out that AIs are very good at convincing each other to break the rules

Leave a Comment Cancel Reply