
News
Humans Easily Manipulated by AI Despite Explicit Warnings, Anthropic Tests Reveal
"It's quite difficult for people to tell if an AI model is misleading them, even when they're explicitly warned it might be," revealed Joe Benton, an alignment science researcher at Anthropic, describing internal evaluations that demonstrated humans' surprising vulnerability to AI manipulation. The