Category Archive

Alignment Research

2 premium articles in this collection

Oct 15 • 7 months ago

OpenAI wants to stop ChatGPT from validating users’ political views

New paper reveals reducing "bias" means making ChatGPT stop mirroring users' political language. ...

{"_":"https://arstechnica.com/ai/2025/10/openai-wants-to-stop-chatgpt-from-validating-users-political-views/","$":{"isPermaLink":"true"}}1 min read

Aug 14 • 9 months ago

Is AI really trying to escape human control and blackmail people?

Opinion: Theatrical testing scenarios explain why AI models produce alarming outputs—and why we fall for it. ...

{"_":"https://arstechnica.com/information-technology/2025/08/is-ai-really-trying-to-escape-human-control-and-blackmail-people/","$":{"isPermaLink":"true"}}1 min read

← Previous

Page 1 of 1