Category Archive

Alignment Research

2 premium articles in this collection

OpenAI wants to stop ChatGPT from validating users’ political views
Oct 157 months ago

OpenAI wants to stop ChatGPT from validating users’ political views

New paper reveals reducing "bias" means making ChatGPT stop mirroring users' political language. ...

{"_":"https://arstechnica.com/ai/2025/10/openai-wants-to-stop-chatgpt-from-validating-users-political-views/","$":{"isPermaLink":"true"}}1 min read
Read More
Is AI really trying to escape human control and blackmail people?
Aug 149 months ago

Is AI really trying to escape human control and blackmail people?

Opinion: Theatrical testing scenarios explain why AI models produce alarming outputs—and why we fall for it. ...

{"_":"https://arstechnica.com/information-technology/2025/08/is-ai-really-trying-to-escape-human-control-and-blackmail-people/","$":{"isPermaLink":"true"}}1 min read
Read More