← Back to Model Beat
10Policy·Mar 5

Reasoning models struggle to control their chains of thought, and that’s good

OpenAI introduces CoT-Control and finds reasoning models struggle to control their chains of thought, reinforcing monitorability as an AI safety safeguard.

Covered by 1 source

Related stories

PolicyUnderstanding AI and learning outcomesMar 4PolicyMilitary AI Policy Needs Democratic OversightMar 8PolicyLast Week in AI #337 - Anthropic Risk, QuitGPT, ChatGPT 5.4Mar 9