10Policy·Mar 5
Reasoning models struggle to control their chains of thought, and that’s good
OpenAI introduces CoT-Control and finds reasoning models struggle to control their chains of thought, reinforcing monitorability as an AI safety safeguard.
Covered by 1 source
- OOpenAI Blog↗Mar 5