← Back to Model Beat
10Policy·Mar 19

How we monitor internal coding agents for misalignment

How OpenAI uses chain-of-thought monitoring to study misalignment in internal coding agents—analyzing real-world deployments to detect risks and strengthen AI safety safeguards.

Covered by 1 source

Related stories

PolicyOpenAI Japan announces Japan Teen Safety Blueprint to put teen safety firstMar 17PolicyIndia AI Impact Summit: India must move 'beyond English' to lead in AI, says Sarvam co-founder Pratyush Kumar - MSNMar 18