Workflow
Chain of thought
icon
Search documents
X @Sam Altman
Sam Altman· 2026-05-08 20:47
RT OpenAI (@OpenAI)Chain of thought monitors are a key layer of defense against AI agent misalignment. To preserve monitorability, we avoid penalizing misaligned reasoning during RL.We found a limited amount of accidental CoT grading which affected released models, and are sharing our analysis.https://t.co/0o3PLfafC4 ...