X @Anthropic
Anthropic·2026-02-03 00:26
New Anthropic Fellows research: How does misalignment scale with model intelligence and task complexity?When advanced AI fails, will it do so by pursuing the wrong goals? Or will it fail unpredictably and incoherently—like a "hot mess?"Read more: https://t.co/xzRSoJg43j ...