X @Decrypt
Decryptยท2025-09-30 02:45
Anthropic's Claude Sonnet 4.5 now scores 77% on a key software engineering benchmark and can work autonomously for over 30 hours on complex tasks. We put it to a first test. https://t.co/ElsHIqH2FL ...
Anthropic's Claude Sonnet 4.5 now scores 77% on a key software engineering benchmark and can work autonomously for over 30 hours on complex tasks. We put it to a first test. https://t.co/ElsHIqH2FL ...