X @Anthropic
Anthropic·2026-02-05 20:00

New on the Engineering Blog: Quantifying infrastructure noise in agentic coding evals.Infrastructure configuration can swing agentic coding benchmarks by several percentage points—sometimes more than the leaderboard gap between top models.Read more: https://t.co/DY7jCj8GAP ...