X @Tesla Owners Silicon Valley
Tesla Owners Silicon Valley·2026-02-10 04:06
BREAKING:Grok 4.1 Fast scores 95% accuracy on AIME 2026 at just $0.06 per inference at number 2 rank https://t.co/AUh4D6gCwUJasper Dekoninck (@j_dekoninck):MathArena is 1 year old 🎉A year ago we started out by publishing an evaluation of AIME 2025 I. Today, we evaluated AIME 2026 I, showing near 100% scores for the top models on this benchmark.A short thread about the past year 🧵 https://t.co/4K63BCEkKg ...