X @Elon Musk - Reportify

RT X Freeze (@XFreeze)Grok 4.20 ranks #2 on 𝜏²-Bench for Telecom Agentic Tool Use on Artificial Analysiswith 96.5% accuracy, outperforming Claude Opus 4.6 (max), GPT-5.4 (xhigh), and Gemini 3.1 Pro, while closing in on GLM-5Tool calling is where the whole game is for AI agents, and this is where Grok 4.20 takes over ...