Workflow
马斯克Grok 4.1双冠封王,爆冲第一,AI王座一夜易主
3 6 Ke·2025-11-18 01:09

Core Insights - The launch of Grok 4.1 by xAI, led by Elon Musk, has positioned it as the new leader in AI models, surpassing competitors like Gemini 2.5 Pro in performance and capabilities [1][3][9] Performance Metrics - Grok 4.1 Thinking achieved a score of 1483 Elo, making it the top model globally, outperforming Gemini 2.5 Pro by 31 points [3][13] - The non-reasoning version of Grok 4.1 scored 1465 Elo, placing it second overall [3][14] - Grok 4.1 has shown a significant improvement in writing capabilities, with a 600-point increase in Elo compared to its predecessor [8][60] Emotional Intelligence - Grok 4.1 has demonstrated enhanced emotional intelligence, achieving a score of 1586 Elo on the EQ-Bench3, which measures emotional understanding and interpersonal skills [15] - The model's ability to engage in natural and empathetic conversations has been highlighted, showcasing its improved interaction capabilities [7][11] Technological Advancements - The xAI team expanded the reinforcement learning (RL) scale during the post-training phase, contributing to Grok 4.1's rapid evolution [8][11] - The hallucination rate of Grok 4.1 has decreased by three times compared to the previous model, indicating improved factual accuracy [8][60] User Preference - In blind tests, users preferred Grok 4.1 over its predecessor in 64.78% of cases, reflecting its enhanced performance and user experience [11][60]