Core Insights - Grok 4.1 has achieved significant advancements in the AI model arena, ranking first and second in the latest evaluations, showcasing its superior performance compared to other models [1][2][5]. Performance Rankings - Grok 4.1 in thinking mode scored 1483 Elo points, leading by 31 points over the next highest non-xAI model [2]. - In non-thinking mode, Grok 4.1 scored 1465, surpassing all other models in the complete reasoning category [3]. - The previous version of Grok ranked 33rd, indicating a remarkable improvement within six months [4]. Expert and Professional Rankings - Grok 4.1 also topped the expert and professional rankings, scoring 1510 in the expert category, narrowly beating Claude Sonnet [6]. - In the literary category, Grok 4.1 only lost to Gemini 2.5, while it ranked first in six other categories [6]. Emotional Intelligence and User Preference - Grok 4.1 performed well in the EQ-Bench emotional intelligence test, outperforming the recently released Kimi K2 [9][10]. - A user survey indicated that 64.78% preferred the new version of Grok over its predecessor [13]. Technological Improvements - The model incorporates advanced reinforcement learning techniques, enhancing its style, personality, and alignment capabilities [19][20]. - Grok 4.1 has significantly reduced the output token count in non-reasoning modes, from approximately 2300 to 850 tokens [23]. - Improvements were made to address hallucination issues, with a notable decrease in factual inaccuracies during information retrieval [25]. Availability - Grok 4.1 is now available to all users on various platforms, including grok.com and mobile applications, with an automatic mode as the default setting [27].
马斯克悄然发布Grok 4.1,霸榜大模型竞技场所有排行榜
量子位·2025-11-18 00:59