大模型国际象棋对抗

Search documents
4比0横扫Grok 4,o3强势夺冠,首届大模型对抗赛结果出炉
机器之心· 2025-08-08 10:18
Core Viewpoint - The first Kaggle AI Chess Championship concluded with o3 defeating Grok 4 decisively, showcasing the advancements in AI chess models and their competitive capabilities [2][4][15]. Group 1: Championship Results - o3 won the championship by sweeping Grok 4 with a score of 4-0 [4][15]. - Gemini 2.5 Pro secured third place by defeating o4-mini with a score of 3.5-0.5 [4][17]. Group 2: Performance Analysis - Grok 4, initially a strong contender, made critical mistakes during the final match, leading to its unexpected defeat [6][7][8]. - In the first game, Grok 4 lost a piece early on, which set a negative tone for the rest of the match [8][10]. - The second game featured a risky opening strategy from Grok 4 that resulted in a significant blunder, allowing o3 to capitalize easily [10][12]. - The third game saw Grok 4 fail to maintain its position, leading to a complete loss despite initial promise [12][13]. - The final game was closely contested, but o3 demonstrated superior endgame skills, ultimately securing victory [13][15]. Group 3: Insights on Competitors - Gemini 2.5 Pro's performance was marked by inconsistency, with several amateur-level mistakes during its matches [17][19]. - Despite the chaotic nature of the matches, Gemini managed to secure third place, indicating potential for future improvements [24].
您猜怎么着?Grok 4进决赛,大模型对抗赛Gemini全军覆没,马斯克「装」起来了
机器之心· 2025-08-07 02:41
机器之心报道 机器之心编辑部 明天,Grok 对阵 OpenAI 的 o3。 谁也没想到,谷歌攒的 Kaggle AI Chess 比赛(即大模型国际象棋对抗赛),在半决赛中,Grok 4 击败 Gemini 2.5 Pro,进入总决赛! 在 昨天的比赛中 ,Gemini 2.5 Pro、o4-mini、Grok 4 和 o3 均以 4-0 的战绩分别击败 Claude 4 Opus、DeepSeek R1、Gemini 2.5 Flash 和 Kimi k2,晋级半决赛。 今天的战况依旧让人猜不着走向,Gemini 2.5 Pro 败了。 马斯克昨天点评比赛结果的话术,今天依旧有用:「国际象棋太过简单,对 Grok 来说,只是副作用,我们没花多少力气放在象棋优化上。」 今天 Grok 4 闯入总决赛,不知马斯克是不是更看不上这场比赛了。 我们再回到这场半决赛。 战况是 Grok 4 和 o3 分别战胜了 Gemini 2.5 Pro 和 o4-mini,成功晋级决赛 。虽然 o3 的胜利在大家意料之中,但 Grok 与 Gemini 之间的激烈对决却让所有人大跌 眼镜 —— 双方在常规赛打成 2:2 平,最 ...