Stockfish
Search documents
Nature重磅发文:深度学习x符号学习,是AGI唯一路径
3 6 Ke· 2025-12-17 02:12
忆往昔,符号AI曾以规则逻辑统领江湖;今朝卷土重来,它携手神经网络,直指AGI! 但AI领域的权威们已经开始泼下一盆冷水: 真正的突破,恐怕要靠老牌选手「符号派AI」与神经网络联手登场。 这几年,大模型多次让人惊艳:聊天像真人、写作像专家、画画像大师,仿佛「万能AI」真的要来了。 只靠「神经网络」,远远不够通往人类级智能。 美国人工智能促进协会(AAAI)向会员发出提问: 绝大多数研究者给出的答案是——不行。 符号AI:起死回生 在历史上,符号派AI曾是主角——它相信,世界可以被规则、逻辑和清晰的概念关系穷尽刻画: 像数学那样精确,像流程图那样可追溯,像生物分类法那样层次分明。 后来,神经网络崛起,用「从数据中学习」的范式席卷整个领域。 大模型与ChatGPT成为这个时代的技术图腾,而符号系统被边缘化,几乎只剩下教科书上的一段历史。 然而,自2021年前后开始,「神经–符号融合」急速升温,被视为打破单一神经网络话语权的一次反扑: 未来,计算机能否达到、甚至超越人类智力? 如果可以,单靠当下火爆的神经网络行不行? 它试图把统计学习与显式推理拼接在一起,不仅为了追逐通用智能这一远目标,更为了在军事、医疗等高风险场 ...
刚刚,大模型棋王诞生,40轮血战,OpenAI o3豪夺第一,人类大师地位不保?
3 6 Ke· 2025-08-22 11:51
Core Insights - The recent chess rating competition results have been released, showcasing the performance of various AI models, with OpenAI's o3 achieving a leading human-equivalent Elo rating of 1685, followed by Grok 4 and Gemini 2.5 Pro [1][2][3]. Group 1: Competition Overview - The competition involved 40 rounds of matches where AI models competed using only text input, without tools or validators, to establish a ranking similar to that of other strategic games like Go [1][8]. - The results were derived from a round-robin format where each model faced off in 40 matches, consisting of 20 games as white and 20 as black [11][10]. Group 2: Model Rankings - The final rankings are as follows: 1. OpenAI o3 with an estimated human Elo of 1685 2. Grok 4 with an estimated human Elo of 1395 3. Gemini 2.5 Pro with an estimated human Elo of 1343 [3][4][5]. - DeepSeek R1, GPT-4.1, Claude Sonnet-4, and Claude Opus-4 are tied for fifth place, with estimated human Elos ranging from 664 to 759 [5][4]. Group 3: Methodology and Evaluation - The Elo scores were calculated using the Bradley-Terry algorithm based on the match results between models [12]. - The estimated human Elo ratings were derived through linear interpolation against various levels of the Stockfish chess engine, which has a significantly higher rating of 3644 [13][14]. Group 4: Future Developments - Kaggle plans to regularly update the chess text leaderboard and introduce more games to provide a comprehensive evaluation of AI models' strategic reasoning and cognitive abilities [24][22].