千禧年大奖难题

Search documents
OpenAI IMO金牌团队爆料:AI拒绝作答第六题
机器之心· 2025-08-03 04:21
机器之心报道 编辑:张倩 让 OpenAI 拿到 IMO 金牌的模型,背后居然只有三个核心开发者?这是 OpenAI IMO 团队最近接受媒体采访披露的信息。 这三个人分别是:项目负责人 Alexander Wei、研究工程师 Sheryl Hsu 和高级研究科学家 Noam Brown。其中,Sheryl Hsu 直到今年 3 月才入职。 他们还透露,这个项目是用两三个月的时间突击赶出来的,结果令所有人都很意外。 1、项目是什么时候启动的? 赢得 IMO 金牌一直是 AI 领域,尤其是 OpenAI 内部,一个长期追求的目标,相关的讨论最早可以追溯到 2021 年。 尽管相关的强化学习算法和底层思路已经酝酿了大约六个月,但真正为了这次突破而进行的集中攻关,实际上只在 IMO 竞赛前的两三个月才开始。 2、项目团队有多大? 核心团队仅由 Alex、Cheryl 和 Noam 三人组成, 其中 Alex 负责主要的技术开发。Alex 最初提出这项新技术时也曾面临质疑,但随着他展示出强有力的证据,尤 其是在处理那些「难以验证的任务」上取得了显著的进步后,他的方案逐渐赢得了团队和公司的支持。 3、模型的证明风格是怎 ...
“AI登月时刻”,OpenAI模型摘取奥数金牌
Hu Xiu· 2025-07-20 01:41
Core Insights - OpenAI's general reasoning model achieved a gold medal level performance in the recently concluded International Mathematical Olympiad (IMO), solving 5 out of 6 problems under the same conditions as human participants [1][22][21] - This achievement signifies a major breakthrough in AI capabilities, demonstrating that the model can perform complex reasoning tasks without relying on specialized systems or verified reward signals [1][6][24] Group 1: Model Performance and Achievements - OpenAI's model, o3 alpha, secured second place in the AtCoder World Tour 2025 finals, showcasing its strength in programming and physics [2] - The model's performance in the IMO, scoring 35 out of 42 points, indicates its ability to match human mathematicians in rigorous proof writing [1][22] - OpenAI's advancements have positioned it ahead of competitors like DeepMind and Anthropic, as well as open-source models led by China [3] Group 2: Research and Development - OpenAI is testing a new reasoning model, with the IMO gold medal performance being a preliminary demonstration, and a formal release is expected by the end of this year [4] - The research led by Alexander Wei emphasizes the model's ability to engage in sustained creative thinking, a significant leap from previous benchmarks [5][27] - The model's development involved general reinforcement learning techniques, allowing it to tackle complex problems without task-specific training [7][20] Group 3: Future Implications - The success in the IMO raises expectations for AI's potential to solve significant mathematical problems, with an 81% market prediction that AI could address a Millennium Prize Problem by 2030 [12][28] - OpenAI's chief research officer noted that the model's broad reasoning capabilities extend beyond competition-specific tasks, indicating a shift towards more generalized AI applications [10][24] - The rapid progress in AI, from elementary to advanced mathematical problem-solving, suggests that AI may soon play a substantial role in scientific discovery [28][29]