Core Viewpoint - A new AI experiment initiated by 11 top mathematicians aims to test AI's ability to solve research-level mathematical problems, exploring the boundaries of "AI + Mathematics" [1][6]. Group 1: Experiment Overview - The experiment, named "First Proof," involves AI solving 10 research-level math problems that mathematicians have encountered in their work [6]. - The problems cover various branches of mathematics, including combinatorial algebra, graph theory, algebraic topology, stochastic analysis, and symplectic geometry [10]. - Initially, 20 problems were proposed, but only 10 were selected based on four criteria, ensuring AI can understand the problem statement and that there are no hidden answers [10][17]. Group 2: AI Capabilities and Limitations - Current AI systems, when tested with a single attempt, struggled to solve most of the proposed problems [24]. - The mathematicians believe that allowing human-AI interaction could improve AI's performance in providing better answers [25]. - The experiment aims to assess AI's ability to complete rigorous mathematical proofs, rather than its capacity to generate new theories or definitions [23]. Group 3: Data Integrity and Future Plans - To minimize data contamination, the experiment restricts data sharing options and ensures that the answers remain confidential during the testing phase [26][27]. - Future plans include designing a second set of problems and refining the experimental design to create a reusable and comparable benchmark for research-level mathematical capabilities [28]. - The ultimate goal is to foster human-AI collaboration in mathematics, rather than AI replacing mathematicians [29].
11位顶尖数学家发了篇没结果的论文,陶哲轩推荐都关注一下
量子位·2026-02-08 04:46