自然语言处理技术

Search documents
大模型模型取得国际奥数竞赛金牌级成绩
Ke Ji Ri Bao· 2025-07-24 00:07
Core Insights - Google's DeepMind and OpenAI have both announced that their AI models achieved gold medal-level results in the recent International Mathematical Olympiad (IMO), marking a significant milestone in AI's mathematical reasoning capabilities [1] - Last year, DeepMind's AI models "AlphaProof" and "AlphaGeometry" achieved silver medal-level results, indicating a progression in AI performance [1] - OpenAI's new AI system solved 5 out of 6 IMO problems in 4.5 hours, while DeepMind's "Gemini DeepMind" system achieved the same result shortly after [1] Group 1 - The IMO is considered a benchmark for evaluating AI systems' mathematical reasoning abilities [1] - Both teams utilized natural language processing techniques for their models, differing from previous systems that were specifically designed for IMO and used a programming language called "Lean" [1] - DeepMind's developers explained that reinforcement learning, a branch of machine learning, is key to their success in AI applications, similar to their previous achievements with "AlphaZero" [1] Group 2 - Mathematician Terence Tao expressed excitement about the progress but emphasized the need for reproducible research data to support these claims [2] - IMO gold medalist Joseph Meyer noted that while natural language proofs have readability advantages, lengthy arguments may complicate verification [2]