Workflow
IMO金牌模型
icon
Search documents
OpenAI拿下IOI金牌,仅次于前五名人类选手!参赛推理模型才夺得IMO金牌
创业邦· 2025-08-12 03:33
Core Viewpoint - OpenAI's reasoning model achieved a gold medal score at the 2025 International Olympiad in Informatics (IOI), ranking first among AI participants and demonstrating significant advancements in general reasoning capabilities [2][9][16]. Group 1: Competition Performance - OpenAI participated in the online AI track of IOI 2025, scoring just behind five human competitors among 330 participants, securing the top position among AI competitors [6][8]. - The model used by OpenAI was not specifically trained for IOI but was based on a general reasoning model that performed exceptionally well [8][14]. - Compared to last year's performance, OpenAI's score improved dramatically from the 49th percentile to the 98th percentile, showcasing a leap in capabilities [9]. Group 2: Model and Strategy - OpenAI utilized the same model that won gold at the International Mathematical Olympiad (IMO) 2025 without any modifications for the IOI competition [14][15]. - The strategy involved sampling answers from different models and using a heuristic method to select submissions, which contributed to the successful outcome [14]. Group 3: Community Reaction and Future Implications - The achievement has sparked excitement in the community, highlighting the growing strength of general reasoning abilities without specialized training [16]. - There is anticipation for OpenAI to release a public version of the technology that led to the gold medal performance, indicating potential for further advancements in AI capabilities [18].
AI答IMO难题坦承“不会”,OpenAI:这就是自我意识
3 6 Ke· 2025-08-01 12:06
Core Insights - The latest advancements in OpenAI's model demonstrate a significant shift towards "self-awareness," allowing the model to admit when it does not know the answer, contrasting with previous models that provided convincing but incorrect responses [1][3][11] Group 1: Model Performance - OpenAI's model received a zero score on the IMO problem 6, yet it showcased "high IQ honesty" by stating "I don't know" when lacking sufficient evidence, which reduces hidden errors [1][10] - The model's ability to acknowledge its limitations marks a transition from generating hallucinated answers to providing more reliable responses [3][11] Group 2: Team Insights - The core team behind the IMO gold medal consists of three researchers: Alex Wei, Sheryl Hsu, and Noam Brown, who have backgrounds from prestigious institutions and prior experience in leading tech companies [12][14][15][17] - The team achieved the goal of winning the IMO gold medal in just two months, a remarkable feat considering initial skepticism about the timeline [11][12] Group 3: Research Philosophy - OpenAI emphasizes the autonomy of its researchers to pursue impactful studies, focusing on general technology rather than systems tailored specifically for math competitions [11][12]
AI答IMO难题坦承“不会”,OpenAI:这就是自我意识
量子位· 2025-08-01 09:05
Core Viewpoint - The article highlights a significant advancement in AI models, particularly OpenAI's model, which has demonstrated the ability to acknowledge its limitations rather than providing incorrect answers, marking a shift towards more reliable and self-aware AI systems [2][6][7]. Group 1: Model Performance and Characteristics - OpenAI's model received a zero score on the IMO's sixth problem but showcased "high IQ honesty" by admitting uncertainty when lacking evidence, which reduces hidden errors [2][3]. - The new generation of AI models is moving away from generating seemingly perfect but incorrect answers, learning instead to admit when they do not know the answer [6][7]. - This self-awareness allows the model to recognize its limitations in difficult problems, avoiding the generation of plausible yet incorrect solutions [17][16]. Group 2: Team Insights and Achievements - The core team behind the IMO gold medal consists of three researchers: Alex Wei, Sheryl Hsu, and Noam Brown, who shared insights during a conversation organized by Sequoia Capital [5][23]. - Alex Wei expressed that witnessing the model avoid hallucinations was a positive outcome, despite the disappointment of receiving a "I cannot answer" response after extensive computational effort [15]. - The team achieved their goal of winning the IMO gold medal in just two months, a remarkable feat considering initial skepticism about achieving this by 2025 [20][18]. Group 3: Team Backgrounds - Alex Wei holds degrees from Harvard University and a PhD from UC Berkeley, with prior experience at Google, Microsoft, and Meta before joining OpenAI in January 2024 [25]. - Sheryl Hsu graduated from Stanford University and was a researcher at the Stanford AI Lab before joining OpenAI in March 2025 [27]. - Noam Brown completed his undergraduate studies at Rutgers University and obtained his master's and PhD from Carnegie Mellon University, having previously worked at DeepMind and Meta before joining OpenAI in June 2023 [29].