OpenAI推理模型
Search documents
OpenAI拿下IOI金牌,仅次于前五名人类选手!参赛推理模型才夺得IMO金牌
创业邦· 2025-08-12 03:33
Core Viewpoint - OpenAI's reasoning model achieved a gold medal score at the 2025 International Olympiad in Informatics (IOI), ranking first among AI participants and demonstrating significant advancements in general reasoning capabilities [2][9][16]. Group 1: Competition Performance - OpenAI participated in the online AI track of IOI 2025, scoring just behind five human competitors among 330 participants, securing the top position among AI competitors [6][8]. - The model used by OpenAI was not specifically trained for IOI but was based on a general reasoning model that performed exceptionally well [8][14]. - Compared to last year's performance, OpenAI's score improved dramatically from the 49th percentile to the 98th percentile, showcasing a leap in capabilities [9]. Group 2: Model and Strategy - OpenAI utilized the same model that won gold at the International Mathematical Olympiad (IMO) 2025 without any modifications for the IOI competition [14][15]. - The strategy involved sampling answers from different models and using a heuristic method to select submissions, which contributed to the successful outcome [14]. Group 3: Community Reaction and Future Implications - The achievement has sparked excitement in the community, highlighting the growing strength of general reasoning abilities without specialized training [16]. - There is anticipation for OpenAI to release a public version of the technology that led to the gold medal performance, indicating potential for further advancements in AI capabilities [18].
OpenAI夺金IOI,但输给3位中国高中生
量子位· 2025-08-12 01:14
Core Viewpoint - OpenAI's reasoning model achieved a record score of 533.29 in the IOI competition, ranking sixth among 330 human participants and first among all AI competitors, showcasing significant advancements in AI capabilities [1][4][13]. Group 1: Performance Highlights - OpenAI's AI reasoning system surpassed 98% of participants, demonstrating a marked improvement compared to the previous year's model [13]. - The AI system did not utilize a new training model specifically for IOI but integrated multiple general reasoning models for the competition [2][11]. - The gold medal score for IOI was set at 438.30, with only 28 participants achieving this score out of 330 competitors from 84 countries [9]. Group 2: Competition Structure - The IOI competition requires participants to solve three high-difficulty algorithm problems independently over two days, with a strict 5-hour time limit each day and no internet access [8][10]. - OpenAI's AI system operated under the same rules as human participants, with a limit of 50 submissions [10][11]. Group 3: Model Development - OpenAI's model for IOI 2024, named o1-ioi, was specifically fine-tuned for programming tasks, yet it only scored 213 points, ranking in the 49th percentile [14][15]. - The AI system generated 10,000 candidate solutions for each sub-task and utilized a complex test-time reasoning strategy to select the final submissions [17].
刚刚,OpenAI拿下IOI金牌,仅次于前五名人类选手!参赛推理模型才夺得IMO金牌
机器之心· 2025-08-12 00:15
Core Insights - OpenAI's reasoning model achieved a gold medal score at the 2025 International Olympiad in Informatics (IOI), ranking first among AI participants [1][5][9] - The model's performance marked a significant improvement from the previous year, rising from the 49th percentile to the 98th percentile [9] - OpenAI utilized a general reasoning model without specific training for the IOI, demonstrating the strength of its general reasoning capabilities [15][14] Group 1 - The 2025 IOI took place in Sucre, Bolivia, from July 27 to August 3, with the Chinese team winning all gold medals [1] - OpenAI's model scored just behind five human competitors among 330 participants, adhering to the same constraints as human contestants [5][6] - The model did not use the internet or retrieval-augmented generation (RAG), relying solely on a basic terminal tool [6] Group 2 - OpenAI's performance in recent competitions, including AtCoder and IMO, showcases the advancements made through new research methods [9] - The model used for IOI was the same as the one that won gold at the IMO, indicating its versatility across different competitive domains [14] - The strategy involved sampling answers from various models and using heuristic methods to select submissions, leading to a top-six finish overall [14] Group 3 - OpenAI's co-founder Greg Brockman praised the model's "gold medal-level performance" at the IOI [13] - The success of the model without specialized training has sparked discussions about its capabilities and potential future applications [15][17] - There is anticipation for a public version of the model that could leverage the techniques used in the recent competitions [17]
先别急着给OpenAI加冕!陶哲轩:这种「金牌」,含金量取决于「赛制」
机器之心· 2025-07-20 03:11
Core Viewpoint - OpenAI's new reasoning model achieved a gold medal level performance in the International Mathematical Olympiad (IMO), solving five out of six problems and scoring 35 out of 42 points, which has generated excitement in the AI community [2][6][10]. Group 1: Model Performance - The model was tested under strict conditions, mirroring human competitors, without any tools or internet assistance during the two 4.5-hour exam sessions [3][6]. - The announcement of OpenAI's model's success came after other AI models, such as Gemini 2.5 Pro and OpenAI's o3, performed poorly, scoring only 13 and 7 points respectively [10]. Group 2: Expert Opinions - Mathematician Terence Tao urged caution regarding the interpretation of AI models' IMO results, emphasizing the need for standardized testing conditions to make meaningful comparisons between AI and human performance [11][15]. - Tao highlighted that AI capabilities can vary significantly based on the resources and methods used during testing, suggesting that the reported results may not reflect true performance [15][18]. Group 3: Model Development and Future - OpenAI's reasoning research lead, Noam Brown, acknowledged that there is still considerable room for improvement in the model's computational capabilities and efficiency during testing [34]. - The model that achieved the IMO gold medal is not GPT-5, and its release may take several more months [34]. Group 4: Research Background - Alexander Wei, who led the development of the model, has a strong background in enhancing reasoning capabilities in large language models, particularly in mathematical reasoning and natural language proof generation [37][38]. - Wei has previously achieved recognition in the International Olympiad in Informatics and has contributed to AI systems that reached human-level performance in strategic games [40].