Core Insights - ByteDance's latest mathematical reasoning model, Seed Prover 1.5, achieved a gold medal score at the IMO 2025 by solving five problems in 16.5 hours, scoring 35 points, which meets the gold medal threshold for this year [1][3] - This performance matches that of Google's Gemini, which was certified as an IMO gold medalist in July [3] - The model has not been open-sourced yet, but a technical report has been released, highlighting the performance improvements brought by large-scale reinforcement learning [5][19] Model Performance - Seed Prover 1.5 significantly outperformed its predecessor, which took three days to solve four out of six problems and achieved a silver medal [3] - The model also set new state-of-the-art (SOTA) records in the North American undergraduate mathematics competition, Putnam [4] Technical Innovations - The model features a new architecture called Agentic Prover, which allows it to use formal mathematical reasoning instead of natural language, ensuring more reliable results [10][12] - It incorporates a Sketch Model that simulates how human mathematicians draft proofs, breaking down complex problems into manageable sub-goals [22][23] - The model employs a multi-agent collaborative system that enhances efficiency and success rates by recursively calling the Sketch Model for difficult lemmas [25][28] Reinforcement Learning and Efficiency - The model's proof success rate improved from 50% to nearly 90% with increased reinforcement learning training steps [19] - In comparative tests, Seed Prover 1.5 required significantly less computational resources while outperforming previous models on high-difficulty datasets [19][20] Conclusion - The research is part of ByteDance's Seed AI4Math team, showcasing advancements in mathematical reasoning through innovative model architectures and training methodologies [30]
字节Seed发布最强数学模型:一招“打草稿”,IMO银牌变金牌
量子位·2025-12-25 06:08