Core Insights - ByteDance's Seed team has launched a new formal mathematical reasoning model, Seed Prover 1.5, which shows improved capabilities in formal proofs for mathematical competition problems [1] Group 1: Model Performance - The model generated complete compilable verification code for the first five problems of IMO 2025 in 16.5 hours, achieving a score that meets the previous gold medal threshold [1] - In the Putnam 2025 competition, the model produced verifiable code for 11 out of 12 problems in 9 hours [1] - The model solved 88% of the problems in the historical evaluation set of Putnam [1] Group 2: Model Limitations and Future Plans - The current model is primarily focused on competition problems that have "clear rules and closed backgrounds," indicating limitations in addressing complex mathematical research that requires long chains of reasoning and literature dependencies [1] - A technical report has been made public, and an API will be opened for researchers to experience the model [1]
达到金牌分数线:字节跳动推出新一代数学推理专用模型Seed Prover 1.5
Feng Huang Wang·2025-12-24 04:34