Workflow
o3 high
icon
Search documents
高考出分!大模型“考生”,有望冲击“清北”!
Zheng Quan Shi Bao· 2025-06-26 06:32
Core Insights - The performance of large models in the 2025 national college entrance examination (Gaokao) has garnered significant attention, with ByteDance's Doubao model achieving impressive scores of 683 in liberal arts and 648 in science [1][4] - The introduction of various mainstream models for comparison indicates that these large models have surpassed many ordinary candidates, reaching the level of outstanding students [2] Group 1: Model Performance - Doubao model 1.6-Thinking scored 683 in liberal arts and 648 in science, ranking it among the top 80 candidates in Shandong province [1][6] - Other models, including Google's Gemini 2.5 Pro and OpenAI's o3 high, also performed well, with Gemini achieving 651 in liberal arts and 655 in science [2][3] - The assessment revealed that the models excelled in foundational subjects, with minimal differentiation in scores among them [6] Group 2: Technical Advancements - The Doubao model 1.6 series incorporates significant technological innovations, including multi-modal capabilities and adaptive deep thinking [8] - The model utilizes a mixture of experts (MoE) architecture with 23 billion active parameters and 230 billion total parameters, enhancing its performance without increasing parameter count [8] - The model's training involved continuous improvements in architecture and algorithms, resulting in notable performance enhancements [8] Group 3: Industry Context - The Gaokao has become a competitive arena for AI companies, providing a comprehensive testing ground for model capabilities across various subjects [10] - The AI large model market in China is projected to grow significantly, with an estimated market size of approximately 29.416 billion yuan in 2024, expected to exceed 70 billion yuan by 2026 [10][11] - Doubao has been widely adopted across multiple industries, including automotive, finance, and education, covering over 400 million terminal devices [11]
高考出分!大模型“考生”,有望冲击“清北”!
证券时报· 2025-06-26 06:19
6月25日晚间,字节跳动Seed团队公布了豆包大模型1.6-Thinking版本的"高考成绩":文科总分683分, 理科总分648分。这一成绩以2025年山东高考试题作为测评基准,其中语数外使用新课标全国新一卷,政 史地/物化生则采用山东省自主命题。 最新公布的山东高考分数线显示,特殊类型招生控制线为521分,普通类一段线为441分。山东省内多位有 着多年高三带班经验的资深教师判断,根据山东省公布的2025年夏季高考文化成绩一分一段表,豆包大模 型1.6-Thinking的科目组合的赋分成绩最高能超过690分,排名在前80位左右,稳上985,并达到了冲 击"清北"的水平。 值得注意的是,本次测试还引入了OpenAI的o3 high、谷歌的Gemini 2.5 Pro、Anthropic的Claude Sonnet 4和DeepSeek的R1-0528等国内外多款主流模型作为对比对象。成绩显示,4款大模型文理科成 绩均大幅超过了普通类一段线,显示大模型已超越众多普通考生,达到人类优秀考生的水平。 | | | MillersDorcx Seed | | | | | | --- | --- | --- | --- ...