豆包·文生图模型3.0
Search documents
豆包1.5深度思考模型发布:暴砍参数量,能看图思考,数学编程超DeepSeek-R1
3 6 Ke· 2025-04-17 08:54
Core Insights - The Volcano Engine has officially launched the Doubao 1.5 Deep Thinking Model, which utilizes the MoE architecture with a total parameter count of 200 billion and an active parameter count of 20 billion, achieving top-tier performance in multiple benchmark tests [1][3][8] Model Capabilities - Doubao 1.5 features practical capabilities such as "thinking while searching" and "visual understanding," available for enterprise users on the Volcano Ark platform [3][4] - The model can achieve a low latency of 20 milliseconds in high-concurrency scenarios, allowing it to perform searches and reasoning simultaneously [4][6] - It demonstrates visual understanding by analyzing text and image information, providing tailored recommendations based on user preferences [6][20] Performance Metrics - In various authoritative benchmark tests, Doubao 1.5's scores are comparable to OpenAI's models, particularly in mathematical tests like AIME 2024 and AIME 2025, while showing significant advantages in the ARC-AGI test [8][10] - The model scored 77.3 in the GPQA Diamond reasoning challenge, closely trailing OpenAI's models, and has shown strong performance in programming benchmarks [10] Market Position - As of March 2025, Doubao's daily token usage has exceeded 12.7 trillion, marking a threefold increase from December 2024 and a 106-fold increase from its initial launch [3] - Volcano Engine holds a 46.4% market share in China's public cloud model usage, positioning it as the market leader [3] Additional Model Upgrades - The upgraded Doubao Text-to-Image Model 3.0 can generate high-quality 2K images and is applicable in various fields such as marketing and design [11][15] - The new Doubao Visual Understanding Model enhances visual localization capabilities and supports semantic video search, making it suitable for commercial applications like security and home care [17][20] Industry Context - The competition among domestic reasoning models is intensifying, with Doubao 1.5's advancements in reasoning costs and visual understanding potentially setting the stage for the next wave of upgrades in the industry [21]