Workflow
豆包2.0 Code
icon
Search documents
字节跳动豆包大模型2.0发布,多数基准达SOTA水平
Sou Hu Cai Jing· 2026-02-14 15:57
Core Insights - ByteDance announced the official launch of Doubao 2.0, which has undergone systematic optimization for large-scale production environments, enhancing its capabilities in efficient reasoning, multimodal understanding, and complex instruction execution [1] Model Features - Doubao 2.0 includes three general agent models: Pro, Lite, and Mini, as well as a Code model, designed to adapt flexibly to various business scenarios [1] - Doubao 2.0 Pro is now available on the Doubao App, desktop, and web versions, allowing users to experience the "expert" mode for interactive dialogue [1] Performance Enhancements - Doubao 2.0 has significantly upgraded its multimodal capabilities, achieving state-of-the-art (SOTA) levels in various visual understanding tasks, with Doubao 2.0 Pro scoring highest in most relevant benchmark tests [2] - The model has improved its understanding of time series and motion perception, leading in key assessments like TVBench and surpassing human scores in the EgoTempo benchmark [4] Long-Range Task Execution - Doubao 2.0 Pro has enhanced long-range task execution capabilities, outperforming GPT 5.2 in SuperGPQA and achieving first place in HealthBench, with overall performance comparable to Gemini 3 Pro and GPT 5.2 in scientific domains [5] - In reasoning and agent capability evaluations, Doubao 2.0 Pro achieved gold medal results in IMO, CMO math competitions, and ICPC programming contests, demonstrating strong mathematical and reasoning skills [5] Cost Efficiency - Doubao 2.0 has reduced inference costs significantly, with model performance comparable to top industry models while lowering token pricing by approximately an order of magnitude [8] Code Model Features - The Doubao 2.0 Code model is optimized for programming scenarios, enhancing code library interpretation and application generation capabilities, and has been integrated into TRAE for improved functionality [9] - An example project, "TRAE Spring Festival Town · Year of the Horse Temple Fair," illustrates the model's ability to construct complex applications efficiently with minimal prompts [9]
豆包大模型2.0重磅登场:多场景适配能力升级,成本降低助力复杂任务新突破
Sou Hu Cai Jing· 2026-02-14 14:33
Core Insights - ByteDance's Doubao model has officially launched version 2.0, marking a significant step towards real-world application of its technology capabilities [1] - The update focuses on three main areas: multimodal understanding, long-range task execution, and improved development efficiency [1] Multimodal Capabilities - Doubao 2.0 has achieved comprehensive breakthroughs in multimodal capabilities, excelling in visual reasoning, spatial perception, and dynamic scene understanding [3] - The model demonstrates significant advantages in processing time-series data, surpassing similar models in TVBench evaluations and even exceeding human average levels in EgoTempo benchmark tests [3] - It supports real-time Q&A and environmental perception for long video scenarios, enabling proactive service such as fitness guidance and outfit suggestions [3] Complex Task Handling - The new version features a differentiated model system, with the flagship Doubao 2.0 Pro optimizing the reasoning engine, scoring higher than GPT 5.2 in SuperGPQA knowledge tests and topping HealthBench medical benchmarks [3] - The model has won gold medals in prestigious evaluations like the IMO math Olympiad and ICPC programming competition, with a 40% improvement in tool invocation accuracy compared to its predecessor [3] - The Lite version reduces reasoning costs to one-tenth of the industry average while maintaining superior performance over version 1.8, making it suitable for large-scale deployments [3] - The Mini version is optimized for low-latency demands, capable of processing thousands of concurrent requests per second [3] Development Efficiency - Doubao 2.0 Code has been deeply integrated with the TRAE development platform, enhancing codebase parsing capabilities and enabling automatic project architecture recognition [4] - In the "TRAE Spring Festival Town" interactive project, developers completed complex scene setups in just five prompts, achieving an 80% efficiency improvement over traditional development processes [4] - The built-in error correction mechanism can detect logical flaws in real-time, reducing debugging time by 65% within the Agent workflow [4] Technical Architecture - Doubao 2.0 employs knowledge distillation and reinforcement learning techniques, increasing real-world data coverage to 92% [6] - Its innovative dynamic attention mechanism automatically adjusts resource allocation, maintaining contextual coherence when processing long texts [6] - The Volcano Engine has opened API services, allowing enterprise developers to flexibly utilize different model capabilities for full-scene deployment from mobile to cloud services [6] - Internal tests indicate a 35% improvement in task completion rates in vertical fields such as logistics path planning and financial risk control compared to previous versions [6]
字节豆包2.0发布:推理成本降一个数量级,正面对标GPT-5和Gemini 3
硬AI· 2026-02-14 11:37
Core Viewpoint - ByteDance's Doubao 2.0 has officially entered a new phase, launching a systematic upgrade version aimed at the Agent era, significantly reducing reasoning costs while maintaining performance comparable to GPT-5.2 and Gemini 3 Pro [3][12]. Group 1: Model Features - Doubao 2.0 includes three models: Pro, Lite, and Mini, along with a specialized Code model, with the flagship Doubao 2.0 Pro directly competing with GPT-5.2 and Gemini 3 Pro [3]. - The model has achieved top-tier performance in visual understanding benchmarks and has won gold medals in mathematics and programming competitions [3][10]. - Doubao 2.0 has enhanced multimodal capabilities, excelling in visual reasoning, perception, spatial reasoning, and long-context understanding tasks [6]. Group 2: Cost Efficiency - The reasoning cost of Doubao 2.0 has been reduced by approximately an order of magnitude, which is crucial for large-scale reasoning and long-chain generation scenarios [4][12]. - This cost advantage is expected to become a key competitive edge in the commercial application of large models [4]. Group 3: Performance Metrics - Doubao 2.0 Pro outperformed GPT-5.2 in the SuperGPQA benchmark and ranked first in HealthBench, demonstrating strong performance in scientific fields [10]. - The model achieved a score of 54.2 in the HLE-text evaluation and excelled in tool invocation and instruction-following tests [10]. Group 4: Application and Integration - Doubao 2.0 Pro has been integrated into the Doubao App, desktop, and web versions, featuring an "Expert" mode for end-users [17]. - The Code model has been optimized for programming scenarios, enhancing code library interpretation and application generation capabilities, and is now available in the TRAE product [15][17]. - An intelligent customer service agent has been built on the Doubao 2.0 Pro model, capable of handling customer interactions and proactively seeking human assistance when needed [13].