豆包语音合成模型2.0
Search documents
火山引擎升级豆包系列模型
Ke Ji Ri Bao· 2025-10-20 23:28
Core Insights - Volcano Engine has released a series of updates for the Doubao large model, including Doubao 1.6, which natively supports multiple thinking lengths, and new models such as Doubao Voice Synthesis Model 2.0 and Doubao Voice Replication Model 2.0 [1] Group 1: Model Updates - Doubao 1.6 introduces four thinking lengths (minimum, low, medium, high) to balance model performance, latency, and cost for enterprises, making it the first model in China to support "tiered thinking length adjustment" natively [2] - Doubao 1.6lite is a lighter version of the flagship model, offering faster inference speed and a 53.3% reduction in overall usage costs compared to Doubao 1.5pro in the most commonly used input range of 0-32k [2] - The Smart Model Router, a solution for intelligent model selection, has been launched, allowing automatic selection of the most suitable model for task requests, optimizing both performance and cost [2] Group 2: Market Performance - As of the end of September, the daily token usage for Doubao has exceeded 30 trillion, representing an over 80% increase since the end of May [1] - According to IDC, Volcano Engine holds a 49.2% market share in China's public cloud large model service market, ranking first [1]
火山引擎:日均tokens超30万亿
Bei Jing Shang Bao· 2025-10-16 13:48
Core Insights - Volcano Engine has released a series of updates for the Doubao large model, including Doubao model 1.6, which natively supports multiple thinking lengths, and new models such as Doubao model 1.6 lite, Doubao voice synthesis model 2.0, and Doubao voice replication model 2.0 [1] Summary by Categories - **Product Updates** - Doubao model 1.6 introduces native support for various thinking lengths [1] - New models launched include Doubao model 1.6 lite, Doubao voice synthesis model 2.0, and Doubao voice replication model 2.0 [1] - **Performance Metrics** - As of September 30, 2025, the daily average token usage for the Doubao large model exceeds 30 trillion, representing an increase of over 80% compared to the end of May [1]
火山引擎发布豆包系列模型升级,披露日均tokens超30万亿
2 1 Shi Ji Jing Ji Bao Dao· 2025-10-16 10:01
Core Insights - Volcano Engine has released a series of updates for the Doubao large model, including Doubao 1.6, which natively supports multiple thinking lengths, and introduced Doubao 1.6 lite, Doubao Speech Synthesis Model 2.0, and Doubao Voice Replication Model 2.0 [1][2] Model Updates - Doubao 1.6 is the first large model in China to support "tiered adjustment of thinking length," offering four options: Minimal, Low, Medium, and High, which balance model performance, latency, and cost [3] - The upgraded Doubao 1.6 model shows a 77.5% reduction in total output tokens and an 84.6% decrease in thinking time at low thinking length, while maintaining model effectiveness [3] - Doubao 1.6 lite is lighter and faster than the flagship version, outperforming Doubao 1.5 pro by 14% in enterprise-level assessments and reducing overall usage costs by 53.3% in the most commonly used input range of 0-32k [3] Speech Models - The newly released Doubao Speech Synthesis Model 2.0 and Doubao Voice Replication Model 2.0 feature enhanced emotional expressiveness and precise instruction adherence, capable of accurately reading complex formulas [8] - These models have achieved a 90% accuracy rate in reading complex formulas for subjects from elementary to high school, addressing a significant challenge in the industry [8] Intelligent Model Routing - Volcano Engine has introduced the Smart Model Router, the first intelligent model selection solution in China, allowing users to choose from "Balanced Mode," "Effect Priority Mode," and "Cost Priority Mode" for optimal model selection based on task requests [10] - In tests, the Smart Model Router improved the effectiveness of the DeepSeek model by 14% in Effect Priority Mode and reduced overall costs by over 70% in Cost Priority Mode while achieving similar results [10] Market Position - As of September 2025, the daily token usage of Doubao large model has exceeded 30 trillion, representing an over 80% increase since May 2023 [1] - Volcano Engine holds a 49.2% market share in China's public cloud large model service market, ranking first according to IDC [1]
新豆包模型让郭德纲喊出发疯文学:(这班)不上了!不上了!不上了!!!
量子位· 2025-10-16 06:11
Core Viewpoint - The article discusses the advancements in AI voice technology by Huoshan Engine, particularly focusing on the upgrades to the Doubao voice synthesis and voice replication models, which enhance emotional expression and contextual understanding in AI-generated speech [5][11][41]. Group 1: AI Voice Technology Upgrades - Huoshan Engine has upgraded its Doubao voice synthesis model to version 2.0, which allows for better emotional expression and understanding of dialogue [7][11]. - The upgrade includes two main models: Doubao voice synthesis model 2.0 and Doubao voice replication model 2.0, enabling AI to replicate voices and understand emotional nuances [7][8]. - The new models can interpret user instructions regarding emotions, dialects, tones, and speech rates, significantly improving the quality of AI-generated speech [12][21]. Group 2: Contextual Understanding and Emotional Expression - The models can now incorporate context from previous dialogue, enhancing the coherence and emotional depth of the generated speech [12][23]. - The ability to accurately read complex formulas has improved, with the Doubao model achieving around 90% accuracy in reading complex formulas for school subjects, compared to less than 50% for similar models [24][25]. - The advancements allow for a more human-like interaction, moving from merely sounding human to truly understanding human emotions and context [11][41]. Group 3: Technological Innovations and Applications - The Doubao large model 1.6 has been upgraded to support adjustable thinking lengths, allowing users to balance effectiveness, latency, and cost [30][33]. - Huoshan Engine has introduced a Smart Model Router, which optimally matches user tasks with the most suitable models, significantly reducing costs by up to 71% in cost-prioritized modes [39][41]. - The technology has been applied in various commercial scenarios, enhancing user experiences in products from companies like Xiaomi and OPPO, and improving complex demand responses in platforms like Dongchedi [45][46]. Group 4: Growth and Infrastructure - The daily token usage of the Doubao large model has surged from 120 billion to over 30 trillion, marking a 253-fold increase in just over a year [47][48]. - This growth is supported by Huoshan Engine's robust AI cloud infrastructure, which provides the necessary computational power and high-quality data for model training and inference [48].