混合推理架构 - filings, earnings calls, financial reports, news

DeepSeek-V3.1正式发布，叫板OpenAI，适配下一代国产芯片

DeepSeek R1

DeepSeek R2

DeepSeek V3.1

Feng Huang Wang· 2025-08-21 09:18

Core Insights - The release of DeepSeek V3.1 is positioned as a significant step towards the "Agent Era," featuring a hybrid reasoning architecture that allows the model to switch between fast responses and longer reasoning processes [1] - The new model reduces token generation by 20% to 50% compared to its predecessor, enhancing response speed and lowering usage costs [1] - V3.1 improves throughput efficiency and energy performance, laying the groundwork for large-scale applications [1] - The model demonstrates enhanced capabilities in programming tasks, showing improved execution and stability in real-world environments [1] - In complex search tasks, V3.1 exhibits advanced retrieval and integration abilities, outperforming previous models in multi-disciplinary challenges [1] Business and Ecosystem Strategy - DeepSeek adopts a "dual-track" strategy, continuing to offer API services while adjusting pricing and eliminating night discounts starting September 6 [2] - The base model and post-training versions of V3.1 have been open-sourced on Hugging Face and MoDa [2] Technical Specifications - V3.1 utilizes UE8M0 FP8 Scale parameter precision, aligning with the upcoming generation of domestic chips, which may require specific software adaptations for optimal performance [4] - The release appears to be a direct competitor to GPT-5, with both models supporting long contexts and complex task handling, while offering flexible base model calls and cost structures [4]

Seek .(US:SKLTY)

大模型

DeepSeek-V3.1正式发布：混合推理，Agent能力大幅提高！概念股直线拉升

GPT5

API服务

Mei Ri Jing Ji Xin Wen· 2025-08-21 08:27

Core Insights - DeepSeek officially released DeepSeek-V3.1 on August 21, featuring significant upgrades in its architecture and capabilities [1] Group 1: Product Upgrades - The new hybrid reasoning architecture allows a single model to support both thinking and non-thinking modes [1] - DeepSeek-V3.1-Think demonstrates improved efficiency, providing answers in a shorter time compared to DeepSeek-R1-0528 [1] - Enhanced agent capabilities have been achieved through post-training optimization, resulting in better performance in tool usage and agent tasks [1] Group 2: API and Pricing Changes - The DeepSeek API has been upgraded to include deepseek-chat for non-thinking mode and deepseek-reasoner for thinking mode, with context expanded to 128K [1] - A new pricing structure for API calls will be implemented starting September 6, 2025, with the cancellation of night-time discounts [2] - Until September 6, all API services will continue to be billed under the original pricing policy [4] Group 3: Technical Specifications - DeepSeek-V3.1 utilizes UE8M0 FP8 Scale parameter precision, designed for the upcoming generation of domestic chips [4] - The integration of DeepSeek-V3.1 capabilities into the Claude Code framework is facilitated by support for Anthropic API format [1]

DeepSeek-V3.1发布：更高效思考、更强Agent能力、更长上下文

生物世界· 2025-08-21 08:00

Core Insights - DeepSeek has officially released DeepSeek-V3.1, introducing a hybrid reasoning architecture that allows users to switch between "Deep Thinking" mode and "Non-Thinking" mode for enhanced interaction [2][3]. Group 1: Hybrid Reasoning Architecture - The "Deep Thinking" mode (DeepSeek-Reasoner) is designed for tasks requiring deep reasoning, such as mathematical calculations and complex logic analysis, providing higher reasoning efficiency [3]. - The "Non-Thinking" mode (DeepSeek-Chat) is tailored for everyday conversations and information queries, offering quicker responses [4]. - Users can easily switch modes via a "Deep Thinking" button on the official app and web interface, enhancing the user experience [5]. Group 2: Enhanced Agent Capabilities - DeepSeek-V3.1 has significantly improved tool usage and agent task performance through Post-Training optimization, resulting in fewer required iterations and higher efficiency in code repair and command line tasks [6]. - Benchmark results show that DeepSeek-V3.1 outperforms its predecessor, DeepSeek-R1-0528, in various tasks, including SWE-bench and Terminal-Bench, with scores of 66.0 and 31.3 respectively [7][8]. Group 3: Efficiency Improvements - The new version employs a thought chain compression training method, reducing output tokens by 20%-50% while maintaining performance levels comparable to DeepSeek-R1-0528, leading to faster response times and lower API call costs [9]. Group 4: API Upgrades and Model Availability - The DeepSeek API has been upgraded to support a context length of 128K, facilitating easier handling of long documents [10][12]. - The base and post-training models of DeepSeek-V3.1 are now open-sourced on platforms like Hugging Face and ModelScope, with a price adjustment for the API set to take effect on September 6, 2025 [11].

第一财经· 2025-08-21 07:53

Core Viewpoint - DeepSeek has officially released version V3.1, featuring significant upgrades in reasoning architecture, efficiency, and agent capabilities [3][4]. Group 1: Key Features of DeepSeek-V3.1 - The new hybrid reasoning architecture allows the model to support both thinking and non-thinking modes simultaneously [3]. - Enhanced thinking efficiency enables DeepSeek-V3.1-Think to provide answers in a shorter time compared to its predecessor, DeepSeek-R1-0528 [3]. - Improved agent capabilities through post-training optimization have led to better performance in tool usage and intelligent tasks [3]. Group 2: API and Pricing Changes - The official app and web model have been upgraded to DeepSeek-V3.1, allowing users to switch between thinking and non-thinking modes via a "deep thinking" button [3]. - The DeepSeek API has also been upgraded, with deepseek-chat corresponding to non-thinking mode and deepseek-reasoner to thinking mode, expanding context to 128K [3]. - Starting from September 6, 2025, the pricing for API calls will be adjusted, with the cancellation of night-time discounts [4][6].

思考效率

Agent能力

官宣！DeepSeek-V3.1 发布，API调用价格低至0.5元/百万Tokens

Xin Lang Ke Ji· 2025-08-21 07:05

Core Insights - DeepSeek announced the release of DeepSeek-V3.1 and will adjust the API pricing effective September 6, 2025 [1][3] - The new pricing structure includes input prices of 0.5 CNY per million tokens for cache hits and 4 CNY per million tokens for cache misses, with output prices set at 12 CNY per million tokens [1] Group 1: Upgrade Features - The V3.1 upgrade introduces a hybrid reasoning architecture that supports both thinking and non-thinking modes within a single model [3] - Enhanced thinking efficiency allows DeepSeek-V3.1-Think to provide answers in a shorter time compared to its predecessor, DeepSeek-R1-0528 [3] - Improved agent capabilities through post-training optimization significantly enhance the model's performance in tool usage and agent tasks [3] Group 2: User Experience - The official app and web model have been upgraded to DeepSeek-V3.1, allowing users to switch freely between thinking and non-thinking modes via a "deep thinking" button [3]

DeepSeek开放平台API接口

DeepSeek-V3.1发布

Zheng Quan Shi Bao Wang· 2025-08-21 07:01

Core Insights - DeepSeek has officially released DeepSeek-V3.1, which includes significant upgrades in its architecture and performance [1] Group 1: Key Features of DeepSeek-V3.1 - Hybrid reasoning architecture: The model supports both thinking and non-thinking modes simultaneously [1] - Enhanced thinking efficiency: DeepSeek-V3.1-Think can provide answers in a shorter time compared to DeepSeek-R1-0528 [1] - Improved agent capabilities: The new model shows significant improvements in tool usage and agent tasks through post-training optimization [1]

Seek .(US:SKLTY)