Workflow
混合推理架构
icon
Search documents
「AI新世代」R2没等来先发V3.1!DeepSeek深陷大模型“包围圈”热度炙烤
Hua Xia Shi Bao· 2025-08-22 06:49
Core Viewpoint - DeepSeek's recent update to version V3.1 has disappointed many users who were eagerly awaiting the release of the R2 model, which has been delayed for several months, leading to a decline in the company's popularity and user engagement [2][3][10] Group 1: Product Updates - DeepSeek released V3.1 on August 21, which ranked third on HuggingFace's trend list, but many users expressed dissatisfaction and called for the return of the previous R1 model [2][3] - The V3.1 update features a hybrid reasoning architecture that combines thinking and non-thinking modes, enhancing efficiency and aligning with trends seen in other major models like GPT-5 [4] - V3.1 offers faster response times and improved agent capabilities, with an expanded context of 128K after the API upgrade [5] Group 2: Pricing Changes - Starting September 6, DeepSeek will adjust its API pricing to 0.5 RMB per million tokens for cache hits, 4 RMB for cache misses, and 12 RMB for output, representing a middle ground between previous versions [5] Group 3: Competitive Landscape - Other domestic AI models, such as those from Zhiyu and Alibaba, are rapidly updating and releasing new features, creating a competitive environment that DeepSeek is struggling to keep up with [7][8] - The overall market for large models is intensifying, with significant advancements from both domestic and international competitors, including OpenAI's GPT-5 and Google's Genie 3 [9] Group 4: User Engagement and Market Position - DeepSeek's website traffic has been declining for four consecutive months, with a 9.63% average monthly decrease, and its app's monthly active users fell to 82.93 million in July, marking a significant drop [10]
DeepSeek-V3.1正式发布,叫板OpenAI,适配下一代国产芯片
Feng Huang Wang· 2025-08-21 09:18
Core Insights - The release of DeepSeek V3.1 is positioned as a significant step towards the "Agent Era," featuring a hybrid reasoning architecture that allows the model to switch between fast responses and longer reasoning processes [1] - The new model reduces token generation by 20% to 50% compared to its predecessor, enhancing response speed and lowering usage costs [1] - V3.1 improves throughput efficiency and energy performance, laying the groundwork for large-scale applications [1] - The model demonstrates enhanced capabilities in programming tasks, showing improved execution and stability in real-world environments [1] - In complex search tasks, V3.1 exhibits advanced retrieval and integration abilities, outperforming previous models in multi-disciplinary challenges [1] Business and Ecosystem Strategy - DeepSeek adopts a "dual-track" strategy, continuing to offer API services while adjusting pricing and eliminating night discounts starting September 6 [2] - The base model and post-training versions of V3.1 have been open-sourced on Hugging Face and MoDa [2] Technical Specifications - V3.1 utilizes UE8M0 FP8 Scale parameter precision, aligning with the upcoming generation of domestic chips, which may require specific software adaptations for optimal performance [4] - The release appears to be a direct competitor to GPT-5, with both models supporting long contexts and complex task handling, while offering flexible base model calls and cost structures [4]
DeepSeek-V3.1正式发布:混合推理,Agent能力大幅提高!概念股直线拉升
Mei Ri Jing Ji Xin Wen· 2025-08-21 08:27
Core Insights - DeepSeek officially released DeepSeek-V3.1 on August 21, featuring significant upgrades in its architecture and capabilities [1] Group 1: Product Upgrades - The new hybrid reasoning architecture allows a single model to support both thinking and non-thinking modes [1] - DeepSeek-V3.1-Think demonstrates improved efficiency, providing answers in a shorter time compared to DeepSeek-R1-0528 [1] - Enhanced agent capabilities have been achieved through post-training optimization, resulting in better performance in tool usage and agent tasks [1] Group 2: API and Pricing Changes - The DeepSeek API has been upgraded to include deepseek-chat for non-thinking mode and deepseek-reasoner for thinking mode, with context expanded to 128K [1] - A new pricing structure for API calls will be implemented starting September 6, 2025, with the cancellation of night-time discounts [2] - Until September 6, all API services will continue to be billed under the original pricing policy [4] Group 3: Technical Specifications - DeepSeek-V3.1 utilizes UE8M0 FP8 Scale parameter precision, designed for the upcoming generation of domestic chips [4] - The integration of DeepSeek-V3.1 capabilities into the Claude Code framework is facilitated by support for Anthropic API format [1]
DeepSeek-V3.1发布:更高效思考、更强Agent能力、更长上下文
生物世界· 2025-08-21 08:00
Core Insights - DeepSeek has officially released DeepSeek-V3.1, introducing a hybrid reasoning architecture that allows users to switch between "Deep Thinking" mode and "Non-Thinking" mode for enhanced interaction [2][3]. Group 1: Hybrid Reasoning Architecture - The "Deep Thinking" mode (DeepSeek-Reasoner) is designed for tasks requiring deep reasoning, such as mathematical calculations and complex logic analysis, providing higher reasoning efficiency [3]. - The "Non-Thinking" mode (DeepSeek-Chat) is tailored for everyday conversations and information queries, offering quicker responses [4]. - Users can easily switch modes via a "Deep Thinking" button on the official app and web interface, enhancing the user experience [5]. Group 2: Enhanced Agent Capabilities - DeepSeek-V3.1 has significantly improved tool usage and agent task performance through Post-Training optimization, resulting in fewer required iterations and higher efficiency in code repair and command line tasks [6]. - Benchmark results show that DeepSeek-V3.1 outperforms its predecessor, DeepSeek-R1-0528, in various tasks, including SWE-bench and Terminal-Bench, with scores of 66.0 and 31.3 respectively [7][8]. Group 3: Efficiency Improvements - The new version employs a thought chain compression training method, reducing output tokens by 20%-50% while maintaining performance levels comparable to DeepSeek-R1-0528, leading to faster response times and lower API call costs [9]. Group 4: API Upgrades and Model Availability - The DeepSeek API has been upgraded to support a context length of 128K, facilitating easier handling of long documents [10][12]. - The base and post-training models of DeepSeek-V3.1 are now open-sourced on platforms like Hugging Face and ModelScope, with a price adjustment for the API set to take effect on September 6, 2025 [11].
DeepSeek-V3.1正式发布
第一财经· 2025-08-21 07:53
Core Viewpoint - DeepSeek has officially released version V3.1, featuring significant upgrades in reasoning architecture, efficiency, and agent capabilities [3][4]. Group 1: Key Features of DeepSeek-V3.1 - The new hybrid reasoning architecture allows the model to support both thinking and non-thinking modes simultaneously [3]. - Enhanced thinking efficiency enables DeepSeek-V3.1-Think to provide answers in a shorter time compared to its predecessor, DeepSeek-R1-0528 [3]. - Improved agent capabilities through post-training optimization have led to better performance in tool usage and intelligent tasks [3]. Group 2: API and Pricing Changes - The official app and web model have been upgraded to DeepSeek-V3.1, allowing users to switch between thinking and non-thinking modes via a "deep thinking" button [3]. - The DeepSeek API has also been upgraded, with deepseek-chat corresponding to non-thinking mode and deepseek-reasoner to thinking mode, expanding context to 128K [3]. - Starting from September 6, 2025, the pricing for API calls will be adjusted, with the cancellation of night-time discounts [4][6].
官宣!DeepSeek-V3.1 发布,API调用价格低至0.5元/百万Tokens
Xin Lang Ke Ji· 2025-08-21 07:05
Core Insights - DeepSeek announced the release of DeepSeek-V3.1 and will adjust the API pricing effective September 6, 2025 [1][3] - The new pricing structure includes input prices of 0.5 CNY per million tokens for cache hits and 4 CNY per million tokens for cache misses, with output prices set at 12 CNY per million tokens [1] Group 1: Upgrade Features - The V3.1 upgrade introduces a hybrid reasoning architecture that supports both thinking and non-thinking modes within a single model [3] - Enhanced thinking efficiency allows DeepSeek-V3.1-Think to provide answers in a shorter time compared to its predecessor, DeepSeek-R1-0528 [3] - Improved agent capabilities through post-training optimization significantly enhance the model's performance in tool usage and agent tasks [3] Group 2: User Experience - The official app and web model have been upgraded to DeepSeek-V3.1, allowing users to switch freely between thinking and non-thinking modes via a "deep thinking" button [3]
DeepSeek-V3.1发布
Core Insights - DeepSeek has officially released DeepSeek-V3.1, which includes significant upgrades in its architecture and performance [1] Group 1: Key Features of DeepSeek-V3.1 - Hybrid reasoning architecture: The model supports both thinking and non-thinking modes simultaneously [1] - Enhanced thinking efficiency: DeepSeek-V3.1-Think can provide answers in a shorter time compared to DeepSeek-R1-0528 [1] - Improved agent capabilities: The new model shows significant improvements in tool usage and agent tasks through post-training optimization [1]