Workflow
DeepSeek
icon
Search documents
DeepSeek-V3.1首搭UE8M0 FP8精度技术 适配下一代国产芯片
Feng Huang Wang· 2025-08-21 08:18
凤凰网科技讯 8月21日,DeepSeek在其官宣"正式发布DeepSeek-V3.1"的文章里面提到,DeepSeek-V3.1 使用了UE8M0 FP8 Scale的参数精度。另外,V3.1对分词器及chat template进行了较大调整,与 DeepSeek-V3 存在明显差异。DeepSeek官微在置顶留言里说,UE8M0 FP8是针对即将发布的下一代国产 芯片设计。 此外,针对网友提问DeepSeek版本信息不是V3.1的问题,官方回复表示,当前官方网页端、App、小程 序及 API 开放平台所调用模型均已同步更新,新模型自我认知为DeepSeek-V3。 ...
DeepSeek发布新模型V3.1,价格涨了但Agent能力提升了
Di Yi Cai Jing· 2025-08-21 08:11
"迈向智能体时代的第一步"。 8月21日,业界千呼万唤的R2模型没来,但DeepSeek官方正式发布了新模型V3.1。从命名来看这或许不是一次大的版本更新,更像是前一代DeepSeek-V3模 型的小版本迭代。 在X上,DeepSeek将V3.1称为"我们迈向智能体时代的第一步"(our first step toward the agent era)。本次升级主要有三大亮点,其中包括更强的 Agent能力、 混合思考模式和更高的思考效率。 官方表示,通过后训练优化,新模型在工具使用与智能体任务中的表现有较大提升。在编程智能体、搜索智能体测评中, V3.1 相比之前的 DeepSeek 系列 模型都有明显提高。 | Benchmarks | DeepSeek-V3.1 | | --- | --- | | SWE-bench | 66.0 | | Verified | | | SWE-bench | 54.5 | | Multilingual | | | Terminal-Bench | 31.3 | DeepSeek-V3.1 是混合推理架构,一个模型同时支持思考模式和非思考模式。目前用户可在官方 App与网 ...
DeepSeek-V3.1,正式发布
财联社· 2025-08-21 08:00
据DeepSeek官方公众号消息,DeepSeek-V3.1正式发布。 本次升级包含以下主要变化: 官方App与网页端模型已同步升级为DeepSeek-V3.1。用户可以通过"深度思考"按钮,实现思考模式与非思考模式的自由切换。 价格调整 将于北京时间2025年9月6日凌晨起,对DeepSeek开放平台API接口调用价格进行如下调整: 执行新版价格表(如下图所示,详见定价页面); 取消夜间时段优惠。 在9月6日前,所有API服务仍按原价格政策计费,可继续享受当前优惠。 混合推理架构: 一个模型同时支持思考模式与非思考模式; 更高的思考效率: 相比DeepSeek-R1-0528,DeepSeek-V3.1-Think能在更短时间内给出答案; 更强的Agent能力: 通过Post-Training优化,新模型在工具使用与智能体任务中的表现有较大提升。 下载财联社APP获取更多资讯 准确 快速 权威 专业 7x24h电报 头条新闻 VIP资讯 实时盯盘 ...
DeepSeek-V3.1发布:更高效思考、更强Agent能力、更长上下文
生物世界· 2025-08-21 08:00
Core Insights - DeepSeek has officially released DeepSeek-V3.1, introducing a hybrid reasoning architecture that allows users to switch between "Deep Thinking" mode and "Non-Thinking" mode for enhanced interaction [2][3]. Group 1: Hybrid Reasoning Architecture - The "Deep Thinking" mode (DeepSeek-Reasoner) is designed for tasks requiring deep reasoning, such as mathematical calculations and complex logic analysis, providing higher reasoning efficiency [3]. - The "Non-Thinking" mode (DeepSeek-Chat) is tailored for everyday conversations and information queries, offering quicker responses [4]. - Users can easily switch modes via a "Deep Thinking" button on the official app and web interface, enhancing the user experience [5]. Group 2: Enhanced Agent Capabilities - DeepSeek-V3.1 has significantly improved tool usage and agent task performance through Post-Training optimization, resulting in fewer required iterations and higher efficiency in code repair and command line tasks [6]. - Benchmark results show that DeepSeek-V3.1 outperforms its predecessor, DeepSeek-R1-0528, in various tasks, including SWE-bench and Terminal-Bench, with scores of 66.0 and 31.3 respectively [7][8]. Group 3: Efficiency Improvements - The new version employs a thought chain compression training method, reducing output tokens by 20%-50% while maintaining performance levels comparable to DeepSeek-R1-0528, leading to faster response times and lower API call costs [9]. Group 4: API Upgrades and Model Availability - The DeepSeek API has been upgraded to support a context length of 128K, facilitating easier handling of long documents [10][12]. - The base and post-training models of DeepSeek-V3.1 are now open-sourced on platforms like Hugging Face and ModelScope, with a price adjustment for the API set to take effect on September 6, 2025 [11].
DeepSeek-V3.1正式发布
第一财经· 2025-08-21 07:53
Core Viewpoint - DeepSeek has officially released version V3.1, featuring significant upgrades in reasoning architecture, efficiency, and agent capabilities [3][4]. Group 1: Key Features of DeepSeek-V3.1 - The new hybrid reasoning architecture allows the model to support both thinking and non-thinking modes simultaneously [3]. - Enhanced thinking efficiency enables DeepSeek-V3.1-Think to provide answers in a shorter time compared to its predecessor, DeepSeek-R1-0528 [3]. - Improved agent capabilities through post-training optimization have led to better performance in tool usage and intelligent tasks [3]. Group 2: API and Pricing Changes - The official app and web model have been upgraded to DeepSeek-V3.1, allowing users to switch between thinking and non-thinking modes via a "deep thinking" button [3]. - The DeepSeek API has also been upgraded, with deepseek-chat corresponding to non-thinking mode and deepseek-reasoner to thinking mode, expanding context to 128K [3]. - Starting from September 6, 2025, the pricing for API calls will be adjusted, with the cancellation of night-time discounts [4][6].
DeepSeek-V3.1发布 具备更高的思考效率以及更强的Agent能力
智通财经网· 2025-08-21 07:49
智通财经APP获悉,8月21日,DeepSeek正式发布 DeepSeek-V3.1。本次升级包含主要变化有:混合推理架构(一个模型同时支持思考模式与非思考模式); 更高的思考效率(相比 DeepSeek-R1-0528,DeepSeek-V3.1-Think 能在更短时间内给出答案);更强的 Agent 能力(通过 Post-Training 优化,新模型在工具使 用与智能体任务中的表现有较大提升)。 表 2:搜索智能体测评(测试结果调用商用搜索引擎 API+网页过滤+128K context window;R1-0528 使用内部 workflow 模式测试;HLE 测试同时使用 python 与 search 工具) | Benchmarks | DeepSeek-V3.1 | DeepSeek- | | --- | --- | --- | | | | R1-0528 | | Browsecomp | 30.0 | 8.9 | | Browsecomp_zh | 49.2 | 35.7 | | HLE | 29.8 | 24.8 | | xbench-DeepSearch | 71.2 | 55.0 | ...
DeepSeek-V3.1正式发布,上下文均扩展为128K
Di Yi Cai Jing· 2025-08-21 07:19
Core Insights - DeepSeek has officially released the upgraded model DeepSeek-V3.1, which features a hybrid reasoning architecture that supports both thinking and non-thinking modes [1] - The new model, DeepSeek-V3.1-Think, demonstrates improved thinking efficiency, providing answers in a shorter time compared to its predecessor DeepSeek-R1-0528 [1] - Enhanced agent capabilities have been achieved through post-training optimization, significantly improving performance in tool usage and agent tasks [1] Pricing Adjustments - Starting from September 6, 2025, the pricing for API calls on the DeepSeek open platform will be adjusted according to a new price list, with the cancellation of night-time discounts [2] - Until September 6, 2025, all API services will continue to be billed under the original pricing policy [4]
官宣!DeepSeek-V3.1 发布,API调用价格低至0.5元/百万Tokens
Xin Lang Ke Ji· 2025-08-21 07:05
Core Insights - DeepSeek announced the release of DeepSeek-V3.1 and will adjust the API pricing effective September 6, 2025 [1][3] - The new pricing structure includes input prices of 0.5 CNY per million tokens for cache hits and 4 CNY per million tokens for cache misses, with output prices set at 12 CNY per million tokens [1] Group 1: Upgrade Features - The V3.1 upgrade introduces a hybrid reasoning architecture that supports both thinking and non-thinking modes within a single model [3] - Enhanced thinking efficiency allows DeepSeek-V3.1-Think to provide answers in a shorter time compared to its predecessor, DeepSeek-R1-0528 [3] - Improved agent capabilities through post-training optimization significantly enhance the model's performance in tool usage and agent tasks [3] Group 2: User Experience - The official app and web model have been upgraded to DeepSeek-V3.1, allowing users to switch freely between thinking and non-thinking modes via a "deep thinking" button [3]
DeepSeek-V3.1发布
Core Insights - DeepSeek has officially released DeepSeek-V3.1, which includes significant upgrades in its architecture and performance [1] Group 1: Key Features of DeepSeek-V3.1 - Hybrid reasoning architecture: The model supports both thinking and non-thinking modes simultaneously [1] - Enhanced thinking efficiency: DeepSeek-V3.1-Think can provide answers in a shorter time compared to DeepSeek-R1-0528 [1] - Improved agent capabilities: The new model shows significant improvements in tool usage and agent tasks through post-training optimization [1]
X @外汇交易员
外汇交易员· 2025-08-21 06:57
根据DeepSeek最新发布的V3.1 API定价,输入/输出价格从V3的2元和8元/百万 token分别上调至4元和12元/百万 token,同时取消夜间时段优惠。调整将在9月6日起生效。 https://t.co/BvDHFZKQms外汇交易员 (@myfxtrader):DeepSeek刚刚官宣V3.1模型。模型同时支持思考模式与非思考模式;思考效率提升,相比DeepSeek-R1-0528,DeepSeek-V3.1-Think能在更短时间内给出答案;Agent能力更强,通过Post-Training优化,新模型在工具使用与智能体任务中的表现有较大提升。 https://t.co/ajVjLgC4ri ...