Workflow
DeepSeek
icon
Search documents
DeepSeek-V3.1正式发布,叫板OpenAI,适配下一代国产芯片
Feng Huang Wang· 2025-08-21 09:18
Core Insights - The release of DeepSeek V3.1 is positioned as a significant step towards the "Agent Era," featuring a hybrid reasoning architecture that allows the model to switch between fast responses and longer reasoning processes [1] - The new model reduces token generation by 20% to 50% compared to its predecessor, enhancing response speed and lowering usage costs [1] - V3.1 improves throughput efficiency and energy performance, laying the groundwork for large-scale applications [1] - The model demonstrates enhanced capabilities in programming tasks, showing improved execution and stability in real-world environments [1] - In complex search tasks, V3.1 exhibits advanced retrieval and integration abilities, outperforming previous models in multi-disciplinary challenges [1] Business and Ecosystem Strategy - DeepSeek adopts a "dual-track" strategy, continuing to offer API services while adjusting pricing and eliminating night discounts starting September 6 [2] - The base model and post-training versions of V3.1 have been open-sourced on Hugging Face and MoDa [2] Technical Specifications - V3.1 utilizes UE8M0 FP8 Scale parameter precision, aligning with the upcoming generation of domestic chips, which may require specific software adaptations for optimal performance [4] - The release appears to be a direct competitor to GPT-5, with both models supporting long contexts and complex task handling, while offering flexible base model calls and cost structures [4]
X @Bloomberg
Bloomberg· 2025-08-21 09:18
DeepSeek unveiled an update to an older model that it says surpasses the seminal R1 on key benchmarks, keeping the Chinese startup in the game while the industry awaits its next flagship offering https://t.co/sOrQY9ug5n ...
DeepSeek V3.1:价格涨了但Agent能力提升了
第一财经· 2025-08-21 09:09
2025.08. 21 本文字数:1056,阅读时长大约2分钟 作者 | 第一财 经 刘晓洁 8月21日,业界千呼万唤的R2模型没来,但DeepSeek官方正式发布了新模型V3.1。从命名来看这 或许不是一次大的版本更新,更像是前一代DeepSeek-V3模型的小版本迭代。 在X上,DeepSeek将V3.1称为"我们迈向智能体时代的第一步"(our first step toward the agent era)。本次升级主要有三大亮点,其中包括更强的 Agent能力、混合思考模式和更高的思考 效率。 官方表示,通过后训练优化,新模型在工具使用与智能体任务中的表现有较大提升。在编程智能体、 搜索智能体测评中, V3.1 相比之前的 DeepSeek 系列模型都有明显提高。 | Benchmarks | DeepSeek-V3.1 | DeepSeek- | DeepSeek- | | --- | --- | --- | --- | | | | V3-0324 | R1-0528 | | SWE-bench | 66.0 | 45.4 | 44.6 | | Verified | | | | | SWE-ben ...
X @外汇交易员
外汇交易员· 2025-08-21 08:45
DeepSeek在其官宣发布DeepSeek-V3.1的文章中提到,DeepSeek-V3.1使用了UE8M0 FP8 Scale的参数精度。另外,V3.1对分词器及chat template进行了较大调整,与DeepSeek-V3存在明显差异。DeepSeekg官方在置顶留言里表示,UE8M0 FP8是针对即将发布的下一代国产芯片设计。 https://t.co/ydxMxF53VL外汇交易员 (@myfxtrader):DeepSeek刚刚官宣V3.1模型。模型同时支持思考模式与非思考模式;思考效率提升,相比DeepSeek-R1-0528,DeepSeek-V3.1-Think能在更短时间内给出答案;Agent能力更强,通过Post-Training优化,新模型在工具使用与智能体任务中的表现有较大提升。 https://t.co/ajVjLgC4ri ...
华为登顶2025年《财富》中国科技50强 DeepSeek第二
Sou Hu Cai Jing· 2025-08-21 08:27
Core Insights - The 2025 Fortune China Tech 50 list was released, with Huawei Investment Holding Co., Ltd. ranking first due to its comprehensive leadership in communication, chips, operating systems, and artificial intelligence [1] - DeepSeek, focused on AI large model development, ranked second, showcasing significant advancements in the AI sector [1] Huawei - Huawei is recognized as a representative of Chinese tech companies, driving global communication and smart technology development [3] - In the 5G communication sector, Huawei holds a 15% share of essential patents, ranking first globally and providing key technological support for 5G network construction in multiple countries [3] - The Kirin 9020 chip has over 90% localization rate, with significant performance improvements; the Ascend AI processor has set global performance records in supporting large model inference [3] - The Harmony operating system has surpassed 10 million devices, with its ecosystem continuously expanding [3] - Huawei has made breakthroughs in optical technology, network architecture, and AI algorithms, facilitating technological upgrades across various industries [3] DeepSeek - DeepSeek is a leading enterprise in China's AI field, gaining global attention with its self-developed large model DeepSeek-R1, which scored 88.5 in the MMLU benchmark test, significantly outperforming international models like Llama 3 and Claude 2 [9] - The influence of DeepSeek in the open-source community is growing, with its model's global download volume ranking in the top ten [9] - As of June 2025, DeepSeek has reached 163 million monthly active users, becoming the largest AIGC application globally, reflecting China's strong market vitality and technological implementation capabilities in AI [9] Other Notable Companies - Other companies such as CATL, Alibaba, Tencent, and BYD have also made it to the top ranks due to their leading technological capabilities and market performance [10] - CATL continues to lead globally in power batteries and energy storage systems, achieving a 37.6% market share in global power battery installations in 2024, marking its seventh consecutive year in the top position [10]
DeepSeek-V3.1正式发布:混合推理,Agent能力大幅提高!概念股直线拉升
Mei Ri Jing Ji Xin Wen· 2025-08-21 08:27
Core Insights - DeepSeek officially released DeepSeek-V3.1 on August 21, featuring significant upgrades in its architecture and capabilities [1] Group 1: Product Upgrades - The new hybrid reasoning architecture allows a single model to support both thinking and non-thinking modes [1] - DeepSeek-V3.1-Think demonstrates improved efficiency, providing answers in a shorter time compared to DeepSeek-R1-0528 [1] - Enhanced agent capabilities have been achieved through post-training optimization, resulting in better performance in tool usage and agent tasks [1] Group 2: API and Pricing Changes - The DeepSeek API has been upgraded to include deepseek-chat for non-thinking mode and deepseek-reasoner for thinking mode, with context expanded to 128K [1] - A new pricing structure for API calls will be implemented starting September 6, 2025, with the cancellation of night-time discounts [2] - Until September 6, all API services will continue to be billed under the original pricing policy [4] Group 3: Technical Specifications - DeepSeek-V3.1 utilizes UE8M0 FP8 Scale parameter precision, designed for the upcoming generation of domestic chips [4] - The integration of DeepSeek-V3.1 capabilities into the Claude Code framework is facilitated by support for Anthropic API format [1]
DeepSeek:UE8M0 FP8是针对即将发布的下一代国产芯片设计
智通财经网· 2025-08-21 08:23
Core Insights - DeepSeek has released version 3.1, marking a significant step towards the "Agent Era" [1] - The new version utilizes UE8M0 FP8 Scale parameter precision, indicating advancements in technology [1] - There are notable adjustments in the tokenizer and chat template compared to the previous version, DeepSeek-V3 [1] - The UE8M0 FP8 is specifically designed for an upcoming next-generation domestic chip [1][2] Company Developments - The official webpage, app, mini-program, and API platform have all been updated to incorporate the new model [2] - Users have expressed anticipation for additional features, such as image and video functionality [2]
DeepSeek-V3.1首搭UE8M0 FP8精度技术 适配下一代国产芯片
Feng Huang Wang· 2025-08-21 08:18
Core Insights - DeepSeek officially announced the release of DeepSeek-V3.1, which utilizes UE8M0 FP8 Scale parameter precision [1] - The V3.1 version features significant adjustments to the tokenizer and chat template, showing clear differences from DeepSeek-V3 [1] - DeepSeek's official WeChat account indicated that UE8M0 FP8 is designed for the upcoming next-generation domestic chip [1] - In response to user inquiries regarding version information not being V3.1, the company confirmed that the current official web, app, mini-program, and API platform have all been updated to the new model, which identifies itself as DeepSeek-V3 [1]
DeepSeek发布新模型V3.1,价格涨了但Agent能力提升了
Di Yi Cai Jing· 2025-08-21 08:11
"迈向智能体时代的第一步"。 8月21日,业界千呼万唤的R2模型没来,但DeepSeek官方正式发布了新模型V3.1。从命名来看这或许不是一次大的版本更新,更像是前一代DeepSeek-V3模 型的小版本迭代。 在X上,DeepSeek将V3.1称为"我们迈向智能体时代的第一步"(our first step toward the agent era)。本次升级主要有三大亮点,其中包括更强的 Agent能力、 混合思考模式和更高的思考效率。 官方表示,通过后训练优化,新模型在工具使用与智能体任务中的表现有较大提升。在编程智能体、搜索智能体测评中, V3.1 相比之前的 DeepSeek 系列 模型都有明显提高。 | Benchmarks | DeepSeek-V3.1 | | --- | --- | | SWE-bench | 66.0 | | Verified | | | SWE-bench | 54.5 | | Multilingual | | | Terminal-Bench | 31.3 | DeepSeek-V3.1 是混合推理架构,一个模型同时支持思考模式和非思考模式。目前用户可在官方 App与网 ...
DeepSeek-V3.1,正式发布
财联社· 2025-08-21 08:00
据DeepSeek官方公众号消息,DeepSeek-V3.1正式发布。 本次升级包含以下主要变化: 官方App与网页端模型已同步升级为DeepSeek-V3.1。用户可以通过"深度思考"按钮,实现思考模式与非思考模式的自由切换。 价格调整 将于北京时间2025年9月6日凌晨起,对DeepSeek开放平台API接口调用价格进行如下调整: 执行新版价格表(如下图所示,详见定价页面); 取消夜间时段优惠。 在9月6日前,所有API服务仍按原价格政策计费,可继续享受当前优惠。 混合推理架构: 一个模型同时支持思考模式与非思考模式; 更高的思考效率: 相比DeepSeek-R1-0528,DeepSeek-V3.1-Think能在更短时间内给出答案; 更强的Agent能力: 通过Post-Training优化,新模型在工具使用与智能体任务中的表现有较大提升。 下载财联社APP获取更多资讯 准确 快速 权威 专业 7x24h电报 头条新闻 VIP资讯 实时盯盘 ...