DeepSeek

Search documents
DeepSeek最新透露:是针对即将发布的下一代国产芯片设计
财联社· 2025-08-21 10:00
Core Insights - DeepSeek has announced the release of DeepSeek-V3.1, which utilizes UE8M0 FP8 Scale parameter precision [1] - The V3.1 version has made significant adjustments to the tokenizer and chat template, showing clear differences from DeepSeek-V3 [2] - DeepSeek's official WeChat account indicates that UE8M0 FP8 is designed for the upcoming next-generation domestic chip [3]
DeepSeek宣布涨价!适配下一代国产芯片 概念股飙升
2 1 Shi Ji Jing Ji Bao Dao· 2025-08-21 09:59
Core Insights - DeepSeek officially announced the release of version 3.1 on August 21, featuring significant upgrades including a hybrid reasoning architecture and improved response efficiency [1] - The pricing for API calls has increased, with new rates effective from September 6, including a removal of night-time discounts [2] - The market reacted positively to the news, with shares of Daily Interactive (300766.SZ) rising by 13.62% to 47.98 CNY per share [4] Group 1: Product Updates - DeepSeek V3.1 introduces a hybrid reasoning architecture that allows for flexible switching between thinking and non-thinking modes, enhancing agent capabilities [1] - The model utilizes UE8M0 FP8 Scale parameter precision and has undergone significant adjustments to the tokenizer and chat template, marking a clear distinction from DeepSeek V3 [1] - The foundational model of V3.1 has been retrained with an additional 840 billion tokens, and both the foundational and post-training models are available on Huggingface and MoDa [4] Group 2: Pricing Changes - The API call pricing has been adjusted, with input prices set at 0.5 CNY per million tokens for cache hits and 4 CNY per million tokens for cache misses, up from 2 CNY for V3 [2] - The output price has increased to 12 CNY per million tokens, compared to 8 CNY for V3 [2] Group 3: Market Reaction - Following the announcement, shares of Daily Interactive surged, attributed to its perceived connection with DeepSeek, as it was rumored to hold a 14.50% stake in the DeepSeek development team [4] - Daily Interactive clarified that it does not hold any equity in DeepSeek or its associated companies, despite previous speculation [4]
DeepSeek宣布涨价!适配下一代国产芯片,概念股飙升
2 1 Shi Ji Jing Ji Bao Dao· 2025-08-21 09:36
Group 1 - DeepSeek officially announced the release of version 3.1 on August 21, featuring significant upgrades including a hybrid reasoning architecture and improved response efficiency [1] - The new version utilizes UE8M0FP8Scale parameter precision and has made substantial adjustments to the tokenizer and chat template, showing clear differences from version 3 [1] - DeepSeek has adjusted its pricing for API interface calls, with input prices increasing to 0.5 yuan per million tokens for cache hits and 4 yuan for cache misses, while output prices rose to 12 yuan per million tokens [2] Group 2 - The foundational model of DeepSeek V3.1 underwent extensive retraining, adding a total of 840 billion tokens, and both the foundational and post-training models are available on Huggingface and Modao [4] - Following the announcement, shares of Daily Interactive (300766) surged, closing at 47.98 yuan per share with a daily increase of 13.62% [4] - Daily Interactive, established in 2010, provides data intelligence products and services, and there were rumors of its ownership stake in DeepSeek through its subsidiary, although the company later clarified it does not hold any equity in DeepSeek or its associated companies [7]
DeepSeek-V3.1正式发布,叫板OpenAI,适配下一代国产芯片
Feng Huang Wang· 2025-08-21 09:18
Core Insights - The release of DeepSeek V3.1 is positioned as a significant step towards the "Agent Era," featuring a hybrid reasoning architecture that allows the model to switch between fast responses and longer reasoning processes [1] - The new model reduces token generation by 20% to 50% compared to its predecessor, enhancing response speed and lowering usage costs [1] - V3.1 improves throughput efficiency and energy performance, laying the groundwork for large-scale applications [1] - The model demonstrates enhanced capabilities in programming tasks, showing improved execution and stability in real-world environments [1] - In complex search tasks, V3.1 exhibits advanced retrieval and integration abilities, outperforming previous models in multi-disciplinary challenges [1] Business and Ecosystem Strategy - DeepSeek adopts a "dual-track" strategy, continuing to offer API services while adjusting pricing and eliminating night discounts starting September 6 [2] - The base model and post-training versions of V3.1 have been open-sourced on Hugging Face and MoDa [2] Technical Specifications - V3.1 utilizes UE8M0 FP8 Scale parameter precision, aligning with the upcoming generation of domestic chips, which may require specific software adaptations for optimal performance [4] - The release appears to be a direct competitor to GPT-5, with both models supporting long contexts and complex task handling, while offering flexible base model calls and cost structures [4]
X @Bloomberg
Bloomberg· 2025-08-21 09:18
DeepSeek unveiled an update to an older model that it says surpasses the seminal R1 on key benchmarks, keeping the Chinese startup in the game while the industry awaits its next flagship offering https://t.co/sOrQY9ug5n ...
DeepSeek V3.1:价格涨了但Agent能力提升了
第一财经· 2025-08-21 09:09
2025.08. 21 本文字数:1056,阅读时长大约2分钟 作者 | 第一财 经 刘晓洁 8月21日,业界千呼万唤的R2模型没来,但DeepSeek官方正式发布了新模型V3.1。从命名来看这 或许不是一次大的版本更新,更像是前一代DeepSeek-V3模型的小版本迭代。 在X上,DeepSeek将V3.1称为"我们迈向智能体时代的第一步"(our first step toward the agent era)。本次升级主要有三大亮点,其中包括更强的 Agent能力、混合思考模式和更高的思考 效率。 官方表示,通过后训练优化,新模型在工具使用与智能体任务中的表现有较大提升。在编程智能体、 搜索智能体测评中, V3.1 相比之前的 DeepSeek 系列模型都有明显提高。 | Benchmarks | DeepSeek-V3.1 | DeepSeek- | DeepSeek- | | --- | --- | --- | --- | | | | V3-0324 | R1-0528 | | SWE-bench | 66.0 | 45.4 | 44.6 | | Verified | | | | | SWE-ben ...
X @外汇交易员
外汇交易员· 2025-08-21 08:45
DeepSeek在其官宣发布DeepSeek-V3.1的文章中提到,DeepSeek-V3.1使用了UE8M0 FP8 Scale的参数精度。另外,V3.1对分词器及chat template进行了较大调整,与DeepSeek-V3存在明显差异。DeepSeekg官方在置顶留言里表示,UE8M0 FP8是针对即将发布的下一代国产芯片设计。 https://t.co/ydxMxF53VL外汇交易员 (@myfxtrader):DeepSeek刚刚官宣V3.1模型。模型同时支持思考模式与非思考模式;思考效率提升,相比DeepSeek-R1-0528,DeepSeek-V3.1-Think能在更短时间内给出答案;Agent能力更强,通过Post-Training优化,新模型在工具使用与智能体任务中的表现有较大提升。 https://t.co/ajVjLgC4ri ...
华为登顶2025年《财富》中国科技50强 DeepSeek第二
Sou Hu Cai Jing· 2025-08-21 08:27
Core Insights - The 2025 Fortune China Tech 50 list was released, with Huawei Investment Holding Co., Ltd. ranking first due to its comprehensive leadership in communication, chips, operating systems, and artificial intelligence [1] - DeepSeek, focused on AI large model development, ranked second, showcasing significant advancements in the AI sector [1] Huawei - Huawei is recognized as a representative of Chinese tech companies, driving global communication and smart technology development [3] - In the 5G communication sector, Huawei holds a 15% share of essential patents, ranking first globally and providing key technological support for 5G network construction in multiple countries [3] - The Kirin 9020 chip has over 90% localization rate, with significant performance improvements; the Ascend AI processor has set global performance records in supporting large model inference [3] - The Harmony operating system has surpassed 10 million devices, with its ecosystem continuously expanding [3] - Huawei has made breakthroughs in optical technology, network architecture, and AI algorithms, facilitating technological upgrades across various industries [3] DeepSeek - DeepSeek is a leading enterprise in China's AI field, gaining global attention with its self-developed large model DeepSeek-R1, which scored 88.5 in the MMLU benchmark test, significantly outperforming international models like Llama 3 and Claude 2 [9] - The influence of DeepSeek in the open-source community is growing, with its model's global download volume ranking in the top ten [9] - As of June 2025, DeepSeek has reached 163 million monthly active users, becoming the largest AIGC application globally, reflecting China's strong market vitality and technological implementation capabilities in AI [9] Other Notable Companies - Other companies such as CATL, Alibaba, Tencent, and BYD have also made it to the top ranks due to their leading technological capabilities and market performance [10] - CATL continues to lead globally in power batteries and energy storage systems, achieving a 37.6% market share in global power battery installations in 2024, marking its seventh consecutive year in the top position [10]
DeepSeek-V3.1正式发布:混合推理,Agent能力大幅提高!概念股直线拉升
Mei Ri Jing Ji Xin Wen· 2025-08-21 08:27
Core Insights - DeepSeek officially released DeepSeek-V3.1 on August 21, featuring significant upgrades in its architecture and capabilities [1] Group 1: Product Upgrades - The new hybrid reasoning architecture allows a single model to support both thinking and non-thinking modes [1] - DeepSeek-V3.1-Think demonstrates improved efficiency, providing answers in a shorter time compared to DeepSeek-R1-0528 [1] - Enhanced agent capabilities have been achieved through post-training optimization, resulting in better performance in tool usage and agent tasks [1] Group 2: API and Pricing Changes - The DeepSeek API has been upgraded to include deepseek-chat for non-thinking mode and deepseek-reasoner for thinking mode, with context expanded to 128K [1] - A new pricing structure for API calls will be implemented starting September 6, 2025, with the cancellation of night-time discounts [2] - Until September 6, all API services will continue to be billed under the original pricing policy [4] Group 3: Technical Specifications - DeepSeek-V3.1 utilizes UE8M0 FP8 Scale parameter precision, designed for the upcoming generation of domestic chips [4] - The integration of DeepSeek-V3.1 capabilities into the Claude Code framework is facilitated by support for Anthropic API format [1]
DeepSeek:UE8M0 FP8是针对即将发布的下一代国产芯片设计
智通财经网· 2025-08-21 08:23
Core Insights - DeepSeek has released version 3.1, marking a significant step towards the "Agent Era" [1] - The new version utilizes UE8M0 FP8 Scale parameter precision, indicating advancements in technology [1] - There are notable adjustments in the tokenizer and chat template compared to the previous version, DeepSeek-V3 [1] - The UE8M0 FP8 is specifically designed for an upcoming next-generation domestic chip [1][2] Company Developments - The official webpage, app, mini-program, and API platform have all been updated to incorporate the new model [2] - Users have expressed anticipation for additional features, such as image and video functionality [2]