Workflow
DeepSeek
icon
Search documents
DeepSeek-V3.1正式发布,上下文均扩展为128K
Di Yi Cai Jing· 2025-08-21 07:19
Core Insights - DeepSeek has officially released the upgraded model DeepSeek-V3.1, which features a hybrid reasoning architecture that supports both thinking and non-thinking modes [1] - The new model, DeepSeek-V3.1-Think, demonstrates improved thinking efficiency, providing answers in a shorter time compared to its predecessor DeepSeek-R1-0528 [1] - Enhanced agent capabilities have been achieved through post-training optimization, significantly improving performance in tool usage and agent tasks [1] Pricing Adjustments - Starting from September 6, 2025, the pricing for API calls on the DeepSeek open platform will be adjusted according to a new price list, with the cancellation of night-time discounts [2] - Until September 6, 2025, all API services will continue to be billed under the original pricing policy [4]
官宣!DeepSeek-V3.1 发布,API调用价格低至0.5元/百万Tokens
Xin Lang Ke Ji· 2025-08-21 07:05
Core Insights - DeepSeek announced the release of DeepSeek-V3.1 and will adjust the API pricing effective September 6, 2025 [1][3] - The new pricing structure includes input prices of 0.5 CNY per million tokens for cache hits and 4 CNY per million tokens for cache misses, with output prices set at 12 CNY per million tokens [1] Group 1: Upgrade Features - The V3.1 upgrade introduces a hybrid reasoning architecture that supports both thinking and non-thinking modes within a single model [3] - Enhanced thinking efficiency allows DeepSeek-V3.1-Think to provide answers in a shorter time compared to its predecessor, DeepSeek-R1-0528 [3] - Improved agent capabilities through post-training optimization significantly enhance the model's performance in tool usage and agent tasks [3] Group 2: User Experience - The official app and web model have been upgraded to DeepSeek-V3.1, allowing users to switch freely between thinking and non-thinking modes via a "deep thinking" button [3]
DeepSeek-V3.1发布
Core Insights - DeepSeek has officially released DeepSeek-V3.1, which includes significant upgrades in its architecture and performance [1] Group 1: Key Features of DeepSeek-V3.1 - Hybrid reasoning architecture: The model supports both thinking and non-thinking modes simultaneously [1] - Enhanced thinking efficiency: DeepSeek-V3.1-Think can provide answers in a shorter time compared to DeepSeek-R1-0528 [1] - Improved agent capabilities: The new model shows significant improvements in tool usage and agent tasks through post-training optimization [1]
X @外汇交易员
外汇交易员· 2025-08-21 06:57
根据DeepSeek最新发布的V3.1 API定价,输入/输出价格从V3的2元和8元/百万 token分别上调至4元和12元/百万 token,同时取消夜间时段优惠。调整将在9月6日起生效。 https://t.co/BvDHFZKQms外汇交易员 (@myfxtrader):DeepSeek刚刚官宣V3.1模型。模型同时支持思考模式与非思考模式;思考效率提升,相比DeepSeek-R1-0528,DeepSeek-V3.1-Think能在更短时间内给出答案;Agent能力更强,通过Post-Training优化,新模型在工具使用与智能体任务中的表现有较大提升。 https://t.co/ajVjLgC4ri ...
X @外汇交易员
外汇交易员· 2025-08-21 06:51
DeepSeek刚刚官宣V3.1模型。模型同时支持思考模式与非思考模式;思考效率提升,相比DeepSeek-R1-0528,DeepSeek-V3.1-Think能在更短时间内给出答案;Agent能力更强,通过Post-Training优化,新模型在工具使用与智能体任务中的表现有较大提升。 https://t.co/ajVjLgC4riDeepSeek (@deepseek_ai):Introducing DeepSeek-V3.1: our first step toward the agent era! 🚀🧠 Hybrid inference: Think & Non-Think — one model, two modes⚡️ Faster thinking: DeepSeek-V3.1-Think reaches answers in less time vs. DeepSeek-R1-0528🛠️ Stronger agent skills: Post-training boosts tool use and ...
DeepSeek-V3.1正式发布,迈向 Agent 时代的第一步
Hua Er Jie Jian Wen· 2025-08-21 06:39
Group 1 - DeepSeek officially released DeepSeek-V3.1, featuring a hybrid reasoning architecture that supports both thinking and non-thinking modes [1] - The new version, DeepSeek-V3.1-Think, offers higher thinking efficiency, providing answers in a shorter time compared to DeepSeek-R1-0528 [1] - Enhanced agent capabilities have been achieved through post-training optimization, significantly improving performance in tool usage and intelligent tasks [1] Group 2 - Starting from September 6, 2025, the pricing for API calls on the DeepSeek open platform will be adjusted, with input costs set at 0.5 to 4 yuan per million tokens (cache hit) and 4 yuan per million tokens (cache miss), while output costs will be 12 yuan per million tokens [1]
DeepSeek-V3.1正式发布
Di Yi Cai Jing· 2025-08-21 06:37
本次升级包含以下主要变化:混合推理架构:一个模型同时支持思考模式与非思考模式;更高的思考效 率:相比DeepSeek-R1-0528,DeepSeek-V3.1-Think能在更短时间内给出答案;更强的Agent能力:通过 Post-Training优化,新模型在工具使用与智能体任务中的表现有较大提升。 官方App与网页端模型已同步升级为DeepSeek-V3.1。用户可以通过"深度思考"按钮,实现思考模式与非 思考模式的自由切换。 (文章来源:第一财经) 据DeepSeek官方公众号消息,DeepSeek-V3.1正式发布。 ...
DeepSeek、宇树科技上榜2025年《财富》中国科技50强榜单
Feng Huang Wang· 2025-08-21 05:21
Core Insights - The "Fortune China Top 50 Technology Companies" list was released, featuring companies like Huawei, DeepSeek, and Yushu Technology [1] Group 1: DeepSeek - DeepSeek is recognized as a leading AI large model product in China, with its DeepSeek-R1 model scoring 88.5 on the MMLU benchmark test, which is lower than OpenAI's GPT-4 (92.0) and Google's Gemini Pro (90.0), but higher than Meta's Llama 3 (82.0) and Anthropic's Claude 2 (85.1) [1] - DeepSeek ranks among the top 10 globally in terms of open-source large model downloads, indicating strong market presence [1] - As of June 2025, DeepSeek is projected to have 163 million monthly active users, making it the leading application in AI-generated content globally [1] Group 2: Yushu Technology - In 2024, Yushu Technology achieved global sales of 18,000 quadruped robots, capturing a 23% market share, ranking second only to Boston Dynamics [1] - Yushu Technology was awarded the WIPO 2025 Global Award, distinguishing it as the only representative from China among 780 applicants from 95 countries and regions [1] - The company's success is attributed to innovations in robotic motion control, high-performance joint motors, and real-time systems, along with a comprehensive global intellectual property strategy [1]
DeepSeek又更新了,期待梁文锋“炸场”
Hu Xiu· 2025-08-21 02:28
Core Insights - DeepSeek has released an updated version of its model, V3.1, which shows significant improvements in context length and user interaction, although it is not the highly anticipated R2 model [2][4][14] - The model now supports a context length of 128K, enhancing its ability to handle longer texts and improving its programming capabilities [5][10] - The update merges the functionalities of V3 and R1, leading to reduced deployment costs and improved efficiency [13][25] Group 1: Model Improvements - The new V3.1 model has a parameter count of 685 billion, showing only a slight increase from the previous version, V3, which had 671 billion parameters [7] - User experience has been enhanced with more natural language responses and the use of tables for information presentation [8][10] - The programming capabilities of V3.1 have been validated through tests, achieving a score of 71.6% in multi-language programming, outperforming Claude 4 Opus [10] Group 2: Market Context - The release of V3.1 comes seven months after the launch of R1, during which time other major companies have also released new models, using R1 as a benchmark [3][16] - Despite the improvements in V3.1, the industry is still eagerly awaiting the release of the R2 model, which has not been announced [4][20] - The competitive landscape includes companies like Alibaba and ByteDance, which have launched models that claim to surpass DeepSeek R1 in various metrics [17][19] Group 3: Future Outlook - There are indications that the merging of V3 and R1 may be a preparatory step for the release of a multi-modal model [25] - Industry insiders suggest that the focus will shift towards innovations in economic viability and usability for future models [24] - The absence of the R2 model in the current update has heightened expectations for its eventual release, with speculation that it may not arrive until later [21][22]
外媒:中国企业还得依靠英伟达
半导体行业观察· 2025-08-21 01:12
Core Viewpoint - The article discusses the implications of the U.S. allowing NVIDIA's key AI chips to return to China, highlighting the complex dynamics between U.S.-China trade negotiations and China's AI ambitions [1][2]. Group 1: U.S.-China Relations and NVIDIA - The U.S. has permitted the sale of H20 chips to China, which is crucial for China's AI development, while China is leveraging this in trade negotiations [1]. - Despite the U.S. announcement, there are concerns in China regarding potential security risks associated with NVIDIA's chips, leading to warnings from state media [1][2]. - The U.S. Treasury Secretary indicated that China's reaction reflects concerns about NVIDIA chips becoming a standard in China, suggesting a deeper anxiety about technological dominance [1][2]. Group 2: China's AI Industry and Domestic Alternatives - Chinese companies are still eager to purchase H20 chips despite warnings about potential backdoors, indicating a strong reliance on NVIDIA's technology [2]. - Domestic alternatives to NVIDIA's products are not yet capable of matching the performance or production levels required for AI development, as evidenced by delays in projects like DeepSeek's new model [2]. - The Chinese government is aware of the need for domestic chips but faces challenges in achieving the desired technological capabilities [2]. Group 3: Financial Implications and Security Concerns - President Trump's announcement that NVIDIA would pay 15% of its AI chip sales revenue in China raises questions about the transactional nature of national security concerns [3]. - This payment structure could provoke strong reactions globally, emphasizing the intertwining of trade and security in the semiconductor industry [3].