Workflow
Seek .(SKLTY)
icon
Search documents
DeepSeek发布V3.1终极版
Mei Ri Jing Ji Xin Wen· 2025-09-23 01:22
Core Insights - DeepSeek announced the update of its model to DeepSeek-V3.1-Terminus, enhancing its existing capabilities while addressing user feedback [1] Improvements - The update focuses on two main areas: - Language consistency, which alleviates issues related to mixed Chinese and English text and occasional abnormal characters [1] - Enhanced performance of intelligent agents, specifically the Code Agent and Search Agent [1]
刚刚,DeepSeek发了“终极版”
3 6 Ke· 2025-09-23 00:54
Core Insights - DeepSeek has released an updated model, DeepSeek-V3.1-Terminus, which improves upon the previous version by enhancing language consistency and fixing bugs related to unexpected character outputs [1][7][20] - The model has been open-sourced, allowing broader access and potential community contributions [1][7] Performance Improvements - Benchmark tests show that DeepSeek-V3.1-Terminus has achieved performance improvements ranging from 0.2% to 36.5% compared to DeepSeek-V3.1, with notable enhancements in the Human's Last Exam (HLE) test, which assesses high-level knowledge and reasoning capabilities [3][5] - In non-Agent evaluations, the model's performance in MMLU-Pro improved from 84.8 to 85.0, and in HLE, it increased from 15.9 to 21.7 [5] Bug Fixes - The previous version had a significant bug where the model would output random characters, which has been resolved in the new version [7][8] - Additionally, issues with multilingual outputs, particularly in translating minor languages, have also been addressed, resulting in more coherent translations [9][10] Enhanced Capabilities - The model demonstrates improved programming and search capabilities, successfully simulating physical effects in programming tasks and providing comprehensive recommendations for plant care based on specific criteria [13][17] - The model's ability to cross-verify information and present it in a readable format has also been highlighted as a significant improvement [17] Future Outlook - The name "Terminus" suggests that this version may represent the culmination of the current technological path for DeepSeek, although future updates, including an Agent model, are anticipated by the end of the year [20][21]
DeepSeek线上模型升级至V3.1-Terminus!算力与应用板块或迎价值重估(附概念股)
Zhi Tong Cai Jing· 2025-09-22 23:37
Core Insights - DeepSeek has officially upgraded its model to DeepSeek-V3.1-Terminus, enhancing performance based on user feedback, particularly in language consistency and agent capabilities [1][2] - The new model shows improved stability in output, with benchmark results indicating significant performance gains across various assessments [1] - The release of DeepSeek-V3.1 is seen as a breakthrough for domestic large models and chip ecosystems, addressing compatibility issues with NVIDIA's FP8 standard [2][3] Model Performance - The benchmark results for DeepSeek-V3.1-Terminus compared to its predecessor are as follows: - MMLU-Pro: 85.0 (up from 84.8) - GPQA-Diamond: 80.7 (up from 80.1) - Humanity's Last Exam: 21.7 (up from 15.9) - BrowseComp: 38.5 (up from 30.0) - SimpleQA: 96.8 (up from 93.4) - SWE Verified: 68.4 (up from 66.0) [1] Industry Impact - The launch of DeepSeek V3.1 has significantly boosted the domestic computing industry, with expectations for increased applications of domestic AI chips in training and inference [3][4] - The success of DeepSeek is viewed as a victory for open-source models, prompting other Chinese companies to adopt similar open-source strategies [3] - The AI computing demand is projected to grow, benefiting various segments of the computing supply chain, including AI chips and servers [4] Related Developments - DeepSeek's research paper on the R1 reasoning model has been featured on the cover of the prestigious journal Nature, marking a significant achievement in the field [2] - Other companies in the industry, such as Baidu and Alibaba, are also advancing their models, with Baidu's Wenxin model showing a 34.8% improvement in factual accuracy [6] and Alibaba launching its Qwen3-Max-Preview model [6]
上证早知道|央行,再次出手;DeepSeek,最新升级;事关工业园区发展,两部门印发
今日导读 ·工业和信息化部、国家发展改革委印发《工业园区高质量发展指引》,要求加快园区绿色设施建设, 加强屋顶光伏、分散式风电、多元储能、充电桩等新能源基础设施的开发利用。 上证精选 ·国家体育总局近日发布《关于推动运动促进健康事业高质量发展的指导意见》,旨在完善全民健身公 共服务体系,推动运动促进健康事业高质量发展,加快全民健身和全民健康深度融合,更好满足人民群 众运动促进健康需求。 ·工业和信息化部、国家发展改革委印发《工业园区高质量发展指引》,要求加快园区绿色设施建设, 加强屋顶光伏、分散式风电、多元储能、充电桩等新能源基础设施的开发利用。 ·央行9月22日公告,以固定利率、数量招标方式开展了2405亿元7天期逆回购操作,以固定数量、利率 招标、多重价位中标方式开展了3000亿元14天期逆回购操作。这是央行时隔8个月再度开展14天期逆回 购操作,也是央行调整其操作模式后的首次操作。 ·9月22日晚,DeepSeek小助手在官方社群中称,DeepSeek线上模型已升级,当前版本号DeepSeek-V3.1- Terminus。此次更新在保持模型原有能力的基础上,针对用户反馈的问题进行了改进,主要包括以下方 ...
港股概念追踪 | DeepSeek线上模型升级至V3.1-Terminus!算力与应用板块或迎价值重估(附概念股)
智通财经网· 2025-09-22 23:27
Core Insights - DeepSeek has officially upgraded its model to DeepSeek-V3.1-Terminus, enhancing performance based on user feedback and improving language consistency and agent capabilities [1][2] - The new model shows improved stability in output, with benchmark results indicating performance increases in various assessments compared to the previous version [1] - The release of DeepSeek V3.1 is seen as a significant breakthrough for domestic large models and chip ecosystems, reducing reliance on NVIDIA standards and promoting domestic computing power autonomy [2][3] Model Performance - The benchmark results for DeepSeek-V3.1-Terminus show improvements in several areas, including: - MMLU-Pro: 84.8 to 85.0 - Humanity's Last Exam: 15.9 to 21.7 - SimpleQA: 93.4 to 96.8 - BrowseComp: 30.0 to 38.5 [1] - The model's agent capabilities have significantly improved, which is expected to enhance commercial applications of AI agents [3] Industry Impact - The launch of DeepSeek V3.1 has led to a surge in the domestic computing industry, with increased demand for AI chips and related infrastructure [3][4] - The success of DeepSeek is viewed as a victory for open-source models, prompting other Chinese companies to adopt similar open-source strategies [3] - The AI computing demand is projected to grow, benefiting various segments of the computing industry, including AI chips, servers, and related technologies [4] Related Companies - Baidu has released its Wenxin model X1.1, showing significant improvements in performance metrics compared to previous versions and competing models [6] - Alibaba's Tongyi Qianwen has launched the Qwen3-Max-Preview model, marking advancements in the domestic large model sector [6] - SenseTime's new interactive platform integrates with Xiaomi AI glasses, showcasing the application of AI in real-world scenarios [7] - ZTE has introduced several products focused on AI and intelligent computing, facilitating the deployment of DeepSeek models across various industries [7]
DeepSeek-V3.1版本更新
Di Yi Cai Jing· 2025-09-22 13:45
DeepSeek-V3.1现已更新至DeepSeek-V3.1-Terminus版本。官方公号表示,此次更新在保持模型原有能力 的基础上,针对用户反馈的问题进行了改进,包括:语言一致性,缓解了中英文混杂、偶发异常字符等 情况;Agent能力,进一步优化了Code Agent与Search Agent的表现。 此次更新在保持模型原有能力的基础上,针对用户反馈的问题进行了改进。 ...
DeepSeek官宣线上模型升级 版本号DeepSeek-V3.1-Terminus
Xin Lang Ke Ji· 2025-09-22 12:06
Core Insights - DeepSeek has announced an upgrade to its online model, now at version DeepSeek-V3.1-Terminus, which includes both thinking and non-thinking modes [1] - The model supports a context length of 128k, enhancing user experience by allowing for more extensive interactions [1] - Users can now experience the upgraded model online, indicating a focus on accessibility and user engagement [1]
DeepSeek官宣线上模型升级,版本号DeepSeek-V3.1-Terminus
Xin Lang Ke Ji· 2025-09-22 11:59
Core Insights - DeepSeek has announced the upgrade of its online model to version DeepSeek-V3.1-Terminus, which includes both a thinking model and a non-thinking mode [2] Group 1: Model Features - The context length for both models is set at 128k [2] - The non-thinking model has a default output length of 4K and a maximum of 8K, while the thinking model has a default output length of 32K and a maximum of 64K [2] Group 2: Pricing Structure - The cost for inputting one million tokens with cache hit is 0.5 yuan, while the cost for cache miss is 4 yuan [2] - The output cost for one million tokens is set at 12 yuan [2]
这一空白终于被DeepSeek打破
Xin Lang Cai Jing· 2025-09-21 06:26
Core Insights - DeepSeek has achieved a significant milestone by having its research paper on the DeepSeek-R1 inference model published in the prestigious journal Nature, marking a breakthrough in the independent peer review of large models [1] - The paper details the model's training methods and data sources, emphasizing transparency and reproducibility in the AI industry, which has been criticized for its "black box" nature since the rise of ChatGPT [1] - DeepSeek's commitment to open-source technology has contributed to its success, with the model being downloaded over 10.9 million times on the HuggingFace platform since its release [1] Industry Impact - DeepSeek is actively applying its technology in verticals such as medical consultation and industrial quality inspection, showcasing the potential of AI to enhance production and daily life [1] - The company exemplifies China's innovative path, demonstrating that true technological advancement thrives in an open and inclusive ecosystem [1] - Amid rising protectionism and unilateralism globally, China is pursuing its own path in scientific innovation while advocating for open collaboration to keep pace with technological development [1]
金沙江创投朱啸虎:大家低估了DeepSeek的影响力
Xin Lang Ke Ji· 2025-09-20 02:26
Core Insights - The influence of DeepSeek is underestimated, according to Zhu Xiaohu, a managing partner at Jinsha River Venture Capital [1] - The future of AI development will not be controlled by a few privatized companies or models, but will instead be characterized by an open-source and open AI ecosystem, which is crucial for humanity [3] Group 1 - DeepSeek's impact on the AI landscape is significant and should not be overlooked [1] - The evolution of AI will lead to a more democratized and accessible ecosystem, moving away from privatization [3]