Workflow
DeepSeek
icon
Search documents
国产芯片再迎利好!智谱发布新一代大模型 全面适配寒武纪和摩尔线程芯片!
Zheng Quan Shi Bao· 2025-09-30 09:24
Core Insights - The release of the new generation large model GLM-4.6 by the domestic unicorn company Zhipu marks a significant advancement in programming capabilities, surpassing the latest model DeepSeek-V3.2-Exp and aligning with global leader Claude Sonnet 4 in various benchmarks [2][3][5] Model Performance - GLM-4.6 has achieved substantial improvements in core capabilities such as Agentic Coding, long context processing, reasoning ability, information retrieval, text generation, and intelligent agent applications [3][4] - The context window has been increased from 128K to 200K, allowing for better handling of longer code and agent tasks [3] - The model's reasoning ability has been enhanced, supporting tool invocation during reasoning processes [3] - In practical programming tasks, GLM-4.6 outperformed Claude Sonnet 4 in 74 real-world scenarios [3] Token Efficiency and User Experience - GLM-4.6 has improved token efficiency, consuming over 30% fewer tokens compared to GLM-4.5 for similar tasks [4] - The model enhances the usability of presentations and the aesthetic quality of front-end code, improving layout design [4] Open Source and Ecosystem Development - GLM-4.6 is set to be open-sourced on platforms like Hugging Face and ModelScope under a permissive MIT license, positioning it as one of the strongest general-purpose models in the global open-source ecosystem [5] - The model has been adapted for use on domestic AI chips from companies like Cambrian and Moore Threads, facilitating a collaborative ecosystem between domestic large models and chips [5][6] Industry Collaboration and Future Prospects - The rapid adaptation of GLM-4.6 by domestic chip manufacturers indicates a deepening collaboration within China's AI industry, moving towards a unified ecosystem of software and hardware [6] - Zhipu has initiated A-share listing guidance, aiming to become the first publicly listed company focused on domestic AI large models, signaling a shift from technological competition to commercialization and capital operation [6]
国庆前搞大事!DeepSeek 新模型速度翻 3 倍,API 直接半价!网友调侃:这假没法休了
程序员的那些事· 2025-09-30 08:45
Core Viewpoint - DeepSeek has released its experimental version DeepSeek-V3.2-Exp, which significantly improves long text training and inference efficiency while maintaining output quality compared to its predecessor V3.1-Terminus [5][6]. Group 1: Model Performance - DeepSeek-V3.2-Exp introduces DeepSeek Sparse Attention (DSA), achieving a 2-3 times increase in long text inference speed and a 30%-40% reduction in memory usage, along with a 50% improvement in training efficiency [5]. - In benchmark tests, DeepSeek-V3.2-Exp performs comparably to V3.1-Terminus, with scores of 85.0 in MMLU-Pro and a slight improvement in AIME 2025, scoring 89.3 compared to 88.4 [5][6]. Group 2: Pricing Adjustments - Due to the reduced service costs associated with the new model, DeepSeek has lowered its API pricing by over 50%, with input prices dropping from 0.5 yuan to 0.2 yuan per million tokens for cache hits, and from 4 yuan to 2 yuan for cache misses. Output prices have decreased from 12 yuan to 3 yuan per million tokens [7].
深夜炸场,Claude Sonnet 4.5上线,自主编程30小时,网友实测:一次调用重构代码库,新增3000行代码却运行失败
3 6 Ke· 2025-09-30 08:43
Core Insights - Anthropic has launched the Claude Sonnet 4.5, claiming it to be the "best coding model in the world" with significant improvements over its predecessor, Opus 4 [1][2]. Performance Enhancements - Claude Sonnet 4.5 can autonomously run for over 30 hours on complex multi-step tasks, a substantial increase from the 7 hours of Opus 4 [2]. - In the OSWorld evaluation, Sonnet 4.5 achieved a score of 61.4%, up from 42.2% of Sonnet 4, indicating a marked improvement in computer operation capabilities [4]. - The model outperformed competitors like GPT-5 and Gemini 2.5 Pro in various tests, including Agentic Coding and Agentic Tool Use [6][7]. Safety and Alignment - Claude Sonnet 4.5 is touted as the most "aligned" model to date, having undergone extensive safety training to mitigate issues like "hallucination" and "deception" [9][10]. - It has received an AI Safety Level 3 (ASL-3) rating, equipped with protective measures against dangerous inputs and outputs, particularly in sensitive areas like CBRN [12]. Developer Tools and Features - The update includes a native VS Code plugin for Claude Code, allowing real-time code modification tracking and inline diffs [13]. - A new checkpoint feature enables developers to save code states automatically, facilitating easier exploration and iteration during complex tasks [18]. - Claude API has been enhanced with context editing and memory tools, enabling the handling of longer and more complex tasks [20]. Market Response and Competition - Developers have expressed surprise at the capabilities of Claude Sonnet 4.5, with reports of it autonomously generating complete projects [21][22]. - The competitive landscape is intensifying, with other companies like DeepSeek also releasing new models that significantly reduce inference costs [29][32].
科创人工智能ETF(588730)涨3.14%,DeepSeek、寒武纪同步发布相关重要事项
Ge Long Hui· 2025-09-30 07:39
Core Insights - The semiconductor and AI sectors are experiencing significant growth, with the Sci-Tech Innovation AI ETF rising by 3.14% and reaching a historical net asset value high, driven by strong performances from key stocks like Cambrian and Lattice Power [1] Group 1: Market Performance - On the last trading day before the holiday, the chip and AI sectors led the market, with Lattice Technology increasing over 7% [1] - The Sci-Tech Innovation AI ETF, which tracks the Shanghai Stock Exchange Sci-Tech Innovation Board AI Index, has a semiconductor weight of 54.1%, with top three holdings being Cambrian (16.62%), Lattice Technology (10%), and Chip Original [1] Group 2: Fund Inflows - There has been a significant inflow of funds into the Sci-Tech Innovation AI ETF, with a net inflow of 114 million yuan over the past five days, bringing the total fund size to 1.747 billion yuan [1] Group 3: Industry Developments - DeepSeek announced updates to its official app and services, significantly reducing API costs by over 50%, which is expected to enhance developer engagement [1] - Several domestic chip manufacturers have completed adaptations for DeepSeek-V3.2-Exp, with Cambrian announcing the synchronization of its latest model and the open-sourcing of its large model inference engine [2] - Tencent has launched and open-sourced its native multimodal image generation model, HunyuanImage 3.0, which has a parameter scale of 80 billion, marking a significant advancement in the industry [2] - Huaxin Securities has expressed optimism about the domestic AI chip industry, highlighting the complete integration of the AI industry chain from advanced processes to model acceleration by major companies like ByteDance, Alibaba, and Tencent [2]
PPIO首发上线DeepSeek-V3.2-Exp
Zheng Quan Ri Bao Wang· 2025-09-30 06:17
Group 1 - DeepSeek has launched a new experimental model version, DeepSeek-V3.2-Exp, which incorporates the "DeepSeek Sparse Attention" mechanism to enhance training and inference efficiency in long context scenarios [1] - The new architecture of DeepSeek-V3.2-Exp has significantly reduced API pricing, with costs dropping by 75%, making it more affordable for developers to utilize DeepSeek API [1] - PPIO platform offers high-performance API services and features a variety of open-source models, achieving the top rank in throughput tests for DeepSeek-R1-0528 according to the "2025 Large Model Service Performance Ranking" [2] Group 2 - PPIO has successfully achieved over 10 times cost reduction in large model inference through practices in 2024, balancing inference efficiency and resource usage dynamically [2]
X @Bloomberg
Bloomberg· 2025-09-30 05:30
RT Saritha Rai (@SarithaRai)DeepSeek debuts "DeepSeek Sparse Attention" next-gen architecture in experimental version of model(Native Sparse Attention paper by DeepSeek founder Liang Wenfeng & others won the ACL 2025 Best Paper award)Gift link (Free to read until Oct 7)https://t.co/EeZFpsm8bA#AI https://t.co/7ye5aImbcg ...
华虹半导体涨超15%,科创芯片ETF指数、科创芯片ETF涨超2%
Ge Long Hui A P P· 2025-09-30 05:10
Group 1: Semiconductor Stocks Performance - Semiconductor stocks continue to rise strongly, with Huahong Semiconductor increasing over 15% and reaching a new historical high, while leading company SMIC rose by 2.88%, also hitting a historical high [1] - Various semiconductor ETFs, including the Fortune and Guotai ETFs, saw gains of over 2% [1] Group 2: ETF Performance Details - The Fortune Sci-Tech Chip ETF (588810) rose by 2.96% with a 5-day increase of 8.32% and an estimated scale of 577 million [2] - The Guotai Sci-Tech Chip ETF (589100) increased by 2.87% with a 5-day increase of 8.34% and an estimated scale of 641 million [2] - The top ten weighted stocks in the Sci-Tech Chip ETF include Cambricon, Haiguang Information, SMIC, and others, focusing on semiconductor materials, equipment, design, manufacturing, packaging, and testing [2] Group 3: AI Chip Industry Developments - DeepSeek announced a significant update to its services, reducing API costs by over 50%, which has been adapted by several domestic chip manufacturers [3] - Analysts from Huaxin Securities express optimism about the domestic AI chip industry, highlighting a complete industry chain from advanced processes to model acceleration by major companies [3] - Zhongyin Securities notes that the commercialization of AI applications is accelerating, leading to increased demand for computing power in the domestic market [3] Group 4: New AI Models and Market Trends - Anthropic launched a new large model, Claude Sonnet 4.5, capable of running autonomously for 30 hours, excelling in cybersecurity and financial services [4] - Tencent released and open-sourced its native multimodal image model, HunyuanImage 3.0, with a parameter scale of 80 billion [4] - TrendForce predicts a shift in AI infrastructure focus towards efficient inference services, with increasing demand for Nearline SSDs due to shortages in traditional HDDs [4]
两大千亿芯片龙头,历史新高
Group 1: Market Performance - The non-ferrous metals, storage chips, and AI application sectors led the market gains today, with significant increases in key stocks such as Luoyang Molybdenum, Huayou Cobalt, Jiangxi Copper, and Northern Rare Earth [1][2] - The Shanghai Composite Index rose by 0.4%, the Shenzhen Component Index increased by 0.31%, and the ChiNext Index saw a slight rise of 0.06% [1] Group 2: Non-Ferrous Metals Sector - The non-ferrous metals sector experienced a strong rally, driven by energy metals, industrial metals, and minor metals [2] - Key catalysts for this sector include a policy plan from the Ministry of Industry and Information Technology aiming for an average annual growth of 5% in the non-ferrous metals industry from 2025 to 2026, and a projected annual growth of 1.5% in the production of ten non-ferrous metals [4] - The Federal Reserve's recent interest rate cut has led to expectations of a new round of easing, further supporting the sector [4] - Supply-side disruptions have also contributed to the sector's performance, with a focus on copper and aluminum as key investment opportunities [5] Group 3: AI Applications Sector - The AI applications sector showed strong performance, with stocks like Danghong Technology and Yidian Tianxia experiencing significant gains [7] - The Sora concept and AI corpus sectors also saw increases, reflecting growing interest and investment in AI technologies [8] - Recent developments in AI models, such as the release of DeepSeek-V3.2-Exp and Anthropic's Claude Sonnet 4.5, indicate ongoing advancements in the field [9][10]
科创50ETF(588000)早盘高开高走上涨1.75%,近三日累积吸金11.16亿元
Mei Ri Jing Ji Xin Wen· 2025-09-30 04:31
9月30日早盘A股三大股指集体上涨,科创50ETF高开高走。截至9:56,科创50ETF(588000)上涨 1.75%。资金上看,科创50ETF(588000)连续3个交易日资金净流入,9月25日吸金2.02亿元,9月26日 吸金2.77亿元,9月29日吸金6.37亿元,累积11.16亿元。截至9月29日,科创50ETF(588000)规模已达 744.17亿元。 每日经济新闻 (责任编辑:董萍萍 ) 科创50ETF(588000)追踪科创50指数,指数持仓电子行业68.77%,计算机行业4.99%,合计 73.76%,与当前人工智能、机器人等前沿产业的发展方向高度契合。同时涉及半导体、医疗器械、软 件开发、光伏设备等多个细分领域,硬科技含量高。看好中国硬科技长期发展前景的投资者建议持续关 注。 相关ETF:科创50ETF(588000) 消息面上,9月29日晚间,DeepSeek宣布官方App、网页端、小程序均已同步更新为DeepSeek-V3.2- Exp。DeepSeek介绍,得益于新模型服务成本的大幅降低,官方API价格也相应下调,新价格即刻生 效。在新的价格政策下,开发者调用DeepSeekAPI ...
AI应用概念活跃,当虹科技20%涨停,品茗科技再创新高
Core Viewpoint - The AI application sector is experiencing significant market activity, with notable stock price increases for companies like DeepSeek and others following the release of the new DeepSeek-V3.2-Exp model, which enhances training and inference efficiency for long texts and reduces API costs by over 50% [1] Group 1: Market Activity - Companies such as 当虹科技 (Danghong Technology) and 品茗科技 (Pinming Technology) have seen stock price increases of 20% and approximately 15% respectively, with other firms like 网达软件 (Wanda Software) and 中电鑫龙 (Zhongdian Xinlong) also hitting the daily limit up [1] - 海天瑞声 (Haitian Ruisheng) has experienced a stock price increase of over 8% [1] Group 2: Product Development - DeepSeek has officially launched the DeepSeek-V3.2-Exp model, which introduces a sparse attention mechanism aimed at optimizing training and inference efficiency for long texts [1] - The official app, web version, and mini-program have all been updated to the new DeepSeek-V3.2-Exp model [1] Group 3: Industry Outlook - Institutions predict that with the continuous improvement of model capabilities represented by companies like OpenAI and DeepSeek, along with the emergence of Agent and multimodal products, AI applications are expected to accelerate [1] - It is anticipated that starting from the fourth quarter, companies in various application sectors will further iterate their AI products and solutions, leading to increased order fulfillment and performance [1]