DeepSeek - filings, earnings calls, financial reports, news

国产芯片再迎利好！智谱发布新一代大模型全面适配寒武纪和摩尔线程芯片！

DeepSeek-V3.2-Exp模型

DeepSeek API

TileLang

Zheng Quan Shi Bao· 2025-09-30 09:24

Core Insights - The release of the new generation large model GLM-4.6 by the domestic unicorn company Zhipu marks a significant advancement in programming capabilities, surpassing the latest model DeepSeek-V3.2-Exp and aligning with global leader Claude Sonnet 4 in various benchmarks [2][3][5] Model Performance - GLM-4.6 has achieved substantial improvements in core capabilities such as Agentic Coding, long context processing, reasoning ability, information retrieval, text generation, and intelligent agent applications [3][4] - The context window has been increased from 128K to 200K, allowing for better handling of longer code and agent tasks [3] - The model's reasoning ability has been enhanced, supporting tool invocation during reasoning processes [3] - In practical programming tasks, GLM-4.6 outperformed Claude Sonnet 4 in 74 real-world scenarios [3] Token Efficiency and User Experience - GLM-4.6 has improved token efficiency, consuming over 30% fewer tokens compared to GLM-4.5 for similar tasks [4] - The model enhances the usability of presentations and the aesthetic quality of front-end code, improving layout design [4] Open Source and Ecosystem Development - GLM-4.6 is set to be open-sourced on platforms like Hugging Face and ModelScope under a permissive MIT license, positioning it as one of the strongest general-purpose models in the global open-source ecosystem [5] - The model has been adapted for use on domestic AI chips from companies like Cambrian and Moore Threads, facilitating a collaborative ecosystem between domestic large models and chips [5][6] Industry Collaboration and Future Prospects - The rapid adaptation of GLM-4.6 by domestic chip manufacturers indicates a deepening collaboration within China's AI industry, moving towards a unified ecosystem of software and hardware [6] - Zhipu has initiated A-share listing guidance, aiming to become the first publicly listed company focused on domestic AI large models, signaling a shift from technological competition to commercialization and capital operation [6]

国庆前搞大事！DeepSeek 新模型速度翻 3 倍，API 直接半价！网友调侃：这假没法休了

程序员的那些事· 2025-09-30 08:45

Core Viewpoint - DeepSeek has released its experimental version DeepSeek-V3.2-Exp, which significantly improves long text training and inference efficiency while maintaining output quality compared to its predecessor V3.1-Terminus [5][6]. Group 1: Model Performance - DeepSeek-V3.2-Exp introduces DeepSeek Sparse Attention (DSA), achieving a 2-3 times increase in long text inference speed and a 30%-40% reduction in memory usage, along with a 50% improvement in training efficiency [5]. - In benchmark tests, DeepSeek-V3.2-Exp performs comparably to V3.1-Terminus, with scores of 85.0 in MMLU-Pro and a slight improvement in AIME 2025, scoring 89.3 compared to 88.4 [5][6]. Group 2: Pricing Adjustments - Due to the reduced service costs associated with the new model, DeepSeek has lowered its API pricing by over 50%, with input prices dropping from 0.5 yuan to 0.2 yuan per million tokens for cache hits, and from 4 yuan to 2 yuan for cache misses. Output prices have decreased from 12 yuan to 3 yuan per million tokens [7].

DeepSeek API

深夜炸场，Claude Sonnet 4.5上线，自主编程30小时，网友实测：一次调用重构代码库，新增3000行代码却运行失败

DeepSeek API

3 6 Ke· 2025-09-30 08:43

Core Insights - Anthropic has launched the Claude Sonnet 4.5, claiming it to be the "best coding model in the world" with significant improvements over its predecessor, Opus 4 [1][2]. Performance Enhancements - Claude Sonnet 4.5 can autonomously run for over 30 hours on complex multi-step tasks, a substantial increase from the 7 hours of Opus 4 [2]. - In the OSWorld evaluation, Sonnet 4.5 achieved a score of 61.4%, up from 42.2% of Sonnet 4, indicating a marked improvement in computer operation capabilities [4]. - The model outperformed competitors like GPT-5 and Gemini 2.5 Pro in various tests, including Agentic Coding and Agentic Tool Use [6][7]. Safety and Alignment - Claude Sonnet 4.5 is touted as the most "aligned" model to date, having undergone extensive safety training to mitigate issues like "hallucination" and "deception" [9][10]. - It has received an AI Safety Level 3 (ASL-3) rating, equipped with protective measures against dangerous inputs and outputs, particularly in sensitive areas like CBRN [12]. Developer Tools and Features - The update includes a native VS Code plugin for Claude Code, allowing real-time code modification tracking and inline diffs [13]. - A new checkpoint feature enables developers to save code states automatically, facilitating easier exploration and iteration during complex tasks [18]. - Claude API has been enhanced with context editing and memory tools, enabling the handling of longer and more complex tasks [20]. Market Response and Competition - Developers have expressed surprise at the capabilities of Claude Sonnet 4.5, with reports of it autonomously generating complete projects [21][22]. - The competitive landscape is intensifying, with other companies like DeepSeek also releasing new models that significantly reduce inference costs [29][32].

AI Safety

Claude Sonnet 4.5

Claude Code

GPT - 5

AI Safety

科创人工智能ETF(588730)涨3.14%，DeepSeek、寒武纪同步发布相关重要事项

Claude Sonnet 4.5

Claude Code

GPT - 5

Ge Long Hui· 2025-09-30 07:39

Core Insights - The semiconductor and AI sectors are experiencing significant growth, with the Sci-Tech Innovation AI ETF rising by 3.14% and reaching a historical net asset value high, driven by strong performances from key stocks like Cambrian and Lattice Power [1] Group 1: Market Performance - On the last trading day before the holiday, the chip and AI sectors led the market, with Lattice Technology increasing over 7% [1] - The Sci-Tech Innovation AI ETF, which tracks the Shanghai Stock Exchange Sci-Tech Innovation Board AI Index, has a semiconductor weight of 54.1%, with top three holdings being Cambrian (16.62%), Lattice Technology (10%), and Chip Original [1] Group 2: Fund Inflows - There has been a significant inflow of funds into the Sci-Tech Innovation AI ETF, with a net inflow of 114 million yuan over the past five days, bringing the total fund size to 1.747 billion yuan [1] Group 3: Industry Developments - DeepSeek announced updates to its official app and services, significantly reducing API costs by over 50%, which is expected to enhance developer engagement [1] - Several domestic chip manufacturers have completed adaptations for DeepSeek-V3.2-Exp, with Cambrian announcing the synchronization of its latest model and the open-sourcing of its large model inference engine [2] - Tencent has launched and open-sourced its native multimodal image generation model, HunyuanImage 3.0, which has a parameter scale of 80 billion, marking a significant advancement in the industry [2] - Huaxin Securities has expressed optimism about the domestic AI chip industry, highlighting the complete integration of the AI industry chain from advanced processes to model acceleration by major companies like ByteDance, Alibaba, and Tencent [2]

PPIO首发上线DeepSeek-V3.2-Exp

Zheng Quan Ri Bao Wang· 2025-09-30 06:17

Group 1 - DeepSeek has launched a new experimental model version, DeepSeek-V3.2-Exp, which incorporates the "DeepSeek Sparse Attention" mechanism to enhance training and inference efficiency in long context scenarios [1] - The new architecture of DeepSeek-V3.2-Exp has significantly reduced API pricing, with costs dropping by 75%, making it more affordable for developers to utilize DeepSeek API [1] - PPIO platform offers high-performance API services and features a variety of open-source models, achieving the top rank in throughput tests for DeepSeek-R1-0528 according to the "2025 Large Model Service Performance Ranking" [2] Group 2 - PPIO has successfully achieved over 10 times cost reduction in large model inference through practices in 2024, balancing inference efficiency and resource usage dynamically [2]

Seek .(US:SKLTY)

DeepSeek稀疏注意力机制

大模型服务

PPIO平台

X @Bloomberg

Bloomberg· 2025-09-30 05:30

RT Saritha Rai (@SarithaRai)DeepSeek debuts "DeepSeek Sparse Attention" next-gen architecture in experimental version of model(Native Sparse Attention paper by DeepSeek founder Liang Wenfeng & others won the ACL 2025 Best Paper award)Gift link (Free to read until Oct 7)https://t.co/EeZFpsm8bA#AI https://t.co/7ye5aImbcg ...

DeepSeek Sparse Attention

Sparse Attention

DeepSeek Sparse Attention

Sparse Attention

华虹半导体涨超15%，科创芯片ETF指数、科创芯片ETF涨超2%

Ge Long Hui A P P· 2025-09-30 05:10

Group 1: Semiconductor Stocks Performance - Semiconductor stocks continue to rise strongly, with Huahong Semiconductor increasing over 15% and reaching a new historical high, while leading company SMIC rose by 2.88%, also hitting a historical high [1] - Various semiconductor ETFs, including the Fortune and Guotai ETFs, saw gains of over 2% [1] Group 2: ETF Performance Details - The Fortune Sci-Tech Chip ETF (588810) rose by 2.96% with a 5-day increase of 8.32% and an estimated scale of 577 million [2] - The Guotai Sci-Tech Chip ETF (589100) increased by 2.87% with a 5-day increase of 8.34% and an estimated scale of 641 million [2] - The top ten weighted stocks in the Sci-Tech Chip ETF include Cambricon, Haiguang Information, SMIC, and others, focusing on semiconductor materials, equipment, design, manufacturing, packaging, and testing [2] Group 3: AI Chip Industry Developments - DeepSeek announced a significant update to its services, reducing API costs by over 50%, which has been adapted by several domestic chip manufacturers [3] - Analysts from Huaxin Securities express optimism about the domestic AI chip industry, highlighting a complete industry chain from advanced processes to model acceleration by major companies [3] - Zhongyin Securities notes that the commercialization of AI applications is accelerating, leading to increased demand for computing power in the domestic market [3] Group 4: New AI Models and Market Trends - Anthropic launched a new large model, Claude Sonnet 4.5, capable of running autonomously for 30 hours, excelling in cybersecurity and financial services [4] - Tencent released and open-sourced its native multimodal image model, HunyuanImage 3.0, with a parameter scale of 80 billion [4] - TrendForce predicts a shift in AI infrastructure focus towards efficient inference services, with increasing demand for Nearline SSDs due to shortages in traditional HDDs [4]

国产算力

半导体

Claude Sonnet 4.5