Workflow
Seek .(SKLTY)
icon
Search documents
硅基流动上线DeepSeek-V3.1,上下文升至160K
Di Yi Cai Jing· 2025-08-25 13:09
据硅基流动消息,硅基流动大模型服务平台已上线深度求索团队最新开源的DeepSeek-V3.1,支持160K 超长上下文。 (文章来源:第一财经) ...
硅基流动:上线DeepSeek-V3.1,上下文升至160K
Xin Lang Cai Jing· 2025-08-25 12:32
据硅基流动消息,8月25日,硅基流动大模型服务平台上线深度求索团队最新开源的DeepSeek-V3.1。 DeepSeek-V3.1总参数共671B,激活参数37B,采用混合推理架构(同时支持思考模式与非思考模 式)。此外,DeepSeek-V3.1率先支持160K超长上下文,让开发者高效处理长文档、多轮对话、编码及 智能体等复杂场景。 ...
大厂怎么看DeepSeek-V3
2025-08-25 09:13
Summary of DeepSeek and the AI Chip Industry Conference Call Industry and Company Overview - The conference call focuses on the AI chip industry, specifically discussing DeepSeek's new U18M Zero IP8 format and its implications for domestic AI chip development and training efficiency. Key Points and Arguments Introduction of U18M Zero IP8 Format - DeepSeek has defined the U18M Zero IP8 format to establish a new standard for domestic chips, aiming to reduce training memory usage by 20%-30% and improve training efficiency by 30%-40% [1][2] - This new format is expected to guide the design of the next generation of domestic chips and may expand into the RP8 protocol standard through OCP [1][2] Training and Inference Efficiency - The U18M Zero IP8 format optimizes memory usage and computational overhead by splitting weight data into smaller blocks, thus enhancing training and inference efficiency while maintaining high precision [4] - The SP8 data format is anticipated to significantly improve the training efficiency of domestic large models, helping to close the gap with international leaders [6][7] Current Challenges in Domestic AI Chips - Domestic AI chips face challenges such as insufficient operator coverage (approximately 50%), gradient quantization errors, and immature tensor expansion [8][9] - Full-scale application of these technologies is expected to take until Q2 or Q3 of the following year [8] Future Developments and Market Impact - The introduction of FP8 format in inference will lower costs and is expected to be implemented rapidly in domestic chips within the next six months to a year [8] - However, no domestic manufacturer can independently complete training tasks yet, with significant technical hurdles remaining [8][10] Mixed Precision Strategy - DeepSeek employs a mixed precision strategy to balance performance and precision, retaining high precision for sensitive parameters while using the new U18M Zero IP8 format for less sensitive ones [5] Competitive Landscape - DBC V3.1 version introduces mixed inference capabilities and enhances agent abilities, with a significant increase in the dataset size to 840 billion tokens, improving understanding of long texts and code [3][25] - Compared to international models like GPT-5 and Claude 4, DBC V3.1 ranks among the top six globally, indicating strong competitiveness [26][27] Multi-Modal Transition - By Q1 2026, leading domestic AI models are expected to transition into the multi-modal era, requiring high-performance computing resources [30] - The integration of different modalities will necessitate re-training and will increase the demand for training equipment [30] Long-Term Outlook - The adoption of new data formats and standards is a gradual process, with significant changes expected over the next year, particularly in hardware support for FP8 [10][11] - The industry is moving towards a more standardized approach to avoid fragmentation, with major manufacturers leading the charge [10] Additional Important Insights - The current strategy involves maximizing the potential of existing hardware while preparing for the transition to new formats [19] - The impact of new formats on model training methods will require substantial adjustments and a phased approach to implementation [15][16] - The FP8 format has limitations in high-precision fields such as finance and medicine, indicating a need for careful application [23][24] This summary encapsulates the critical insights from the conference call, highlighting the advancements and challenges within the domestic AI chip industry and the strategic direction of DeepSeek.
DeepSeek、阿里云AI编程能力进化,全球科技巨头密集投入 为何AI编程是AI领域最具确定性高增长赛道之一?
Mei Ri Jing Ji Xin Wen· 2025-08-25 07:16
Core Insights - The launch of DeepSeek-V3.1 marks a significant step towards the era of AI agents, with developers now able to build their own intelligent agents [1] - Alibaba's introduction of the Qoder programming platform highlights the competitive landscape in AI programming, with major players like ByteDance and Tencent also entering the market [2] - The AI programming sector is rapidly growing, with at least seven unicorns valued over $1 billion and total funding exceeding 240 billion RMB [2][3] Group 1: Product Developments - DeepSeek-V3.1 achieved a score of 76.3% in Aider coding tests, outperforming competitors like Claude 4 Opus and Gemini 2.5 Pro [1] - Qoder integrates top programming models and can search through 100,000 code files at once, significantly enhancing software development efficiency [1] - Anysphere's Cursor has gained approximately 30,000 enterprise clients and reached an annual recurring revenue (ARR) of over $500 million, showcasing its rapid growth in the AI programming space [3] Group 2: Market Dynamics - The AI programming race has intensified, with major tech companies vying for control over the ecosystem rather than just competing on product features [2] - The potential market for personalized software development could reach up to $15 billion by 2030, driven by reduced costs and barriers to entry in software development [6] - The rise of open-source strategies among domestic companies, such as Qwen3-Coder and DeepSeek-V3.1, is attracting global developers and fostering ecosystem growth [5][6] Group 3: Competitive Landscape - The AI programming sector is characterized by a unique advantage for domestic tech firms, which includes performance catch-up and ecosystem collaboration [4] - The market share of domestic models like Tongyi Qianwen has increased from 5% to 22% in the AI programming field within a month [6] - The competition is not only about faster coding but also about establishing a stronghold in the next wave of AI and computational power [5]
英博数科观察:DeepSeek V3.1 发布,AI 工程化的关键一跃
Zhong Jin Zai Xian· 2025-08-25 06:54
近日,DeepSeek 正式推出 V3.1 版本,完成了一次以"工程实用主义"为核心的全面升级。作为AI算力与 智算解决方案的提供者,英博数科持续关注此次迭代对工具调用、思维链条与系统集成的优化,在不牺 牲原有性能的前提下,实现更稳健、高效、低成本的落地表现。 在经历数轮大规模预训练与强化优化后,DeepSeek 于本次迭代推出V3.1,定位非常明确:在不牺牲主 流任务质量的前提下,把工具调用、思维组织与系统集成做得更稳、更快、更"省"。 概览:一次"以用为先"的增量跃迁 与以往强调纯粹大模型能力不同,DeepSeek V3.1 更像一次"工程化特性"驱动的版本: ·思维模式支持更完整:tokenizer 增加了 4 个与推理/检索相关的特殊 token,配合后训练的策略约束, 使"思考—检索—工具—回答"的链条更可控。 ·工具与代理能力更稳:在函数调用、检索增强、智能代理等场景中,调用意图更明确、参数更规整、 失败重试更克制。 ·"Think" 变体效率提升:DeepSeek-V3.1-Think 的整体回答质量大体对齐DeepSeek-R1-0528,但响应更 快,吞吐与时延表现更友好。 ·更贴近硬件的训 ...
DeepSeek新版本引爆国产算力
Hu Xiu· 2025-08-25 06:06
Core Viewpoint - DeepSeek has launched its V3.1 version, which supports the next generation of domestic chips, signaling a significant moment for China's artificial intelligence and a turning point for domestic computing power [1] Group 1 - The release of DeepSeek V3.1 indicates advancements in domestic AI capabilities [1] - Nvidia has notified its suppliers to halt the production of the China-specific H20 chips, reflecting a shift in the competitive landscape [1] - Both events suggest a convergence towards strengthening China's domestic computing power in the AI sector [1]
AI本土化?特斯拉将接入DeepSeek和豆包
Guan Cha Zhe Wang· 2025-08-25 05:54
Core Insights - Tesla has partnered with ByteDance's Volcano Engine to enhance its in-car voice assistant capabilities with large language models [2][3] - The integration includes Doubao model for voice command functions and DeepSeek Chat for AI interaction [3][4] Group 1: Partnership and Technology - Tesla's voice assistant will utilize Doubao model for functions like navigation, media playback, and temperature control, as well as querying the owner's manual [3][4] - DeepSeek will provide AI interaction capabilities, allowing users to chat with the voice assistant for information like weather and news [4][6] Group 2: Market Strategy and Product Development - Tesla's voice assistant update in China is seen as a delayed response since its entry into the market in 2013, as previous versions had limited functionality [7] - The company is implementing localization strategies to attract consumers, including the launch of a new 6-seat Model Y priced at 339,000 yuan [7][9] - Tesla plans to introduce a low-cost Model Y by 2026, aimed at reducing costs by 20% to capture more market share in China [9] Group 3: Sales Performance - In the first half of 2025, Tesla's cumulative sales in China were approximately 263,400 units, a decline of about 5.4% compared to the same period in 2024 [9] - In July 2025, Tesla's Shanghai factory reported sales (including exports) of 67,900 units, reflecting a year-on-year decline of 8.4% and a month-on-month decline of 5.2% [9]
半导体早参丨国产芯片版块迎来“DeepSeek”时刻,A股美股半导体联袂大涨!
Mei Ri Jing Ji Xin Wen· 2025-08-25 01:32
Market Performance - As of August 22, 2025, the Shanghai Composite Index rose by 1.45% to close at 3825.76 points, the Shenzhen Component Index increased by 2.07% to 12166.06 points, and the ChiNext Index surged by 3.36% to 2682.55 points [1] - The overnight performance of U.S. markets showed the Dow Jones Industrial Average up by 1.89%, the S&P 500 up by 1.52%, and the Nasdaq Composite up by 1.88% [1] - The Philadelphia Semiconductor Index rose by 2.70%, with notable increases in stocks such as Micron Technology (up 1.63%), ARM (up 3.48%), NXP Semiconductors (up 4.87%), Microchip Technology (up 5.32%), and Applied Materials (up 1.66%) [1] Semiconductor Sector Insights - DeepSeek's comment triggered a significant rally in A-share semiconductor and computing stocks on August 22, with leading stocks like Cambricon, Haiguang Information, and Zhongke Shuguang hitting the daily limit [2] - Cambricon's stock price broke through the 1100 yuan and 1200 yuan thresholds, closing at 1243.20 yuan, with a total market capitalization exceeding 520 billion yuan [2] - Yuchip Technology reported a 60.12% year-on-year increase in revenue for the first half of 2025, reaching 449 million yuan, and a 123.19% increase in net profit to 91 million yuan [2] - Yuchip's AI audio chip products have entered the project stage with several leading brands, leading to significant sales growth in low-latency wireless audio products [2] Company Performance - Shengke Communication reported a revenue of 508 million yuan for the first half of 2025, a decrease of 4.56% year-on-year, but improved its net profit to -24 million yuan from -57 million yuan in the same period last year [3] - The company's Ethernet switch chip revenue accounted for 71.46% of total revenue, highlighting its core product's importance in various network applications [3] Industry Developments - The 2025 China Computing Power Conference opened on August 23, with a report indicating that as of June 2025, the number of operational computing power centers in China reached 10.85 million racks, with an intelligent computing power scale of 788 EFLOPS [3] - The Ministry of Industry and Information Technology emphasized the need to optimize the national computing power layout and guide the approval of new projects in areas with low overall computing power utilization [3] Investment Opportunities - Zhongyuan Securities noted that the domestic semiconductor equipment and components still have a relatively low localization rate, indicating potential benefits for companies capable of breakthroughs in advanced processes [4] - Advanced packaging technology is highlighted as a key to enhancing chip performance, particularly for advanced AI computing chips, suggesting a favorable environment for domestic AI computing chip manufacturers [4] - Relevant ETFs, such as the Sci-Tech Semiconductor ETF (588170), focus on semiconductor equipment and materials, indicating a strong investment opportunity in the semiconductor sector driven by domestic substitution and AI demand expansion [4]
国产芯片版块迎来“DeepSeek”时刻,A股美股半导体联袂大涨!
Mei Ri Jing Ji Xin Wen· 2025-08-25 01:31
Market Performance - As of August 22, 2025, the Shanghai Composite Index rose by 1.45% to close at 3825.76 points, while the Shenzhen Component Index increased by 2.07% to 12166.06 points, and the ChiNext Index surged by 3.36% to 2682.55 points [1] - The overnight performance in the U.S. markets showed the Dow Jones Industrial Average up by 1.89%, the S&P 500 up by 1.52%, and the Nasdaq Composite up by 1.88%. The Philadelphia Semiconductor Index rose by 2.70% [1] Semiconductor Sector Highlights - DeepSeek's comment triggered a significant rally in A-share semiconductor and computing stocks on August 22, with leading stocks like Cambricon, Haiguang Information, and Zhongke Shuguang hitting the daily limit. Cambricon's stock price broke through the 1100 and 1200 yuan thresholds, closing at 1243.20 yuan, with a total market value exceeding 520 billion yuan [2] - Yuchip Technology reported a 60.12% year-on-year increase in revenue for the first half of 2025, reaching 449 million yuan, and a 123.19% increase in net profit to 91 million yuan. The company attributed this growth to its AI transformation in edge products, with significant sales increases in low-latency wireless audio products [2] - Shengke Communication's revenue for the first half of 2025 was 508 million yuan, a decrease of 4.56% year-on-year, but it reported a net profit improvement to -24 million yuan from -57 million yuan in the previous year. The company's Ethernet switch chips accounted for 71.46% of its revenue [3] Industry Developments - The 2025 China Computing Power Conference opened on August 23 in Datong, Shanxi. As of June 30, 2025, the number of operational computing centers in China reached 10.85 million, with an intelligent computing scale of 788 EFLOPS. The Ministry of Industry and Information Technology plans to optimize the national computing layout [3] - According to Zhongyuan Securities, the domestic semiconductor equipment and components still have a relatively low localization rate. Companies capable of breaking through advanced process capabilities are expected to benefit significantly. Advanced packaging is highlighted as a key technology for enhancing chip performance, particularly for advanced AI computing chips [4] - The Sci-Tech Innovation Semiconductor ETF and its linked funds focus on semiconductor equipment and materials, which are crucial areas for domestic substitution, benefiting from the expansion of semiconductor demand driven by the AI revolution [4]
DeepSeek 更新,一句话让国产芯片集体暴涨
3 6 Ke· 2025-08-24 23:36
Core Viewpoint - The launch of DeepSeek V3.1 has generated significant excitement in the AI community due to its innovative architecture and the introduction of a new generation of domestic chips, which may reduce reliance on foreign computing power [1][2]. Group 1: Product Innovation - The most revolutionary feature of DeepSeek V3.1 is its Hybrid Reasoning Architecture, which allows users to switch between thinking and non-thinking modes, enhancing flexibility and efficiency in usage [6]. - The new model integrates various core functions such as general dialogue, complex reasoning, and professional programming into a single model, improving user experience and operational efficiency [9]. - The reasoning efficiency of V3.1 has significantly improved, with a reported reduction in output token count by 20% to 50% in thinking mode compared to the previous top model [9][10]. Group 2: Cost Efficiency - The "thinking chain compression" technique allows the model to generate more concise and efficient reasoning paths, reducing computational costs and API call expenses, making it more viable for large-scale commercial applications [10]. - Community tests indicate that DeepSeek V3.1 outperformed Claude 4 Opus in multi-language programming tests while being more cost-effective [10]. Group 3: Technical Specifications - DeepSeek V3.1 utilizes UE8M0 FP8 Scale parameter precision, which compresses standard floating-point numbers into 8 bits, optimizing space and computing power [13][15]. - The MXFP8 block scaling approach allows for efficient data processing without significant information loss, making it suitable for next-generation domestic chips [15][16]. - The compatibility of UE8M0 FP8 with new domestic chips like Moore Threads MUSA 3.1 GPU and Chipone VIP9000 NPU enhances performance while maintaining precision [16]. Group 4: Market Reaction - Following the announcement of DeepSeek V3.1, domestic chip concept stocks surged, with Daily Interaction seeing a closing increase of 13.62% [2][3]. - The overall market index rose to 3800 points, reflecting strong investor sentiment towards the advancements in domestic AI technology [3].