Workflow
DeepSeek V4
icon
Search documents
突发!DeepSeek又“崩”了!
证券时报· 2026-03-31 12:45
Core Viewpoint - DeepSeek has experienced significant service disruptions over three consecutive days, raising concerns about its infrastructure and upcoming model releases [1][5][6]. Group 1: Service Disruptions - DeepSeek's services faced interruptions from March 29 to March 31, with outages lasting approximately 1 hour 48 minutes, 10 hours 13 minutes, and 1 hour 3 minutes respectively [1]. - The service disruptions are speculated to be related to the ongoing gray testing of the upcoming V4 model, as the company prepares for its official release [5][6]. - After resolving the service issues, the API documentation did not include any references to the V4 model, indicating potential challenges in the transition to a higher performance architecture [7]. Group 2: Model Development and Innovations - DeepSeek-OCR 2 was released on January 27, 2026, featuring the innovative DeepEncoder V2 method, which allows AI to process images in a more human-like manner by dynamically rearranging image segments based on logical sequences [8]. - The new model achieved a score of 91.09% on the OmniDocBench v1.5 benchmark, representing a 3.73% improvement over its predecessor, while maintaining low computational costs with visual token limits between 256 and 1120 [8]. - DeepSeek-OCR 2's architecture demonstrates the potential of using language model frameworks for visual encoding, paving the way for a unified multimodal encoder capable of extracting features from images, audio, and text [10].
DeepSeek又崩了 业内猜测系V4进行隐身测试导致
经济观察报· 2026-03-31 10:06
Core Viewpoint - DeepSeek has experienced significant service interruptions while awaiting the release of its V4 model, raising concerns about its infrastructure and stability [1][2][3]. Group 1: Service Interruptions - On March 31, DeepSeek faced a major service interruption lasting 12 hours, which was the longest since its inception [2][3]. - The service disruption was linked to the ongoing gray testing of the V4 model, which has not yet been officially released despite multiple anticipated launch windows [2][3]. - During the service interruption, users reported issues such as persistent loading screens and server errors, with core functionalities being limited [3][4]. Group 2: Technical Developments - DeepSeek has made significant updates to its model, including expanding the context window from 128K to 1M Tokens and updating its knowledge cutoff to May 2025 [2]. - The technical community believes that DeepSeek has prepared the necessary infrastructure for testing the V4 model, which is expected to include a new "native reasoning layer" [2][3]. - Despite the service interruptions, DeepSeek's API services maintained a 100% operational status, indicating that the issues primarily affected the C-end user services [4][6]. Group 3: Industry Impact - The service interruption is viewed as critical, akin to a traditional data center outage, as it can disrupt business logic dependent on the DeepSeek model [4]. - The incident has not significantly impacted API users, who continued to operate normally during the downtime [6].
DeepSeek又出手了?一个神秘的AI模型引起全球开发者热议
凤凰网财经· 2026-03-18 13:21
Core Viewpoint - The article discusses the emergence of a new AI model named "Hunter Alpha," which has sparked speculation about its connection to the upcoming DeepSeek V4 model due to its impressive performance metrics and anonymous release [3][4][6]. Group 1: Performance Metrics - Hunter Alpha boasts a parameter scale of 1 trillion, placing it among the leading models in the industry [4]. - The model claims to have a context window of up to 1 million tokens, significantly surpassing most commercial models, allowing it to handle longer texts and more complex tasks [4]. - As of the latest statistics, Hunter Alpha has processed over 160 billion tokens, indicating rapid adoption among developers [5]. Group 2: Connection to DeepSeek - The model's self-identification as a "Chinese AI model trained primarily in Chinese" and its knowledge cutoff date of May 2025 align with the specifications of DeepSeek's existing models [6]. - Some developers suggest that the reasoning style of Hunter Alpha may reveal its "heritage," with its scale and memory capacity matching expectations for DeepSeek V4 [7]. - Despite the similarities, some analysts remain cautious about definitively linking Hunter Alpha to DeepSeek V4, noting differences in token behavior and architectural patterns [9][10]. Group 3: Industry Practices - The anonymous release of AI models for real feedback has become a standard practice in the industry, with platforms like OpenRouter facilitating testing across multiple AI systems [8]. - Notifications on Hunter Alpha's profile indicate that all prompts and completions are recorded for model improvement, a common practice in the field [9].
DeepSeek又出手了?一个神秘的AI模型引起全球开发者热议
华尔街见闻· 2026-03-18 04:22
Core Viewpoint - The article discusses the emergence of an AI model named "Hunter Alpha" on the OpenRouter platform, speculated to be a secret test of DeepSeek's next-generation system before its official release [1][2]. Group 1: Model Specifications - Hunter Alpha was released on March 11 as a "stealth model" and is currently available for free access to developers [2]. - The model boasts a scale of 1 trillion parameters and a context window of up to 1 million tokens, significantly surpassing most commercial models, allowing it to handle longer texts and more complex tasks [4]. - The model claims to be primarily trained in Chinese, with a knowledge cutoff date of May 2025, aligning with DeepSeek's existing models [2][6]. Group 2: Market Impact and Usage - The combination of Hunter Alpha's high performance and zero cost has led to rapid adoption among developers, with over 160 billion tokens processed by the model as of the last report [5]. - The model's performance metrics have triggered significant discussion in the market, highlighting its potential impact [3]. Group 3: Connection to DeepSeek - Clues linking Hunter Alpha to DeepSeek include its underlying data characteristics and operational logic, particularly its training data cutoff date and reasoning style [6][7]. - Some developers believe that the model's reasoning style may reveal its "heritage," suggesting a connection to DeepSeek's anticipated V4 model, which is expected to be released soon [7]. Group 4: Industry Practices - The anonymous release of models for real feedback has become a standard practice in the AI industry, with platforms like OpenRouter facilitating this process [9]. - Notifications on Hunter Alpha's profile indicate that all prompts and completions are recorded for model improvement, further supporting the notion of a "gray testing" mechanism [10].
DeepSeek V4迟迟不发,中国开源王者为何越来越慢?
Core Viewpoint - DeepSeek's development has slowed down significantly, raising concerns among developers and the AI community about its future competitiveness compared to other players like OpenAI and Anthropic [5][8][18]. Group 1: DeepSeek's Development Timeline - DeepSeek V4 is expected to launch in April 2026, following multiple delays in its announcement timeline [6][14]. - The previous version, DeepSeek V3.2, was released on December 1, 2025, marking a high point for the company with rapid updates and significant community engagement [8][11]. - Since the release of V3.2, updates have been minimal, focusing on small adjustments rather than major advancements, leading to community frustration [12][13]. Group 2: Comparison with Competitors - OpenAI and Anthropic have maintained a rapid release cycle, with OpenAI launching multiple updates and products almost monthly, while DeepSeek has not released any major updates since V3.2 [15][18]. - The competitive landscape has shifted, with DeepSeek lagging behind in terms of update frequency and innovation, which could impact its market position [42]. Group 3: Challenges Faced by DeepSeek - The transition from releasing basic models to developing a comprehensive system has increased the complexity and duration of DeepSeek's development cycles [21][25]. - DeepSeek is under pressure to meet high expectations from the open-source community, where any perceived failure could damage its reputation significantly [28][31]. - The need for DeepSeek to ensure that each release is impactful is critical, as minor updates may not suffice in a competitive environment [32]. Group 4: Strategic and Technical Considerations - The upcoming V4 is expected to focus on multi-modal capabilities, long-term memory, and enhanced code abilities, alongside deep adaptation to domestic chipsets [38][42]. - The development of V4 is seen as a response to both external technological pressures and internal resource limitations, which may extend the research and development timeline [39][40]. - The ability to adapt to the evolving hardware ecosystem is crucial for DeepSeek's future success in the AI landscape [37].
梁文锋推迟V4,是为了根治龙虾的健忘症?
虎嗅APP· 2026-03-17 00:08
Core Viewpoint - The article discusses the anticipation surrounding the release of DeepSeek's V4, emphasizing the importance of its Long-Term Memory (LTM) feature, which aims to enhance AI's contextual understanding and memory capabilities, setting it apart from competitors like OpenClaw [7][8][17]. Group 1: V4 Development and Features - DeepSeek's V4 is expected to include a significant architectural overhaul with 1 trillion parameters and native multimodal capabilities, set to be released in April [7][8]. - The core innovation of V4 is the Long-Term Memory (LTM) system, which allows the AI to retain user interactions and preferences over time, improving its contextual understanding [8][11]. - The LTM aims to address the limitations of existing models, particularly OpenClaw, which struggles with memory retention and context management [9][10][22]. Group 2: Challenges and Competitor Analysis - The AI industry is rapidly evolving, with competitors releasing new features and models, putting pressure on DeepSeek to catch up [38]. - DeepSeek currently lacks multimodal capabilities, being primarily a text-based model, while competitors have advanced to support audio and video processing [39][43]. - The company faces challenges in agent capabilities, AI programming, and search functionalities, which are critical for maintaining competitiveness in the market [45][48][51]. Group 3: Memory and Learning Capabilities - Current AI models, including OpenClaw, have significant limitations in memory management, leading to issues with context retention and task continuity [18][30]. - Research indicates that many leading models struggle to learn effectively from context, highlighting a gap in their ability to utilize information dynamically [32][34]. - The development of a robust memory system within V4 could potentially transform how AI learns and interacts, making it more adaptable and user-friendly [30][35].
行业周报:周观点:政府工作报告首提“算电协同”,关注产业链投资机会-20260315
KAIYUAN SECURITIES· 2026-03-15 14:02
Investment Rating - The industry investment rating is "Positive" (maintained) [2] Core Insights - The government work report first mentioned "computing and electricity collaboration," promoting the green and low-carbon transformation of AI infrastructure. This includes implementing large-scale intelligent computing clusters and enhancing national integrated computing power monitoring and scheduling [5][12] - The report emphasizes the high requirements for continuity and stability in power supply for data centers, which have extreme demands for power reliability due to their millisecond-level load fluctuations [6][13] - AI algorithms and big data analysis are utilized to form automated scheduling strategies for computing and electricity collaboration, optimizing the scheduling of computing tasks in time and space [7][14] Summary by Sections Industry Overview - The government work report highlights the importance of "computing and electricity collaboration" in driving the green transformation of AI infrastructure, with a target for new data centers to achieve an 80% green electricity consumption rate [5][12] Market Review - In the week from March 9 to March 13, 2026, the CSI 300 index rose by 0.19%, while the computer index fell by 0.92% [16] Investment Recommendations - The report suggests focusing on investment opportunities in the industry chain, particularly in areas such as power informationization and virtual power plants, with recommended beneficiaries including Guoneng Rixin, Nanfang Digital, and others [8][15]
网传某车企疑似「养龙虾」致员工电脑集体失控;有人购买iPhone后换屏退货赚差价?苹果回应;曝梁文锋将携DeepSeek V4撞上姚顺雨
雷峰网· 2026-03-13 00:35
Key Points - A car company faced issues with employees' computers being remotely controlled, leading to software deletions, with speculation linking it to the "养龙虾" AI tool, although no evidence supports this claim [4][5] - Apple addressed concerns about individuals profiting from returning iPhones after screen replacements, stating that returned products undergo inspection to ensure they are undamaged [8][9] - Tencent launched SkillHub, a local mirror site for ClawHub, and clarified its intentions to support the ecosystem while addressing accusations of data scraping [10] - Liang Wenfeng's DeepSeek V4 is set to launch in April, focusing on improvements in coding capabilities and long-term memory, while Tencent's Yao Shunyu is also expected to release a new model [11][12] - Cambricon announced a cash dividend of 632 million yuan, with a significant increase in revenue and a return to profitability [12][13] - Baidu's RoboSense is set to implement high-line laser radar in its autonomous driving service, indicating a shift towards advanced sensor technology in the industry [14][15] - BYD is conducting a large-scale recruitment drive, seeking over 2,000 workers with competitive salaries, reflecting its expanding production capacity [16] - Kuaishou is ramping up recruitment for AI-related positions, indicating a growing demand for talent in the generative AI space [17] - Li Auto aims for over 20% sales growth in 2026, with plans to launch new models and enhance its sales management strategy [18] - Xiaomi's new SU7 model is expected to have a production capacity of 16,000 units in March, highlighting the company's focus on electric vehicles [19] - Stellantis is exploring partnerships with Chinese companies to restructure its European operations, potentially allowing for Chinese investment in its brands [34][35] - Apple's iPhone Fold is entering mass production, with significant orders for memory chips despite rising costs, indicating strong market confidence [36] - AMD's CEO is visiting South Korea to secure semiconductor supply chains, emphasizing the importance of partnerships in the competitive AI infrastructure market [37] - Google's Project Genie, aimed at generative AI for gaming, has limitations and is not yet capable of fully developing video games, clarifying misconceptions in the industry [38][39]
DeepSeek V4真要来了?万亿参数模型匿名开测,免费跑龙虾
机器之心· 2026-03-12 11:00
Core Insights - The article discusses the anticipation surrounding the upcoming release of DeepSeek V4, as hinted by a Twitter user, although the authenticity of the information is uncertain [1][2] - Two new models, "Hunter Alpha" and "Healer Alpha," have been introduced on the OpenRouter platform, sparking speculation about their origins and capabilities [3][4] Model Specifications - **DeepSeek V4**: Expected to have approximately 1 trillion parameters, a context window of 320 billion tokens, and a multimodal capability [3] - **Hunter Alpha**: Features 1 trillion parameters and a context window of 1 million tokens, designed for complex tasks and deep tool usage [4] - **Healer Alpha**: An omni-modal model with a context window of 262,144 tokens, capable of processing visual and audio inputs and executing complex multi-step tasks [6][7] Speculation on Origins - The community is speculating about the origins of the new models, with suggestions that Healer Alpha may be related to DeepSeek V4 or a Xiaomi model, while Hunter Alpha could be linked to several other models including DeepSeek V4, Kimi K3, and Grok 4.2 [14][16][20] - The models are currently free to use, with providers tracking prompts and outputs for potential model improvements [12] Market Context - The introduction of these models comes at a time when other companies have already released their models, leading to speculation that DeepSeek V4 may be the only major release left for the year [18][20] - The performance and capabilities of these models are being closely monitored by users, particularly in relation to their application in various workflows [8][12]
DeepSeek V4多模态大模型将发布,深度适配华为寒武纪国产芯片;马斯克确认SpaceX的IPO目标估值超1.75万亿美元丨AI周报
创业邦· 2026-03-08 04:20
Core Viewpoint - The article provides a comprehensive overview of significant developments in the AI industry, highlighting key investments, product launches, and technological advancements that shape the current market landscape [5]. Group 1: Major Company Developments - Elon Musk confirmed SpaceX's IPO target valuation exceeding $1.75 trillion, indicating a significant milestone for the commercial space industry [7]. - DeepSeek plans to launch its new multimodal model V4, optimized for Huawei and Cambrian chips, reinforcing its position in AI efficient computing [10]. - OpenAI released GPT-5.4 and GPT-5.4 Pro, enhancing its capabilities in various professional tasks and tools [11][12]. - OpenAI announced a new investment round at a valuation of $730 billion, raising $110 billion from major investors including SoftBank and NVIDIA [14]. - Ant Group and Tsinghua University released the AReaL v1.0 framework for reinforcement learning, allowing seamless integration for various agent frameworks [10]. Group 2: Investment and Financing Trends - The global AI financing events totaled 55, with a total financing scale of 784.09 billion RMB, marking a significant increase from the previous period [34]. - The highest domestic financing amount was 9.068 billion RMB, led by Galaxy General, which completed a 2.5 billion RMB B+ round [43]. - The overseas AI financing total reached 774.572 billion RMB, with OpenAI's 110 billion USD funding being the largest disclosed [44]. Group 3: Market Insights and Predictions - IDC predicts that the global intelligent robot hardware market will approach 30 billion USD by 2026, with China expected to lead the growth [33]. - OpenAI's annual revenue surpassed 25 billion USD, reflecting a 17% increase from the previous year [29]. - Cursor, an AI programming assistant, achieved an annual revenue of over 2 billion USD, with enterprise clients contributing 60% of its revenue [29].