Workflow
DeepSeek V4 Lite
icon
Search documents
消息称 DeepSeek V4 模型打破惯例:华为等国内厂商可早期访问,不让英伟达 AMD 先用
Xin Lang Cai Jing· 2026-02-27 10:36
Group 1 - DeepSeek has not shared its upcoming flagship model with US chip manufacturers like Nvidia and AMD, which breaks industry norms [1][4][5] - Instead, DeepSeek V4 has provided early access to domestic suppliers, including Huawei Technologies [1][4] - AI developers typically share pre-release versions of major models with chip manufacturers to ensure software efficiency on widely used hardware [5] Group 2 - DeepSeek is currently testing the V4 Lite model, codenamed "Sealion-lite," which features a context window of 1 million tokens and supports native multimodal reasoning [5] - The latest update from DeepSeek has expanded its knowledge base to May 2025, allowing it to accurately output news from April 2025 in offline mode [2][5]
DeepSeek又一论文上新!新模型V4更近了?
Di Yi Cai Jing· 2026-02-27 07:01
Core Insights - The paper introduces an innovative inference system called DualPath, aimed at optimizing the inference performance of large language models (LLMs) under agent workloads, significantly enhancing efficiency in AI applications [3][4] - The DualPath system improves offline inference throughput by 1.87 times and increases the average number of agent operations per second in online services by 1.96 times [3] Group 1: Technological Advancements - The introduction of a "dual-path reading KV-Cache" mechanism reallocates storage network load, addressing the core issue of speed being hindered by data reading during agent tasks [4] - The shift from traditional human-LLM interaction to human-LLM-environment interaction necessitates a transformation in inference workloads, allowing for multiple rounds of interaction that can accumulate extensive context [3] Group 2: Market Reactions and Expectations - There are mixed opinions within the industry regarding the optimization efforts by DeepSeek, with some viewing it as a necessary response to hardware limitations, while others see value in cost reduction for broader AI adoption [5] - Speculation around the release of DeepSeek's next flagship model, V4, has generated significant market interest, with various timelines being discussed, from early February to March [5][6] - DeepSeek has not publicly commented on the rumors surrounding the V4 model, leading to heightened anticipation and concern among investors about potential market volatility upon its release [6]