炸锅了!DeepSeek MODEL1 引发全网大猜测,R2 or V4?
程序员的那些事·2026-01-21 04:21

Core Viewpoint - The sudden emergence of a new model named "MODEL1" from DeepSeek has generated significant excitement in the domestic large model sector, indicating potential advancements in AI technology and competition in the market [1][3]. Group 1: MODEL1 Features - MODEL1 has been revealed to include advanced technologies such as optimized KV cache layout and support for FP8 sparse decoding, which are expected to greatly enhance inference efficiency and reduce memory usage [2]. - The model integrates a long-context optimization mechanism, addressing the common issue of large models struggling to retain long texts [2]. Group 2: Speculations and Expectations - There is speculation among the community that MODEL1 could either be the long-awaited R2 model, which has faced delays due to chip shortages, or the upcoming V4 model, following the naming convention after V3.2 [3]. - DeepSeek has not officially responded to these speculations, but there are indications that the new model may be released around the Chinese New Year [3].

炸锅了!DeepSeek MODEL1 引发全网大猜测,R2 or V4? - Reportify