Workflow
DeepSeek-R2为什么还没发?
量子位·2025-06-27 08:09

Core Viewpoint - The release of DeepSeek-R2 has been delayed due to CEO Liang Wenfeng's dissatisfaction with its performance and a shortage of Nvidia H20 chips, which are critical for its development [1][2][4]. Development Timeline - The anticipation for R2 began after the release of the DeepSeek-V3 model in December last year, which was considered a benchmark for cost-performance [5]. - An upgrade to V3 was announced in March 2023, leading to speculation that R2 would be released in April [11]. - Despite the release of a paper on scaling laws in early April, there has been no official update on R2 since then [12][16]. Technical Specifications - R1's training utilized 30,000 H20 chips, 10,000 H800 chips, and 10,000 H100 chips, indicating the significant computational resources required for R2 [3]. - Leaked parameters for R2 suggested it would have 1.2 trillion parameters and utilize 5.2 petabytes of training data, although the authenticity of these claims remains uncertain [17]. Community Reactions - Following the news of the delay, community responses varied, with some expressing belief that the delay is justified, while others speculated that R2 might wait for the release of V4 [26][30].