steve

Search documents
DeepSeek-R2!?神秘模型惊现竞技场,真实身份引网友猜测
量子位· 2025-07-03 04:26
Core Viewpoint - The article discusses the emergence of a mysterious model named "steve" from DeepSeek, sparking speculation about its identity and performance in comparison to other models like R2 and V4 [1][5][19]. Group 1: Model Identity and Speculation - Users are speculating about the identity of "steve," with suggestions ranging from it being R2, V4, or an upgraded version of an older model [3][19]. - "steve" has been confirmed to be associated with DeepSeek, although further details about its identity remain undisclosed [8][19]. - The model's presence is not visible on the public page, but traces of it can be found in the front-end code [5][6]. Group 2: Performance Comparison - Initial tests show that "steve" has passed certain intelligence tests, but it has also failed some questions [11]. - Comparisons between "steve" and V3 indicate that "steve" produced approximately 300 lines of game code, while V3 generated around 800 lines [13]. - Overall, "steve's" performance is perceived as underwhelming compared to V3 and R1, leading to doubts about it being R2 [22][19]. Group 3: Development and Release Timeline - The anticipated release of R2 has been delayed again, attributed to dissatisfaction from CEO Liang Wenfeng regarding its performance [25]. - The slow progress of R2's development may be linked to a shortage of NVIDIA H20 chips [26]. - Speculation about R2's capabilities includes parameters such as 1.2 trillion parameters and 5.2 petabytes of training data, although these claims remain unverified [32].