Workflow
DeepSeek-V3.1适配下一代国产芯片引爆市场,大模型这次和哪些国产芯一起“自主可控”?
3 6 Ke·2025-09-01 11:37

Core Insights - DeepSeek officially launched DeepSeek-V3.1 on August 21, featuring a hybrid reasoning architecture, improved thinking efficiency, and enhanced agent capabilities [1] - The release sparked significant market activity, with FP8 concept stocks surging, including companies like Cambricon, Hezhong Technology, and Jiadu Technology [1] Group 1: DeepSeek-V3.1 Features - The hybrid reasoning architecture allows the model to support both thinking and non-thinking modes [1] - DeepSeek-V3.1-Think demonstrates higher efficiency, providing answers in a shorter time compared to its predecessor, DeepSeek-R1-0528 [1] - Enhanced agent capabilities are achieved through post-training optimization, improving performance in tool usage and agent tasks [1] Group 2: FP8 and UE8M0 FP8 - FP8, or Floating-Point 8, is a format that uses 8 bits to balance range and precision, with the introduction of UE8M0 FP8 specifically designed for upcoming domestic chips [4][8] - UE8M0 FP8 prioritizes dynamic range while sacrificing some precision, making it suitable for stable training on non-NVIDIA architectures [22] - The shift to FP8 is driven by the need for lower precision formats to reduce memory usage and improve computational speed, especially in AI applications [9][15] Group 3: Market Impact and Collaboration - The announcement of DeepSeek-V3.1 and its FP8 capabilities led to a surge in interest from domestic chip manufacturers, indicating a collaborative effort between model developers and chip manufacturers [17][22] - The compatibility of UE8M0 FP8 with domestic chips is seen as a strategic move to enhance the stability and efficiency of AI model training in the context of export restrictions on NVIDIA technology [22] - The collaboration aims to establish a robust FP8 ecosystem within China, facilitating the development of AI infrastructure independent of foreign technology [22][23]