中国AI芯片在推理赛道寻突破
Zhong Guo Jing Ying Bao·2025-11-25 14:36

Core Insights - The demand for AI computing power is shifting from training to inference, with inference expected to become the main driver of AI computing growth starting in 2025 [1][4] - Domestic AI chip companies are focusing on differentiation in the inference market, particularly in video generation, edge computing, and industry applications, despite the dominance of NVIDIA and AMD in the general AI computing market [1][3] Group 1: Industry Challenges - Chinese AI chip industry faces challenges due to geopolitical factors, with limitations in advanced processes, high bandwidth memory (HBM), packaging technology, and design tools [2] - Current domestic AI chips primarily use 12nm and 7nm processes, while North America is advancing towards 2nm, resulting in domestic chips having only about 30% of the computing power of their North American counterparts [2] Group 2: Technological Innovations - Domestic industry is innovating through technological pathways, such as computing power networking and super-node architecture, achieving overall computing power that is 2.1 times that of similar North American systems with 384 card deployments [2] - The shift towards inference chips is seen as a strategic opportunity for Chinese chip companies, as the demand for inference computing is experiencing explosive growth [4][5] Group 3: Market Dynamics - The ratio of computing power demand between training and inference is expected to reverse from 6:4 to favor inference by 2025, indicating a significant market shift [4] - The complexity of intelligent AI tasks requires higher performance, energy efficiency, and compatibility from inference chips, as they will need to handle more tokens and multiple model calls compared to traditional methods [4] Group 4: Future Directions - The focus for domestic AI chip companies is shifting from merely being available to being effective and cost-efficient, which is crucial for breaking through in the inference market [5] - The market for inference chips emphasizes scenario adaptability, low power consumption, and cost control, aligning with the strengths of Chinese chip companies in specific fields [5]