CUDA兼容
Search documents
中国推理芯片突围与成本革命:破“内存墙”、兼容CUDA
2 1 Shi Ji Jing Ji Bao Dao· 2026-02-04 09:09
Core Insights - The article discusses the shift in the global AI computing power focus from training to inference, indicating a competitive landscape for cost-effective and energy-efficient chips [1][2] - The consensus in the industry is that inference chips will dominate AI evolution in the next five to ten years, with companies like Google and Nvidia leading the charge [1][3] - CloudWalk Technology has announced its strategic focus on AI inference chips, aiming to significantly reduce the cost of processing tokens, which are becoming a core productivity driver in the AI landscape [2][3] Industry Trends - The demand has shifted from relying on high-performance GPUs to a pressing need for high-cost performance inference chips [2] - The past year has seen a dramatic increase in the computational requirements for large models, with token processing needs growing hundreds of times, highlighting the importance of inference over training [2][3] - Nvidia's strategic acquisition of Groq's core assets for $20 billion reflects the growing importance of inference chips, with Groq's valuation skyrocketing from $7 billion to $20 billion in just four months [3] Company Strategy - CloudWalk Technology's CEO, Chen Ning, emphasizes the goal of reducing the cost of processing one million tokens by 100 times, aiming for a transformative impact on industrial productivity by 2030 [3][4] - The company is developing a new processor architecture, GPNPU, designed to optimize inference for large models while addressing cost, efficiency, and deployment challenges [5][6] - The GPNPU architecture aims to maintain compatibility with existing CUDA programs, lowering the barrier for integration into production systems [5][6] Product Development - CloudWalk Technology plans to launch the DeepVerse 100, 200, and 300 series chips over the next five years, targeting major clients across various industries [6] - The company is focusing on modular chip design through a "power building block" approach, allowing for scalable and flexible computing solutions [6] - The company has established a strong domestic production capacity, ensuring supply chain security for large-scale chip production and delivery [6]