国产推理芯片
Search documents
英伟达的“神秘芯片”背后:推理时代开启“四大算力新趋势”
Hua Er Jie Jian Wen· 2026-03-01 13:53
Core Insights - Nvidia is shifting the AI computing competition focus from training to inference, with plans to unveil a new inference chip integrated with Groq's LPU technology at the upcoming GTC developer conference [1] - OpenAI has agreed to become a major customer for Nvidia's new processor, indicating a strong demand for dedicated inference capacity [1] - The report from Shenwan Hongyuan highlights four key trends in inference computing: increased deployment of pure CPU scenarios, the rise of specialized architectures like LPU, accelerated breakthroughs in domestic computing chips, and a shift in demand structure towards mass token consumption [2] Inference Demand Explosion - The demand for inference has surged, driven by the monetization of large models and the rapid deployment of agents in real-world applications, requiring substantial inference computing power [3] - Data shows a significant increase in inference volume during the Chinese New Year, with major models reaching record token consumption [3] LPU's Emergence - Nvidia's acquisition of Groq's core technology for $20 billion signifies the growing importance of pure inference chips, with LPU architecture offering efficiency advantages in inference scenarios [6] - The future AI chip landscape is expected to differentiate between training and inference, with training continuing to use GPU-HBM combinations while inference evolves towards ASIC+LPU-SRAM+SSD configurations [6] System-Level Innovations - The upgrade in inference computing also involves a shift from single chips to system-level innovations, with a three-layer network architecture emerging to meet the demands of low latency and high throughput [7] - Nvidia is expanding its collaboration with Meta Platforms to support large-scale pure CPU deployments, moving beyond a single GPU sales model [7] Domestic Chip Breakthroughs - Domestic inference chips are experiencing significant technological upgrades, with new designs supporting low-precision data formats and enhanced interconnect bandwidth [9] - The supply chain for domestic chips is also improving, as evidenced by the rapid growth in revenue from high-performance computing chip packaging services [9]
游族网络与国产GPU厂商曦望达成战略合作
Xin Lang Cai Jing· 2026-01-28 10:20
Core Insights - Recently, Youzu Interactive has formed a strategic partnership with domestic GPU manufacturer Sunrise to collaborate on digital economy computing power synergy [1] - Sunrise, a fully self-developed AI computing power chip company, was previously the chip division of SenseTime and will operate independently by the end of 2024 [1] - Youzu Interactive is set to invest in Sunrise in 2025, with plans to explore the integration of domestic inference chips into game research and operation processes [1]