Core Insights - The company is among the first globally to propose and commercialize NPU-driven AI inference chips, having completed four generations of NPU development and commercialization [1] - The upcoming Nova 500 series will upgrade the GPNPU architecture, enhancing compatibility, performance, and energy efficiency for AI inference applications [1] - The IPU-X6000 accelerator card, set to launch in 2024, is already in development with multiple clients, aiming to integrate AI inference capabilities into broader enterprise digital processes [1] Industry Trends - Inference heterogeneity has become an industry trend, prompting the company to develop the fifth generation GPNPU architecture, which combines GPU versatility with NPU energy efficiency [2] - The core innovation focuses on "computing power building blocks" design and 3D stacked storage, aiming to enhance capital and operational expenditure token conversion rates [2] - The goal is to provide core computing power support for large model applications and composite intelligent agent deployments, achieving "extreme cost-effectiveness for millions of tokens" [2]
云天励飞:目前在研Nova 500系列将全面升级GPNPU架构