Workflow
英伟达祭出下一代GPU,狂飙百万token巨兽,投1亿爆赚50亿
NvidiaNvidia(US:NVDA) 3 6 Ke·2025-09-11 02:45

Core Insights - NVIDIA has launched the Rubin CPX, a new CUDA GPU designed for massive context AI, marking the entry into the "million-token era" for large model inference [1][3] - The Rubin CPX is expected to significantly enhance AI computing capabilities, creating a new category of processors [4][12] Performance Metrics - The Rubin CPX offers over twice the performance of the Vera Rubin NVL144 platform and 7.5 times that of the Blackwell Ultra-based GB300 NVL72 system [3] - It features 8 EFLOPS of NVFP4 computing power, 100TB of high-speed memory, and 1.7 PB/s memory bandwidth, along with 128GB of GDDR7 memory [3][16] - The attention mechanism processing capability is three times greater than that of the NVIDIA GB300 NVL72 system [19] Economic Impact - The Rubin CPX can generate a return on investment (ROI) of 30-50 times, effectively rewriting the economics of inference [5][12] - For every $100 million invested, it can potentially yield up to $5 billion in token revenue [3] Technological Advancements - The Rubin CPX is designed to address the "long context" bottleneck in AI, enabling inference across millions of knowledge tokens simultaneously [3][4] - It supports multi-step inference, persistent memory, and long-term context, making it suitable for complex tasks in software development, video generation, and deep research [4][12] Infrastructure and Ecosystem - The Rubin CPX is part of the NVIDIA Vera Rubin NVL144 platform, which integrates with NVIDIA Vera CPUs and Rubin GPUs for a complete high-performance inference solution [15][22] - The platform is expected to be available by the end of 2026, unlocking new capabilities for developers and redefining the construction of next-generation generative AI applications [22][24]