Core Insights - NVIDIA unveiled its new Vera Rubin architecture at CES 2026, boasting a 5x increase in inference performance and a 3.5x increase in training performance compared to the previous Blackwell architecture, while reducing costs by 90% [1][3][8] - The Rubin platform is designed to address the urgent demand for AI computing power, with large-scale production set to begin in the second half of 2026 [3][10][47] Group 1: Vera Rubin Architecture - The Vera Rubin architecture integrates CPU, GPU, networking, storage, and security into a cohesive system, moving away from merely stacking GPUs to creating a unified AI supercomputer [13] - Key components of the Rubin platform include the Vera CPU, Rubin GPU, NVLink 6, ConnectX-9 SuperNIC, BlueField-4 DPU, and Spectrum-6 Ethernet, all designed to enhance AI performance [14][16] - The Rubin GPU achieves 50 PFLOPS of NVFP4 computing power, significantly outperforming the Blackwell GPU [16][27] Group 2: Performance Enhancements - The Rubin architecture's training speed reaches 35 petaflops, while inference tasks can achieve up to 50 petaflops, marking a substantial improvement over Blackwell [27][28] - The architecture's HBM4 memory bandwidth has increased to 22 TB/s, and the NVLink interconnect bandwidth has doubled to 3.6 TB/s, facilitating efficient multi-GPU training [27][29] - The platform reduces the number of GPUs needed for training MoE models by 75%, leading to significant energy savings [28][32] Group 3: AI Applications and Innovations - NVIDIA introduced AlphaMayo, an end-to-end autonomous driving AI capable of reasoning and decision-making without human intervention [49][55] - The company is also launching a comprehensive open-source suite for physical AI, which includes models and frameworks for various applications, including robotics [62][64] - The new DGX SuperPOD, featuring multiple Rubin NVL72 racks, can handle thousands of AI agents and millions of tokens, providing a robust AI infrastructure [41][39] Group 4: Market Impact and Future Outlook - Major cloud providers like AWS, Microsoft Azure, and Google Cloud are expected to be the first to deploy the Rubin architecture, with widespread commercial use anticipated by late 2026 [47] - The advancements in AI infrastructure are expected to drive a significant increase in investment in AI, with estimates of $3 to $4 trillion over the next five years [8] - NVIDIA's innovations are set to redefine the AI landscape, making high-performance computing more accessible and affordable, akin to electricity [8][71]
今夜无显卡,老黄引爆Rubin时代,6颗芯狂飙5倍算力