老黄All in物理AI!最新GPU性能5倍提升,还砸掉了智驾门槛
NvidiaNvidia(US:NVDA) 量子位·2026-01-06 01:01

Core Viewpoint - NVIDIA is shifting its focus entirely towards AI, as evidenced by its absence of gaming graphics cards at CES 2026 and the introduction of new AI products and architectures [2][10]. Group 1: AI Product Launches - NVIDIA unveiled the next-generation Rubin architecture GPU, which boasts inference and training performance that are 5 times and 3.5 times better than the Blackwell GB200, respectively [4][17]. - The company introduced five new product families targeting various AI applications, including the NVIDIA Nemotron for Agentic AI, NVIDIA Cosmos for physical AI, and NVIDIA Alpamayo for autonomous driving [6][8][39]. - The Vera Rubin NVL72 architecture was officially launched, featuring six core components designed to enhance AI data center capabilities [14][15]. Group 2: Performance Metrics - The Rubin GPU achieves an inference performance of 50 PFLOPS and a training performance of 35 PFLOPS under the NVFP4 data type, significantly surpassing its predecessor [17]. - Each Rubin GPU is equipped with 288GB of HBM4 memory and offers a bandwidth of 22 TB/s, supporting the high computational demands of modern AI models [18]. - The overall architecture of the Vera Rubin NVL72 can deliver 3.6 exaFLOPS of NVFP4 inference performance and 2.5 exaFLOPS of training performance [37]. Group 3: Networking and Connectivity - The introduction of NVLink 6 enhances interconnect bandwidth to 3.6 TB/s per GPU, with a total bandwidth of 260 TB/s across the entire NVL72 rack [20][21]. - The Vera CPU integrates 88 custom Arm cores and features a bandwidth of 1.8 TB/s for NVLink C2C interconnect, facilitating efficient communication between CPU and GPU [22]. Group 4: AI Model Developments - The Alpamayo model, a large-scale open-source visual-language-action model for autonomous driving, was launched with 10 billion parameters [41]. - The Nemotron series expanded to include specialized models for speech recognition, visual-language processing, and safety, enhancing AI applications across various sectors [49][51]. - The Cosmos model for robotics was upgraded to generate synthetic data that adheres to real-world physical laws, aiding in the development of AI agents [54][58]. Group 5: Industry Impact and Future Outlook - NVIDIA's comprehensive approach to AI, integrating models, data, and tools, is expected to strengthen its competitive edge and ecosystem lock-in [10]. - The company plans to begin mass production of the Vera Rubin NVL72 in the second half of 2026, indicating a strong commitment to advancing AI infrastructure [38].