Core Insights - NVIDIA's CEO Jensen Huang highlighted the transformative impact of next-generation accelerated computing and AI across industries during his keynote at CES 2026 [2] - The demand for AI training and inference computing is surging, with the Rubin architecture entering full-scale production and expected to launch in the second half of 2026, offering up to a 10x reduction in token costs compared to the previous Blackwell generation [2][4][5] NVIDIA Rubin Platform - The NVIDIA Rubin platform features six new chips designed for extreme collaboration, significantly reducing training times and inference token costs [4] - The six chips include NVIDIA Vera CPU, NVIDIA Rubin GPU, NVIDIA NVLink 6 Switch, NVIDIA ConnectX-9 SuperNIC, NVIDIA BlueField-4 DPU, and NVIDIA Spectrum-6 Ethernet Switch [4] - Innovations in the Rubin platform include the latest NVIDIA NVLink interconnect technology, a Transformer engine, confidential computing, and RAS engine [4] AI Model Advancements - The Rubin platform accelerates intelligent agent AI, advanced reasoning, and large-scale mixture of experts (MoE) model inference, reducing the number of GPUs needed for training MoE models by four times compared to previous generations [5] - The platform introduces a new generation of AI-native storage architecture designed for gigascale inference context, enhancing response capabilities and throughput [5] Market Deployment and Partnerships - NVIDIA Rubin products will be available through partners like AWS, Google Cloud, Microsoft, and others in the second half of 2026 [5] - CoreWeave will collaborate with NVIDIA to leverage Rubin's advancements in inference and MoE models, while major server manufacturers like Cisco, Dell, HPE, Lenovo, and Supermicro are expected to launch Rubin-based servers [6] Physical AI and Open Source Models - Huang announced the arrival of "physical AI's ChatGPT moment," with machines beginning to understand and act upon real-world data [12][13] - NVIDIA introduced the open-source physical AI foundational model, Cosmos, which has been pre-trained on vast datasets to understand the workings of the world [13] - The Alpamayo series of open-source AI models aims to accelerate the development of safe, reasoning-based autonomous vehicles, garnering interest from industry leaders [14] Robotics and Ecosystem Development - Global robotics leaders are developing products based on NVIDIA's Isaac platform and GR00T foundational model, covering various applications from industrial to consumer robotics [15] - NVIDIA emphasizes the importance of building an open-source AI ecosystem, with models like DeepSeek R1 demonstrating rapid industry adoption and collaboration [15] Industry Implications - The introduction of the Vera Rubin platform is expected to drive demand for high-speed optical modules and CPO technology, with companies in the supply chain already preparing for this shift [9][10] - The increased power requirements of the Rubin GPU, estimated at around 1800 watts, will elevate the demands on power supply and cooling systems [10]
黄仁勋“带货”Rubin,A股谁有望受益?