Rubin Ultra NVL576

Search documents
AI算力“卖水人”专题系列(7):从Blackwell到Rubin:计算、网络、存储持续升级
Guohai Securities· 2025-09-17 11:02
Investment Rating - The report maintains a "Buy" rating for the computer industry [1] Core Insights - The demand for AI computing power is expected to grow significantly, driven by advancements in large model training and the introduction of new architectures like GB300 and Vera Rubin [11] - NVIDIA's revenue for FY2026 Q2 reached $46.7 billion, a year-on-year increase of 56%, indicating strong market demand for AI computing solutions [5][59] - The report highlights the performance improvements of NVIDIA's new GPU architectures, with the GB300 achieving a 1.5x increase in FP4 computing power compared to its predecessor [30] Summary by Sections Section 1: GPU Core - The GB300 GPU, based on the Blackwell Ultra architecture, utilizes TSMC's 4NP process and features a floating-point performance of 15 PFLOPS, which is 1.5 times that of the B200 [5][26] - The Rubin Ultra NVL576 is expected to launch in 2027, offering significant performance enhancements over the GB300 NVL72 [11][31] Section 2: Server Details - The GB300 NVL72 system consists of 18 compute trays and 9 switch trays, integrating 72 Blackwell Ultra GPUs and 36 Grace CPUs, with potential performance improvements of up to 50 times compared to previous architectures [6][80] - The report discusses the transition from HGX to MGX server designs, allowing for more efficient AI and HPC applications [67] Section 3: Networking - The introduction of CPO technology is set to replace traditional pluggable optical modules, enhancing energy efficiency by 3.5 times and deployment speed by 1.3 times [7] - The Rubin architecture will utilize NVLink 6.0 technology, doubling the speed to 3.6 TB/s, facilitating high-speed interconnects for AI applications [7] Section 4: HBM - HBM4 is expected to achieve mass production in 2026, with SK Hynix leading the market, and collaborations with major clients like NVIDIA and Microsoft [8] Section 5: Liquid Cooling - The GB300 NVL72 employs a full liquid cooling solution, enhancing thermal efficiency and operational cost-effectiveness [9] Section 6: Investment Recommendations and Related Companies - The report identifies potential beneficiaries in the AI computing supply chain, including companies involved in AI chips, server systems, HBM, and cooling technologies [12]
【招商电子】英伟达GTC 2025跟踪报告:2028年全球万亿美金Capex可期,关注CPO、正交背板等新技术趋势
招商电子· 2025-03-20 02:51
Core Insights - The event highlighted the transformative shift in data centers towards AI-driven computing, with projected capital expenditures exceeding $1 trillion by 2028 for data center construction, primarily focused on accelerated computing chips [2][12][13] - NVIDIA's Blackwell architecture is fully operational, showcasing significant performance improvements and a roadmap for future products like Rubin and Feynman, which promise substantial enhancements in computational power and efficiency [3][42][45] - The introduction of the Quantum-X CPO switch and Spectrum-X technology aims to revolutionize networking capabilities, reducing energy consumption and increasing deployment efficiency [5][46] - The advancements in AI applications, particularly in autonomous driving and robotics, are supported by NVIDIA's new systems and frameworks, enhancing the development and training processes [6][26][24] Capital Expenditure and AI Infrastructure - Data center capital expenditures are expected to reach $1 trillion by 2028, with a significant portion allocated to accelerated computing chips [2][12] - NVIDIA plans to deliver 1.3 million Hopper GPUs to major cloud service providers in 2024, with an increase to 3.6 million Blackwell GPUs in 2025 [2][3] AI Model Training and Inference - The demand for computational power for AI training and inference has surged, with estimates suggesting a 100-fold increase in required computing resources compared to the previous year [10][11] - NVIDIA outlines three levels of AI: Generative AI, Agentic AI, and Physical AI, each representing a different stage of AI development and application [8][10] Product Development and Future Roadmap - Blackwell has been fully launched, with significant customer demand and performance improvements, including a 40-fold increase in inference performance compared to previous models [3][42] - Future products like Vera Rubin and Rubin Ultra are set to enhance computational capabilities further, with expected performance increases of up to 15 times [45][42] Networking Innovations - The Quantum-X CPO switch is anticipated to launch in late 2025, offering substantial energy savings and improved network efficiency [5][46] - Spectrum-X technology will provide high bandwidth and low latency, integrating seamlessly into NVIDIA's computing architecture [5][46] AI Applications in Autonomous Driving and Robotics - NVIDIA's Halos system aims to enhance safety in autonomous vehicles, while the open-source Isaac Groot N1 model supports robotics development [6][24] - The integration of Omniverse and Cosmos platforms accelerates the development of AI for autonomous driving, enabling end-to-end training capabilities [26][24] Data Center Evolution - The transition of data centers into AI factories is underway, focusing on processing, analyzing, and generating AI-driven applications [12][13] - NVIDIA's Dynamo operating system is designed to optimize AI factory operations, enhancing efficiency and performance [35][36]
英伟达(NVDA):发布GB300、Rubin,软件持续迭代
SINOLINK SECURITIES· 2025-03-19 07:54
Investment Rating - The report maintains a "Buy" rating for the company, indicating an expected price increase of over 15% in the next 6-12 months [4]. Core Insights - The company is expected to benefit as a leading AI chip manufacturer due to rapid hardware iteration and a rich software ecosystem, which enhances its competitive edge against rivals [4]. - The demand for AI computing power is anticipated to remain strong, driven by the complexity of models in the inference stage, which require significantly more computing resources compared to earlier generative AI models [2][4]. Summary by Sections Performance Review - The company held the GTC 2025 event on March 18, 2025, showcasing future product launches including GB300 (GB Ultra), Vera Rubin, Rubin Ultra GPUs, and CPO switches for Infiniband and Ethernet [1]. Operational Analysis - The transition from simple generative AI to assistant AI is expected to sustain demand for computing power, with inference requiring 100 times more tokens than before. The company anticipates strong customer demand, with major cloud providers expected to purchase 3.6 million Blackwell GPU dies in 2024 [2]. - Upcoming product releases include GB300 NVL72 in H2 2025, which will feature 288GB HBM3e memory and 1.5 times the computing power of GB200 NVL72. Vera Rubin NVL144 is expected in H2 2026, offering 3.3 times the computing power of GB300 NVL72, and Rubin Ultra NVL576 is projected for H2 2027, with 14 times the computing power of GB300 NVL72 [2]. Software Ecosystem - The company continues to enhance its software ecosystem, launching libraries tailored for various industries, such as cuLitho for lithography and CUDA-Q for quantum computing. Additionally, the introduction of the Dynamo system aims to improve GPU efficiency by assisting with prefill and decode tasks [3]. Profit Forecast and Valuation - The company forecasts net profits of $122.2 billion, $156.9 billion, and $177.9 billion for FY26, FY27, and FY28, respectively, with corresponding P/E ratios of 23, 18, and 16 [4][6].
英伟达发布新一代AI芯片Rubin,预计2026年下半年推出
2 1 Shi Ji Jing Ji Bao Dao· 2025-03-19 04:19
Core Insights - Nvidia announced the upcoming release of its next-generation AI chip, Rubin, expected to ship in the second half of 2026 [3] - The Rubin platform will utilize NVLink 144 technology, promising a performance increase of 100% compared to its predecessor [3] - Rubin will achieve a processing speed of 50 petaflops during inference, significantly surpassing the current Blackwell chip's 20 petaflops [3] - The chip will support up to 288 GB of fast memory [3] - Following Rubin, Nvidia plans to release Rubin Ultra NVL576 in the second half of 2027, which is projected to be 14 times more powerful than the GB 300 NVL72 [3]