Core Insights - NVIDIA's CEO Jensen Huang announced that the company will unveil "world's first" new chip products at the upcoming GTC conference, sparking significant market interest in NVIDIA's next-generation product roadmap [1] - The GTC keynote will take place on March 15 in San Jose, California, focusing on the next phase of the AI infrastructure race [1] Potential New Products - The new products are speculated to fall into two main categories: 1. Derivative chips from the Rubin series, such as the previously leaked Rubin CPX, following the recent launch of the Vera Rubin AI series, which includes six chips now in full production [2] 2. The potentially revolutionary Feynman architecture chip, which may utilize broader SRAM integration and possibly 3D stacking technology for Language Processing Units (LPU), although this has not been officially confirmed [2] Market Demand and Product Evolution - NVIDIA is responding to changing computational demands, with a shift from pre-training to inference capabilities becoming central, as indicated by the introduction of Grace Blackwell Ultra and Vera Rubin [3] - The Feynman architecture is expected to be deeply optimized for inference scenarios, addressing performance bottlenecks related to latency and memory bandwidth, which will significantly impact cloud service providers and enterprise customers reliant on AI inference capabilities [3] - Huang emphasized the importance of broader partnerships and investment strategies, indicating NVIDIA's transition from a chip supplier to an AI ecosystem builder, aiming to maintain a leading position in the AI infrastructure competition through acquisitions and collaborations [3]
黄仁勋预告“前所未见”的芯片新品,下一代Feynman架构或成焦点