迎战TPU与Trainium？英伟达再度发文“自证”：GB200 NVL72可将开源AI模型性能最高提升10倍

Core Viewpoint - Nvidia is facing competition from Google TPU and Amazon Trainium, prompting the company to reinforce its market position through a series of technical validations and public responses, including claims that its GPU technology is "a generation ahead" of the industry [2][5]. Group 1: GB200 NVL72 Technology Advantages - The GB200 NVL72 system can enhance the performance of leading open-source AI models by up to 10 times, addressing the scalability challenges of Mixture of Experts (MoE) models in production environments [2][9]. - The system integrates 72 NVIDIA Blackwell GPUs, delivering 1.4 exaflops of AI performance and 30TB of fast shared memory, with an internal GPU communication bandwidth of 130TB/s [9]. - Top-performing open-source models like Kimi K2 Thinking and DeepSeek-R1 have shown significant performance improvements when deployed on the GB200 NVL72 system [9][10]. Group 2: Market Concerns and Client Dynamics - Nvidia's recent technical assertions are seen as a direct response to market concerns, particularly regarding key client Meta's consideration of adopting Google's TPU for large-scale data center use, which could threaten Nvidia's dominant market share [5]. - Despite Nvidia's efforts to address these concerns, the company's stock price has declined nearly 10% over the past month [6]. Group 3: Cloud Service Provider Deployment - The GB200 NVL72 system is being deployed by major cloud service providers and Nvidia's cloud partners, including Amazon Web Services, Google Cloud, and Microsoft Azure, among others [12]. - CoreWeave and Fireworks AI have highlighted the efficiency and performance benchmarks set by the GB200 NVL72 system for MoE model services [12].