AI工厂

Search documents
NVIDIA GTC 2025:GPU、Tokens、合作关系
Counterpoint Research· 2025-04-03 02:59
Core Viewpoint - The article discusses NVIDIA's advancements in AI technology, emphasizing the importance of tokens in the AI economy and the need for extensive computational resources to support complex AI models [1][2]. Group 1: Chip Developments - NVIDIA has introduced the "Blackwell Super AI Factory" platform GB300 NVL72, which offers 1.5 times the AI performance compared to the previous GB200 NVL72 [6]. - The new "Vera" CPU features 88 custom cores based on Arm architecture, delivering double the performance of the "Grace" CPU while consuming only 50W [6]. - The "Rubin" and "Rubin Ultra" GPUs will achieve performance levels of 50 petaFLOPS and 100 petaFLOPS, respectively, with releases scheduled for the second half of 2026 and 2027 [6]. Group 2: System Innovations - The DGX SuperPOD infrastructure, powered by 36 "Grace" CPUs and 72 "Blackwell" GPUs, boasts AI performance 70 times higher than the "Hopper" system [10]. - The system utilizes the fifth-generation NVLink technology and can scale to thousands of NVIDIA GB super chips, enhancing its computational capabilities [10]. Group 3: Software Solutions - NVIDIA's software stack, including Dynamo, is crucial for managing AI workloads efficiently and enhancing programmability [12][19]. - The Dynamo framework supports multi-GPU scheduling and optimizes inference processes, potentially increasing token generation capabilities by over 30 times for specific models [19]. Group 4: AI Applications and Platforms - NVIDIA's "Halos" platform integrates safety systems for autonomous vehicles, appealing to major automotive manufacturers and suppliers [20]. - The Aerial platform aims to develop a native AI-driven 6G technology stack, collaborating with industry players to enhance wireless access networks [21]. Group 5: Market Position and Future Outlook - NVIDIA's CUDA-X has become the default programming language for AI applications, with over one million developers utilizing it [23]. - The company's advancements in synthetic data generation and customizable humanoid robot models are expected to drive new industry growth and applications [25].
黄仁勋年度演讲来了,Scaling Law失效只是假象,推理需求暴涨100倍,AI模型优化迎来新挑战|GTC 2025
AI科技大本营· 2025-03-19 01:49
作者 | 王启隆 出品 | CSDN(ID:CSDNnews) 北京时间 3 月 19 日凌晨,NVIDIA GTC 2025 的主会开场演讲来了! 在黄仁勋的这场演讲前,英伟达股票还是 119.53 美元 。刷推的时候又发现,马斯克的 Grok AI 都 在和网友们吐槽英伟达今年开年不济,相当艰难,需要一场演讲拯救股市,振奋投资者。还有些直 播,直接开了个股市页面实时盯着 NVDA 涨涨停停,画面相当喜感。 两小时的演讲结束后,股价居然还跌了将近 3%…… 今年的演讲主题是「 AI 工厂 」。 英伟达创始人兼 CEO 黄仁勋身穿标志性的皮衣,潇洒上台。 下面先简单总结演讲的内容有哪些(正好黄仁勋自己在最后强调了一遍本次主会的 五大亮点 ),后 文我们再来个 "事无巨细"的 全面回顾 ,带大家云体验一遍全程。 Blackwell 全面投入生产 第一代 Blackwell 芯片还没热乎,英伟达就推出了下一代 Blackwell Ultra,旨在 提升训练和扩展 推理能力。主会上展示了两个版本: 顺带一提,看外媒的现场返图,英伟达这次在 GTC 大会会馆前 摆了个摊卖煎饼 ,黄仁勋 亲自上阵 边吃边卖, 里面穿着 ...