人工智能推理

Search documents
OpenAI研究负责人诺姆·布朗:基准测试比数字大小毫无意义,未来靠token成本衡量模型智能|GTC 2025
AI科技大本营· 2025-03-24 08:39
责编 | 王启隆 出品丨AI 科技大本营(ID:rgznai100) 今年英伟达大会(GTC 2025)邀请到了 OpenAI 的人工智能推理研究负责人、OpenAI o1 作者 诺姆·布朗(Noam Brown) 参与圆桌对话。 他先是带着大家回顾了自己早期发明"德扑 AI"的工作,当时很多实验室都在研究玩游戏的 AI,但大家都觉得摩尔定律或者扩展法则(Scaling Law)这 些算力条件才是突破关键。诺姆则在最后才顿悟发现,范式的更改才是真正的答案:" 如果人们当时就找到了正确的方法和算法,那多人扑克 AI 会提前 20 年实现 。 " 究其根本原因,其实还是很多研究方向曾经被忽视了。" 在项目开始前,没有人意识到 推理计算会带来这么大的差异。 " 毕竟,试错的代价是非常惨痛的,诺姆·布朗用一句很富有哲思的话总结了直到现在都适用的一大问题:" 探索全新的研究范式,通常不需要大量的计算 资源。但是,要大规模地验证这些新范式,肯定需要大量的计算投入。 " 左为英伟达专家布莱恩·卡坦扎罗,中为诺姆·布朗,右为主持人瓦尔蒂卡 在和英伟达专家的对话过程中,诺姆还对自己加入 OpenAI 之前、成为" 德扑 AI ...
不止芯片!英伟达,重磅发布!现场人山人海,黄仁勋最新发声
21世纪经济报道· 2025-03-19 03:45
Core Viewpoint - The article highlights NVIDIA's GTC 2025 event, emphasizing the shift in AI focus from training to inference, showcasing new hardware and software innovations aimed at enhancing AI capabilities and applications [1][3][30]. Group 1: Key Innovations and Products - NVIDIA introduced the Blackwell Ultra GPU series and the next-generation architecture Rubin, with plans for the Vera Rubin NLV144 platform to launch in the second half of 2026 and Rubin Ultra NV576 in the second half of 2027 [5][10]. - The Blackwell Ultra architecture significantly enhances AI performance, achieving a 1.5x improvement in AI performance compared to the previous generation, and offers a 50x increase in revenue opportunities for AI factories [8][10]. - The new CPO switch technology aims to reduce data center power consumption by 40MW and improve network transmission efficiency, laying the groundwork for future large-scale AI data centers [13][14]. Group 2: AI Inference and Software Upgrades - NVIDIA's new AI inference service software, Dynamo, is designed to maximize token revenue in AI models, achieving a 40x performance improvement over the previous Hopper generation [19][21]. - The introduction of AI agents and the Ll ama Nemo tr o n series models aims to facilitate complex inference tasks, enhancing capabilities in various applications such as automated customer service and scientific research [20][30]. Group 3: Robotics and Physical AI - NVIDIA launched the GROOT N1, the world's first open-source humanoid robot model, designed for various tasks such as material handling and packaging, indicating a significant step towards the commercialization of humanoid robots [25][30]. - The company also introduced new desktop AI supercomputers, DGX Spark and DGX Station, aimed at providing high-performance AI computing capabilities for researchers and developers [23][24]. Group 4: Market Sentiment and Future Outlook - Despite the significant technological advancements presented at GTC 2025, NVIDIA's stock price fell by 3.43% post-event, reflecting ongoing market concerns regarding AI spending and competition [28][29]. - Analysts suggest that while there are concerns about AI capital expenditure growth in 2026, the overall sentiment may improve due to the innovations showcased at the event [29][30].
速递|与微软再对弈,OpenAI向CoreWeave注资120亿美元
Z Potentials· 2025-03-11 03:27
Core Viewpoints - OpenAI has signed a five-year agreement worth $11.9 billion with CoreWeave, which includes a $350 million equity stake in CoreWeave, separate from its planned IPO [1][2] - CoreWeave's revenue is heavily reliant on Microsoft, which accounted for 62% of its income in 2024, growing to $1.9 billion from $228.9 million in 2023, an increase of nearly eight times [2] - The partnership with OpenAI is expected to alleviate investor concerns regarding CoreWeave's dependency on a single client, potentially boosting its IPO prospects [2] Company Dynamics - CoreWeave, initially a cryptocurrency mining company, has significant debt of $7.9 billion and aims to use IPO proceeds to repay some of this debt [6] - The relationship between Microsoft and OpenAI is becoming increasingly competitive, with both companies vying for enterprise clients and developing competing AI models [4][5] - CoreWeave operates a cloud service designed for AI, supported by Nvidia, and has expanded its GPU resources significantly, including the latest Nvidia Blackwell products [2][5]