Nvidia H100 GPU

Search documents
26天倒计时:OpenAI即将关停GPT-4.5Preview API
3 6 Ke· 2025-06-18 07:34
近日,OpenAI向开发者发了一封邮件,宣布将于7月14日正式移除 GPT-4.5 Preview API。 图注:OpenAI邮件。图源网络 对于那些已经将GPT-4.5深度集成到自己产品或工作流中的开发者来说,这无异于一次震撼。他们必须在不到一个月的时间内,从OpenAI提供的近40个模 型中,重新寻找一个替代品。 为什么非关不可? 许多人将矛头指向了高昂的计算成本。毕竟,一个性能优越、但商业上不划算的模型,在任何一家公司的账本上都不会长久。 图注:GPT模型一览 GPT-4.5 API 定价高达 75 美元 / 百万输入 tokens,150 美元 / 百万输出 tokens,几乎是 GPT-4.1 的多倍。 OpenAI官方称,这次移除计划早在4月发布GPT-4.1时就已公布。GPT-4.5从始至终都是一个"实验性"产品,其使命是为未来的模型迭代提供经验,尤其是 在创意和写作的细微之处。邮件只是按计划发送的提醒。 不够,GPT-4.5 预览版将继续作为选项,通过应用程序顶部的下拉模型选择菜单,提供给个人 ChatGPT 用户使用。 图注:用户表示GPT-4.5是最喜欢的模型之一。 最近,OpenAI公 ...
NVIDIA Powers World's Largest Quantum Research Supercomputer
GlobeNewswire News Room· 2025-05-19 04:43
Core Insights - NVIDIA has launched the Global Research and Development Center for Business by Quantum-AI Technology (G-QuAT), featuring the ABCI-Q supercomputer, which is the largest research supercomputer dedicated to quantum computing globally [1][14] - The ABCI-Q supercomputer integrates 2,020 NVIDIA H100 GPUs connected via the NVIDIA Quantum-2 InfiniBand networking platform, facilitating unprecedented quantum-GPU computing capabilities [3][2] - The collaboration between NVIDIA and Japan's National Institute of Advanced Industrial Science and Technology (AIST) aims to advance quantum error correction and application development, essential for building practical quantum supercomputers [4][5] Industry Impact - Quantum processors are expected to enhance AI supercomputers in addressing complex challenges across various sectors, including healthcare, energy, and finance [2] - The integration of quantum hardware with AI supercomputing is anticipated to accelerate the realization of quantum computing's potential [4] - ABCI-Q will enable researchers to tackle core challenges in quantum computing technologies, expediting the development of practical use cases [5]
拥有20万GPU的集群建好了,只用了122天
半导体行业观察· 2025-05-09 01:13
如果您希望可以时常见面,欢迎标星收藏哦~ 来源:本文 编译自 tomshardware ,谢谢。 埃隆·马斯克的 xAI 孟菲斯超级集群一期项目刚刚达到满负荷运营,现场变电站已投入运营并连接 到主电网。据大孟菲斯商会称,该站点将从孟菲斯电力、燃气和水务局 (MLGW) 和田纳西河谷管 理局 (TVA) 获得 150 兆瓦的电力。除此之外,xAI Colossus 超级计算机还拥有另外 150 兆瓦的 Megapack 电池作为备用电源,使其能够在断电或用电需求增加时持续供电。 马 斯 克于去 年 7 月 首 次 启 动 他的 AI 集 群 , 该 集 群 在 单 一 架 构 上 搭 载 了 10 万 块 Nvidia H100 GPU。这台 xAI 超级计算机的搭建速度非常快,公司只用了 19 天就将其投入运行——而 Nvidia 首席执行官黄仁勋表示,这通常需要四年时间。然而,如此快的速度意味着它不得不走一些捷径, 比如在没有电网供电的情况下启动,因此该站点使用了大量天然气涡轮发电机来满足其电力需求。 初步报告称,该站点内停放了 14 台发电机,每台输出功率为 2.5 兆瓦,但一些居民最近抱怨说, 附近发现 ...
Meta, Microsoft, Alphabet, and Amazon Just Delivered Incredible News for Nvidia Stock Investors
The Motley Fool· 2025-05-05 22:05
Core Viewpoint - Nvidia has faced significant stock volatility in 2025, with a year-to-date decline of 15%, primarily due to concerns over potential demand reduction for its data center chips amid tariff implications [1][9] Group 1: Tariff Impact and Customer Spending - Although semiconductors are exempt from aggressive tariffs, Nvidia's customers may still experience increased costs, potentially leading to reduced capital expenditures [2] - Major customers like Meta, Microsoft, Alphabet, and Amazon have provided positive updates on their AI spending plans for 2025, indicating continued demand for Nvidia's chips [2][12] - Meta raised its 2025 capex forecast to $64 billion to $72 billion, Microsoft plans to spend around $80 billion, Alphabet maintains a $75 billion forecast, and Amazon is set to spend approximately $105 billion [12] Group 2: Nvidia's Technological Advancements - Nvidia's H100 GPU was the leading AI data center chip in 2023 and most of 2024, but has been succeeded by the more advanced Blackwell and Blackwell Ultra architectures, with the latter offering up to 50 times faster AI inference in specific configurations [4][6] - The upcoming Rubin GPUs, expected in 2026, are projected to deliver 3.3 times more compute performance, further enhancing Nvidia's position in the AI market [7] Group 3: Market Position and Future Growth - Nvidia generated $115.2 billion in data center revenue for fiscal 2025, marking a 142% increase from the previous year, with predictions of data center spending exceeding $1 trillion annually by 2028 [14] - Demand for Nvidia's chips currently exceeds supply, making it difficult for companies to cancel orders without risking a competitive disadvantage in AI [16] - Nvidia's stock is viewed as a buying opportunity, trading at a P/E ratio of 39, significantly lower than its 10-year average above 50 [11]
GPU告急!亚马逊自建“调度帝国”
半导体芯闻· 2025-04-22 10:39
来源:内容 编译自 businessinsider. ,谢谢。 去年,亚马逊庞大的零售业务面临一个重大问题:它无法获得足够的AI芯片来完成关键工作。 据《商业内幕》获取的一系列亚马逊内部文件显示,由于多个项目被延迟,这家西方世界最大的电 商公司发起了一场激进的内部流程和技术改革,以解决这一问题。 这项举措罕见地揭示了一家科技巨头是如何在英伟达等行业供应商的支持下,在内部协调GPU组 件供需的细节。 2024年初,生成式AI热潮全面爆发,成千上万家公司争夺用于部署这项强大新技术的基础设施资 源。 如果您希望可以时常见面,欢迎标星收藏哦~ "随时可开工" 根据《商业内幕》获得的文件,亚马逊现在要求每一项GPU请求都必须提供详细数据和投资回报 证明。 项目将根据多个因素进行"优先排序和排名",包括所提供数据的完整性以及每颗GPU带来的财务 收 益 。 项 目 还 必 须 " 随 时 可 开 工 " ( 即 已 获 得 开 发 批 准 ) , 并 证 明 自 己 处 于 一 场 " 抢 占 市 场 的 竞 争"中,还要明确说明何时能实现预期成果。 一份2024年末的内部文件提到,亚马逊零售部门计划在2025年第一季度 ...
DeepSeek-R1与Grok-3:AI规模扩展的两条技术路线启示
Counterpoint Research· 2025-04-09 13:01
自今年二月起,DeepSeek 便因其开源旗舰级推理模型DeepSeek-R1 而引发全球瞩目——该模型性能 堪比全球前沿推理模型。其独特价值不仅体现在卓越的性能表现,更在于仅使用约2000块NVIDIA H800 GPU 就完成了训练(H800 是H100 的缩减版出口合规替代方案),这一成就堪称效率优化的 典范。 几天后,Elon Musk 旗下xAI 发布了迄今最先进的Grok-3 模型,其性能表现略优于DeepSeek-R1、 OpenAI 的GPT-o1 以及谷歌的Gemini 2。与DeepSeek-R1 不同,Grok-3 属于闭源模型,其训练动用 了惊人的约20万块H100 GPU,依托xAI "巨像"超级计算机完成,标志着计算规模实现了巨大飞跃。 xAI "巨像" 数据中心 Grok-3 展现了无妥协的规模扩张——约200,000块NVIDIA H100 显卡追求前沿性能提升。而 DeepSeek-R1 仅用少量计算资源就实现了相近的性能,这表明创新的架构设计和数据策展能够 与蛮力计算相抗衡。 效率正成为一种趋势性策略,而非限制条件。DeepSeek 的成功重新定义了AI扩展方式的讨 论。我 ...
IREN (IREN) Update / Briefing Transcript
2023-11-22 00:02
IREN (IREN) Update / Briefing November 21, 2023 06:00 PM ET Speaker0 Good day, and welcome to the IRIS Energy Investor Update Conference Call. At this time, all participants are in a listen only mode. After the speakers' presentation, there will be a question and answer session. To ask a question during the session, you will need to press 11 on your telephone. You will then hear an automated message advising your hand is raised. To withdraw your question, please press 11 again. Please be advised that today' ...