大模型推理
Search documents
英伟达:Blackwell收入超预期,2025年推理爆发主导GPU需求-20250304
First Shanghai Securities· 2025-03-04 10:43
Investment Rating - The report assigns a "Buy" rating to the company with a target price of $160, representing a potential upside of 33.17% from the current price of $120.15 [2][31]. Core Insights - The company is expected to experience significant growth driven by the demand for its Blackwell products, particularly in the AI and data center sectors. The revenue for fiscal year 2025 is projected to be $393 billion, a year-over-year increase of 77.9%, surpassing previous guidance and market expectations [3][5][10]. - The gross margin for the latest quarter was reported at 73.0%, slightly below expectations due to higher short-term costs associated with ramping up Blackwell production. However, margins are expected to improve as production stabilizes [5][10]. - The company anticipates a revenue guidance midpoint of $430 billion for the next quarter, reflecting a year-over-year growth of 65.1% [10][15]. Financial Performance Summary - For the fiscal year ending January 26, 2025, total revenue is forecasted to reach $393 billion, with a net profit of $221 billion, resulting in a GAAP diluted EPS of $0.89, exceeding Bloomberg consensus estimates [3][6]. - The company generated free cash flow of $155 billion in the latest quarter, up from $115 billion in the same period last year, and returned $81 billion to shareholders through buybacks and dividends [6][10]. - The data center business saw revenue of $355.8 billion, a 93.3% increase year-over-year, driven by demand for large models and AI applications [15][19]. Product and Market Developments - The Blackwell platform is highlighted as the fastest ramping product in the company's history, with Q4 revenue reaching $110 billion, exceeding expectations. The transition from Hopper to Blackwell is noted to be more challenging, but improvements in gross margins are anticipated as production scales [10][19]. - The company launched Project DIGITS, a personal AI computer capable of running large models, showcasing its commitment to innovation in AI technology [26][20]. - The automotive business reported a revenue increase of 102.8% year-over-year, driven by rising demand for smart driving chips, with a projected market space of $5 billion for autonomous driving chips in 2025 [27][26]. Future Outlook - The company expects a compound annual growth rate (CAGR) of 29% for revenue and EPS over the next three years, supported by strong capital expenditure growth from major clients like Microsoft and Google [32][31]. - The report emphasizes the need for continuous product development and iteration to maintain competitive advantages in the rapidly evolving AI and semiconductor markets [32][19].
天翼云CPU实例部署DeepSeek-R1模型最佳实践
量子位· 2025-03-03 07:58
文章来源:天翼云网站 量子位 | 公众号 QbitAI 本文介绍了 英特尔 ® 至强 ® 处理器在AI推理领域的优势,如何使用一键部署的镜像进行纯CPU环境下基于AMX加速后的 DeepSeek-R1 7B蒸馏模型推理,以及纯CPU环境下部署DeepSeek-R1 671B满血版模型实践。 大模型因其参数规模庞大、结构复杂,通常需要强大的计算资源来支持其推理过程,这使得算力成为大模型应用的核心要素。随着DeepSeek-R1模型 的问世,各行各业纷纷展开了关于如何接入大模型能力的广泛调研与探索,市场对大模型推理算力的需求呈现出爆发式增长的趋势。 例如在医疗、金融、零售等领域,企业迫切希望通过接入DeepSeek大模型来提升决策效率和业务能力,从而推动行业的创新发展。在这一背景下,算 力的供给和优化成为推动大模型落地应用的重要因素。 近年来,CPU制程和架构的提升以及 英特尔 ® 高级矩阵扩展AMX(Advanced Matrix Extensions)加速器的面世带来了算力的快速提升。英特尔对大 模型推理等多个AI领域持续深入研究,提供全方位的AI软件支持,兼容主流AI软件且提供多种软件方式提升CPU的AI性 ...
两台运行“满血版”DeepSeek,第四范式推出大模型推理一体机解决方案SageOne IA
IPO早知道· 2025-02-28 04:11
此 外 , 一 体 机 解 决 方 案 还 集 成 了 智 能 算 力 池 化 技 术 , 在 支 持 DeepSeek V3/R1 、 QWen2.5 、 LLama3.3等主流大模型的基础上,企业可灵活在满血版和多个蒸馏模型之间切换,GPU利用率提升 30%以上,推理性能平均提升5-10倍;同时内置大模型应用开发平台,并搭载了丰富的开箱即用AI 应用套件,帮助开发者高效开发企业级的生成式AI应用,让企业享受高效的大模型应用服务,加速AI 智能化落地进程。 具体来讲:SageOne IA大模型推理一体机解决方案,具备三大核心优势: 1) 智能算力池化,资源动态调度,突破物理机架构 大模型应用成本"一降再降"。 本文为IPO早知道原创 作者| Stone Jin 微信公众号|ipozaozhidao 据IPO早知道消息,第四范式日前推出大模型推理一体机解决方案SageOne IA,进一步减低了大模 型推理成本。如满血版的DeepSeek V3/R1仅需要两台一体机即可使用。 方案支持企业按需选择DeepSeek V3/R1、QWen2.5、LLama3.3等主流大模型,还预装了丰富的 AI应用套件,包括AIG ...