Workflow
实时推理
icon
Search documents
AI芯片公司,拿下OpenAI百亿美元大单
半导体行业观察· 2026-01-15 01:38
OpenAI计划使用Cerebras公司设计的芯片为其热门聊天机器人提供动力,两家公司周三宣布了这一 消息。OpenAI已承诺在未来三年内从Cerebras购买高达750兆瓦的计算能力。据知情人士透露,这笔 交易价值超过100亿美元。 Cerebras公司设计的人工智能芯片声称,其运行AI模型和生成响应的速度比行业领导者英伟达更快。 OpenAI首席执行官奥特曼是Cerebras的个人投资者,两家公司曾在2017年探讨过合作事宜。 公众号记得加星标⭐️,第一时间看推送不会错过。 据报道,OpenAI 已达成一项数十亿美元的协议,将从初创公司 Cerebras Systems 购买计算能力。 Cerebras Systems 由首席执行官Sam Altman支持。这是 ChatGPT 制造商 OpenAI 签署的一系列芯 片和云交易中的最新一笔。 据OpenAI公告,Cerebras 构建专用人工智能系统,旨在加速人工智能模型的长时间输出。其独特的 速度优势源于将海量计算能力、内存和带宽集成于单个巨型芯片上,从而消除了传统硬件上导致推理 速度下降的瓶颈。 将 Cerebras 集成到我们的计算解决方案组合中,旨 ...
Mini-Omni-Reasoner:实时推理,定义下一代端到端对话模型
机器之心· 2025-09-20 04:37
Core Viewpoint - The article introduces Mini-Omni-Reasoner, a new real-time reasoning paradigm designed for dialogue scenarios, which allows models to think and express simultaneously, enhancing interaction quality while maintaining logical depth [4][11][25]. Group 1: Introduction to Mini-Omni-Reasoner - Mini-Omni-Reasoner is inspired by human cognitive processes, where individuals often think and speak simultaneously rather than waiting to complete their thoughts before speaking [7][25]. - The model employs a "Thinking-in-Speaking" paradigm, contrasting with traditional models that follow a "thinking-before-speaking" approach, which can lead to delays in interaction [11][25]. Group 2: Model Architecture and Mechanism - The architecture of Mini-Omni-Reasoner consists of two components: Thinker, responsible for logic and reasoning, and Talker, focused on dialogue, allowing for efficient task execution [12][15]. - The model alternates between generating response tokens and reasoning tokens in a 2:8 ratio, balancing reasoning depth with real-time speech synthesis [13][15]. Group 3: Data and Training Process - A comprehensive data pipeline, including the Spoken-Math-Problems-3M dataset, was developed to address the "Anticipation Drift" issue, ensuring the model does not prematurely reveal conclusions [17][19]. - The training process is divided into five stages, progressively aligning text reasoning capabilities with speech modalities to ensure effective performance [19][20]. Group 4: Experimental Validation - Mini-Omni-Reasoner was tested against various models, demonstrating significant performance improvements over the baseline model Qwen2.5-Omni-3B [21][24]. - The model's ability to maintain natural and concise responses while ensuring high-quality reasoning was validated through comparative analysis [24]. Group 5: Future Directions - The article emphasizes that Mini-Omni-Reasoner is a starting point for further exploration into reasoning capabilities in dialogue systems, encouraging ongoing research in this area [26][28].