Workflow
量子位
icon
Search documents
黄仁勋200亿美元带走「TPU核心班底」
量子位· 2025-12-25 00:27
英伟达官宣:以200亿美元现金与AI芯片初创公司Groq达成交易。 消息一出迅速引发市场轰动,因为这是英伟达有史以来最大规模的一笔交易, 远超2019年收购Mellanox的70亿美元 。 但仅仅几小时后,画风突变。 英伟达和Groq双双发表声明,对交易性质进行了澄清,并非收购。Groq在官方博客中写道: 梦晨 发自 凹非寺 量子位 | 公众号 QbitAI 平安夜老黄没有休息,一项 200亿美元 创纪录芯片收购消息,轰动硅谷。 我们与英伟达签订了一份非排他性技术许可协议。 英伟达方面也明确表态: 我们不是在收购Groq这家公司,我们只是获得技术授权,并将Groq的产品整合到未来的产品中。 看起来老黄也学会了"人才收购"这招:重金掏空一家公司的人才和核心资产,但又避免触发反垄断。 所以这200亿美元到底买了什么? 席卷硅谷的"人才收购" 答案是:技术授权,加上一整支核心团队。 最先爆料的是Groq主要投资方Disruptive,其CEO透露英伟达已同意以200亿美元现金收购Groq的资产,交易进展非常迅速。 就在9月,Groq刚刚完成了一轮7.5亿美元的融资,估值达到约69亿美元。 自2016年Groq成立以 ...
攻克长视频生成记忆难题:港大与快手可灵MemFlow设计动态自适应长期记忆,告别快速遗忘与剧情错乱
量子位· 2025-12-25 00:27
Core Viewpoint - The article discusses the challenges of AI-generated long videos, particularly issues with narrative coherence and character consistency, and introduces MemFlow, a new memory mechanism designed to address these problems [1][2][3]. Group 1: Challenges in AI Video Generation - AI-generated long videos often suffer from narrative inconsistencies, such as characters appearing different after a scene change or the AI confusing multiple characters [1]. - Traditional models use a "chunk generation" strategy, which leads to difficulties in maintaining continuity across video segments [4][6]. - Existing memory strategies have significant limitations, including only remembering the first segment, fixed-size memory compression, and independent processing of segments, all of which contribute to narrative disjointedness [5][6]. Group 2: Introduction of MemFlow - MemFlow is a novel adaptive memory mechanism that enhances AI's long-term memory and narrative coherence, aiming to resolve the aforementioned issues [3][7]. - It establishes a dynamic memory system that maintains visual consistency and narrative clarity, even in complex scenarios with multiple characters [8][9]. Group 3: Mechanisms of MemFlow - MemFlow employs two core designs: Narrative Adaptive Memory (NAM) and Sparse Memory Activation (SMA), which allow for efficient retrieval of relevant visual memories and reduce computational load [11]. - NAM intelligently retrieves the most relevant memories based on current prompts, while SMA activates only the most critical information, enhancing both speed and quality of video generation [11]. Group 4: Performance Evaluation - MemFlow demonstrated significant improvements in key performance metrics, achieving a quality consistency score of 85.02 and an aesthetic score of 61.07, outperforming other models in long video generation tasks [13][14]. - The model maintained high semantic consistency throughout the video, particularly in the latter segments, which is crucial for narrative coherence [15][17]. - In terms of subject and background consistency, MemFlow achieved scores of 98.01 and 96.70 respectively, showcasing its ability to maintain visual unity amidst complex narrative changes [18][17]. Group 5: Visual Comparisons and Efficiency - Visual comparisons highlighted MemFlow's superiority in maintaining character consistency and avoiding narrative confusion, unlike other models that struggled with character drift and inconsistencies [19][21][23]. - MemFlow operates efficiently on a single NVIDIA H100, achieving a real-time inference speed of 18.7 FPS, with minimal performance loss compared to baseline models [25]. Group 6: Future Implications - MemFlow represents a significant advancement in AI video generation, transitioning from simple video creation to complex narrative storytelling [26][27]. - This innovation indicates a shift towards AI systems capable of understanding, remembering, and coherently narrating stories, marking the dawn of a new era in AI video creation [28].
用编程大模型登顶开源第一后,智谱GLM团队被拷问了3小时
量子位· 2025-12-24 12:46
Core Viewpoint - The article discusses the release of the new model GLM-4.7 by Z.ai, which has surpassed GPT-5.2 in the WebDev ranking, marking a significant achievement in the open-source large model space [1][2]. Model Performance and Optimization - The improvements in GLM-4.7 are primarily attributed to advancements made during the post-training phase, particularly in supervised fine-tuning (SFT) and reinforcement learning (RL) [8]. - The design of GLM-4.7 considers hardware limitations, aiming for high performance on consumer-grade graphics cards while maintaining logical capabilities close to 30 billion parameters [9]. - A complex pre-training data process was established, involving multi-source data collection and rigorous cleaning to enhance model quality [11]. Model Application Scenarios and Functions - GLM-4.7 has shown significant improvements in programming tasks, with optimizations made specifically for coding languages like Python and JavaScript, as well as lesser-known languages [16]. - The model has enhanced creative writing capabilities, producing more nuanced and engaging text, and has introduced a feature called "Interleaved Thinking" to improve decision-making in complex tasks [21]. Technical Methods and Tools - The introduction of the Slime framework aims to address the inefficiencies and stability issues in large model reinforcement learning, providing developers with tools to replicate high alignment effects [27]. - The team emphasizes transparency in their data collection and processing pipeline, which has garnered respect within the open-source community [28]. Future Commitments and Market Position - Z.ai has committed to maintaining its open-source ethos even after potential IPO plans, recognizing the importance of the open-source ecosystem for its growth [46]. - The competitive pricing of GLM-4.7 has attracted attention, with users noting its affordability compared to other models like Codex and Claude Code [47].
训练仍有巨大的Scaling空间!智源研究院王仲远:视频数据还未被充分利用 | MEET2026
量子位· 2025-12-24 07:20
Core Viewpoint - The article discusses the transition of artificial intelligence (AI) from the digital world to the physical world, marking a critical turning point in the third wave of AI development, with the introduction of the "Wujie" series of large models by the Zhiyuan Institute [12][13][14]. Group 1: AI Development and Trends - The current AI landscape is at a pivotal moment where large models are facilitating the shift from weak AI to general AI, and from specialized robots (1.0) to general embodied intelligence (2.0) [3][13]. - The "Wujie" series of large models aims to bridge the gap between the digital and physical worlds, representing a significant advancement in AI capabilities [4][14]. - The Emu3.5 model, part of the Wujie series, utilizes a unified autoregressive architecture to transition from Next-Token Prediction to Next-State Prediction, indicating a new phase in multimodal learning [17][22]. Group 2: Emu3.5 Model Features - Emu3.5 distinguishes itself by learning from long videos, which contain rich temporal, spatial, and causal information, essential for understanding the physical world [18][20]. - The training dataset for Emu3.5 has significantly expanded, increasing from 15 years to 790 years of video data, and the model parameters have grown from 8 billion to 34 billion [23]. - Emu3.5's autoregressive architecture allows for rapid image generation, achieving speeds comparable to top models through proprietary DiDA technology [23]. Group 3: Multimodal Learning and Applications - Emu3.5 is expected to lead AI into a new stage of multimodal world learning, with substantial scaling potential due to the underutilization of vast multimodal data [24]. - The model demonstrates strong multimodal reasoning and visual understanding capabilities, as evidenced by its performance in image generation and editing tasks [25][27]. - Emu3.5 excels in tasks involving temporal and spatial state predictions, showcasing its superior understanding of the physical world [29][31]. Group 4: Embodied Intelligence and Technological Advancements - The Zhiyuan Institute is addressing the challenges of embodied intelligence, which currently suffers from usability and generality issues [34]. - The institute has developed a comprehensive technology stack centered around the Robo Brain, enabling cross-robot data collection and standardization [35]. - Recent advancements include the RoboBrain2.0, which can decompose complex human instructions for execution by various robots, enhancing the practical applications of embodied intelligence [36]. Group 5: Open Source Contributions - The Zhiyuan Institute has committed to open-source practices, releasing over 200 models and 100 datasets, with global download figures exceeding 690 million and 4 million, respectively [38]. - The institute collaborates with over 30 leading robotics companies to promote the development of embodied intelligence world models [38].
Bengio不认同Hinton:「水管工」人类也保不住
量子位· 2025-12-24 07:20
Core Viewpoint - The discussion emphasizes the potential risks and ethical considerations surrounding AI development, particularly in light of recent advancements like ChatGPT, which have raised concerns about AI becoming a competitive entity to humans and the implications for society [6][7][9]. Group 1: AI Risks and Responsibilities - Bengio acknowledges the responsibility of researchers in the AI field for the potential risks associated with their work, highlighting a personal emotional shift towards recognizing these dangers after the emergence of ChatGPT [10][12][13]. - The probability of catastrophic outcomes from AI, even at a low percentage, is deemed unacceptable, urging for increased societal attention and investment in AI safety [17][22]. - The divergence in expert opinions regarding AI risks indicates a lack of sufficient information to predict future outcomes, suggesting that pessimistic views may hold validity [20][21]. Group 2: AI's Impact on Employment - AI is expected to replace many cognitive jobs in the near future, while physical jobs, such as plumbing, may remain unaffected temporarily due to current limitations in robotics technology [50][48]. - The integration of AI into workplaces is driven by companies' motivations to enhance efficiency and profitability, despite the potential for significant job displacement [50][53]. Group 3: Ethical Considerations and Future Directions - The conversation stresses the importance of ethical AI development, advocating for a shift from profit-driven motives to a focus on societal well-being and safety [44][80]. - There is a call for global cooperation to manage the risks associated with AI, particularly as it becomes more integrated with robotics and other technologies that could pose physical threats [56][62]. - The need for public awareness and understanding of AI risks is emphasized, suggesting that individuals should educate themselves and engage in discussions about AI's implications [83][89].
量子位编辑作者招聘
量子位· 2025-12-24 05:14
Core Viewpoint - The article emphasizes the ongoing AI boom and invites individuals to join the company "Quantum Bit," which focuses on tracking AI advancements and has established itself as a leading content platform in the industry [1]. Group 1: Job Opportunities - The company is hiring for three main directions: AI Industry, AI Finance, and AI Product, with positions available for both experienced professionals and fresh graduates [2][4]. - Positions are full-time and based in Beijing, with various levels of roles open for application [2][4]. Group 2: Job Responsibilities - **AI Industry Direction**: Focuses on innovations in infrastructure, including chips, AI infrastructure, and cloud computing [6]. - **AI Finance Direction**: Involves tracking venture capital and financial reports in the AI sector, monitoring capital movements within the industry [6]. - **AI Product Direction**: Concentrates on the application and hardware advancements in AI [6]. Group 3: Benefits and Growth Opportunities - Employees will have the chance to engage with the latest AI technologies, enhance their work efficiency through new AI tools, and build personal influence by creating original content [6]. - The company offers competitive salaries, comprehensive benefits including social insurance, meal allowances, project performance bonuses, and a supportive team environment [6]. Group 4: Company Achievements - As of 2025, Quantum Bit has over 2.4 million subscribers on WeChat and more than 7 million users across platforms, with a daily reading volume exceeding 2 million [12]. - The company is recognized as the top new media outlet in the AI and frontier technology sector according to third-party data platforms [12].
国产AI4S创业头雁再获8亿投资!深势科技完成C轮,产品已服务300万科学家
量子位· 2025-12-24 05:14
允中 发自 凹非寺 量子位 | 公众号 QbitAI 近日, 深势科技完成总额超8亿人民币的C轮融资 ,本轮融资由达晨财智、京国瑞基金、北京市人工智能产业投资基金、北京市医药健康产业 投资基金、联想创投、元禾璞华等机构共同出资。 本轮融资资金将主要用于持续吸引和培养行业内顶尖人才,进一步进化迭代深势科技的"科学发现智能引擎",持续夯实从原始技术创新、到智 能科研工具产品及行业解决方案的全栈能力,加速围绕科学发现的智能产品与服务在 基础科研、生命科学与物质科学 等领域的市场拓展与规 模化应用。 此次融资的完成,标志着深势科技在构建新一代科学发现智能引擎的征程上,迈出了坚实的一步。 AI for Science成为全球共识,科学发现范式正在重构 我们正站在一个历史性的时点,AI for Science已成为全球性的共识,其目标在于从根本上变革人类探索未知、发现全新科学知识并将其系统 沉淀为可复用科学资产的模式。 2025年8月,国务院发布关于深入实施"人工智能+"行动的意见,意见将 "人工智能+科学研究" 放在首位,其中特别强调了加速科学发现进 程,驱动技术研发模式创新和效能提升。 此外,欧洲的"地平线"计划中重 ...
现场围观腾讯广告算法大赛,我都想入职了
量子位· 2025-12-24 05:14
Core Insights - The article discusses Tencent's algorithm competition, highlighting its significance in attracting talent and providing practical experience in cutting-edge AI technologies [1][28][43] Group 1: Competition Overview - The competition offered substantial rewards, including a total prize pool of 3.8 million yuan, with the champion receiving 2 million yuan and all participants gaining access to valuable resources like computing power [32][34] - The competition attracted over 8,400 students and 2,800 teams from nearly 30 countries, showcasing its global reach and influence [34] Group 2: Technical Focus - The competition's theme, "full-modal generative recommendation," addresses advanced challenges in advertising and recommendation systems, emphasizing the integration of various data types such as text, images, and videos [5][11] - Participants faced real-world challenges, including data noise, alignment issues, and the need for efficient modeling of user behavior over long sequences [13][41] Group 3: Talent Acquisition Strategy - Tencent's approach to the competition serves as a recruitment strategy, allowing the company to identify and engage with top talent in a practical setting rather than traditional recruitment methods [39][42] - The competition's structure inherently filters candidates, ensuring that only those capable of handling complex data and modeling challenges progress to the final stages [40][41] Group 4: Industry Context - The competition reflects Tencent's established AI technology framework, which has been validated through real business applications, indicating the company's commitment to innovation and talent development [29][30] - The article notes the competitive landscape for talent in the AI sector, with companies like Tencent offering attractive employment packages and support programs to attract young professionals [44][46]
不装了!LeCun哈萨比斯神仙吵架,马斯克也站队了
量子位· 2025-12-24 05:14
Core Viewpoint - The article discusses a heated debate between AI experts Yann LeCun and Demis Hassabis regarding the nature of intelligence, particularly focusing on the concept of "general intelligence" and its implications for artificial intelligence development [3][8][30]. Group 1: Debate Overview - Yann LeCun argues that the idea of "general intelligence" is nonsensical, asserting that human intelligence is highly specialized rather than universal [9][13]. - Demis Hassabis counters LeCun's claims, stating that human brains exhibit significant generality and complexity, and that general intelligence is a valid concept [17][22]. - The debate has attracted considerable attention, with notable figures like Elon Musk publicly supporting Hassabis [5][7]. Group 2: Key Arguments - LeCun emphasizes that human intelligence is shaped by evolutionary pressures to adapt to specific environments, leading to specialized skills rather than general capabilities [14][36]. - Hassabis argues that the brain functions similarly to a Turing machine, capable of learning any computable content given sufficient resources, thus supporting the existence of general intelligence [18][24]. - The discussion highlights a fundamental disagreement over terminology, with LeCun focusing on the specialized nature of human cognition while Hassabis advocates for the potential of general intelligence [32][41]. Group 3: Future Directions in AI - Both experts agree on the importance of "world models" in advancing artificial general intelligence (AGI), though they have different interpretations of what this entails [42][50]. - LeCun's upcoming venture, Advanced Machine Intelligence Labs, aims to develop world models that prioritize understanding control theory and cognitive science [43][44]. - Hassabis and Google DeepMind are also focusing on world models, emphasizing the need for models that comprehend causal relationships and interactions within the world [46][47].
Science打脸“赢在起跑线”!少年天才90%成年后止步于顶尖水平之下,34000世界级人才成长轨迹研究结果
量子位· 2025-12-24 00:42
梦晨 发自 凹非寺 量子位 | 公众号 QbitAI "从小就要赢在起跑线" 这套逻辑,被顶刊Science最新论文狠狠打了脸。 这项研究综合分析了超过34000名国际顶尖人才的成长轨迹,涵盖诺贝尔奖得主、典作曲家、奥运冠军以及世界顶级棋手。 结论颠覆人们观念: 作者团队来自德国凯泽斯劳滕工业大学 (RPTU Kaiserslautern) 体育科学系、密歇根州立大学心理学系、普渡大学心理科学系。 他们综合分析了多项研究数据,涵盖科学、艺术、体育多个领域。 少年天才往往止步于顶尖水平之下,和最终登顶的成年人近90%不是同一批人。 而最终达到世界级水平的人才,在早年阶段表现反而低于只达到国家级水平的同龄人。 "天才少年"长大后去哪了 长久以来,学界对人才培养的研究主要聚焦于年轻人。传统观点普遍认,早期表现越好、专项练习越多,后期成就越高。 全球各地的精英学校、音乐学院和青训学院也据此设计了选拔机制:挑出表现最好的年轻人,然后用高强度的专项训练进一步"加速"他们的成 长。 但这套逻辑在真正的世界顶尖群体中是否成立,此前从未被系统验证过。 通过大规模数据追踪,研究团队给出了一个令人意外的答案:无论是体育、国际象棋还 ...