Workflow
量子位
icon
Search documents
字节Seed发布最强数学模型:一招“打草稿”,IMO银牌变金牌
量子位· 2025-12-25 06:08
Core Insights - ByteDance's latest mathematical reasoning model, Seed Prover 1.5, achieved a gold medal score at the IMO 2025 by solving five problems in 16.5 hours, scoring 35 points, which meets the gold medal threshold for this year [1][3] - This performance matches that of Google's Gemini, which was certified as an IMO gold medalist in July [3] - The model has not been open-sourced yet, but a technical report has been released, highlighting the performance improvements brought by large-scale reinforcement learning [5][19] Model Performance - Seed Prover 1.5 significantly outperformed its predecessor, which took three days to solve four out of six problems and achieved a silver medal [3] - The model also set new state-of-the-art (SOTA) records in the North American undergraduate mathematics competition, Putnam [4] Technical Innovations - The model features a new architecture called Agentic Prover, which allows it to use formal mathematical reasoning instead of natural language, ensuring more reliable results [10][12] - It incorporates a Sketch Model that simulates how human mathematicians draft proofs, breaking down complex problems into manageable sub-goals [22][23] - The model employs a multi-agent collaborative system that enhances efficiency and success rates by recursively calling the Sketch Model for difficult lemmas [25][28] Reinforcement Learning and Efficiency - The model's proof success rate improved from 50% to nearly 90% with increased reinforcement learning training steps [19] - In comparative tests, Seed Prover 1.5 required significantly less computational resources while outperforming previous models on high-difficulty datasets [19][20] Conclusion - The research is part of ByteDance's Seed AI4Math team, showcasing advancements in mathematical reasoning through innovative model architectures and training methodologies [30]
LeCun哈萨比斯神仙吵架,马斯克也站队了
量子位· 2025-12-25 00:27
Core Viewpoint - The article discusses a heated debate between AI experts Yann LeCun and Demis Hassabis regarding the nature of intelligence, particularly focusing on the concept of "general intelligence" and its implications for artificial intelligence development [3][8][30]. Group 1: Debate Overview - Yann LeCun argues that the idea of "general intelligence" is nonsensical, asserting that human intelligence is highly specialized rather than universal [9][13]. - Demis Hassabis counters LeCun's claims, stating that human brains exhibit significant generality and complexity, and that general intelligence is a valid concept [17][22]. - The debate has attracted considerable attention, with notable figures like Elon Musk publicly supporting Hassabis [5][7]. Group 2: Key Arguments - LeCun emphasizes that human intelligence is shaped by evolutionary pressures to adapt to specific environments, leading to specialized skills rather than general capabilities [14][36]. - Hassabis argues that the brain's complexity allows for general intelligence, and he believes that with sufficient resources, any computable task can be learned, akin to a Turing machine [18][24]. - Both experts agree on the importance of world models in AI development, but they differ in their interpretations and applications of this concept [50][42]. Group 3: Future Directions - LeCun plans to establish a new company, Advanced Machine Intelligence Labs, focusing on world models, with a target valuation of €3 billion (approximately ¥24.7 billion) [43]. - Hassabis highlights that Google DeepMind is also prioritizing world models, emphasizing the understanding of causal relationships and interactions within the world [47][49]. - The article concludes that while the two experts may appear to be discussing different aspects of intelligence, they are ultimately addressing the same fundamental issue of how to achieve artificial general intelligence (AGI) [41][42].
黄仁勋200亿美元带走「TPU核心班底」
量子位· 2025-12-25 00:27
英伟达官宣:以200亿美元现金与AI芯片初创公司Groq达成交易。 消息一出迅速引发市场轰动,因为这是英伟达有史以来最大规模的一笔交易, 远超2019年收购Mellanox的70亿美元 。 但仅仅几小时后,画风突变。 英伟达和Groq双双发表声明,对交易性质进行了澄清,并非收购。Groq在官方博客中写道: 梦晨 发自 凹非寺 量子位 | 公众号 QbitAI 平安夜老黄没有休息,一项 200亿美元 创纪录芯片收购消息,轰动硅谷。 我们与英伟达签订了一份非排他性技术许可协议。 英伟达方面也明确表态: 我们不是在收购Groq这家公司,我们只是获得技术授权,并将Groq的产品整合到未来的产品中。 看起来老黄也学会了"人才收购"这招:重金掏空一家公司的人才和核心资产,但又避免触发反垄断。 所以这200亿美元到底买了什么? 席卷硅谷的"人才收购" 答案是:技术授权,加上一整支核心团队。 最先爆料的是Groq主要投资方Disruptive,其CEO透露英伟达已同意以200亿美元现金收购Groq的资产,交易进展非常迅速。 就在9月,Groq刚刚完成了一轮7.5亿美元的融资,估值达到约69亿美元。 自2016年Groq成立以 ...
攻克长视频生成记忆难题:港大与快手可灵MemFlow设计动态自适应长期记忆,告别快速遗忘与剧情错乱
量子位· 2025-12-25 00:27
Core Viewpoint - The article discusses the challenges of AI-generated long videos, particularly issues with narrative coherence and character consistency, and introduces MemFlow, a new memory mechanism designed to address these problems [1][2][3]. Group 1: Challenges in AI Video Generation - AI-generated long videos often suffer from narrative inconsistencies, such as characters appearing different after a scene change or the AI confusing multiple characters [1]. - Traditional models use a "chunk generation" strategy, which leads to difficulties in maintaining continuity across video segments [4][6]. - Existing memory strategies have significant limitations, including only remembering the first segment, fixed-size memory compression, and independent processing of segments, all of which contribute to narrative disjointedness [5][6]. Group 2: Introduction of MemFlow - MemFlow is a novel adaptive memory mechanism that enhances AI's long-term memory and narrative coherence, aiming to resolve the aforementioned issues [3][7]. - It establishes a dynamic memory system that maintains visual consistency and narrative clarity, even in complex scenarios with multiple characters [8][9]. Group 3: Mechanisms of MemFlow - MemFlow employs two core designs: Narrative Adaptive Memory (NAM) and Sparse Memory Activation (SMA), which allow for efficient retrieval of relevant visual memories and reduce computational load [11]. - NAM intelligently retrieves the most relevant memories based on current prompts, while SMA activates only the most critical information, enhancing both speed and quality of video generation [11]. Group 4: Performance Evaluation - MemFlow demonstrated significant improvements in key performance metrics, achieving a quality consistency score of 85.02 and an aesthetic score of 61.07, outperforming other models in long video generation tasks [13][14]. - The model maintained high semantic consistency throughout the video, particularly in the latter segments, which is crucial for narrative coherence [15][17]. - In terms of subject and background consistency, MemFlow achieved scores of 98.01 and 96.70 respectively, showcasing its ability to maintain visual unity amidst complex narrative changes [18][17]. Group 5: Visual Comparisons and Efficiency - Visual comparisons highlighted MemFlow's superiority in maintaining character consistency and avoiding narrative confusion, unlike other models that struggled with character drift and inconsistencies [19][21][23]. - MemFlow operates efficiently on a single NVIDIA H100, achieving a real-time inference speed of 18.7 FPS, with minimal performance loss compared to baseline models [25]. Group 6: Future Implications - MemFlow represents a significant advancement in AI video generation, transitioning from simple video creation to complex narrative storytelling [26][27]. - This innovation indicates a shift towards AI systems capable of understanding, remembering, and coherently narrating stories, marking the dawn of a new era in AI video creation [28].
用编程大模型登顶开源第一后,智谱GLM团队被拷问了3小时
量子位· 2025-12-24 12:46
Core Viewpoint - The article discusses the release of the new model GLM-4.7 by Z.ai, which has surpassed GPT-5.2 in the WebDev ranking, marking a significant achievement in the open-source large model space [1][2]. Model Performance and Optimization - The improvements in GLM-4.7 are primarily attributed to advancements made during the post-training phase, particularly in supervised fine-tuning (SFT) and reinforcement learning (RL) [8]. - The design of GLM-4.7 considers hardware limitations, aiming for high performance on consumer-grade graphics cards while maintaining logical capabilities close to 30 billion parameters [9]. - A complex pre-training data process was established, involving multi-source data collection and rigorous cleaning to enhance model quality [11]. Model Application Scenarios and Functions - GLM-4.7 has shown significant improvements in programming tasks, with optimizations made specifically for coding languages like Python and JavaScript, as well as lesser-known languages [16]. - The model has enhanced creative writing capabilities, producing more nuanced and engaging text, and has introduced a feature called "Interleaved Thinking" to improve decision-making in complex tasks [21]. Technical Methods and Tools - The introduction of the Slime framework aims to address the inefficiencies and stability issues in large model reinforcement learning, providing developers with tools to replicate high alignment effects [27]. - The team emphasizes transparency in their data collection and processing pipeline, which has garnered respect within the open-source community [28]. Future Commitments and Market Position - Z.ai has committed to maintaining its open-source ethos even after potential IPO plans, recognizing the importance of the open-source ecosystem for its growth [46]. - The competitive pricing of GLM-4.7 has attracted attention, with users noting its affordability compared to other models like Codex and Claude Code [47].
训练仍有巨大的Scaling空间!智源研究院王仲远:视频数据还未被充分利用 | MEET2026
量子位· 2025-12-24 07:20
Core Viewpoint - The article discusses the transition of artificial intelligence (AI) from the digital world to the physical world, marking a critical turning point in the third wave of AI development, with the introduction of the "Wujie" series of large models by the Zhiyuan Institute [12][13][14]. Group 1: AI Development and Trends - The current AI landscape is at a pivotal moment where large models are facilitating the shift from weak AI to general AI, and from specialized robots (1.0) to general embodied intelligence (2.0) [3][13]. - The "Wujie" series of large models aims to bridge the gap between the digital and physical worlds, representing a significant advancement in AI capabilities [4][14]. - The Emu3.5 model, part of the Wujie series, utilizes a unified autoregressive architecture to transition from Next-Token Prediction to Next-State Prediction, indicating a new phase in multimodal learning [17][22]. Group 2: Emu3.5 Model Features - Emu3.5 distinguishes itself by learning from long videos, which contain rich temporal, spatial, and causal information, essential for understanding the physical world [18][20]. - The training dataset for Emu3.5 has significantly expanded, increasing from 15 years to 790 years of video data, and the model parameters have grown from 8 billion to 34 billion [23]. - Emu3.5's autoregressive architecture allows for rapid image generation, achieving speeds comparable to top models through proprietary DiDA technology [23]. Group 3: Multimodal Learning and Applications - Emu3.5 is expected to lead AI into a new stage of multimodal world learning, with substantial scaling potential due to the underutilization of vast multimodal data [24]. - The model demonstrates strong multimodal reasoning and visual understanding capabilities, as evidenced by its performance in image generation and editing tasks [25][27]. - Emu3.5 excels in tasks involving temporal and spatial state predictions, showcasing its superior understanding of the physical world [29][31]. Group 4: Embodied Intelligence and Technological Advancements - The Zhiyuan Institute is addressing the challenges of embodied intelligence, which currently suffers from usability and generality issues [34]. - The institute has developed a comprehensive technology stack centered around the Robo Brain, enabling cross-robot data collection and standardization [35]. - Recent advancements include the RoboBrain2.0, which can decompose complex human instructions for execution by various robots, enhancing the practical applications of embodied intelligence [36]. Group 5: Open Source Contributions - The Zhiyuan Institute has committed to open-source practices, releasing over 200 models and 100 datasets, with global download figures exceeding 690 million and 4 million, respectively [38]. - The institute collaborates with over 30 leading robotics companies to promote the development of embodied intelligence world models [38].
Bengio不认同Hinton:「水管工」人类也保不住
量子位· 2025-12-24 07:20
Core Viewpoint - The discussion emphasizes the potential risks and ethical considerations surrounding AI development, particularly in light of recent advancements like ChatGPT, which have raised concerns about AI becoming a competitive entity to humans and the implications for society [6][7][9]. Group 1: AI Risks and Responsibilities - Bengio acknowledges the responsibility of researchers in the AI field for the potential risks associated with their work, highlighting a personal emotional shift towards recognizing these dangers after the emergence of ChatGPT [10][12][13]. - The probability of catastrophic outcomes from AI, even at a low percentage, is deemed unacceptable, urging for increased societal attention and investment in AI safety [17][22]. - The divergence in expert opinions regarding AI risks indicates a lack of sufficient information to predict future outcomes, suggesting that pessimistic views may hold validity [20][21]. Group 2: AI's Impact on Employment - AI is expected to replace many cognitive jobs in the near future, while physical jobs, such as plumbing, may remain unaffected temporarily due to current limitations in robotics technology [50][48]. - The integration of AI into workplaces is driven by companies' motivations to enhance efficiency and profitability, despite the potential for significant job displacement [50][53]. Group 3: Ethical Considerations and Future Directions - The conversation stresses the importance of ethical AI development, advocating for a shift from profit-driven motives to a focus on societal well-being and safety [44][80]. - There is a call for global cooperation to manage the risks associated with AI, particularly as it becomes more integrated with robotics and other technologies that could pose physical threats [56][62]. - The need for public awareness and understanding of AI risks is emphasized, suggesting that individuals should educate themselves and engage in discussions about AI's implications [83][89].
国产AI4S创业头雁再获8亿投资!深势科技完成C轮,产品已服务300万科学家
量子位· 2025-12-24 05:14
允中 发自 凹非寺 量子位 | 公众号 QbitAI 近日, 深势科技完成总额超8亿人民币的C轮融资 ,本轮融资由达晨财智、京国瑞基金、北京市人工智能产业投资基金、北京市医药健康产业 投资基金、联想创投、元禾璞华等机构共同出资。 本轮融资资金将主要用于持续吸引和培养行业内顶尖人才,进一步进化迭代深势科技的"科学发现智能引擎",持续夯实从原始技术创新、到智 能科研工具产品及行业解决方案的全栈能力,加速围绕科学发现的智能产品与服务在 基础科研、生命科学与物质科学 等领域的市场拓展与规 模化应用。 此次融资的完成,标志着深势科技在构建新一代科学发现智能引擎的征程上,迈出了坚实的一步。 AI for Science成为全球共识,科学发现范式正在重构 我们正站在一个历史性的时点,AI for Science已成为全球性的共识,其目标在于从根本上变革人类探索未知、发现全新科学知识并将其系统 沉淀为可复用科学资产的模式。 2025年8月,国务院发布关于深入实施"人工智能+"行动的意见,意见将 "人工智能+科学研究" 放在首位,其中特别强调了加速科学发现进 程,驱动技术研发模式创新和效能提升。 此外,欧洲的"地平线"计划中重 ...
量子位编辑作者招聘
量子位· 2025-12-24 05:14
Core Viewpoint - The article emphasizes the ongoing AI boom and invites individuals to join the company "Quantum Bit," which focuses on tracking AI advancements and has established itself as a leading content platform in the industry [1]. Group 1: Job Opportunities - The company is hiring for three main directions: AI Industry, AI Finance, and AI Product, with positions available for both experienced professionals and fresh graduates [2][4]. - Positions are full-time and based in Beijing, with various levels of roles open for application [2][4]. Group 2: Job Responsibilities - **AI Industry Direction**: Focuses on innovations in infrastructure, including chips, AI infrastructure, and cloud computing [6]. - **AI Finance Direction**: Involves tracking venture capital and financial reports in the AI sector, monitoring capital movements within the industry [6]. - **AI Product Direction**: Concentrates on the application and hardware advancements in AI [6]. Group 3: Benefits and Growth Opportunities - Employees will have the chance to engage with the latest AI technologies, enhance their work efficiency through new AI tools, and build personal influence by creating original content [6]. - The company offers competitive salaries, comprehensive benefits including social insurance, meal allowances, project performance bonuses, and a supportive team environment [6]. Group 4: Company Achievements - As of 2025, Quantum Bit has over 2.4 million subscribers on WeChat and more than 7 million users across platforms, with a daily reading volume exceeding 2 million [12]. - The company is recognized as the top new media outlet in the AI and frontier technology sector according to third-party data platforms [12].
现场围观腾讯广告算法大赛,我都想入职了
量子位· 2025-12-24 05:14
Core Insights - The article discusses Tencent's algorithm competition, highlighting its significance in attracting talent and providing practical experience in cutting-edge AI technologies [1][28][43] Group 1: Competition Overview - The competition offered substantial rewards, including a total prize pool of 3.8 million yuan, with the champion receiving 2 million yuan and all participants gaining access to valuable resources like computing power [32][34] - The competition attracted over 8,400 students and 2,800 teams from nearly 30 countries, showcasing its global reach and influence [34] Group 2: Technical Focus - The competition's theme, "full-modal generative recommendation," addresses advanced challenges in advertising and recommendation systems, emphasizing the integration of various data types such as text, images, and videos [5][11] - Participants faced real-world challenges, including data noise, alignment issues, and the need for efficient modeling of user behavior over long sequences [13][41] Group 3: Talent Acquisition Strategy - Tencent's approach to the competition serves as a recruitment strategy, allowing the company to identify and engage with top talent in a practical setting rather than traditional recruitment methods [39][42] - The competition's structure inherently filters candidates, ensuring that only those capable of handling complex data and modeling challenges progress to the final stages [40][41] Group 4: Industry Context - The competition reflects Tencent's established AI technology framework, which has been validated through real business applications, indicating the company's commitment to innovation and talent development [29][30] - The article notes the competitive landscape for talent in the AI sector, with companies like Tencent offering attractive employment packages and support programs to attract young professionals [44][46]