通用人工智能(AGI)
Search documents
谷歌DeepMind CEO哈萨比斯:世界模型是未来,AI泡沫真实存在
Sou Hu Cai Jing· 2025-11-20 08:14
Core Insights - Google has officially launched its latest large model, Gemini 3 Pro, aimed at creating a comprehensive foundational model that addresses shortcomings in programming, logical reasoning, and mathematical capabilities [1][3] - Gemini 3 Pro is considered a key component in the pursuit of Artificial General Intelligence (AGI) [1][3] Model Performance and User Engagement - Gemini 3 demonstrates enhanced reasoning coherence in multi-step tasks and can dynamically generate customized interactive interfaces for users [3] - The monthly active users of Gemini have surpassed 650 million, and when including users accessing Gemini through the "AI Overviews" feature, the number reaches 2 billion [3] Future Developments and Research Focus - Demis Hassabis has shifted his research focus to World Models, which are being used internally at Google for training robots and other agents [3][4] - Hassabis predicts a significant breakthrough in World Models akin to a "ChatGPT moment," but highlights challenges related to cost and current technological limitations [4] Market Dynamics and Investment Outlook - Hassabis notes the existence of a bubble in the private market, citing unsustainable valuations for startups without substantial outputs [4] - He emphasizes that Google is well-positioned to navigate market fluctuations, having integrated AI research into its core products, leading to rapid commercial returns [4] Long-term Vision for AGI - Despite advancements with Gemini 3, Hassabis maintains that achieving true AGI will require 5 to 10 more years and one or two critical breakthroughs [5] - He acknowledges diminishing returns from merely increasing model parameters but asserts that ongoing investments remain valuable and yield high returns [5] Security Considerations - The enhancement of model capabilities introduces new risks, particularly in cybersecurity, necessitating increased caution to prevent malicious misuse of technology [5]
本周六,围观学习NeurIPS 2025论文分享会,最后报名了
机器之心· 2025-11-20 06:35
2025年,AI 的演进正从"能力突破"迈向"系统构建"阶段。 自主智能体开始尝试真实任务闭环,世界模型在复杂环境中持续验证,推理架构与训练范式不断重构——技术的焦点,已不再只是"能不能做",而是"如何做得更 可靠、更可解释、更可持续"。 在这一背景下,NeurIPS 作为全球人工智能与机器学习领域最具影响力的学术会议之一, 再度成为洞察前沿趋势的重要风向标。今年大会共收到 21575 份有效投 稿,最终接收 5290 篇,整体录用率为 24.52%。大会将于 2025 年 12 月 2 日到 7 日在美国圣地亚哥举办,并且首次设置了第二个官方分会场墨西哥城,标志着全 球 AI 学术生态的多元化布局正在加速成型。 为了服务中国 AI 社区,近年来机器之心持续举办了多场 NeurIPS、CVPR、ACL、ICLR 论文分享会,受到了海内外 AI 社区的极大关注,众多高校、企业都积极 参与。 本次「NeurIPS 2025 论文分享会」专为国内 AI 人才打造,精心设置了 Keynote、论文分享、圆桌对话、Poster 交流及企业展台互动等多元环节。今天,论文分享 会的全日程、Keynote 分享嘉宾、演讲主题 ...
LLM 没意思,小扎决策太拉垮,图灵奖大佬 LeCun 离职做 AMI
AI前线· 2025-11-20 06:30
Core Insights - Yann LeCun, a Turing Award winner and a key figure in deep learning, announced his departure from Meta to start a new company focused on Advanced Machine Intelligence (AMI) research, aiming to revolutionize AI by creating systems that understand the physical world, possess persistent memory, reason, and plan complex actions [2][4][11]. Departure Reasons & Timeline - LeCun's departure from Meta was confirmed after rumors circulated, with the initial report coming from the Financial Times on November 11, indicating his plans to start a new venture [10][11]. - Following the announcement, Meta's market value dropped approximately 1.5% in pre-market trading, equating to a loss of about $44.97 billion (approximately 320.03 billion RMB) [11]. - The decision to leave was influenced by long-standing conflicts over AI development strategies within Meta, particularly as the focus shifted towards generative AI (GenAI) products, sidelining LeCun's foundational research efforts [11][12]. Research Philosophy & Future Vision - LeCun emphasized the importance of long-term foundational research, which he felt was being undermined by Meta's shift towards rapid product development under the leadership of younger executives like Alexandr Wang [12][13]. - He expressed skepticism towards large language models (LLMs), viewing them as nearing the end of their innovative potential and advocating for a focus on world models and self-supervised learning to achieve true artificial general intelligence (AGI) [14][15]. - LeCun's vision for AMI includes four key capabilities: understanding the physical world, possessing persistent memory, true reasoning ability, and the capacity to plan actions rather than merely predicting sequences [16][15]. Industry Context & Future Outlook - The article suggests a growing recognition in the industry that larger models are not always better, with a potential shift towards smaller, more specialized models that can effectively address specific tasks [18]. - Delangue, co-founder of Hugging Face, echoed LeCun's sentiments, indicating that the current focus on massive models may lead to a bubble, while the true potential of AI remains largely untapped [18][15]. - Meta acknowledged LeCun's contributions over the past 12 years and expressed a desire to continue benefiting from his research through a partnership with his new company [22].
杨立昆官宣离职,感谢一圈Meta领导,只字不提亚历山大·王
3 6 Ke· 2025-11-20 01:52
Core Insights - Yang Li-Kun, a Turing Award winner and Chief Scientist at Meta AI, announced his departure from Meta to establish a startup focused on Advanced Machine Intelligence (AMI) by the end of the year [1][3][4] - The new venture aims to create systems that can understand the physical world, possess persistent memory, reason, and plan complex action sequences, with Meta as a partner [1][3] Summary by Sections Departure and New Venture - Yang Li-Kun will leave Meta after 12 years, where he led the foundational AI research lab (FAIR) and contributed significantly to AI long-term research [3][4] - His new startup will analyze information beyond network data to better represent the physical world and its attributes [1][3] Background on AMI - AMI, a concept introduced by Yang, is Meta's internal term for AGI, focusing on understanding the physical world, common sense, persistent memory, reasoning, and planning [3][4] - Yang's departure follows the exit of another key figure, Soumith Chintala, indicating a trend of talent loss at Meta [3][4] Meta's Strategic Shift - Meta has been undergoing significant changes, including layoffs and a shift in focus towards faster model deployment, which may have influenced Yang's decision to leave [12][14] - CEO Mark Zuckerberg's strategy includes hiring top talent from other companies and restructuring the AI division, which contrasts with Yang's vision for AI development [12][14] Future Implications - Yang's new venture may serve as a balance between Meta's current direction and his vision for AI, potentially addressing the ongoing technical route conflicts within the industry [18]
Gemini 3负责人最新访谈:不做情感陪伴,只做最强生产力工具
3 6 Ke· 2025-11-20 00:03
Core Insights - Google has launched the Gemini 3 model, which introduces Generative UI capabilities, allowing users to create interactive pages and customized tools like mortgage calculators based on queries [1][2][8] - The model shows significant improvements in reasoning capabilities, maintaining coherent logic over 10 to 15 steps in complex tasks, and achieving a score of 37.5% in the "Humanity's Last Exam," surpassing its predecessor and competitors [2][4][9] - Gemini 3 Pro excels in visual intelligence, scoring 72.7% in the ScreenSpot-Pro test, indicating its ability to understand UI elements and enhance automation tasks [3][4] Performance Metrics - In various benchmark tests, Gemini 3 Pro outperformed previous models and competitors in multiple categories, including: - Humanity's Last Exam: 37.5% (up from 21.6% for Gemini 2.5 Pro) [2][4][9] - SimpleQA Verified: 72.1% accuracy, significantly higher than GPT-5.1 and Claude Sonnet 4.5 [2][4] - ScreenSpot-Pro: 72.7%, nearly 20 times better than GPT-5.1 [3][4] Strategic Positioning - Google positions Gemini 3 as a productivity-enhancing tool rather than an emotional companion, focusing on task completion metrics rather than user engagement [5][10] - The model integrates deeply with user data, allowing it to assist in email management and other tasks, evolving from a simple assistant to a more autonomous digital colleague [5][10][11] Development and Future Outlook - Google has introduced a new development platform, "Google Antigravity," which utilizes Gemini 3 to generate functional and aesthetically pleasing code based on natural language prompts [4][11] - The company emphasizes that while Gemini 3 is a significant advancement, achieving AGI still requires further breakthroughs in reasoning depth and memory mechanisms [14][16]
如何看待人工智能生态系统中的“竞合”态势?世界经济论坛首席技术官答一财
Di Yi Cai Jing· 2025-11-19 08:28
马思远认为,科技巨头间的密切合作一方面是对人工智能的潜力抱有极高期待,另一方面,行业已经意 识到需要通过战略合作才能应对目前在算力和部署上的诸多瓶颈。 人工智能发展正经历一场深度重构,美国科技巨头们打破传统竞争边界,频频组成以算力和基建为纽带 的战略联盟,该如何理解这一"合纵连横"?随着技术变革的浪潮重塑劳动力市场,身处其中的年轻人又 将如何面对未来? 在近日于浙江杭州举行的世界经济论坛"产业转型升级新动力"论坛上,第一财经记者专访了世界经济论 坛执⾏董事、⾸席技术官马思远(Stephan Mergenthaler)。 马思远认为,科技巨头间的密切合作基于两个因素,一方面是对人工智能的潜力抱有极高期待,另一方 面,行业已经意识到需要通过战略合作才能应对目前在算力和部署上的诸多瓶颈。 而对于人工智能时代下的年轻人就业困境,他的态度则十分乐观。他表示,具备与人工智能协作的思维 和能力的年轻人将对企业有极强吸引力,而这些能力的发挥也将催生新的职业形态和价值。 第一财经:在人工智能驱动产业转型上,中国、欧盟和美国分别处于什么阶段,面临哪些挑战和机遇? 但与此同时,各方也在激烈竞争,由此形成了一种颇为有趣的、近乎"竞合 ...
谷歌抢跑L3级AI,Gemini连续工作40分钟,Agent自动生成评审百条创意
3 6 Ke· 2025-11-19 08:03
Core Insights - Google's Gemini AI system is advancing towards L3 AI capabilities, allowing for extended task execution and multi-agent collaboration [15][18] - The Gemini system can run for 40 minutes on a single task, generating over 100 creative ideas and providing structured evaluation reports [2][10] Group 1: Gemini's Functionality - Gemini employs a multi-agent competition system that generates and ranks ideas based on user input, significantly reducing the time spent on iterative feedback [4][7] - The system's process includes a 40-minute cycle of generation, competition, and selection, resulting in a comprehensive output rather than a single response [7][10] - Two primary applications of this system are creative generation and collaborative research, enhancing the scope of tasks it can handle [9][10] Group 2: L3 AI Development - The transition to L3 AI, characterized by autonomous task execution over extended periods, is exemplified by Gemini's ability to operate continuously for 40 minutes [15][18] - This capability positions Gemini closer to the L3 definition, with potential future developments suggesting even longer operational durations [15][17] - The ongoing development of collaborative research features may further elevate Gemini towards L4 AI capabilities [18]
新模型“屠榜” 对话谷歌团队:AI“新旗手”如何诞生
Di Yi Cai Jing· 2025-11-19 04:41
11月19日,预热已久、全网热议的Gemini 3终于正式亮相。谷歌这次打出的不是小修小补的普通升级,而是一张"王牌"——在几乎所有主流基准测试中实现 全面领先,大模型的竞争格局可能就此改写。甚至有业内人士预言:"未来六个月内,很难有公司能够超越这一成绩。" 发布不久,OpenAI CEO 奥尔特曼与特斯拉CEO 马斯克便先后公开表示祝贺。奥尔特曼称其"看起来是个很棒的模型",评论区则调侃"这句来自竞争对手的 夸奖真是暖心"。马斯克也一如既往地送上"Nice work"的评价。 一向风格严谨的谷歌,这次也显得格外高调。官方博客标题直接打出"开启智慧新纪元",内容中多次强调"最佳""最先进"。谷歌员工也纷纷在社交媒体上为 自家产品助阵,谷歌CEO桑达尔·皮查伊(Sundar Pichai)今天已经连发了8条帖子介绍Gemini 3。 : center;"> 今天凌晨皮查伊发了条帖子,内容只有一张图,但这张图足够有说服力,Gemini 3 Pro几乎"屠榜",在所有主要竞技场排行榜上排名第一。 : center;"> 在正式发布前,第一财经参与了谷歌面向媒体的小范围沟通会,尽管对模型进展已有预期,但行业的热烈反响 ...
新模型“屠榜”,对话谷歌团队:AI“新旗手”如何诞生
Di Yi Cai Jing· 2025-11-19 04:33
Core Insights - Google has officially launched Gemini 3, a significant advancement in AI, which is expected to redefine the competitive landscape in the AI industry, with predictions that it will be hard for competitors to surpass its performance in the next six months [1][3][21] Performance Metrics - Gemini 3 Pro has achieved top rankings across major benchmarks, outperforming competitors like GPT-5.1 and Claude Sonnet 4.5 in various tests, including a 37.5% score in "Humanity's Last Exam" and 91.9% in the GPQA Diamond test [4][5][6] - In multimodal understanding, Gemini 3 Pro scored 81% in MMMU-Pro and 87.6% in Video-MMMU, setting new records in these areas [6] User Experience and Applications - Users have reported exceptional experiences with Gemini 3 Pro, noting its ability to generate complex tasks and code with minimal prompts, showcasing its advanced capabilities in practical applications [7][10] - The model is designed to assist users in handling multi-step complex tasks, which is seen as one of its key strengths [12] Strategic Moves - Google has integrated Gemini 3 into its search engine and launched a new AI programming product called Antigravity, indicating the model's readiness for commercial applications [13][16] - The company aims to leverage its extensive user base and product ecosystem to drive AI adoption, with over 650 million monthly active users and 13 million developers building applications based on Gemini [18][19] Competitive Landscape - The launch of Gemini 3 positions Google as a potential leader in the AI space, especially as it has caught up with competitors like OpenAI and Anthropic, which previously held a lead in AI programming [17][21] - Analysts have noted that Google's advancements may shift market dynamics, with increased interest from investors, as evidenced by Loop Capital upgrading Google's stock rating [18]
Gemini3发布后哈萨比斯首发声:谷歌重回第一阵营,但AI确实有泡沫
3 6 Ke· 2025-11-19 03:06
北京时间11月19日,在谷歌发布Gemini 3系列模型之后,《纽约时报》旗下科技播客《Hard Fork》发布 特别节目,由主持人凯文·罗兹(Kevin Roose)和凯西·牛顿(Casey Newton)专访谷歌DeepMind首席执 行官德米斯・哈萨比斯(Demis Hassabis)与谷歌Gemini团队负责人乔希・伍德沃德(Josh Woodward)。 本次访谈聚焦谷歌最新发布的旗舰AI模型Gemini 3(实际为Gemini 3.0系列中的Pro版本),这是谷歌在 经历Bard失败、Gemini 1.x和2.x追赶阶段之后,首次被业界广泛认为重新夺回技术与产品领先地位的里 程碑式发布。 两位负责人详细阐述了Gemini 3在多步推理、代码生成(尤其是前端与"氛围编码")、动态生成交互界 面等方面的突破,强调谷歌已将最强模型快速推向搜索、Gmail、Workspace等数十亿用户产品,重塑竞 争壁垒。 访谈核心观点包括: Gemini 3完全符合预期发展轨迹,距离通用人工智能(AGI)仍需5至10年及1至2次重大研 究突破; 以下为访谈内容精简版: 罗兹:凯西,我们今天临时加播一期特别节目,主题是 ...