Workflow
语言模型
icon
Search documents
“多模态方法无法实现AGI”
AI前线· 2025-06-14 04:06
作者 | Benjamin 译者 | 王强 策划 | 褚杏娟 "将语言投射回思想模型时,我们忽视了支撑我们智能的不言而喻的具身认知。" 首先,虽然奥赛罗的移动可被证明用于推断奥赛罗棋盘的完整状态, 但我们没有理由相信有办法通 过语言描述推断出物理世界的完整画面 。将奥赛罗游戏与物理世界的许多任务区分开来的是, 奥赛 罗本质上位于符号领域,只是使用物理标记来实现,以便于人类玩耍 。一个完整的奥赛罗游戏可以 用纸和笔进行,但人们不能用纸和笔扫地、洗碗或开车。要解决这些任务,你需要超越人类用语言描 述的物理世界概念。这种描述世界的概念是否编码进了正式的世界模型中,或者例如编码进了一个价 值函数,还有待讨论, 但很明显,物理世界中有许多问题不能完全由符号系统表示并用纯粹的符号 操作解决 。 最近生成式人工智能模型的成功让一些人相信人工通用智能(AGI)即将到来。虽然这些模型似乎捕 捉到了人类智能的本质,但它们甚至违背了我们对智能最基本的直觉。它们之所以出现,并非因为它 们是解决智能问题的深思熟虑的解决方案,而是因为它们在我们已有的硬件上有效地扩展了规模。一 些人沉浸在规模扩展的成果中,开始相信这提供了通往 AGI 的 ...
刚刚,CVPR 2025奖项出炉:牛津&Meta博士生王建元获最佳论文,谢赛宁摘年轻研究者奖
机器之心· 2025-06-13 15:45
机器之心报道 机器之心编辑部 刚刚,在美国田纳西州纳什维尔举办的 CVPR 2025 公布了最佳论文等奖项。 今年共有 14 篇论文入围最佳论文评选,最终 5 篇论文摘得奖项 ,包括 1 篇最佳论文 、 4 篇最佳论文荣誉提名 。此外,大会还颁发了 1 篇最佳学生论文 、 1 篇最 佳学生论文荣誉提名 。 根据会方统计,今年大会共收到 4 万多名作者提交的 13008 份论文。相比去年(11532),今年的投稿数量增长了 13%,最终有 2872 篇论文被接收,整体接收率 约为 22.1%。在接收论文中,Oral 的数量是 96(3.3%),Highlights 的数量是 387(13.7%)。 计算机视觉技术的火热给大会审稿带来了空前的压力。本届投稿作者数量、论文评审者和领域主席(AC)数量均创下新高。 今年前来现场参会的学者也超过 9000 人,他们来自 70 余个国家和地区。 CVPR 官方公布了各个细分领域的论文接收情况,如下图所示。可以看到,图像与视频生成领域今年度的论文接收数量最多,而接收率最高的领域则是基于多视角 和传感器的 3D 以及基于单图像的 3D。 此次,最佳论文奖委员会成员中有 AI ...
ICML 2025 | 千倍长度泛化!蚂蚁新注意力机制GCA实现16M长上下文精准理解
机器之心· 2025-06-13 15:45
该工作第一作者为蚂蚁技术研究院副研究员胡翔,蚂蚁技术研究院高级研究员武威为通讯作者。 在大语言模型如火如荼的当下,长文本建模仍然是一个极具挑战的问题。纠其根源,一方面在于主流 LLMs 的架构 Transformers 中平方复杂度及随序列长度线性增 长的推理阶段显存开销;另一方面在于 full-attention 有限的外推能力,难以泛化到远超预训练阶段长度的输入。 而高效处理长上下文能力,除了简单的工业界降本增效的需求外,还涉及通用人工智能 (AGI) 的核心问题:具有永久记忆的智能体。如果将人类从出生开始接收 到的信息视作长上下文,人类拥有记忆无非是访问这些上下文。因此记忆可以看作是超长上下文访问能力,而拥有与用户所有对话记忆的智能体,很可能为大语 言模型公司构建数据护城河 (事实上,OpenAI 已经开放了类似能力)。 近日,蚂蚁的研究团队为这个问题带来了一个新思路。就像人类开卷考试只会挑和当前问题相关的关键页作为参考,语言模型也可以只关注与当前上下文相关的 过去片段。以此为出发点,他们提出一种 基于因果检索的注意力机制 GCA (Grouped Cross Attention),完全端到端地学习如何 ...
烧钱一年,李飞飞的「空间智能」愿景有变化吗?
机器之心· 2025-06-13 12:02
01. 创业一年后,李飞飞如何阐述 World Labs 的愿景? 成立一年的World Labs 发布过什么进展?World Labs 的愿景有变化吗?空间智能终于有望解锁了?... 02 . 为什么没有空间智能的 AI 是不完整的? 本文来自PRO会员通讯内容,文末关注「机器之心PRO会员」,查看更多专题解读。 在近期由 a16z 普通合伙人 Erik Torenberg 主持的一场访谈中,李飞飞和 World Labs 早期投资者 Martin Casado 围绕「世界模型」和「空间智能」的话题探讨了她对 AI 技术的理解,并在创业 项目 启动一年后重新 介绍了 World Labs 的任务和愿景。 目录 2、李飞飞指出当前语言模型在描述和理解三维物理世界方面存在明显的局限性,空间智能则超越语言模型成 为智能的关键组件,是世界模型理解、重建和生成物理世界的核心能力。 ① 语言虽然是思想和信息的强大编码,但对 3D 物理世界而言是「有损的编码方式」,无法有效描述和操作三 维空间。而空间智能代表着更为古老和根本的智能形式,是 AI 的关键组成部分。 3、在这一认知框架下,World Labs 试图构建能理解 ...
每日机构分析:6月13日
Xin Hua Cai Jing· 2025-06-13 08:29
Group 1 - HSBC's foreign exchange strategy head indicates that geopolitical risks are putting pressure on the British pound, which is seen as a risk-sensitive currency, dropping to around 1.3530 against the US dollar [1] - Danske Bank analysts report that the recent 30-year US Treasury auction showed strong demand, alleviating concerns about long-term US Treasury demand and pushing yields below the critical 5% level [1] - The Swedish Nordea Bank anticipates that the Swedish central bank will lower interest rates in June, reflecting expectations among fixed-income investors [2] Group 2 - Analysts from Mizuho Securities highlight that the current geopolitical tensions have not been fully reflected in market volatility, with risks of full-scale conflict increasing [2] - HSBC Global Research predicts that the Philippine central bank will lower its policy rate to 5.25%, differing from previous expectations of maintaining rates, due to low inflation and slow economic growth [2] - Economists from Wilmington Trust suggest that long-term impacts of US tariffs are more likely to lead to economic weakness rather than inflation, with consumers beginning to cut back on non-essential spending [2] Group 3 - RSM's chief economist notes that rising prices in the US appliance market reflect cost increases from previous import tariffs, emphasizing the importance of consumer behavior in determining inflation persistence [3] - Goldman Sachs analysts report that the US data center securitization market has surged from $5 billion to $30 billion, driven by increased capital expenditure in cloud computing and policy support [3] - The data center market is expected to peak in occupancy rates by mid-2026, with growth primarily fueled by large investments in facilities equipped with thousands of GPUs for large language models [3]
全球最大上市对冲基金集团出手!
Zhong Guo Ji Jin Bao· 2025-06-13 07:00
日前,全球最大的上市对冲基金集团——英仕曼集团宣布,其全资子公司英仕曼(上海)投资管理有限公司于中国市场推出首只自主管理的股票指数增强 策略产品——英仕曼美量中证500指数增强策略。该产品已于中国证券投资基金业协会(简称协会)备案,面向合格投资者发行。 自2017年在境内登记为证券私募管理人以来,英仕曼集团发展节奏历经波动。英仕曼集团于6月12日发布的新闻稿中表示,该产品的发行标志着集团在中 国投资市场的重要战略布局进入新阶段。 于中国市场推出自主管理指增产品 英仕曼进一步表示,该产品将集团旗下Numeric团队的全球长期实盘经验的系统化量化投资方法用于中国A股市场投资。据了解,Numeric团队拥有超过30 年的量化投资经验。截至2025年3月31日,其管理的全球股票策略资产规模超过400亿美元。 英仕曼Numeric高级投资经理方子昂表示,随着中国经济的稳健增长,作为全球第二大股票市场,A股市场不仅拥有显著的配置潜力,而且为量化策略提 供了丰富的Alpha来源。 英仕曼Numeric投资经理、英仕曼美量中证500指数增强策略首席基金经理杨海翔表示,投资策略在量化模型基础上,整合了包括公司基本面、行业另类数 ...
OpenAI掀桌子,新模型力压谷歌,o3降到地板价
3 6 Ke· 2025-06-13 06:07
Core Insights - OpenAI has launched o3-pro, an enhanced version of its reasoning model, following a 9-hour outage of ChatGPT, aiming to provide more reliable responses and extended thinking time [1][2][4]. Model Performance - o3-pro has been made available to all ChatGPT and API Pro users, with usage limits for Plus users increased from 100 to 200 times per week [2]. - In expert evaluations, o3-pro outperformed its predecessor o3 in all tested categories, particularly in science, education, programming, business, and writing assistance [2][6]. - The model supports both text and image inputs, with a context window size of 200k and a maximum output token count of 100k [11]. Competitive Landscape - OpenAI's performance is under scrutiny, especially with Google’s Gemini 2.5 Pro entering the market, which has been noted for its competitive pricing and capabilities [4][24]. - In internal tests, o3-pro surpassed Gemini 2.5 Pro in mathematical benchmarks and outperformed Anthropic's Claude 4 Opus in doctoral-level science tests [27]. Pricing Strategy - o3-pro is priced at $20 per million tokens for input and $80 for output, significantly lower than its predecessor o1-pro, which is expected to be phased out [24][27]. - Following the launch of o3-pro, OpenAI announced an 80% price reduction for o3, making it more competitive against Gemini 2.5 Pro [27]. User Experience - Users have reported that o3-pro is slower in response times compared to other models, taking several minutes for simple queries, which has raised concerns about its efficiency [15][17]. - Despite the slower response, o3-pro has demonstrated strong analytical capabilities and proficiency in using tools for complex problem-solving [19][22].
迈向人工智能的认识论:真的没有人真正了解大型语言模型 (LLM) 的黑箱运作方式吗
3 6 Ke· 2025-06-13 06:01
Group 1 - The core issue revolves around the opacity of large language models (LLMs) like GPT-4, which function as "black boxes," making their internal decision-making processes largely inaccessible even to their creators [1][4][7] - Recent research highlights the disconnect between the reasoning processes of LLMs and the explanations they provide, raising concerns about the reliability of their outputs [2][3][4] - The discussion includes the emergence of human-like reasoning strategies within LLMs, despite the lack of transparency in their operations [1][3][12] Group 2 - The article explores the debate on whether LLMs exhibit genuine emergent capabilities or if these are merely artifacts of measurement [2][4] - It emphasizes the importance of understanding the fidelity of chain-of-thought (CoT) reasoning, noting that the explanations provided by models may not accurately reflect their actual reasoning paths [2][5][12] - The role of the Transformer architecture in supporting reasoning and the unintended consequences of alignment techniques, such as Reinforcement Learning from Human Feedback (RLHF), are discussed [2][5][12] Group 3 - Methodological innovations are being proposed to bridge the gap between how models arrive at answers and how they explain themselves, including circuit-level attribution and quantitative fidelity metrics [5][6][12] - The implications for safety and deployment in high-risk areas, such as healthcare and law, are examined, stressing the need for transparency in AI systems before their implementation [6][12][13] - The article concludes with a call for robust verification and monitoring standards to ensure the safe deployment of AI technologies [2][6][12]
今年“港股AGI第一股”确认了!云知声冲刺IPO五年终通过港交所聆讯
Sou Hu Cai Jing· 2025-06-13 00:36
Core Viewpoint - Yunzhisheng Intelligent Technology Co., Ltd. is set to become the first "AGI stock" in Hong Kong this year after passing the Hong Kong Stock Exchange hearing and disclosing relevant information [2][3] Company Overview - Founded in 2012, Yunzhisheng specializes in providing intelligent voice technology and comprehensive AI solutions, focusing on the smart voice sector [6] - The company has developed several key products, including the UniCore language model and the UniOne AI chip series, and recently launched a self-developed 600 billion parameter model [6] - Yunzhisheng's AI computing cluster has over 184 PFLOPS and more than 10 PB of storage capacity, supporting its technology development [6] Business Model and Market Position - The company primarily serves the life and medical sectors, with clients including China's top three insurance groups [7] - Yunzhisheng offers AI capabilities through a MaaS model, providing API services and customized AI technology platforms [7] - According to Frost & Sullivan, Yunzhisheng is the fourth largest AI solution provider in China by revenue in 2024, with a market share of 0.6% [9] Financial Performance - The company has completed 11 rounds of financing totaling over $340 million, with a valuation around 10 billion [9] - Revenue for 2022, 2023, and 2024 is projected at 601 million, 727 million, and 939 million respectively, with a compound annual growth rate of 25% [9] - Despite revenue growth, the company reported net losses of 375 million, 376 million, and 454 million for the same years [9] Funding and Future Outlook - The IPO proceeds will be used to enhance R&D capabilities, invest in emerging business opportunities, and support international expansion [13] - The company has raised over 700 million RMB in its D3 financing round in 2023, ensuring sufficient operational funds for at least the next 12 months [10] - Yunzhisheng anticipates continued net losses due to ongoing R&D investments and financing costs related to redeemable securities [10]
万马科技20250612
2025-06-12 15:07
摘要 万马科技通过收购有方科技切入车联网领域,车联网业务收入从 2021 年的 5,000 万元增长到 2024 年的 2.6 亿元,利润也显著提升,并已建 立完整的数据闭环工具链和智驾算力中心。 国内车联网行业渗透率约为 80%,海外市场渗透率不足 30%,随着智 能驾驶对数据需求的增加,国内外市场均有较大的发展空间,尤其 Robotaxi 对实时数据监控和技术要求更高,单车价值提升显著。 优卡科技提供蓝海全球车联和云自动驾驶数据闭环两大解决方案,支持 1,400 万辆车辆,客户包括吉利、上汽、东风和理想等,并在全球范围 内支持 Robotaxi 企业的业务布局。 Robotaxi 被视为车联网行业发展的"皇冠上的明珠",高盛预测中国 Robotaxi 市场年化增长率将达到 96%。目前已在北京、武汉、广州以 及香港、迪拜等地进行常态化运营,特斯拉也即将推出相关业务。 Robotaxi 运营对网络质量有极高要求,包括运行安全、用户交互、合 规性、自动驾驶数据采集和运维等方面,需要高清地图、车路协同、远 程脱困以及海量数据支持。 万马科技 20250612 据监控需求高,对技术和数据量要求也更高,从单车价值上 ...