机器之心 - filings, earnings calls, financial reports, news

机器之心

Search documents

机器之心· 2025-06-15 04:40

Core Viewpoint - The article discusses a study that reveals large language models (LLMs) do not possess human-like working memory, which is essential for coherent reasoning and conversation [5][30]. Summary by Sections Working Memory - Working memory in humans retains information for a short period, enabling reasoning and complex tasks [7]. - LLMs are often compared to a "talking brain," but the lack of working memory is a significant barrier to achieving true general artificial intelligence [8]. Evaluation of Working Memory - Traditional N-Back Task assessments are unsuitable for LLMs, as they can access all historical tokens rather than recalling internal memory [10]. Experiments Conducted - **Experiment 1: Number Guessing Game** - LLMs were asked to think of a number between 1-10 and respond to repeated guesses. Most models failed to provide a "yes" response, indicating a lack of internal memory [13][19]. - **Experiment 2: Yes-No Game** - LLMs were tasked with answering questions about a chosen object. Results showed that models began to contradict themselves after 20-40 questions, demonstrating inadequate working memory [22][26]. - **Experiment 3: Math Magic** - LLMs were required to remember and manipulate numbers through a series of calculations. The accuracy was low across models, with LLaMA-3.1-8B performing best [28][29]. Conclusions - None of the tested models passed all three experiments, indicating a significant gap in their ability to mimic human-like working memory [30]. - Future advancements in AI may require integrating a true working memory mechanism rather than relying solely on extended context windows [30].

人工智能

工作记忆

GPT-4o-Mini-2024-07-18

GPT-4o-2024-05-13

LLaMA-3.1-70B-Instruct-Turbo

LLaMA-3.1-405B-Instruct-Turbo

人工智能

工作记忆

GPT-4o-Mini-2024-07-18

GPT-4o-2024-05-13

LLaMA-3.1-70B-Instruct-Turbo

LLaMA-3.1-405B-Instruct-Turbo

通用 Agent 之外，Agentic Age 流量赛还有哪些「隐藏副本」？

机器之心· 2025-06-14 12:45

1. 通用 Agent 之外，Agentic Age 流量赛还有哪些「隐藏副本」？ Agentic AI 的「流量入口」逻辑，与传统互联网时代有何根本不同？有哪些产品被视为当前最值得争夺的战略高地，而又是谁在主导这些战略入口？在「流量入口即生态」的新范式下，各主力玩家如何划定阵地？有哪些路线分歧？ 2. 烧钱一年，李飞飞的「空间智能」愿景有变化吗？机器之心PRO · 会员通讯 Week 24 --- 本周为您解读 ② 个值得细品的 AI & Robotics 业内要事 --- ① AI 助手可以跨平台自主执行任务，绕过传统平台的注意力分发模式。过去的互联网时代，用户获取信息和服务的入口主要集中在搜索引擎、社交平台、门户网站等传统节点。用户主动搜索或点击链接，即可获得所需内容。 World Labs 的愿景有变化吗？AI 技术如何「反直觉」发展？为什么没有空间智能的 AI 是不完整的？空间智能如何解锁从「单一现实」到「多元宇宙」的未来？为什么李飞飞没有更早重视 3D 表征？ ... 本期完整版通讯含 2 项专题解读 + 31 项 AI & Robotics 赛道要事速递，其中技术方面 12 项，国内方面 ...