Workflow
大型语言模型(LLM)
icon
Search documents
喝点VC|a16z谈搜索大变局:搜索迈入由语言模型主导的“生成式引擎优化(GEO)”全新范式
Z Potentials· 2025-06-12 04:24
Core Insights - The article discusses the transition from traditional Search Engine Optimization (SEO) to Generative Engine Optimization (GEO), highlighting the impact of large language models (LLMs) on search behavior and marketing strategies [3][5][21] - It emphasizes that the SEO market, valued at over $80 billion, is facing challenges as search behavior shifts from browsers to LLM platforms, fundamentally altering how exposure and content optimization are defined [3][5][9] Transition from Links to Language Models - Traditional search relied on link-based ranking, while GEO focuses on language and direct answers generated by models [4][5] - The average query length has increased significantly to 23 words, compared to just 4 words in traditional searches, indicating deeper user engagement [4] - LLMs provide personalized responses through memory and reasoning capabilities, changing the content discovery and optimization logic [4][5] New Metrics and Competitive Focus - The focus of competition has shifted from click-through rates to "model citation rates," where brands need to be encoded into AI layers to build new competitive barriers [5][12] - Emerging platforms like Profound and Goodie help brands analyze their presence in AI-generated answers and track sentiment in model outputs [12][13] Brand Strategy Evolution - A new brand strategy is emerging that prioritizes model recognition over public recognition, with "unprompted awareness" becoming a key metric in the AI era [12][14] - Tools like Ahrefs' Brand Radar and Semrush's AI toolkit are adapting to help brands monitor their visibility and mentions in generative platforms [13][14] The Rise of GEO Tools - GEO tools are not just about data measurement but also about actively shaping LLM behavior through insights and iterative feedback loops [20] - Companies that excel in GEO will create actionable infrastructures for real-time marketing activities and content optimization [20][21] Timing and Market Dynamics - The article notes that the transition to GEO is still in its early stages, with significant opportunities for brands to adapt as advertising budgets shift rapidly [21][22] - The ultimate question for marketers in the AI-driven landscape is whether models will remember their brands [22]
本周WWDC推出新Siri无望?华尔街质疑苹果AI能力
Hua Er Jie Jian Wen· 2025-06-09 02:43
Core Insights - Apple's upcoming WWDC on June 9 is expected to disappoint investors due to ongoing challenges in upgrading Siri and integrating advanced large language models (LLM) into its AI functionality, "Apple Intelligence" [1][4] - The integration of LLMs to enhance Siri's conversational abilities has faced significant technical difficulties, leading to numerous bugs that competitors like OpenAI and Google have not encountered [3][8] - The delay in launching the upgraded Siri has resulted in a decline of approximately 18% in Apple's stock price since the beginning of 2025, making it the worst performer among the "Tech Seven" giants [4] Siri Upgrade Challenges - Apple is attempting to improve Siri's capabilities to respond more like a human, but the integration process has been plagued by bugs, which has hindered progress [3] - A former Apple executive criticized the gradual development approach, stating that it cannot fundamentally transform Siri [3] - Analysts suggest that it may take Apple three years or more to deliver a modernized AI assistant, significantly lagging behind competitors [8] Market Reactions - Investor sentiment has soured due to repeated delays in the "Apple Intelligence" feature, leading to low expectations for the upcoming WWDC [4] - Analysts from Morgan Stanley and Bank of America have expressed concerns about Apple's ability to meet its previous commitments regarding AI advancements [4][8] Strategic Focus Shift - The upcoming WWDC may focus more on brand restructuring rather than significant technological breakthroughs, with plans to rebrand operating systems and repackage existing features as "AI-driven" [9] - Apple is expected to announce the opening of its foundational models to third-party developers, although its LLM capabilities are significantly less complex than those of competitors [9] - Internal sources indicate that expectations for the AI segment of the conference are low, raising concerns about Apple's visibility in the AI space [9]
ICML 2025 Spotlight | 谁导致了多智能体系统的失败?首个「自动化失败归因」研究出炉
机器之心· 2025-05-30 03:28
问题来了:到底是哪个 Agent 出了错?又是在对话流程的哪一环节?调试这样的多智能体系统如同大海捞针,需要翻阅大量复杂日志,极其耗时。 这并非虚构。在多智能体 LLM 系统中,失败常见但难以诊断。随着这类系统愈加普及,我们急需新方法快速定位错误。正因如此,ICML 2025 的一篇 Spotlight 论 文提出了「自动化失败归因(Automated Failure Attribution)」的新研究方向,目标是让 AI 自动回答:是谁、在哪一步导致了失败。 该工作由 Penn State、Duke、UW、Goolge DeepMind 等机构的多位研究人员合作完成。 论文标题:Which Agent Causes Task Failures and When? On Automated Failure Attribution of LLM Multi-Agent Systems 背景挑战 LLM 驱动的多智能体系统在诸多领域展现出巨大潜力,从自动化助手协同办公到多 Agent 合作完成 Web 复杂操作等。然而,这些系统 脆弱性 也逐渐显现:多个 Agent 之间的误解、信息传递错误或决策不当,都可能导致 ...
全球首个宠物翻译器,上线爆火
3 6 Ke· 2025-05-23 00:47
近期,谷歌推出 DolphinGemma 大模型, 称将让人类听懂海豚的语言,实现人与海豚在水下的实时交流。另一个由华人团队研发、面向全球英文用户的 人狗交流应用Traini在去年6月出现,成为全球首个实现人宠语言互译的AI原生应用。AI正步入跨物种交流领域,拓宽着人们对非人类语言理解的边界。 一条联系了Traini的CEO孙邻家,他是80后、中国人,老家在吉林长白山。我们与他聊了聊AI新技术对人宠交流领域的影响,从0到1探索过程中的挑战, 以及他三年来身处行业内部的感受。 除此之外,我们还想知道:当人类暂时离开语言的中心,开始尝试建立起与非人类语言平等对话的可能,在新奇感过后,AI+跨物种交流对我们具有怎样 的意义? 孙邻家,80后,吉林长白山人 知名投行高盛近期的一份报告显示,中国的宠物数量首次超过 4 岁以下婴幼儿总量。 同时 根据艾媒咨询的数据, 2023 年中国宠物经济产业规模 就已经 达到 5928 亿元。 根据《2025宠物品牌网红营销生态报告》,以年轻群体为代表的养宠人多将宠物视作"孩子"与"朋友",呈现出情感消费与拟人化养宠的趋势。 这样的需求也催生了相关产业,比如几年前备受争议的宠物灵媒师 ...
仅需1个数据,就能让大模型的数学推理性能大大增强?
机器之心· 2025-05-09 09:02
论文发现,只在 RLVR 训练中使用一个训练数据(称作 1-shot RLVR),就可以在 MATH500 上,将 Qwen2.5-Math-1.5B 的表现从 36.0% 提升到 73.6%,以及把 Qwen2.5-Math-7B 的表现从 51.0% 提升到 79.2% 。 这个表现和使用 1.2k 数据集(包括这一个数据)的 RLVR 效果差不多。 使用两个训练样本的 RLVR 甚至略微超过了使用 1.2k 数据集(称作 DSR-sub)的表现, 和使用 7.5k MATH 训练集的 RLVR 表现相当。这种表现可以在 6 个常用的数学推理任务上都可以观察到。 本文第一作者王宜平是华盛顿大学的博士生,其导师、通讯作者杜少雷为华盛顿大学Assistant Professor;另外两位通讯作者 Yelong Shen 和 Shuohang Wang 是 Microsoft GenAI 的Principal Researcher。 最近, 大型语言模型(LLM)在推理能力方面取得了显著进展,特别是在复杂数学任务上。推动上述进步的关键方法之一就是带可验证奖励的强化学习 (Reinforcement Learni ...
AI智能体协议全面综述:从碎片化到互联互通的智能体网络
Core Viewpoint - The article discusses the evolution and categorization of AI agent protocols, emphasizing the need for standardized communication to enhance collaboration and problem-solving capabilities among AI agents across various industries [1][9]. Summary by Sections AI Agent Protocols Overview - The report introduces a systematic two-dimensional classification framework for existing AI agent protocols, distinguishing between context-oriented protocols and inter-agent protocols, as well as general-purpose and domain-specific protocols [1]. Model Context Protocol (MCP) - MCP represents a centralized approach where a core "MCP travel client" agent coordinates all external services, leading to a star-shaped information flow. While it is simple and easy to control, it lacks flexibility and scalability, making it challenging to adapt to complex tasks [2][3]. Agent-to-Agent Protocol (A2A) - A2A promotes a distributed and collaborative model, allowing agents to communicate directly without a central coordinator. This flexibility supports dynamic responses to changing needs but may face challenges when crossing organizational boundaries [4][5]. Agent Network Protocol (ANP) - ANP standardizes cross-domain interactions, enabling agents from different organizations to collaborate effectively. It formalizes the request and response process, making it suitable for diverse and secure environments [6]. Agora Protocol - Agora focuses on translating user natural language requests into standardized protocols for execution by specialized agents. This three-stage process enhances adaptability and allows agents to concentrate on their core functions [7][8]. Future Trends in AI Agent Protocols - The development of AI agent protocols is expected to evolve towards more adaptive, privacy-focused, and modular systems. Short-term goals include establishing unified evaluation frameworks and enhancing privacy protection mechanisms [9][10]. - Mid-term trends may involve embedding protocol knowledge into large language models and developing layered protocol architectures to improve interoperability [11][12]. - Long-term aspirations include creating a collective intelligence infrastructure and specialized data networks to facilitate structured, intent-driven information exchange among agents [13][14][15]. Conclusion - The exploration of AI agent protocols indicates a clear trajectory towards a more intelligent, autonomous, and collaborative future, with significant implications for technology, society, and economic models [16][17].
微软正式开源UFO²,Windows桌面迈入「AgentOS 时代」
机器之心· 2025-05-06 08:04
近年来,图形用户界面(GUI)自动化技术正在逐步改变人机交互和办公自动化的生态。然而,以 Robotic Process Automation(RPA)为代表的传统自动化工具通 常依赖固定脚本进行操作,存在界面变化敏感、维护成本高昂、用户体验欠佳等明显问题。 同时,近年来兴起的基于大型语言模型(LLM)的计算机智能体(Computer-Using Agents,CUA)虽然展现出灵活的自动化潜力,但多数方案仍停留在概念验证 或原型阶段,缺乏与操作系统深度集成的能力,制约了其在实际工作环境中的规模化应用。 针对这些行业痛点,作为前代纯 GUI 桌面智能体 UFO 的全面升级版, 微软研究团队近日正式开源了业内首个深度集成 Windows 操作系统的桌面智能体平 台 ——UFO² AgentOS 。 该平台不仅继承了 UFO 的强大 GUI 操作能力,还在系统层面进行了深度优化,显著提升了智能体在 Windows 环境下的操作效率与稳定 性。 本论文第一作者为微软 DKI 团队的 Chaoyun Zhang,其为 Windows 平台首个智能体系统 ——UFO 的核心开发者,该项目已在 GitHub 上开源并获得 ...
过去四周,AI推理爆了,GPU在燃烧,英伟达依旧供不应求
Hua Er Jie Jian Wen· 2025-04-27 10:38
Group 1 - Investor sentiment has deteriorated due to macroeconomic and supply chain risks, but demand for NVIDIA's GPUs has surged due to the significant need for inference chips driven by large language models (LLMs) [1] - Token generation has increased over five times since the beginning of the year, creating immense pressure on the ecosystem and driving a surge in investment to handle these workloads [1] - AI companies are experiencing explosive user growth, with many forced to compete for GPU resources to meet the massive demand for inference software [1] Group 2 - Morgan Stanley has lowered its target price for NVIDIA to $160 from $162, reflecting overall valuation declines in the peer group rather than changes in the company's fundamentals [2] - Despite strong demand, supply constraints for NVIDIA's Blackwell chips, particularly the GB200/300 models, are limiting the ability to meet the explosive growth in demand [2][4] - Morgan Stanley has raised its revenue forecast for fiscal year 2026 by 10.7% and adjusted earnings per share up by 11.9%, indicating that these figures may still be conservative [5]
人工智能芯片大赢家
半导体芯闻· 2025-04-07 11:07
如果您希望可以时常见面,欢迎标星收藏哦~ 来源:内容 编译自 semiengineering ,谢谢。 2025 年初,我认为 AI 被过度炒作,ASIC 仍是小众产品,市场回调不可避免。我的长期观点发生 了巨大变化。AI 技术和应用正在以惊人的速度加速发展。GenAI/LLM 领导者之一 Nvidia 将在 2030 年成为首家市值达到 10 万亿美元的公司。 大型语言模型 (LLM) 在功能和成本效率方面都在迅速提高。目前每周有超过 5 亿用户,其中 ChatGPT 领先,而且这个数字还在快速增长。这种指数级增长推动了数据中心使用量和资本支出 的大幅增加,主要由领先的 CSP 推动——亚马逊、微软、谷歌、Meta 和现在的 OpenAI。其中四 家是市值达万亿美元的公司。他们将挑选半导体赢家。 类别细分和主要参与者: 1 GPU/AI 加速器 赢家: Nvidia 在 GTC 2025 上,Nvidia 首席执行官黄仁勋预测,到 2028 年全球数据中心资本支出将达到 1 万 亿美元。按照这个速度,到 2030 年数据中心资本支出可能达到约 1.4 万亿美元。我在这次分析中 寻找的是大局——5 年后的数字 ...