上下文工程
Search documents
看完 Manus、Cursor 分享后的最大收获:避免 Context 的过度工程化才是关键
Founder Park· 2026-01-09 12:34
毫无疑问,上下文工程的优化,仍然是 Agent 创业公司在新一年都在「卷」的重点。 在实际落地开发中,上下文信息的质量,很大程度上决定了 Agent 的表现。 Manus 的首席科学家季逸超在之前访谈中提到过一个观点: 初创公司真的应该尽可能长时间地依赖通用模型和上下文工程,而不是过早地构建专用模型,也包括微调。 上下文工程是应用层和模型层之 间最清晰、最实用的边界。 做好上下文工程,开发者能够在不触及模型底层权重的前提下,灵活驾驭模型,同时还能适应快速变化的产品需求。 最近,Cursor 也发表了一篇文章《Dynamic context discovery》,分享了他们是怎么做上下文管理的。 结合 Manus、Cursor 这两家 Agent 领域头部团队的思路,我们整理了如何做好上下文工程的一些关键要点。 Cursor 原文:https://cursor.com/cn/blog/dynamic-context-discovery 此前 Founder Park 分享的文章 《来自 Manus 的一手分享:如何构建 AI Agent 的上下文工程?》 ⬆️关注 Founder Park,最及时最干货的创业分 ...
对话 Kuse: 没融资 3 个月 1000 万美金 ARR,用 NotebookLM 的方法重做 Notion
投资实习所· 2026-01-05 03:54
Core Insights - Kuse has achieved significant growth, reaching nearly $10 million in ARR within three months without external funding, indicating a strong demand for structuring unstructured data [1][17] - The product focuses on a "Context First" approach, allowing users to upload various types of content to create reusable contextual assets, which enhances AI-generated outputs and workflow iterations [3][4] Product Differentiation - Kuse differentiates itself from general AI agents by emphasizing asset accumulation rather than one-time generation, targeting knowledge workers and enterprise scenarios [2][4] - The latest version of Kuse has shifted from a general AI tool to a native "Context First" file management and asset accumulation system, organizing materials in a Finder-like structure [4][6] User Experience and Functionality - Kuse's "Chaos in, Genius out" philosophy transforms complex inputs into clear, consumable web pages and documents, focusing on document and webpage generation rather than application development [6][10] - The formatting engine AI simplifies the process of creating structured documents, significantly reducing the time required for tasks like generating exam papers [7][8] Market Strategy - Kuse's growth strategy leverages Meta's Threads and Instagram, with a unique approach of employing interns to create numerous accounts that share practical use cases, targeting the Taiwanese and Hong Kong markets [18][22] - The product is designed to meet high-frequency needs in document generation, focusing on interactive web pages, resumes, and administrative notifications, aligning closely with traditional office tasks [22] Target Audience and Use Cases - Kuse has expanded its user base from designers to professionals in consulting, education, and law, who require high-precision, template-driven document creation [16][18] - The platform's ability to accumulate context over time enhances user interactions, making it a valuable tool for knowledge workers [15][16]
别了,大模型;你好,Agent:读懂Meta收购Manus的范式转移
创业邦· 2026-01-03 10:22
Core Viewpoint - Meta's acquisition of Manus for billions of dollars highlights the shifting landscape of AI, emphasizing the need for practical applications over mere conversational capabilities [7][14][20]. Group 1: Manus's Journey and Team - Manus, founded in Wuhan and developed in Beijing, has transitioned to a Singapore-based company, showcasing a modern narrative of Chinese tech talent navigating geopolitical challenges [7][18]. - The core team of Manus, led by founder Xiao Hong and chief scientist Peak Ji, is characterized by exceptional engineering skills and insights into user behavior, rather than traditional academic AI backgrounds [8][10]. - Peak Ji's philosophy of "orthogonality" emphasizes building applications that leverage existing models rather than competing directly with them, leading to innovative solutions in AI [12]. Group 2: Technological Innovations - Manus distinguishes itself from traditional chatbots by developing an "Agent" capable of performing complex tasks, such as market research and data analysis, rather than just engaging in conversation [16]. - The company has created a virtual operating system that enhances AI capabilities, addressing limitations in memory and operational accuracy, which has proven to be a significant engineering success [16]. Group 3: Geopolitical and Economic Challenges - The decision to relocate Manus's headquarters to Singapore and lay off Chinese staff reflects the harsh realities of geopolitical tensions, particularly regarding access to critical technology and funding [18][19]. - Manus's shift away from China is driven by the need for advanced computing power and capital, which are increasingly restricted for Chinese companies due to U.S. export controls [19]. Group 4: Implications for the Chinese AI Industry - The acquisition of Manus by Meta signifies a loss for the Chinese AI sector, as talented engineers are compelled to contribute to foreign companies due to local constraints [22]. - Manus's success illustrates the potential of Chinese engineers to innovate independently, yet the current environment hampers the growth of local ecosystems and market opportunities [22][25].
AI Coding 生死局:Spec 正在蚕食人类编码,Agent 造轮子拖垮效率,Token成本失控后上下文工程成胜负手
3 6 Ke· 2025-12-30 09:21
2025 年的 AI Coding 生态,正在为 2026 年的程序员定义一个新角色。答案可能藏在一堆冒烟的 Markdown 文件里。 这半年,Spec 驱动开发火到爆炸。仓库里迅速堆起一层层面向 Agent 的"Markdown 脚手架",它被捧为 AI Coding 的最前沿解法:用一份契约,逼 Agent 真的干活。 但问题来了:这套契约,真能接住软件工程几十年的复杂度吗?还是说,程序员的终极价值,将从"写 代码"转向"定义规则"——用 AI 听得懂的自然语言,驯服这场技术革命? 1 补全的天花板,与 Agent 的必然上位 AI Coding 的演进,已经清晰地分为了两个时代。 第一波由 Copilot 与 Cursor 开创:这是一种以人为主导的编程方式,AI 的角色是预测"下一个字 符"或"下一个编辑位置",在局部范围内提高速度和流畅度。 这种范式的边界其实非常清楚。补全必须足够丝滑,才能不打断心流,这意味着端到端时延被严格压在 几百毫秒量级,可用模型规模和上下文长度都受到天然约束:模型参数不能太大,上下文长度也远不可 能用全。 与此同时,补全的能力却在不断被拉长——从行内预测走向跨行、跨函数、 ...
近两百万人围观的Karpathy年终大语言模型清单,主角是它们
机器之心· 2025-12-21 03:01
编辑|杜伟 2025 年还有 10 天就要结束,这意味着是时候进行一波年终总结了。 对于人工智能领域而言,2025 年是大语言模型(LLM)快速演进、重磅事件密集出现的一年。 就在昨天,知名 AI 学者 Karpathy 列出了一份清单,记录了他个人认为最重要、也多少有些出乎意料的「范式转变」。 这些真正改变了行业格局、并在概念层面让 Karpathy 印象深刻的变化会落在哪些领域呢?我们接下来一一来看(以第一人称)。 可验证奖励强化学习(RLVR) 2025 年初,几乎所有实验室的 LLM 生产训练流程都像下面这样: 这套流程稳定、可靠,曾长期被视为「工业级 LLM」的标准做法。 预训练(类似 2020 年的 GPT-2/3); 监督微调(SFT,类似 2022 年的 InstructGPT) 基于人类反馈的强化学习(RLHF,约 2022 年) 但在 2025 年,一种新的阶段浮出水面,并迅速成为事实上的标配: 可验证奖励强化学习(Reinforcement Learning from Verifiable Rewards,RLVR) 。 RLVR 的核心做法是,让模型在可自动验证的环境中接受强化学习训练 ...
Manus 8 个月突破 1 亿美金 ARR,让我眼前一亮的语音 AI 产品种子轮拿了 4000 多万美金
投资实习所· 2025-12-18 05:35
在 8 月份宣布突破 9000 万美金年化收入后《 Manus 年化收入突破 9000 万美金,红杉中国投了一个 AI 床垫 》, Manus 昨晚宣布其 ARR 已经突破了 1 亿美金 ,成为从 0 到 1 亿美金 ARR 最快的初创公司。 前几天更新的版本则可以直接通过 Manus 来做移动开发了,这块我之前介绍过专注于开发 App 的 AI Coding 产品,《 创立 6 个月的 AI 卖了 8000 万美 金没融资一个创始人,App 的 AI Coding 也火了 》。 有网友说通过 Manus 开发了一个完整的移动 App,我简单体验了一下让它构建一个 AI 笔记产品,效果还不错,包括后端、数据库这些全都自己设计好 了,我只需要集成一个 OpenAI 的 API 即可,并且给出了发布到 App Store 的指南。 Manus 没有公布用户的分布情况,不过我在 X 上简单做了一下搜索,发现有不少日本用户分享了通过 Manus 做各种产品的帖子,特别是用它来做移动 App 和 Web 产品这块,我估计来自日本的用户有不少比例。 Manus 的总收入年化运行率则超过了 1.25 亿美金,这块数据包含了 ...
12月,我们推荐这 7 款 AI 新品
Founder Park· 2025-12-17 14:28
Group 1 - The article discusses the launch of seven innovative AI products at the Geek Park Innovation Conference, highlighting their uniqueness and recent developments [1][2] - These products are part of the Founder Park's AI Product Marketplace, which has recommended over 150 AI products since April, attracting over 17,000 industry professionals [3] Group 2 - Flomo, an AI note-taking product, recently upgraded its "AI Insights" feature to "Multi-Perspective Insights," allowing users to interpret their notes through various therapeutic lenses [4][5] - Flomo emphasizes the importance of personal context in note-taking, avoiding AI-generated content to maintain authenticity [7][8] Group 3 - Doka Camera, an AI-powered photography app, aims to return creative control to users by providing AI-assisted composition guidance without imposing a specific aesthetic [14][22] - Doka has achieved significant user engagement, ranking first in the photography category in Taiwan without any advertising spend [14][17] Group 4 - Remio, a personal office assistant, focuses on creating a comprehensive digital memory by automatically capturing context from users' activities, enhancing productivity [27][30] - Remio's technology allows for seamless integration of local documents and web browsing history, providing a structured context for AI interactions [34][35] Group 5 - Pallas AI is designed to assist brands with AI marketing, transforming the approach from passive search visibility to proactive recommendations [37][39] - The platform offers a comprehensive data analysis and visualization panel, enabling brands to monitor their performance across various AI platforms [43][45] Group 6 - MuleRun is an AI Agent Marketplace that connects developers and users, allowing for the monetization of AI agents and addressing mid-tail market needs [46][49] - The platform has rapidly gained traction, reaching 500,000 registered users within a month of launch [47][55] Group 7 - OdyssLife introduces the Odyss N1, an AI necklace that monitors users' dietary and exercise habits, aiming to improve health management through unobtrusive tracking [56][58] - The product provides personalized health recommendations based on real-time data analysis of users' eating patterns and physical activities [62][63] Group 8 - LavieAI focuses on generating visual content for the fashion industry using AI, significantly reducing production costs and time while maintaining aesthetic quality [65][68] - The company integrates artistic guidance into its AI models to ensure that generated content meets industry standards for visual appeal [71][72]
AI智能体时代中的记忆:形式、功能与动态综述
Xin Lang Cai Jing· 2025-12-17 04:42
记忆已成为并将继续成为基于基础模型的智能体的核心能力。它支撑着长程推理、持续适应以及与复杂环境的有效交互。随着智能体记忆研究的快速扩张 并吸引空前关注,该领域也日益呈现碎片化。当前统称为"智能体记忆"的研究工作,在动机、实现、假设和评估方案上往往存在巨大差异,而定义松散的 记忆术语的激增进一步模糊了概念上的清晰度。诸如长/短期记忆之类的传统分类法已被证明不足以捕捉当代智能体记忆系统的多样性和动态性。 在这些智能体的核心能力中,记忆 尤为关键,它明确地促成了从静态大语言模型(其参数无法快速更新)到自适应智能体的转变,使其能够通过环境交 互持续适应(Zhang et al., 2025r; Wu et al., 2025g)。从应用角度看,许多领域都要求智能体具备主动的记忆管理能力,而非短暂、易忘的行为:个性化聊 天机器人(Chhikara et al., 2025; Li et al., 2025b)、推荐系统(Liu et al., 2025b)、社会模拟(Park et al., 2023; Yang et al., 2025)以及金融调查(Zhang et al., 2024)都依赖于智能体处理、存储和管 ...
Google全链路赋能出海:3人团队调度千个智能体,可成独角兽|MEET2026
量子位· 2025-12-17 03:38
Core Insights - The future will be characterized by autonomous collaboration among intelligent agents, solving complex problems, automating workflows, and autonomously issuing tasks, creating a new business model [1] - AI agents are becoming new productivity units, injecting new meaning into the globalization logic of startups [2] - The intelligent agent sector is just beginning, with significant changes expected in the next one to two years, presenting a major opportunity for Chinese startups to go global [3] Google’s Integrated Solutions for Startups - Google has launched AI-driven integrated solutions to empower startups for efficient globalization [4] - The MEET2026 conference attracted nearly 1,500 offline attendees and over 3.5 million online viewers, highlighting the significant interest in the topic [6] - Startups face various challenges during globalization, and Google’s ecosystem can support them at every stage [7] Stages of Startup Globalization - The five stages of startup globalization include: 1. **Ideation and Strategic Planning**: Founders gather information and analyze competitors, often using Gemini for market research [8] 2. **Product Launch**: Google Cloud provides stable cloud infrastructure support [9] 3. **Market Validation**: Google Ads assists in reaching target customers [9] 4. **Market Expansion**: Google Play and other services support expansion into new markets [9] 5. **IPO Maturity**: Google’s data analysis tools aid in the final push before going public [10] Challenges and Innovations in AI - The AI field is evolving rapidly, with challenges such as hallucination (inaccurate or fabricated information) being addressed through better model training and engineering practices [11] - The introduction of the A2A (Agent-to-Agent) protocol aims to facilitate communication between intelligent agents across different enterprises [16] - The shift from SaaS subscription models to outcome-based payment models reflects a fundamental change in business logic, allowing small teams to scale significantly [18] Gemini's Evolution and Capabilities - Gemini has evolved from its initial version to Gemini 3, which has achieved significant advancements in reasoning, understanding, and problem-solving capabilities [15] - Key capabilities of Gemini 3 include: 1. **Extended Context Window**: Supports 1 million tokens, emphasizing the importance of context engineering [21] 2. **Native Multimodal Capability**: Understands text, video, images, and audio with improved clarity and accuracy [22] 3. **Function Calling Ability**: Enables intelligent agents to utilize external tools and services [23] - Gemini 3 is considered the safest model to date, having undergone comprehensive safety assessments [24]
硅谷人工智能研究院院长皮埃罗·斯加鲁菲:2025年AI智能体将重塑数字劳动力
Jin Rong Jie· 2025-12-10 08:41
Core Insights - The "EVOLVE 2025" summit showcased the roadmap for enterprise-level AI agents and introduced a "3+2+2" product matrix to facilitate rapid development of AI agents for businesses [1] - The summit emphasized the collaboration among major cloud service providers to create a sustainable AI ecosystem through the "Super Connection" global partner program [1] Group 1: AI Development Trends - Piero Scaruffi highlighted a clear trend of technological integration in generative AI by 2025, with innovations like diffusion Transformers and multi-modal capabilities becoming standard [3] - The emergence of new technologies such as thinking chains and expert mixtures is reshaping the landscape of AI applications [3] Group 2: Evolution of AI Agents - The distinction between traditional AI products and advanced AI agents was made, with the latter being likened to autonomous driving, capable of executing complex workflows independently [4] - The operational mechanism of these AI agents is summarized as a cycle of perception, decision-making, action, and learning, allowing them to adapt to various environmental changes [4] Group 3: Multi-Agent Systems - The transition from applications to multi-agent systems introduces challenges in orchestration, necessitating a new technology stack that includes hardware, cloud services, and orchestration layers [5] - The concept of "context engineering" is emphasized, requiring AI agents to understand organizational structures and goals beyond executing single tasks [5] Group 4: Industry Applications - Various sectors are witnessing innovative applications of AI, particularly in customer support, where intelligent systems can understand context and emotions, enhancing user experience [6] - Companies like Johnson Controls have developed integrated AI systems that significantly improve efficiency in maintenance and troubleshooting [6] Group 5: Trust in AI - The "Waymo effect" illustrates the growing trust in AI as autonomous vehicles become more prevalent, laying a foundation for broader AI agent applications [7] - Scaruffi envisions a future where multiple AI agents collaborate dynamically, akin to human social interactions, to achieve common goals [7]