上下文工程
Search documents
看完 Manus、Cursor 分享后的最大收获:避免 Context 的过度工程化才是关键
Founder Park· 2026-01-09 12:34
Core Insights - The optimization of context engineering remains a key focus for Agent startups in the new year [2] - The quality of contextual information significantly determines the performance of Agents in practical development [3] - Manus's chief scientist emphasizes that startups should rely on general models and context engineering for as long as possible before building specialized models [4] Context Engineering Strategies - "Context reduction" is identified as the most direct and effective strategy during the construction of Agents [7] - The phenomenon of "context rot" occurs as the context length continues to grow, leading to performance degradation [10] - A consensus in the industry suggests "context offloading" as a solution, which involves transferring information outside the Agent's short-term memory for precise retrieval when needed [10][11] - Cursor's approach involves converting lengthy tool results and chat records into files, allowing the Agent to reference these files instead of overloading the context [12][14] - Manus has developed a structured, reversible context reduction system that monitors context length and triggers actions based on a predefined threshold [19][20] Action Space Flexibility - As Agent capabilities increase, the diversity of tools also expands, necessitating a flexible action space [30] - Cursor's strategy involves file-based documentation of all tool descriptions, allowing Agents to discover tools dynamically [32] - Manus proposes a layered action space design, categorizing Agent capabilities into function calls, sandbox tools, and APIs [41][42] Multi-Agent Collaboration - The challenge of multi-Agent collaboration is addressed by ensuring context isolation, allowing each sub-Agent to operate independently [50] - Manus introduces two collaboration modes: task delegation through communication and information synchronization via shared context [53][55] - A structured output schema is essential for ensuring consistent and accurate results from multiple sub-Agents [59][60] Design Philosophies - Cursor's "Dynamic Context Discovery" philosophy emphasizes that less is more, advocating for minimal initial detail to allow Agents to autonomously gather relevant context [62] - Manus's approach focuses on simplifying context engineering to make the model's work easier rather than more complex [63][64] - Both companies aim to create an information-rich, easily navigable external environment for Agents rather than merely increasing the amount of information fed into the context [65]
对话 Kuse: 没融资 3 个月 1000 万美金 ARR,用 NotebookLM 的方法重做 Notion
投资实习所· 2026-01-05 03:54
Core Insights - Kuse has achieved significant growth, reaching nearly $10 million in ARR within three months without external funding, indicating a strong demand for structuring unstructured data [1][17] - The product focuses on a "Context First" approach, allowing users to upload various types of content to create reusable contextual assets, which enhances AI-generated outputs and workflow iterations [3][4] Product Differentiation - Kuse differentiates itself from general AI agents by emphasizing asset accumulation rather than one-time generation, targeting knowledge workers and enterprise scenarios [2][4] - The latest version of Kuse has shifted from a general AI tool to a native "Context First" file management and asset accumulation system, organizing materials in a Finder-like structure [4][6] User Experience and Functionality - Kuse's "Chaos in, Genius out" philosophy transforms complex inputs into clear, consumable web pages and documents, focusing on document and webpage generation rather than application development [6][10] - The formatting engine AI simplifies the process of creating structured documents, significantly reducing the time required for tasks like generating exam papers [7][8] Market Strategy - Kuse's growth strategy leverages Meta's Threads and Instagram, with a unique approach of employing interns to create numerous accounts that share practical use cases, targeting the Taiwanese and Hong Kong markets [18][22] - The product is designed to meet high-frequency needs in document generation, focusing on interactive web pages, resumes, and administrative notifications, aligning closely with traditional office tasks [22] Target Audience and Use Cases - Kuse has expanded its user base from designers to professionals in consulting, education, and law, who require high-precision, template-driven document creation [16][18] - The platform's ability to accumulate context over time enhances user interactions, making it a valuable tool for knowledge workers [15][16]
别了,大模型;你好,Agent:读懂Meta收购Manus的范式转移
创业邦· 2026-01-03 10:22
Core Viewpoint - Meta's acquisition of Manus for billions of dollars highlights the shifting landscape of AI, emphasizing the need for practical applications over mere conversational capabilities [7][14][20]. Group 1: Manus's Journey and Team - Manus, founded in Wuhan and developed in Beijing, has transitioned to a Singapore-based company, showcasing a modern narrative of Chinese tech talent navigating geopolitical challenges [7][18]. - The core team of Manus, led by founder Xiao Hong and chief scientist Peak Ji, is characterized by exceptional engineering skills and insights into user behavior, rather than traditional academic AI backgrounds [8][10]. - Peak Ji's philosophy of "orthogonality" emphasizes building applications that leverage existing models rather than competing directly with them, leading to innovative solutions in AI [12]. Group 2: Technological Innovations - Manus distinguishes itself from traditional chatbots by developing an "Agent" capable of performing complex tasks, such as market research and data analysis, rather than just engaging in conversation [16]. - The company has created a virtual operating system that enhances AI capabilities, addressing limitations in memory and operational accuracy, which has proven to be a significant engineering success [16]. Group 3: Geopolitical and Economic Challenges - The decision to relocate Manus's headquarters to Singapore and lay off Chinese staff reflects the harsh realities of geopolitical tensions, particularly regarding access to critical technology and funding [18][19]. - Manus's shift away from China is driven by the need for advanced computing power and capital, which are increasingly restricted for Chinese companies due to U.S. export controls [19]. Group 4: Implications for the Chinese AI Industry - The acquisition of Manus by Meta signifies a loss for the Chinese AI sector, as talented engineers are compelled to contribute to foreign companies due to local constraints [22]. - Manus's success illustrates the potential of Chinese engineers to innovate independently, yet the current environment hampers the growth of local ecosystems and market opportunities [22][25].
AI Coding 生死局:Spec 正在蚕食人类编码,Agent 造轮子拖垮效率,Token成本失控后上下文工程成胜负手
3 6 Ke· 2025-12-30 09:21
Core Insights - The evolution of AI Coding is leading to a new role for programmers, focusing on defining rules rather than just writing code, as the complexity of software engineering increases [1] - The rise of Spec-driven development is reshaping the AI Coding landscape, with a shift from traditional coding practices to a more structured approach that emphasizes the importance of context and specifications [8][9] Group 1: AI Coding Evolution - AI Coding has transitioned from a human-led paradigm, where tools like Copilot and Cursor assist in code completion, to an Agent-driven model that takes over tasks from requirement analysis to code generation [2][3] - The limitations of the completion paradigm are becoming apparent, as it requires significant developer attention and has a narrow scope compared to the broader capabilities of Agents [3] - The integration of IDE, CLI, and Cloud capabilities in programming tools reflects the need for a comprehensive task delivery system across different environments [4] Group 2: Spec-Driven Development - The concept of "Spec" has evolved, with various interpretations ranging from better prompts to detailed product requirement documents, highlighting the need for clear guidance in AI Coding [8][10] - Spec is seen as a critical component in providing stable context for Agents, ensuring they understand what needs to be built and the constraints involved [9][12] - The challenge lies in standardizing Spec across different contexts, as its effectiveness depends on the application scenario and the balance between flexibility and rigor [11][12] Group 3: Context Engineering - Context is increasingly recognized as a vital element in AI Coding, with many teams noting that the lack of context, rather than specifications, is a significant barrier to effective AI code generation [9][10] - The development of "living contracts" for Spec emphasizes the need for dynamic, iterative documentation that evolves alongside the coding process, rather than static documents [14] - The focus on context management is crucial, as it directly impacts the efficiency and cost of AI coding, with a need to maximize cache hit rates and minimize redundant computations [22][23] Group 4: Token Economics - The cost structure of using AI tools is shifting, with Token consumption becoming a critical factor in pricing and operational strategies for platforms [18][19] - The transition from simple question-answer interactions to complex Agent tasks has increased the overall Token costs, as multiple interactions and tool calls are required to complete tasks [20][21] - Effective context management is essential to control Token costs, as it determines how information is organized and reused throughout the coding process [26][27]
近两百万人围观的Karpathy年终大语言模型清单,主角是它们
机器之心· 2025-12-21 03:01
编辑|杜伟 2025 年还有 10 天就要结束,这意味着是时候进行一波年终总结了。 对于人工智能领域而言,2025 年是大语言模型(LLM)快速演进、重磅事件密集出现的一年。 就在昨天,知名 AI 学者 Karpathy 列出了一份清单,记录了他个人认为最重要、也多少有些出乎意料的「范式转变」。 这些真正改变了行业格局、并在概念层面让 Karpathy 印象深刻的变化会落在哪些领域呢?我们接下来一一来看(以第一人称)。 可验证奖励强化学习(RLVR) 2025 年初,几乎所有实验室的 LLM 生产训练流程都像下面这样: 这套流程稳定、可靠,曾长期被视为「工业级 LLM」的标准做法。 预训练(类似 2020 年的 GPT-2/3); 监督微调(SFT,类似 2022 年的 InstructGPT) 基于人类反馈的强化学习(RLHF,约 2022 年) 但在 2025 年,一种新的阶段浮出水面,并迅速成为事实上的标配: 可验证奖励强化学习(Reinforcement Learning from Verifiable Rewards,RLVR) 。 RLVR 的核心做法是,让模型在可自动验证的环境中接受强化学习训练 ...
Manus 8 个月突破 1 亿美金 ARR,让我眼前一亮的语音 AI 产品种子轮拿了 4000 多万美金
投资实习所· 2025-12-18 05:35
在 8 月份宣布突破 9000 万美金年化收入后《 Manus 年化收入突破 9000 万美金,红杉中国投了一个 AI 床垫 》, Manus 昨晚宣布其 ARR 已经突破了 1 亿美金 ,成为从 0 到 1 亿美金 ARR 最快的初创公司。 前几天更新的版本则可以直接通过 Manus 来做移动开发了,这块我之前介绍过专注于开发 App 的 AI Coding 产品,《 创立 6 个月的 AI 卖了 8000 万美 金没融资一个创始人,App 的 AI Coding 也火了 》。 有网友说通过 Manus 开发了一个完整的移动 App,我简单体验了一下让它构建一个 AI 笔记产品,效果还不错,包括后端、数据库这些全都自己设计好 了,我只需要集成一个 OpenAI 的 API 即可,并且给出了发布到 App Store 的指南。 Manus 没有公布用户的分布情况,不过我在 X 上简单做了一下搜索,发现有不少日本用户分享了通过 Manus 做各种产品的帖子,特别是用它来做移动 App 和 Web 产品这块,我估计来自日本的用户有不少比例。 Manus 的总收入年化运行率则超过了 1.25 亿美金,这块数据包含了 ...
12月,我们推荐这 7 款 AI 新品
Founder Park· 2025-12-17 14:28
Group 1 - The article discusses the launch of seven innovative AI products at the Geek Park Innovation Conference, highlighting their uniqueness and recent developments [1][2] - These products are part of the Founder Park's AI Product Marketplace, which has recommended over 150 AI products since April, attracting over 17,000 industry professionals [3] Group 2 - Flomo, an AI note-taking product, recently upgraded its "AI Insights" feature to "Multi-Perspective Insights," allowing users to interpret their notes through various therapeutic lenses [4][5] - Flomo emphasizes the importance of personal context in note-taking, avoiding AI-generated content to maintain authenticity [7][8] Group 3 - Doka Camera, an AI-powered photography app, aims to return creative control to users by providing AI-assisted composition guidance without imposing a specific aesthetic [14][22] - Doka has achieved significant user engagement, ranking first in the photography category in Taiwan without any advertising spend [14][17] Group 4 - Remio, a personal office assistant, focuses on creating a comprehensive digital memory by automatically capturing context from users' activities, enhancing productivity [27][30] - Remio's technology allows for seamless integration of local documents and web browsing history, providing a structured context for AI interactions [34][35] Group 5 - Pallas AI is designed to assist brands with AI marketing, transforming the approach from passive search visibility to proactive recommendations [37][39] - The platform offers a comprehensive data analysis and visualization panel, enabling brands to monitor their performance across various AI platforms [43][45] Group 6 - MuleRun is an AI Agent Marketplace that connects developers and users, allowing for the monetization of AI agents and addressing mid-tail market needs [46][49] - The platform has rapidly gained traction, reaching 500,000 registered users within a month of launch [47][55] Group 7 - OdyssLife introduces the Odyss N1, an AI necklace that monitors users' dietary and exercise habits, aiming to improve health management through unobtrusive tracking [56][58] - The product provides personalized health recommendations based on real-time data analysis of users' eating patterns and physical activities [62][63] Group 8 - LavieAI focuses on generating visual content for the fashion industry using AI, significantly reducing production costs and time while maintaining aesthetic quality [65][68] - The company integrates artistic guidance into its AI models to ensure that generated content meets industry standards for visual appeal [71][72]
AI智能体时代中的记忆:形式、功能与动态综述
Xin Lang Cai Jing· 2025-12-17 04:42
记忆已成为并将继续成为基于基础模型的智能体的核心能力。它支撑着长程推理、持续适应以及与复杂环境的有效交互。随着智能体记忆研究的快速扩张 并吸引空前关注,该领域也日益呈现碎片化。当前统称为"智能体记忆"的研究工作,在动机、实现、假设和评估方案上往往存在巨大差异,而定义松散的 记忆术语的激增进一步模糊了概念上的清晰度。诸如长/短期记忆之类的传统分类法已被证明不足以捕捉当代智能体记忆系统的多样性和动态性。 在这些智能体的核心能力中,记忆 尤为关键,它明确地促成了从静态大语言模型(其参数无法快速更新)到自适应智能体的转变,使其能够通过环境交 互持续适应(Zhang et al., 2025r; Wu et al., 2025g)。从应用角度看,许多领域都要求智能体具备主动的记忆管理能力,而非短暂、易忘的行为:个性化聊 天机器人(Chhikara et al., 2025; Li et al., 2025b)、推荐系统(Liu et al., 2025b)、社会模拟(Park et al., 2023; Yang et al., 2025)以及金融调查(Zhang et al., 2024)都依赖于智能体处理、存储和管 ...
Google全链路赋能出海:3人团队调度千个智能体,可成独角兽|MEET2026
量子位· 2025-12-17 03:38
Core Insights - The future will be characterized by autonomous collaboration among intelligent agents, solving complex problems, automating workflows, and autonomously issuing tasks, creating a new business model [1] - AI agents are becoming new productivity units, injecting new meaning into the globalization logic of startups [2] - The intelligent agent sector is just beginning, with significant changes expected in the next one to two years, presenting a major opportunity for Chinese startups to go global [3] Google’s Integrated Solutions for Startups - Google has launched AI-driven integrated solutions to empower startups for efficient globalization [4] - The MEET2026 conference attracted nearly 1,500 offline attendees and over 3.5 million online viewers, highlighting the significant interest in the topic [6] - Startups face various challenges during globalization, and Google’s ecosystem can support them at every stage [7] Stages of Startup Globalization - The five stages of startup globalization include: 1. **Ideation and Strategic Planning**: Founders gather information and analyze competitors, often using Gemini for market research [8] 2. **Product Launch**: Google Cloud provides stable cloud infrastructure support [9] 3. **Market Validation**: Google Ads assists in reaching target customers [9] 4. **Market Expansion**: Google Play and other services support expansion into new markets [9] 5. **IPO Maturity**: Google’s data analysis tools aid in the final push before going public [10] Challenges and Innovations in AI - The AI field is evolving rapidly, with challenges such as hallucination (inaccurate or fabricated information) being addressed through better model training and engineering practices [11] - The introduction of the A2A (Agent-to-Agent) protocol aims to facilitate communication between intelligent agents across different enterprises [16] - The shift from SaaS subscription models to outcome-based payment models reflects a fundamental change in business logic, allowing small teams to scale significantly [18] Gemini's Evolution and Capabilities - Gemini has evolved from its initial version to Gemini 3, which has achieved significant advancements in reasoning, understanding, and problem-solving capabilities [15] - Key capabilities of Gemini 3 include: 1. **Extended Context Window**: Supports 1 million tokens, emphasizing the importance of context engineering [21] 2. **Native Multimodal Capability**: Understands text, video, images, and audio with improved clarity and accuracy [22] 3. **Function Calling Ability**: Enables intelligent agents to utilize external tools and services [23] - Gemini 3 is considered the safest model to date, having undergone comprehensive safety assessments [24]
硅谷人工智能研究院院长皮埃罗·斯加鲁菲:2025年AI智能体将重塑数字劳动力
Jin Rong Jie· 2025-12-10 08:41
Core Insights - The "EVOLVE 2025" summit showcased the roadmap for enterprise-level AI agents and introduced a "3+2+2" product matrix to facilitate rapid development of AI agents for businesses [1] - The summit emphasized the collaboration among major cloud service providers to create a sustainable AI ecosystem through the "Super Connection" global partner program [1] Group 1: AI Development Trends - Piero Scaruffi highlighted a clear trend of technological integration in generative AI by 2025, with innovations like diffusion Transformers and multi-modal capabilities becoming standard [3] - The emergence of new technologies such as thinking chains and expert mixtures is reshaping the landscape of AI applications [3] Group 2: Evolution of AI Agents - The distinction between traditional AI products and advanced AI agents was made, with the latter being likened to autonomous driving, capable of executing complex workflows independently [4] - The operational mechanism of these AI agents is summarized as a cycle of perception, decision-making, action, and learning, allowing them to adapt to various environmental changes [4] Group 3: Multi-Agent Systems - The transition from applications to multi-agent systems introduces challenges in orchestration, necessitating a new technology stack that includes hardware, cloud services, and orchestration layers [5] - The concept of "context engineering" is emphasized, requiring AI agents to understand organizational structures and goals beyond executing single tasks [5] Group 4: Industry Applications - Various sectors are witnessing innovative applications of AI, particularly in customer support, where intelligent systems can understand context and emotions, enhancing user experience [6] - Companies like Johnson Controls have developed integrated AI systems that significantly improve efficiency in maintenance and troubleshooting [6] Group 5: Trust in AI - The "Waymo effect" illustrates the growing trust in AI as autonomous vehicles become more prevalent, laying a foundation for broader AI agent applications [7] - Scaruffi envisions a future where multiple AI agents collaborate dynamically, akin to human social interactions, to achieve common goals [7]