Workflow
Coding Agent
icon
Search documents
Zed 为什么不用自己造 Agent?OpenAI 架构师给出答案:Codex 重划 IDE × Coding Agent 的分工边界
AI前线· 2026-01-21 07:00
Core Insights - Coding Agents are a rapidly evolving area within applied AI, with a focus on maintaining resilience and rapid iteration amidst changing ecosystems [2] - OpenAI's Codex offers a solution through the co-development of models and Harness, emphasizing the importance of understanding model behavior [4][5] Group 1: Composition of Coding Agents - A Coding Agent consists of three main components: user interface, model, and Harness, where the user interface can be command-line tools or integrated development environments [4] - The model refers to recent releases like the GPT-5.1 series, while the Harness acts as the core agent loop that interacts with the model [4] Group 2: Challenges in Building Harness - Building an efficient Harness is complex, facing challenges such as adapting to new tools that the model may not be familiar with, and managing prompt adjustments based on model characteristics [8][9] - Delays in model processing and the need for effective prompt design to enhance user experience are significant challenges [9][10] Group 3: Codex as a Harness/Agent - Codex is designed to function across various programming environments, allowing for complex tasks such as navigating code repositories and executing commands [12] - The integration of Codex into an agent system simplifies the development of features like parallel tool calls and security management [12][18] Group 4: Future of Codex and SDK Development - The future of Codex is promising, with expectations for models to handle more complex tasks without supervision, and the SDK evolving to support these capabilities [19] - Companies can leverage Codex to create customized agents, enhancing their products with advanced coding capabilities [15][18]
深度|OpenAI产品经理谈Codex爆发式增长背后的AI协作:实现AGI级生产力的真正瓶颈是人类的打字速度!
Z Potentials· 2026-01-19 03:02
Core Insights - Codex, a powerful coding agent developed by OpenAI, has experienced a 20-fold growth since the release of ChatGPT5 in August 2023, processing trillions of characters weekly [3][19]. - The primary goal of Codex is to enhance human productivity by enabling proactive task completion rather than merely responding to commands [9][17]. - OpenAI's organizational structure emphasizes a bottom-up approach, allowing for flexibility and rapid experimentation, which has been crucial for Codex's development [12][14]. Group 1: Codex's Development and Growth - Codex has become a core tool for software engineering teams, functioning as an initial team member capable of writing, testing, and deploying code [15][16]. - The product has seen explosive growth, with usage increasing over 10 times since August, now reaching 20 times, and it is the most utilized code generation model [19][20]. - The integration of product and research teams has facilitated collaborative iterations, leading to more effective experiments and product enhancements [19][26]. Group 2: Proactive Collaboration and User Interaction - Codex aims to function as a proactive collaborator, akin to a new intern, participating in the entire software development lifecycle [16][17]. - The focus is on creating a seamless integration into developers' workflows, allowing Codex to assist without requiring constant user prompts [18][22]. - The feedback loop established through local interactions enhances user experience and encourages gradual adaptation to AI-assisted development [22][23]. Group 3: Future Vision and Market Position - The vision for Codex extends beyond code writing to include capabilities such as scheduling and task management, positioning it as a comprehensive AI assistant [28][29]. - OpenAI is exploring the potential of a "chatter-driven development" model, where communication and collaboration drive coding processes rather than rigid specifications [38][39]. - The company recognizes the need for Codex to adapt to various user environments and preferences, ensuring it remains a valuable tool for diverse teams [25][33].
Zed 为什么不用自己造 Agent?OpenAI 架构师给出答案:Codex 重划 IDE × Coding Agent 的分工边界
AI前线· 2026-01-17 06:25
Core Insights - Coding Agents have become one of the most active areas in applied AI, with a focus on maintaining rapid iteration and resilience amidst changing ecosystems [2] - OpenAI's Codex proposes a solution through the co-development of models and Harness, emphasizing the importance of understanding model behavior [4][6] Composition of Coding Agents - A Coding Agent consists of three main components: User Interface, Models, and Harness. The User Interface can be a command-line tool, integrated development environment (IDE), or cloud-based agent. Models include the latest GPT-5.1 series and others. Harness is a more complex part that interacts directly with the model, serving as the core agent loop [3][5] Importance of Harness - The Harness acts as the interface layer between the model and users, facilitating interaction and code generation. Building an efficient Harness is challenging due to issues like AV tool compatibility, latency management, and API changes [6][9] Challenges in Building Harness - Adapting models to the Harness requires extensive prompt design, as the model's training can lead to specific habits that must be understood for effective interaction. The relationship between steerability, intelligence, and habit is crucial for prompt engineering [7][8] Codex Capabilities - Codex is designed to function across various programming environments, allowing users to convert ideas into executable code, navigate code repositories, and execute commands. Its Harness must handle complex tasks, including parallel tool calls and security management [9][10] Future of Codex - Codex is rapidly evolving, currently serving hundreds of billions of tokens weekly, and is expected to handle more complex tasks with increased trust. The future will focus on large codebases and non-standard libraries, with continuous improvements in SDK capabilities [16][17] Building Custom Agents with Codex - Companies looking to integrate Codex into their agents can benefit from a model where the Harness serves as a new abstraction layer, allowing for easier updates and differentiation in product features [12][14] Successful Collaborations - Top partners like GitHub have successfully integrated Codex, allowing for direct interaction and optimization of their systems. The SDK facilitates various integrations, enhancing the capabilities of custom agents [15][16]
MINIMAX-WP午前拉升逾10% 宣布开源代码智能体系统性评测集OctoCodingBench
Zhi Tong Cai Jing· 2026-01-16 05:19
Core Insights - MINIMAX-WP's stock surged over 10%, currently up 8.16% at 387.2 HKD, with a trading volume of 352 million HKD, following the announcement of its open-source evaluation benchmark for coding agents, OctoCodingBench [1] - The evaluation results indicate that some open-source models are performing exceptionally well in "process compliance," approaching or even surpassing certain closed-source models, highlighting a shift in industry focus towards "data and evaluation paradigms" in the evolution of AI towards agents [1] Company Performance - CITIC Securities reports that MINIMAX-WP is emerging from industry competition with a "counter-consensus" strategic focus on model intelligence breakthroughs, positioning itself strongly in the generative AI wave [2] - As one of the first companies in Shanghai to receive large model registration, MINIMAX-WP demonstrates strong growth potential through technological depth and commercial foresight [2] - Revenue is projected to maintain over 90% high-speed growth from 2025 to 2027, with Non-GAAP gross margin expected to rise to 55% and net loss rate continuing to narrow [2] - The company is anticipated to expand its market space in AI-native applications through optimization of reasoning costs and the implementation of next-generation multimodal models [2]
AI Coding 生死局:Spec 正在蚕食人类编码,Agent 造轮子拖垮效率,Token成本失控后上下文工程成胜负手
3 6 Ke· 2025-12-30 09:21
Core Insights - The evolution of AI Coding is leading to a new role for programmers, focusing on defining rules rather than just writing code, as the complexity of software engineering increases [1] - The rise of Spec-driven development is reshaping the AI Coding landscape, with a shift from traditional coding practices to a more structured approach that emphasizes the importance of context and specifications [8][9] Group 1: AI Coding Evolution - AI Coding has transitioned from a human-led paradigm, where tools like Copilot and Cursor assist in code completion, to an Agent-driven model that takes over tasks from requirement analysis to code generation [2][3] - The limitations of the completion paradigm are becoming apparent, as it requires significant developer attention and has a narrow scope compared to the broader capabilities of Agents [3] - The integration of IDE, CLI, and Cloud capabilities in programming tools reflects the need for a comprehensive task delivery system across different environments [4] Group 2: Spec-Driven Development - The concept of "Spec" has evolved, with various interpretations ranging from better prompts to detailed product requirement documents, highlighting the need for clear guidance in AI Coding [8][10] - Spec is seen as a critical component in providing stable context for Agents, ensuring they understand what needs to be built and the constraints involved [9][12] - The challenge lies in standardizing Spec across different contexts, as its effectiveness depends on the application scenario and the balance between flexibility and rigor [11][12] Group 3: Context Engineering - Context is increasingly recognized as a vital element in AI Coding, with many teams noting that the lack of context, rather than specifications, is a significant barrier to effective AI code generation [9][10] - The development of "living contracts" for Spec emphasizes the need for dynamic, iterative documentation that evolves alongside the coding process, rather than static documents [14] - The focus on context management is crucial, as it directly impacts the efficiency and cost of AI coding, with a need to maximize cache hit rates and minimize redundant computations [22][23] Group 4: Token Economics - The cost structure of using AI tools is shifting, with Token consumption becoming a critical factor in pricing and operational strategies for platforms [18][19] - The transition from simple question-answer interactions to complex Agent tasks has increased the overall Token costs, as multiple interactions and tool calls are required to complete tasks [20][21] - Effective context management is essential to control Token costs, as it determines how information is organized and reused throughout the coding process [26][27]
Codex负责人打脸Cursor CEO“规范驱动开发论”,18天造Sora爆款,靠智能体24小时不停跑,曝OpenAI狂飙内幕
3 6 Ke· 2025-12-17 02:45
自 8 月 GPT-5 发布以来,Codex展现出惊人的爆发力,用户增长 20 倍,每周处理数万亿 tokens,成为 了 Open AI 最受欢迎的编程智能体。 "Codex 能快速实现 20倍的增长,不只是因为模型变强了,还因为我们理解了,真正的智能体不是一个 模型,而是模型、API 和框架共同努力的结果。"在最新播客中,OpenAI 的编程智能体 Codex 产品负 责人 Alexander Embiricos 揭露背后的秘密。 比如,Codex 在长时任务能力上的突破。为了让它能够连续工作十几个小时甚至数天,团队设计了名 为"压缩"的机制——模型负责提炼关键信息,API 承接任务链路,框架负责稳定运行。三层像齿轮般咬 合,使 Codex 能够完成传统大模型难以支撑的长时编程任务。 正是这样的底层逻辑,让 Codex 在业务实战中有惊人表现。 Andrej Karpathy 曾公开分享,他被一个 bug 困住数小时,最终交给 Codex 处理,一小时内就完成了修 复。 Sora 团队更是依靠 Codex,在短短 28 天时间,从 0 到 1 完成 Android 应用的上线,直接冲到 App Store ...
智能体崛起,AI+软件研发到新拐点了?
AI前线· 2025-11-18 05:34
Core Insights - The article discusses the transformative impact of large language models (LLMs) on software development processes, emphasizing the shift from AI as an auxiliary tool to a core productivity driver [2][3] - It highlights the current state of AI in development as being at a "halfway point," indicating that while significant advancements have been made, a true paradigm shift has not yet occurred [5][9] Group 1: AI's Role in Development - AI is primarily seen as a tool for efficiency in testing rather than a replacement for human roles, with the industry still far from a "native development era" [9][10] - The emergence of various AI programming products indicates a growing integration of AI in code production, with some teams reporting over 50% of their code being AI-generated [6][10] - The effectiveness of AI varies significantly among users, with some leveraging it for simple tasks while others utilize it for more complex processes [6][7] Group 2: Challenges and Limitations - AI's current capabilities are limited in handling complex tasks, particularly in existing codebases, where it often struggles with intricate logic and dependencies [5][10] - The stability and reliability of AI outputs remain significant concerns, impacting its adoption in real-world applications [20][21] - AI's role in testing is still largely supportive, with challenges in fully automating complex testing scenarios due to the need for human judgment [9][10] Group 3: Future Directions - The evolution from AI assistants to intelligent agents capable of executing complete development cycles is seen as a key future trend [28][31] - The integration of AI into existing workflows is expected to be gradual, with a focus on plugin-based ecosystems rather than monolithic platforms [32][33] - The article suggests that the future of software development will require professionals to adapt by enhancing their skills in prompt engineering and knowledge management to effectively collaborate with AI [23][24][39]
智能体崛起,AI+软件研发到新拐点了?
3 6 Ke· 2025-11-13 04:51
Core Insights - The article discusses the transformative impact of large language models (LLMs) on software development processes, highlighting the shift from AI as a mere tool to becoming a core productivity driver in the development lifecycle [1][2]. Group 1: LLM Native Development Era - Many experts believe that AI's role in coding is still seen as an advanced autocomplete rather than a paradigm shift, indicating that the industry is on the brink of a significant change [2][3]. - AI excels in small, well-defined tasks but struggles with complex, large-scale projects, particularly when integrating with existing codebases [2][4]. - The proportion of AI-generated code in teams is rapidly increasing, with some teams reporting over 50% of their code being AI-generated, indicating a deep integration of AI into coding practices [3][4]. Group 2: AI's Role in Development Processes - AI is increasingly being used in various forms beyond traditional IDEs, such as integrated tools in DevOps platforms, which is changing development habits [3][4]. - The effectiveness of AI varies significantly among users, with some leveraging it for simple tasks while others utilize it for more complex processes like building intelligent agents [3][4]. - AI's involvement in development is still evolving, and while it has improved efficiency, it has not yet achieved a true paradigm shift [5][6]. Group 3: AI in Testing - AI is primarily seen as a tool for enhancing efficiency in testing rather than a replacement for human testers, with significant challenges remaining before reaching a fully autonomous development era [5][7]. - AI performs well in generating test cases for straightforward tasks but struggles with complex testing scenarios that require deep domain knowledge [7][8]. - The current state of AI in testing is more about assistance than collaboration, with a long way to go before achieving a fully integrated development environment [7][8]. Group 4: Challenges in AI Implementation - The main challenges in implementing AI in real business scenarios include stability, reliability, and the need for teams to adapt to new workflows [16][18]. - Users often face difficulties in effectively communicating their needs to AI, leading to inconsistent results and a lack of trust in AI tools [18][19]. - The computational power available for AI applications significantly affects user experience and the overall effectiveness of AI tools [18][19]. Group 5: Future of AI in Development - The evolution from AI assistants to intelligent agents signifies a shift towards more autonomous systems capable of executing complete development cycles [24][27]. - The integration of AI into development processes is expected to enhance collaboration and efficiency, but achieving a fully automated workflow will take time [27][29]. - The future landscape will likely favor lightweight, plugin-based ecosystems over monolithic platforms, allowing for gradual integration of AI capabilities into existing workflows [28][29]. Group 6: Value and Skills in the AI Era - The introduction of AI in development roles is reshaping job functions, emphasizing the need for engineers to possess a deeper understanding of both technology and business [33][34]. - Engineers who can effectively leverage AI tools will see their value increase, as AI can handle repetitive tasks, allowing them to focus on more strategic aspects of their roles [35][36]. - The ability to communicate effectively with AI and understand its limitations will be crucial for maximizing productivity and ensuring quality in software development [36][37].
Codegen Tools and Production Challenges
Greylock· 2025-09-25 15:54
I'm already using codegen tools like cursor. Can I just extend that to solve my production problems. >> Codegen tools are sort of, you know, designed to operate on the sort of the addressible universe of code, right.Production system is sort of like a living breathing animal, right. It's more than just code, right. It's it's really sort of emergent behavior that comes from like a bunch of these things interacting with each other, right. Like the code, the infrastructure, the deployments, the you know the th ...
从模型为王到应用为王:AI 中间件的基建之战 | 直播预告
AI前线· 2025-09-20 05:33
Core Viewpoint - The article emphasizes that the true competition in AI is the "landing efficiency" of applications, highlighting the ongoing "infrastructure battle" regarding AI middleware [2][6]. Group 1: Event Details - A live broadcast is scheduled for September 23, from 20:00 to 21:30, focusing on the transition from "model-centric" to "application-centric" approaches in AI middleware [2]. - The event will feature experts from the industry, including a senior technical expert from Ant Group and the CTO of Memory Tensor [3]. Group 2: Key Challenges - The article raises questions about how enterprises can transition smoothly from "cloud-native" to "intelligent-native" systems [3]. - It discusses the challenges developers face in capturing the current opportunities and becoming core talents in the intelligent era [6]. Group 3: Live Broadcast Content - The live session will cover topics such as the engineering framework for Agent applications and practical implementations of the RAG framework [7]. - Participants will have the opportunity to ask questions to the instructors during the live session [8].