Deep Research

Search documents
喝点VC|a16z对话OpenAI研究员:GPT-5的官方解析,高质量使用场景将取代基准测试成为AGI真正衡量标准
Z Potentials· 2025-08-21 03:09
Core Viewpoint - The release of ChatGPT-5 marks a significant advancement in AI capabilities, particularly in reasoning, programming, and creative writing, with notable improvements in reliability and behavior design [3][4][6]. Group 1: Model Improvements - ChatGPT-5 has shown a substantial reduction in issues related to flattery and hallucination, indicating a more reliable interaction model [4][14]. - The model's programming capabilities have seen a qualitative leap, allowing users to create applications with minimal coding knowledge, which is expected to foster the emergence of many small businesses [6][17]. - The team emphasizes the importance of user experience and practical applications as key metrics for evaluating model performance, rather than just benchmark scores [20][21]. Group 2: Training and Development - The development process for ChatGPT-5 involved a focus on desired capabilities, with the team designing assessments to reflect real user value [22][23]. - The integration of deep research capabilities into the model has enhanced its ability to perform complex tasks efficiently, leveraging high-quality data and reinforcement learning [16][26]. - Mid-training techniques have been introduced to update the model's knowledge and improve its performance without the need for extensive retraining [45]. Group 3: Future Implications - The advancements in ChatGPT-5 are expected to unlock new use cases and increase daily usage among a broader audience, which is seen as a critical indicator of progress towards AGI [21][15]. - The model's ability to assist in creative writing has been highlighted, showcasing its potential to help users with complex writing tasks [29][31]. - The future of AI is anticipated to be characterized by the rise of autonomous agents capable of performing real-world tasks, with ongoing research focused on enhancing their capabilities [36][41].
很多创业者都没意识到,Deep Research 也是做 Go-to-Market 的利器
Founder Park· 2025-08-18 08:27
Core Insights - The article emphasizes the importance of utilizing Deep Research to enhance the efficiency of AI product go-to-market (GTM) strategies, highlighting its ability to condense hours of work into minutes [2][3] - It provides practical tips and a guide from former Meta strategy director Torsten Walbaum on how to effectively use Deep Research for customized analysis [2][3] Group 1: Key Techniques for Effective Deep Research - Technique 1: Indicate high-quality information sources to improve output quality, including writing effective prompts and selecting appropriate tools for specific scenarios [5][11] - Technique 2: Provide sufficient background information to obtain tailored insights, treating the AI as a human colleague by sharing necessary context [11][12] - Technique 3: Request a research plan before starting to ensure alignment with expectations, particularly useful in tools like Gemini Deep Research [20][23] Group 2: Deep Research Tools and Use Cases - ChatGPT is identified as the best general-purpose Deep Research tool, especially after the release of GPT-5 and its Agent Mode, which allows effective interaction with websites [38][40] - Use Case 1: Creating step-by-step guides for large internal projects, enabling quick understanding and planning for unfamiliar tasks [44][45] - Use Case 2: Conducting in-depth research on competitors' advertising strategies using tools like Agent Mode to access detailed ad libraries [51][52] Group 3: Structuring Effective Prompts - A structured prompt template is provided to guide users in crafting effective Deep Research requests, ensuring clarity in goals, context, and desired outputs [26][29] - Emphasis on specifying sources and instructions to enhance the relevance and accuracy of the research output [32][67] Group 4: Market Evaluation for International Expansion - A two-step approach is recommended for evaluating markets for international expansion, involving framework development and high-quality data source compilation [72][75] - The importance of using recent and credible data sources is highlighted to ensure the accuracy of market assessments [74][76]
喝点VC|红杉对谈OpenAI Agent团队:将Deep Research与Operator整合成主动为你做事的最强Agent
Z Potentials· 2025-08-14 03:33
Core Insights - The article discusses the integration of OpenAI's Deep Research and Operator projects to create a powerful AI Agent capable of executing complex tasks for up to one hour [2][5][6] - The AI Agent utilizes a virtual computer with various tools, including a text browser, GUI browser, terminal access, and API calling capabilities, allowing it to perform tasks that typically require human effort [6][7][24] - The model is designed to facilitate user interaction, enabling users to interrupt, correct, and clarify tasks during execution, which enhances its flexibility and effectiveness [7][22] Integration of Deep Research and Operator - The combination of Deep Research and Operator leverages the strengths of both projects, with Operator excelling in visual interactions and Deep Research in text-based information processing [9][10] - The integration allows the AI Agent to access paid content and perform tasks that require both browsing and interaction with web elements [10][11] - The collaboration has resulted in a more versatile toolset, enabling the AI Agent to perform a wider range of tasks, including generating reports, making purchases, and creating presentations [11][14] Real-World Applications - The AI Agent is designed for both consumer and professional use, targeting "prosumer" users who are willing to wait for detailed reports [15] - Examples of its application include data extraction from spreadsheets, online shopping, and generating financial models based on web-sourced information [16][18] - The model's ability to handle complex tasks autonomously is highlighted, with a recent task taking 28 minutes to complete, showcasing its potential for longer, more intricate assignments [19][20] Training and Development - The AI Agent is trained using reinforcement learning, where it learns to use various tools effectively by completing tasks that require their use [24][25] - The training process involves a significant increase in computational resources and data, allowing for more sophisticated model capabilities [45] - The development team emphasizes the importance of collaboration between research and application teams to ensure the model meets user needs from the outset [30][35] Future Directions - OpenAI aims to enhance the AI Agent's capabilities further, focusing on improving accuracy and performance across diverse tasks [37][49] - The potential for new interaction paradigms between users and the AI Agent is anticipated, with the goal of making the Agent more proactive in assisting users [49][42] - The team is excited about the ongoing exploration of the Agent's capabilities and the discovery of new use cases as it evolves [40][49]
量子位智库2025上半年AI核心成果及趋势报告
2025-08-05 03:19
Summary of Key Points from the AI Industry Report Industry Overview - The report discusses the rapid development of artificial intelligence (AI) and its significance as one of humanity's most important inventions, highlighting the interplay between technological breakthroughs and practical applications in the industry [4][7]. Application Trends - General-purpose agents are becoming mainstream, with specialized agents emerging in various sectors [4][9]. - AI programming is identified as a core application area, significantly changing software production methods, with record revenue growth for leading programming applications [14][15]. - The introduction of Computer Use Agents (CUA) represents a new path for general-purpose agents, integrating visual operations to enhance user interaction with software [10][12]. - Vertical applications are beginning to adopt agent-based functionalities, with natural language control becoming integral to workflows in sectors like travel, design, and fashion [13]. Model Trends - The report notes advancements in reasoning model capabilities, particularly in multi-modal abilities and the integration of tools for enhanced performance [18][21]. - The Model Context Protocol (MCP) is accelerating the adoption of large models by providing standardized interfaces for efficient and secure external data access [16]. - The emergence of small models is highlighted, which aim to reduce deployment barriers and enhance cost-effectiveness, thus accelerating model application [33]. Technical Trends - The importance of reinforcement learning is increasing, with a shift in resource investment towards post-training and reinforcement learning, while pre-training still holds optimization potential [38][39]. - Multi-Agent systems are emerging as a new paradigm, enhancing efficiency and robustness in dynamic environments [42][43]. - The report discusses the evolution of transformer architectures, focusing on optimizing attention mechanisms and feedforward networks, with multiple industry applications [45]. Industry Dynamics - The competitive landscape is evolving, with leading players like OpenAI, Google, and others narrowing the gap in model capabilities [4]. - AI programming is becoming a critical battleground, with significant revenue growth and market validation for applications like Cursor, which has surpassed $500 million in annual recurring revenue [15]. - The report emphasizes the need for practical evaluation metrics that reflect real-world application value, moving beyond traditional static benchmarks [34]. Additional Insights - The report highlights the challenges of data quality and the diminishing returns of human-generated data, suggesting a shift towards models that learn from real-time interactions with the environment [44]. - The integration of visual and textual reasoning capabilities is advancing, with models like OpenAI's o3 excelling in visual reasoning tasks [24][25]. - The report concludes with a focus on the future of AI, emphasizing the potential for models to autonomously develop tools and enhance their problem-solving capabilities [21][44].
OpenAI迎来“Agent时刻”:智能体大战的路线选择
Hu Xiu· 2025-08-04 02:47
Core Insights - OpenAI has officially launched its ChatGPT Agent, marking a significant moment in the evolution of general-purpose AI agents, integrating deep research and execution tools, although it still faces challenges such as slow speed and lack of personalization [1][4][36] - The architecture of ChatGPT Agent is fundamentally a combination of a browser and a sandbox virtual machine, which contrasts with other agents like Manus and Genspark, highlighting different technical paths and capabilities [1][4][12] Architecture Comparison - The main types of AI agents currently available include browser-based agents, sandbox agents, and workflow-integrated agents, each with distinct advantages and limitations [12][26] - OpenAI's browser-based product is noted for its strong capabilities, achieving over 50% on the Browsing Camp benchmark, while competitors like Perplexity and Genspark have lower scores [4][6] - Browser-based agents are versatile but slow, while sandbox agents can execute tasks efficiently but often lack internet access [14][17] User Experience and Performance - User experience varies significantly among agents like Pokee, Genspark, Manus, and OpenAI's ChatGPT Agent, with Pokee being the fastest, potentially 4-10 times quicker than its competitors [36][40] - Manus and ChatGPT Agent share a common drawback of slow performance due to their reliance on browser navigation, with tasks taking upwards of 30 minutes [28][31] - Genspark has shifted towards a template-based approach, which may limit its general-purpose capabilities but improves speed and efficiency [34][33] Market Dynamics and Future Trends - The rise of AI agents is expected to transform internet traffic distribution, potentially reducing reliance on traditional web browsing and search engines [52][56] - Companies are increasingly motivated to open API interfaces to facilitate the integration of AI agents, which could lead to a decline in direct web traffic to traditional sites [52][58] - The advertising landscape is anticipated to evolve, with agents potentially compensating content creators directly, altering the traditional revenue models [64][66]
OpenAI杀入通用AI Agent的背后:四大技术流派与下一个万亿流量之战
3 6 Ke· 2025-08-03 09:57
Core Insights - OpenAI officially launched ChatGPT Agent on July 17, marking its entry into the general AI Agent market, which is anticipated to reshape the internet landscape and become a trillion-dollar traffic entry point [1][50] - The emergence of ChatGPT Agent raises questions about whether the market will be dominated by tech giants or if startups can maintain a foothold due to technological barriers and differentiated approaches [1][39] Summary by Categories 1. ChatGPT Agent Launch - The introduction of ChatGPT Agent signifies the opening of the general AI Agent battlefield, with OpenAI's CEO Sam Altman and researchers presenting the product in a live stream [1] - The launch is seen as a strategic move ahead of the anticipated GPT-5 release, suggesting a competitive response to emerging AI startups [1] 2. Functionality and Tools - ChatGPT Agent can assist users in various tasks, such as ordering products online or generating presentations, driven by two tools: Deep Research and Operator [2][4] - Deep Research focuses on in-depth analysis and report generation, while Operator allows users to perform specific actions on the web [4] 3. Technical Approaches - The article outlines four main technical approaches in the AI Agent space: - **Browser-based Approach**: OpenAI's ChatGPT Agent operates primarily through web browsers, allowing extensive access to online information but suffers from slow performance and high token consumption [7][12] - **Sandbox + Browser Approach**: Manus combines a sandbox environment with browser capabilities, offering high local execution efficiency but limited external access [14][20] - **Large Model + Sandbox Approach**: GensPark utilizes a large language model within a sandbox, sacrificing generality for speed and stability, focusing on specific tasks [24][28] - **Workflow + Tool Integration Approach**: Companies like Pokee integrate pre-designed workflows with third-party tools, resulting in faster execution but limited generality [32][34] 4. Future of AI Agents - The competition in the AI Agent market is expected to intensify, with the potential for agents to become the primary means of internet interaction, leading to a decline in traditional web traffic [39][41] - The concept of "ghost clicks" suggests that future internet traffic will be driven by agents rather than human users, fundamentally altering advertising and information dissemination models [41][45] 5. Market Dynamics - OpenAI's entry into the general AI Agent market is seen as a pivotal moment, with implications for both existing companies and new entrants aiming to capture market share [1][42] - The article emphasizes the need for companies to enhance user retention and reliability through specialized workflows and tools, rather than solely relying on broad capabilities [36][37]
OpenAI杀入通用AI Agent背后:四大技术流派与下一个万亿流量之战
Hu Xiu· 2025-08-03 08:22
Core Insights - The introduction of ChatGPT Agent marks the beginning of a competitive landscape for general AI agents, potentially reshaping the market dynamics and becoming a significant traffic entry point for the next generation of the internet [2][3][64]. Group 1: ChatGPT Agent Overview - OpenAI's ChatGPT Agent was introduced on July 17, showcasing its ability to assist users in various tasks, such as ordering products or generating presentations [4][5]. - The ChatGPT Agent integrates two previously separate tools, Deep Research and Operator, to combine search and execution capabilities [8][10]. Group 2: Technical Approaches in AI Agents - There are four main technical approaches in the AI agent landscape: browser-based, sandbox virtual machine, large model with sandbox, and workflow plus tool integration [11][59]. - The browser-based approach, exemplified by OpenAI's ChatGPT Agent, offers high versatility but suffers from slow performance and high token consumption [12][15][20]. - The sandbox virtual machine approach, represented by Manus, provides high local execution efficiency but has limited access to external services [23][33][38]. - The large model with sandbox approach, as seen in GensPark, sacrifices generality for speed and stability, focusing on specific workflows [40][51]. - The workflow plus tool integration approach, utilized by companies like Pokee, emphasizes speed and delivery but lacks general applicability [52][57]. Group 3: Market Dynamics and Future Trends - The competition in the AI agent market is expected to intensify, with the potential for new companies to emerge as leaders [66][69]. - The concept of "ghost clicks" suggests that future internet traffic will be driven by agents rather than human users, leading to significant changes in advertising and content monetization [67][72]. - OpenAI's ChatGPT currently handles approximately 2.5 billion user commands daily, equating to an annualized volume of 912.5 billion, which represents 18% of Google's annual search volume [75][76].
Manus还活着,还上新了
虎嗅APP· 2025-08-01 10:26
虎嗅独家获悉,这是在Manus内部花最长时间做的功能,耗时超两个月。 就在上个月,行业巨头OpenAI也推出了自家的ChatGPT Agent,尽管上线后市场反响褒贬不一,但其抢占 Agent赛道制高点的野心昭然若揭。其中,"Deep Research"更是被作为核心卖点大肆宣传。 8月1日凌晨,Manus首席科学家Peak季逸超罕见发声——Manus发布新功能Wide Research(广度研究),该 功能目前仅对Pro用户开放,未来会陆续向Basic和Plus用户开放。而目前还没有向免费用户开放的计划。 距离上一次Peak为产品录制视频,还要追溯到今年3月Manus的横空出世。在这5个月里,Manus经历了种种 跌宕起伏的事件。而如今Wide Research功能的上线,似乎是在向大众证明,"我还很好地活着"。 而不到一个月的时间,Manus的Wide Research便横空出世,其"广度研究"的定位,无疑是对OpenAI"深度研 究"的一次精准反击,甚至有点挑战OpenAI霸主的意味。 从实际测试结果来看,这是一场"深"与"广"的对决。 当被要求列出全球前100的MBA学校时,ChatGPT Agent ...
OpenAI会杀死Manus们吗?
创业邦· 2025-07-22 03:02
Core Viewpoint - OpenAI's release of ChatGPT Agent marks a significant advancement in AI capabilities, allowing for complex task execution and planning, which poses challenges for existing AI startups in the agent space [5][9][45]. Group 1: OpenAI's ChatGPT Agent - ChatGPT Agent can autonomously plan and execute tasks, utilizing various tools for functions such as data retrieval, itinerary planning, and hotel booking [5]. - OpenAI founder Sam Altman described the ChatGPT Agent as a significant step towards achieving AGI (Artificial General Intelligence) [9]. - The model is designed to integrate task planning, tool invocation, and document generation within a single system, distinguishing it from other AI agents that rely on context management [9][25]. Group 2: Competitive Landscape - Startups like Manus and Genspark are actively competing with OpenAI, claiming superior performance in task completion and response times [13][21]. - Manus has publicly compared its capabilities with ChatGPT Agent, asserting that it outperforms OpenAI in various tasks, including data organization and financial analysis [20][24]. - Genspark also reported faster response times and higher quality outputs compared to ChatGPT Agent, emphasizing its competitive edge despite being a smaller company [21]. Group 3: Market Implications - The AI Agent market is projected to grow significantly, from $5.1 billion in 2024 to $47.1 billion by 2030, with a CAGR of 44.8% [46]. - Major tech companies are already integrating AI agents into their operations, leading to substantial workforce reductions, as seen with Microsoft and Klarna [45][46]. - The introduction of AI agents raises concerns about privacy and security, as these systems can access sensitive user information [46][48]. Group 4: Technical Aspects - OpenAI's ChatGPT Agent has demonstrated superior performance in academic tests, achieving high scores in various assessments, indicating its advanced capabilities compared to previous models [29][32]. - The agent's ability to perform complex tasks is attributed to its end-to-end training, which provides a unified model advantage over the iterative improvements seen in many startups [29][33]. - Startups are focusing on application innovation and user experience, while OpenAI emphasizes foundational model capabilities [33][34].
OpenAI上新Manus撤退 AI智能体两面
Bei Jing Shang Bao· 2025-07-20 14:31
Core Insights - OpenAI has not released GPT-5 as planned, instead launching the ChatGPT Agent, which possesses autonomous thinking and action capabilities [2][3] - Manus, a previously popular AI agent, has cleared its social media content and is reportedly relocating its headquarters to Singapore, leading to significant layoffs in China [2][6] Group 1: ChatGPT Agent Features - The ChatGPT Agent can autonomously select tools from its skill set to complete complex tasks, such as analyzing competitors and creating presentations [3] - It integrates functionalities from previous features like Operator and Deep Research into a unified system, enhancing its ability to interact with websites and process information [3][4] - The system includes various tools for web interaction, text processing, and code execution, but trading and sensitive operations are restricted to prevent financial losses [4][5] Group 2: Manus Market Exit - Manus has exited the Chinese market, clearing its social media and indicating a shift in focus to operational efficiency by relocating to Singapore [6][7] - The decision to move may be influenced by U.S. investment restrictions and the challenges of maintaining different product versions for domestic and international markets [7] - Manus's co-founder reflected on the challenges faced in developing AI agents, emphasizing the complexity of building effective systems [6][7] Group 3: Industry Trends and Predictions - The global AI agent market is projected to reach $5.4 billion by 2024, with expectations for significant growth as major companies commercialize AI agent products [8] - Analysts predict 2025 could mark the "year of the AI agent," with foundational large models being crucial for agent capabilities [8][9] - Concerns exist regarding the sustainability of the AI agent market, with predictions that over 40% of projects may be canceled by the end of 2027 due to market corrections [8][9]