Workflow
Responses API
icon
Search documents
深度|OpenAI API华人工程负责人:模型会把你的脚手架当早餐吃掉,为模型的未来而构建,而非为模型的当下而构建
Z Potentials· 2026-02-24 03:21
Core Insights - The article discusses the transformative impact of AI, particularly OpenAI's Codex, on software engineering, highlighting a shift from traditional coding to a more management-oriented role for engineers [3][5][12]. Group 1: AI Integration in Software Development - Currently, 95% of OpenAI engineers use Codex, with 100% of code merge requests being reviewed by it, indicating a significant reliance on AI for coding tasks [5][7]. - Engineers are evolving into technical leaders, managing multiple AI agents rather than writing code directly, which reflects a paradigm shift in software development [5][13]. - The expectation is that within the next 12 to 18 months, AI models will be capable of executing complex tasks over several hours, fundamentally changing the nature of software products [5][12]. Group 2: Efficiency and Productivity Gains - Engineers who utilize Codex submit 70% more pull requests (PRs) compared to those who do not, demonstrating increased productivity through AI tools [7][20]. - Codex has automated code reviews, reducing the time required for this task from 10-15 minutes to just 2-3 minutes, allowing engineers to focus on more engaging work [20][22]. Group 3: Evolution of Engineering Roles - The role of engineers is shifting towards that of managers who oversee AI agents, requiring new skills to ensure effective collaboration with AI [12][13]. - The metaphor of engineers as "wizards" using "spells" (code) to command AI reflects the growing complexity and capability of AI tools in software development [14][15]. Group 4: Challenges and Best Practices - A team at OpenAI is experimenting with a codebase entirely written by Codex, facing challenges in ensuring the AI meets specific requirements without a fallback option [18][19]. - Successful AI deployment in organizations requires both top-down strategic support and bottom-up employee engagement to foster a culture of AI utilization [44][45]. Group 5: Future of Startups and Entrepreneurship - The concept of "one-person billion-dollar startups" suggests that individuals leveraging AI tools can achieve significant productivity, potentially leading to a surge in small startups [30][31]. - The article posits that as software development becomes easier, there may be a proliferation of small companies, leading to a B2B SaaS boom [31][32]. Group 6: Management Philosophy and AI - The management philosophy emphasizes empowering high-performing employees, akin to supporting a lead surgeon in an operating room, to enhance productivity and innovation [39][40]. - AI tools can assist managers in predicting potential bottlenecks and proactively addressing issues, thereby improving team efficiency [41][42]. Group 7: AI Deployment Challenges - Many companies face negative ROI from AI projects due to a disconnect between management directives and employee capabilities, highlighting the need for a dedicated "AI task force" to bridge this gap [43][44]. - The ideal "AI task force" should consist of technically inclined individuals who are not necessarily software engineers but can effectively leverage AI tools [46].
OpenAI高管:工程师变成“魔法师”,AI将开启新一轮创业狂潮
Hua Er Jie Jian Wen· 2026-02-15 08:01
Core Insights - OpenAI's internal data reveals that 95% of its engineers are using Codex for programming, with 100% of pull requests (PRs) being reviewed by Codex, indicating a significant shift in software engineering practices [4][9][19] - The company is experimenting with a team maintaining a codebase entirely written by Codex, which could fundamentally change development methodologies [4][12] - Engineers are evolving from traditional coding roles to managing multiple AI agents, likening their work to that of "wizards" casting spells to accomplish tasks [5][6][10] Group 1: AI Integration and Impact - The deep integration of AI tools has led to engineers who use Codex generating 70% more PRs than those who do not, with this gap widening over time [4][18] - OpenAI emphasizes the need for developers to build for the future capabilities of AI models rather than their current state, as many existing scaffolding solutions may become obsolete [4][14][15] - The company views itself as an ecosystem platform aimed at enhancing the overall landscape rather than stifling startups through competition [8] Group 2: Future of Software Engineering - The next 12 to 24 months are expected to see AI models capable of executing complex tasks for several hours, marking a significant advancement in AI capabilities [7] - The rise of "one-person billion-dollar startups" is anticipated, with a corresponding increase in small SaaS companies catering to these individuals, potentially transforming the venture capital ecosystem [7][43] - The emergence of a B2B SaaS golden age is predicted, where the ease of software creation will lead to a proliferation of micro-companies [7][43][44] Group 3: Management and Workforce Dynamics - As AI tools enhance productivity, top performers are expected to leverage these tools to achieve greater efficiency, leading to a wider distribution of team productivity [36][37] - Management roles are evolving, with leaders spending more time supporting top performers and ensuring they have the resources needed to excel [37][41] - The integration of AI tools is likely to enable managers to oversee larger teams, similar to how engineers manage multiple AI agents [38][39]
盘点2025:模型服务,成为基础设施
第一财经· 2025-12-30 10:15
Core Insights - The article emphasizes the rapid growth of the Model as a Service (MaaS) market, with major players like OpenAI, Google Cloud, and Volcano Engine capturing significant market shares by 2025 [1][3] - Volcano Engine has achieved a remarkable daily token call volume of 63 trillion, positioning itself as a leading Chinese player in the AI cloud market [3][6] - The introduction of the Doubao model has led to exponential growth in token usage, highlighting the increasing importance of MaaS as a foundational infrastructure in AI [4][11] Market Dynamics - By October 2025, OpenAI, Google Cloud, and Volcano Engine are projected to hold 65% of the global MaaS market, with respective shares of 31%, 19%, and 15% [1] - Volcano Engine's daily token call volume of 30 trillion places it third globally, following OpenAI and Google Cloud [3] - The MaaS market is still perceived as "thin" and "narrow," indicating potential for further growth and competition [3] Company Performance - Volcano Engine has reported a 100% year-on-year revenue growth, exceeding 20 billion, and has revised its revenue target for 2030 upwards by several percentage points [6] - The company has prioritized MaaS as its strategic focus, leading to significant investments in resources and technology [6][16] - The introduction of the Doubao model API service has drastically reduced pricing, marking a shift from "per count" to "per milligram" pricing, with a reduction of up to 99.3% [6] Technological Advancements - The launch of the DeepSeek-R1 model has further enhanced Volcano Engine's capabilities, allowing it to capitalize on the growing demand for model inference services [7][10] - Continuous iterations of the Doubao model have led to increased token call volumes, with new models being released every three months [10][11] - The company is focusing on optimizing AI application accessibility and cost-effectiveness through advanced tools like Prompt Pilot and Model Router [27][28] Future Outlook - Volcano Engine aims to maintain its leadership in the MaaS market while expanding into deeper industry applications, particularly in sectors like smart manufacturing and consumer electronics [27] - The company is developing a new architecture centered around agents, which will enhance the integration of models into existing workflows [28][30] - The potential market for agents is vast, with estimates suggesting it could significantly expand beyond traditional IT budgets into areas like global customer service and programming [30]
科创板块全线回暖!广发基金科创50、科创100、科创200及科创成长等ETF全产品矩阵,助力布局科创板硬科技龙头标的
Xin Lang Cai Jing· 2025-12-22 06:34
Group 1 - The core viewpoint of the news highlights the positive momentum in the Chinese equity market driven by improving corporate earnings, capital allocation shifts, and policy optimization, with a focus on key sectors such as AI, new energy, and quantum technology [1][3] - The U.S. labor statistics indicate a slight increase in the unemployment rate to 4.6%, the highest since September 2021, which supports the rationale for a recent 25 basis point interest rate cut [1] - The upcoming 2026 economic work plan emphasizes expanding domestic demand, strengthening industries, and promoting reforms, with a particular focus on technology development and market-driven initiatives [3] Group 2 - Haidong International's updates on the volcanic engine and AI agent platform enhancements aim to reduce integration costs and clarify project boundaries, facilitating business value creation [2] - The demand for AI computing power is driving growth in the optical module industry, with 800G products entering mass production and 1.6T products poised for large-scale deployment, indicating a significant upward trend in the industry [2] - The performance of various ETFs, particularly those tracking the Sci-Tech sector, shows strong upward movement, with notable increases in individual stocks such as Zhuojing Technology and Zhongxin International [4][5][6] Group 3 - The Sci-Tech ETFs are designed to provide exposure to a basket of leading Sci-Tech stocks, with features such as daily trading and no restrictions on account assets or investment duration [6][7] - The Sci-Tech 50 ETF, Sci-Tech 100 ETF, and Sci-Tech 200 ETF focus on different segments of the Sci-Tech market, reflecting the performance of large-cap, mid-cap, and small-cap companies respectively [7][8] - The Sci-Tech Growth ETF targets high-growth companies within the Sci-Tech sector, emphasizing those with strong revenue and profit growth metrics [8]
Build Hour: Responses API
OpenAI· 2025-10-14 13:08
Responses API Overview - OpenAI introduced the Responses API to evolve beyond the Chat Completions API, addressing design limitations and enabling new functionalities for building agentic applications [1] - The Responses API combines the simplicity of chat completions with the ability to perform more agentic tasks, simplifying workflows like tool use, code execution, and state management [1] - The core of the Responses API is an agentic loop, allowing multiple actions within a single API request, unlike Chat Completions which only allows one model sample per request [2] - The Responses API uses "items" for everything, including messages, function calls, and MCP calls, making coding easier compared to Chat Completions where function calling was bolted onto messages [2] - The Responses API is purpose-built for reasoning models, preserving reasoning from request to request, boosting tool calling performance by 5% in primary tool calling eval tobench [2] - The Responses API facilitates multimodal workflows, making it easier to work with images and other multimodal content, including support for context stuffing with files like PDFs [2] - Streaming is rethought in the Responses API, emitting a finite number of strongly typed events, simplifying development compared to Chat Completions' object deltas [2] - Long multi-turn rollouts with the Responses API are 20% faster and less expensive due to the ability to rehydrate context from request to request, preserving the chain of thought [2] Agent Platform and Tools - OpenAI is changing deployment with its agent platform, centering on the Responses API and Agents SDK for building embeddable, customizable UIs [3] - Agent Builder and Chatkit, built on the Responses API, make it easy to build workflows into applications with minimal effort [3] - The Responses API is at the core of the improvement flywheel, enabling distillation and reinforcement fine-tuning using stateful data, along with tools like web search and file search [3]
Wall Street Brunch: Hooray For DevDay
Seeking Alpha· 2025-10-05 18:16
Economic Impact of Government Shutdown - The government shutdown is predicted to last nearly 21 days, with a 64% chance of exceeding 15 days and a 40% chance of lasting more than 25 days [3] - Goldman Sachs estimates that each week of the shutdown will reduce fourth-quarter annualized real GDP growth by approximately 0.15 percentage points, with a corresponding positive effect on growth in the first quarter if the shutdown ends before then [4] AI Sector Developments - OpenAI, backed by Microsoft, reached a private market valuation of $500 billion, making it the world's most valuable startup [6] - Speculation surrounds OpenAI's upcoming DevDay event, where CEO Sam Altman may unveil a new AI-powered browser, potentially named Aura, to compete with Google's Chrome [5] Earnings Reports and Company Performance - PepsiCo is expected to report EPS of $2.26 on revenue of $23.86 billion, with activist investor Elliott Management advocating for strategic changes to improve bottling efficiencies [8] - Delta Air Lines is forecasted to post EPS of $1.53 with revenue of $15.94 billion, with analysts noting its strong execution compared to peers, presenting a potential buy opportunity [9] Cryptocurrency Market Trends - Bitcoin has surpassed $125,000 for the first time, driven by strong inflows into bitcoin ETFs and renewed institutional interest [11] Consumer Sentiment and Economic Indicators - The upcoming FOMC minutes will be closely monitored, with an 85% chance of two more quarter-point rate cuts this year [10] - The University of Michigan will release its preliminary measure of October consumer sentiment, following a decline in consumer confidence to a five-month low [10] Pharmaceutical Developments - Costco will offer weight loss medications Wegovy and Ozempic at half the list price to its members, following a partnership with Novo Nordisk [12]
GPT-5差评启示录:用户与AI交互方式还停留在上一个时代
3 6 Ke· 2025-08-21 08:49
Core Insights - GPT-5 has received mixed reviews since its launch on August 8, with users expressing dissatisfaction despite its technical advancements [1][5][7] - The official stance from OpenAI is that the issues stem from users not adapting to the new interaction model required by GPT-5, which has evolved into a more autonomous "digital mind" [9][78] - The release of a prompt guide by OpenAI aims to help users better engage with GPT-5, emphasizing the importance of updated communication methods [8][9] Group 1: Performance and Capabilities - GPT-5 demonstrates significant improvements in areas such as mathematics, coding, and multi-modal understanding, showcasing its capabilities as a "full-stack engineer" [4][13] - Despite its higher IQ, GPT-5 exhibits instability, sometimes making errors on simple tasks and lacking emotional intelligence, which has led to concerns about its practical usability [5][6][10] - OpenAI has reported a performance increase in the Tau-Bench test, with scores rising from 73.9% to 78.2%, indicating better efficiency and lower costs [23][24] Group 2: User Interaction and Guidelines - The prompt guide outlines four key areas of evolution for GPT-5: agentic task performance, coding ability, raw intelligence, and steerability, which are crucial for effective user interaction [10][15][17] - Users are encouraged to adjust parameters like reasoning effort and verbosity to optimize GPT-5's performance based on task complexity [53][70] - The guide suggests methods for users to either constrain or empower GPT-5's capabilities, depending on the task requirements, highlighting the need for a more nuanced approach to AI interaction [29][32][36] Group 3: Challenges and Solutions - The dual-edged nature of GPT-5's capabilities means that improper use can lead to inefficiencies, necessitating users to become adept "trainers" of the AI [26][27] - OpenAI emphasizes the importance of clear and structured prompts to avoid conflicts that could lead to performance degradation [54][56] - The guide provides practical solutions for common user challenges, such as managing verbosity and reasoning depth, to enhance the overall interaction experience [50][52][68]
AI加速落地,算力产业链确定性高
Mei Ri Jing Ji Xin Wen· 2025-05-27 00:50
Group 1 - The core viewpoint of the article highlights the acceleration of AI applications and capital expenditures by major companies, indicating a positive trend in the industry [3][4]. - Major AI companies are releasing new models and applications, with Google's Gemini series being upgraded and set to launch across multiple platforms [3]. - OpenAI's announcement of the Responses API supporting MCP is expected to enhance AI Agent development efficiency and interaction capabilities, further driving the demand for the AIDC industry chain [3]. Group 2 - In Q1 2025, major overseas companies showed strong capital expenditures: Meta's CAPEX was $13.7 billion (up 104% YoY, down 8% QoQ), Amazon's was $26.3 billion (up 74% YoY, down 7% QoQ), and Google's was $17.2 billion (up 43% YoY, up 20% QoQ) [3]. - Domestic companies also increased their capital expenditures significantly: Alibaba's CAPEX was 24.6 billion yuan (up 120.6% YoY, down 22.6% QoQ), while Tencent's was 27.5 billion yuan (up 91% YoY, down 25% QoQ) [4]. - The ongoing investment in IDC construction by both domestic and international companies suggests a high level of certainty in the domestic AIDC computing power industry chain [4].
腾讯研究院AI速递 20250523
腾讯研究院· 2025-05-22 15:09
Group 1: OpenAI Innovations - OpenAI's Responses API now supports MCP services, allowing developers to connect external services with simple configurations, significantly reducing development complexity [1] - The updated API enhances security controls through the allowed_tools parameter and permission management to ensure safe tool usage by agents [1] - New features include image generation, Code Interpreter, file search, background mode, inference summaries, and encrypted inference items [1] Group 2: Microsoft's Magentic-UI - Microsoft launched the open-source Web Agent project Magentic-UI, enabling automatic web browsing, file reading/writing, and code execution, with user monitoring and control [2] - The system employs a collaborative planning and execution mechanism, generating task plans for user confirmation and allowing real-time intervention during execution [2] - The project integrates innovative technologies like neural style engines, component DNA mapping, and performance prediction for intelligent style conversion and component reuse [2] Group 3: Mistral's Devstral Model - Mistral, in collaboration with All Hands AI, released the open-source language model Devstral, featuring 24 billion parameters and capable of running on a single RTX 4090 or a 32GB RAM Mac [3] - Devstral scored 46.8% on the SWE-Bench Verified benchmark, outperforming GPT-4.1-mini and other open-source models, showcasing excellent code understanding and problem-solving abilities [3] - The model is released under the Apache 2.0 license for commercial use, with pricing set at $0.10 per million input tokens and $0.30 per million output tokens [3] Group 4: xAI's Live Search API - xAI introduced the Live Search API, providing real-time data access for Grok AI, enabling retrieval of the latest information from X platform, web content, and breaking news [4][5] - The API offers flexible search control features, including enabling/disabling searches, limiting result numbers, and specifying time ranges and domains, combined with DeepSearch for inference display [5] - A Python SDK is available, with free beta testing until June 5, 2025, allowing developers to implement real-time information queries and research assistance [5] Group 5: OpenAI's Acquisition of Jony Ive's Team - OpenAI acquired AI device startup io for $6.5 billion, gaining a hardware team led by former Apple Chief Design Officer Jony Ive, with the deal expected to close by summer [6] - io is developing new forms of AI devices aimed at reducing screen time, including headphones, wearables, and AI home devices, with a projected release in 2026 [6] - The associated company LoveFrom will continue to operate independently while taking on more design responsibilities for OpenAI, including ChatGPT interface and voice interaction products [6] Group 6: Kunlun Wanwei's Skywork Super Agents - Kunlun Wanwei launched the Skywork Super Agents, integrating five expert agents and one general agent for one-stop generation of documents, PPTs, and spreadsheets [7] - The product's core is based on deep research technology, supporting deep information retrieval and traceable content generation at only 40% of OpenAI's costs, with the framework open-sourced [7] - System features include automated requirement clarification, information tracing, and personal knowledge base functionality, allowing users to upload various file formats to build knowledge bases [7] Group 7: Microsoft's Aurora Model - Microsoft introduced the first large-scale atmospheric foundation model, Aurora, trained on millions of hours of atmospheric data, achieving computation speeds 5000 times faster than the most advanced numerical forecasting systems [8] - Aurora excels in predicting air quality, wave patterns, tropical cyclone trajectories, and high-resolution weather, maintaining high accuracy even in data-scarce regions and extreme weather [8] - The model utilizes a 3D Swin Transformer architecture, allowing fine-tuning for different application areas, with a training cycle of only 4-8 weeks, and future expansion into ocean circulation and seasonal weather predictions [8] Group 8: Gartner's Principles for Intelligent Applications - Gartner identified that GenAI will drive enterprise software from auxiliary tools to intelligent agents, outlining five principles for building intelligent applications: adaptive experience, embedded intelligence, autonomous orchestration, interconnected data, and composable architecture [9] - Intelligent applications emphasize personalized experiences and proactive services, enabling cross-system tasks through natural language interactions, with AI capabilities deeply embedded in business logic for process optimization [9] - Enterprises need to maintain balanced investments in the five principles while upgrading foundational data, processes, architecture, and experiences to ensure intelligent applications transition from pilot demonstrations to scalable value applications [9] Group 9: a16z's Insights on AI Programming - The AI coding market has become the second-largest AI market after chatbots, valued at approximately $3 trillion, with developers rapidly adopting this tool as early technology adopters [10] - AI programming will not completely replace traditional programming; understanding foundational abstractions and system architecture remains crucial, with developer roles shifting towards product management or QA engineering [10] - New demographics and methods are fostering a new software paradigm, similar to the WordPress era, where AI lowers the barrier to "writing code," yet the depth and complexity of software development still require professional knowledge [10]