Core Viewpoint - OpenAI's new AI browser, Atlas, is built on a restructured architecture called OWL, which separates the Chromium runtime from the main application process, aiming to enhance browser performance and user experience [1][3][11]. Group 1: Foundation and Architecture - OpenAI emphasizes that Chromium serves as a foundational building block, providing advanced web engine capabilities, security models, performance, and compatibility, supported by a global developer community [5]. - The OWL architecture allows Chromium's browser process to run independently from the Atlas main application process, enhancing modularity and performance [12][14]. - OpenAI's approach involves a complete redesign of the Chromium integration, focusing on rapid development and maintaining engineering culture [10][11]. Group 2: User Experience Enhancements - Atlas aims to redefine the browser experience with features like instant startup speed, smooth performance even with multiple tabs, and a strong foundation for agent scenarios [7]. - The user interface of Atlas is almost entirely rebuilt from scratch, incorporating modern native frameworks rather than merely re-skinning the open-source Chromium interface [9][10]. - The architecture allows for faster loading times, crash isolation, and reduced merge conflicts, facilitating a quicker development cycle [18]. Group 3: Technical Implementation - Atlas operates as an OWL client, while the Chromium browser process acts as the OWL host, communicating through Mojo, a process communication system [17]. - The OWL client library provides a simplified Swift API for key functionalities, ensuring a clean codebase and modern application design [18]. - Input events are captured and forwarded efficiently, maintaining a seamless interaction between the Atlas interface and the Chromium rendering engine [30][32]. Group 4: Agent Mode and Security - The Agent mode in Atlas presents unique challenges, requiring complete screen images for input while ensuring security through sandboxing and session isolation [36][37]. - Each Agent session operates independently, clearing all cookies and data upon completion, allowing multiple concurrent sessions without interference [37]. Conclusion - OpenAI reiterates the critical role of the global Chromium community in enabling these advancements, with OWL paving the way for a decoupled engine and application architecture that combines top-tier web platforms with modern native frameworks [38].
「套壳」的最高境界:OpenAI揭秘Atlas浏览器架构OWL
机器之心·2025-10-31 03:01