GUI范式
Search documents
手机Agent的两种范式:API与GUI
GOLDEN SUN SECURITIES· 2025-12-07 08:24
Investment Rating - The report maintains an "Accumulate" rating for the computer industry [4]. Core Insights - The mobile interaction paradigm is transitioning from GUI to Agentic interaction, allowing users to express their intentions in natural language, which the mobile agent then executes [1][12]. - Two main technical routes for mobile agents are identified: API paradigm and GUI paradigm, each with distinct advantages and challenges [1][2][24]. - The rise of mobile agents signifies a reshuffling of mobile internet traffic among mobile manufacturers, large model manufacturers, and application developers, leading to complex interactions among these parties [3][26]. Summary by Sections Mobile Agent and Interaction Paradigm - The shift from GUI to Agentic interaction is driven by the increasing complexity of applications and the need for more efficient user interactions [1][12]. - Users can now communicate their needs through natural language, with mobile agents handling the execution of tasks across different applications [1][12]. API Paradigm Analysis - The API paradigm involves creating standardized semantic interfaces that require app developers to adapt and expose functionalities for agent use [16][18]. - Apple's App Intents framework exemplifies this approach, emphasizing privacy and structured integration [16][17]. GUI Paradigm Analysis - The GUI paradigm operates without developer cooperation, using visual models to simulate user actions on the screen [2][19]. - Recent advancements in multi-modal models, such as Google's Gemini 3 Pro, have significantly improved the ability to understand and interact with UI elements [19][21]. Comparison of API and GUI Agents - GUI agents offer higher generality, allowing them to operate across various applications without developer adaptation, while API agents excel in reliability, performance, and privacy [2][24]. - API agents can complete complex tasks in a single call, whereas GUI agents may require multiple steps, leading to higher computational costs and potential delays [24]. Evolution of Business Models - The emergence of mobile agents is reshaping the competitive landscape, with mobile manufacturers seeking to leverage traffic entry points and large model manufacturers aiming to create comprehensive applications [3][27][28]. - Application developers face a dual challenge of collaborating with mobile and model manufacturers while protecting their own interests [31]. Recommendations for Attention - Key players in the GUI agent space include ByteDance, Google, Alibaba, and ZTE, while Tencent, Alibaba, and Google are notable in the API agent domain [7][33].