Interactions API
Search documents
谷歌的阳谋:在GPT-5.2发布日,推出史上“最深度”研究型Agent
美股IPO· 2025-12-12 07:34
Core Viewpoint - Google has launched the most powerful deep research agent to date, Gemini Deep Research, aiming to redefine the infrastructure-level entry for agents, suggesting a future where users rely on their agents to conduct searches instead of manually searching themselves [1][7]. Group 1: Product Launch and Strategic Timing - Google strategically timed the release of Gemini Deep Research to coincide with OpenAI's anticipated launch of GPT-5.2, showcasing its advanced capabilities based on the Gemini 3 Pro model [3][4]. - The launch is seen as a calculated response to OpenAI's developments, positioning Gemini Deep Research as a product with significant strategic implications [4]. Group 2: Features and Capabilities - Gemini Deep Research is not just a tool for generating research reports; it is designed to handle larger contexts, process vast amounts of information, and perform long-chain reasoning tasks that can last for minutes or hours [5]. - The introduction of the Interactions API allows developers to easily integrate Deep Research into their applications, effectively packaging search, multi-step reasoning, and evaluation into an operating system-level service [5]. Group 3: Integration and Future Vision - Deep Research will gradually be integrated into various Google services, including Google Search, Google Finance, Gemini applications, and NotebookLM [6]. - The vision is that users will no longer need to perform searches themselves; instead, their agents will handle all search tasks [7]. Group 4: Addressing Challenges in AI - Google aims to tackle the significant challenge of hallucination rates in AI agents, claiming that Deep Research benefits from the higher factual accuracy of the Gemini 3 Pro model, which helps reduce distortions in long-chain reasoning tasks [8]. Group 5: Performance Benchmarks - Google has introduced new benchmarks, such as DeepSearchQA, to test multi-step information retrieval and has made these benchmarks open source [9]. - In benchmark tests, the new agent outperformed competitors, although OpenAI's ChatGPT 5 Pro showed close performance, particularly in the BrowserComp test [10]. Group 6: Competitive Landscape - The simultaneous announcements from Google and OpenAI mark a direct competition, with Google aiming to establish a foothold in the rapidly evolving agent landscape [11]. - The competition has shifted from model superiority to who can become the foundational infrastructure for future information access methods [12].
谷歌智能体发力:增强版Gemini Deep Research和专属API都来了
量子位· 2025-12-12 06:41
Core Insights - OpenAI and Google are both making significant updates in the AI space, with Google launching an enhanced version of Gemini Deep Research aimed at reducing hallucinations and excelling in complex information retrieval and analysis tasks [1][3][10]. Group 1: Gemini Deep Research Enhancements - The enhanced Gemini Deep Research is built on Gemini 3 Pro and will soon be integrated into various Google services such as Google Search, NotebookLM, Google Finance, and the upgraded Gemini App [3][8]. - This version of Gemini Deep Research can perform iterative reasoning, allowing it to generate queries, read and integrate search results, and identify knowledge gaps, significantly improving its web search capabilities [10][12]. - In benchmark tests like HLE, BrowseComp, and DeepSearchQA, the enhanced model has achieved state-of-the-art (SOTA) results, showcasing its superior performance in complex research tasks [10][12]. Group 2: DeepSearchQA Benchmark - Google has released the DeepSearchQA benchmark dataset to provide a more comprehensive evaluation standard for deep search and research tasks, addressing the limitations of existing benchmarks [5][12]. - The dataset includes 900 manually designed causal chain tasks from 17 domains, requiring detailed answer sets, which better measure the model's multi-step reasoning and information fusion capabilities [12]. Group 3: Interactions API - Google has introduced the Interactions API, designed to provide a unified interface for developers to interact with Gemini 3 Pro and Deep Research agents [6][16]. - This API is particularly suited for scenarios requiring multi-step reasoning, tool invocation, and long-term task execution, enhancing the capabilities of existing models [17][18]. - The Interactions API simplifies workflows and adapts better to developer environments by expanding the core capabilities of content generation and supporting server-side state, interpretable data models, and remote tool support [18].
对抗 OpenAI GPT-5.2,谷歌推出Gemini Deep Research智能体
Huan Qiu Wang Zi Xun· 2025-12-12 03:53
Core Insights - Google has launched Gemini Deep Research, an advanced AI research agent, following the release of OpenAI's GPT-5.2, marking a significant step towards industrial application of AI in complex research tasks [1][3] Group 1: Gemini Deep Research Features - Gemini Deep Research is built on Gemini 3 Pro and is optimized for long-cycle content collection and synthesis, achieving a 40% reduction in hallucination rates compared to previous models, making it Google's most factually accurate model to date [3] - The AI agent utilizes multi-step reinforcement learning to navigate complex information environments with higher precision, enabling deep information mining through iterative planning of research paths [3] - In benchmark tests, Gemini Deep Research scored 46.4% in Google's new benchmarks and performed comparably to GPT-5 Pro in BrowseComp, while costing approximately one-tenth of the latter [3] Group 2: Applications and Industry Impact - The AI agent has demonstrated significant value in various industries, such as automating early information collection in financial services, enhancing research efficiency by integrating market signals and compliance risks [4] - In biotechnology, Axiom Bio has utilized the AI for literature analysis related to drug toxicity prediction, resulting in deeper and more granular research, thereby accelerating drug development processes [4] - The AI's strong information integration capabilities have also improved decision-making in market research [4] Group 3: Interactions API and Future Developments - The newly launched Interactions API allows developers to leverage Gemini Deep Research to create next-generation automated research tools, featuring capabilities like unified information synthesis and structured output [5] - Future upgrades will include native chart output capabilities and expanded support for custom data sources, with plans to launch Deep Research services on the Vertex AI platform for broader industry application [6]