Gemini 2.5 Pro和Flash模型

Search documents
四点速读2025谷歌开发者大会
第一财经· 2025-05-21 03:22
Core Insights - Google has made significant advancements in AI technology, integrating it into its ecosystem through model upgrades, content generation tools, and hardware updates [1]. Group 1: Gemini Model Upgrade - The Gemini model has been upgraded to Gemini 2.5 Pro and Flash, enhancing multimodal capabilities with support for audiovisual input and native audio output [2]. - Developers can utilize the Live API preview to customize dialogue experiences, including tone, accent, and speaking style [2]. - The Deep Think mode introduces an enhanced reasoning mechanism, improving the model's ability to handle mathematical, programming, and multimodal tasks by considering multiple possibilities before answering [2]. Group 2: Generative Content Tools Upgrade - Google introduced the Veo 3 video generation model, which supports native audio generation, allowing for the creation of high-definition videos with background music, sound effects, and dialogue [3]. - The Imagen 4 image generation model has made significant improvements in detail and text output quality, capable of rendering intricate details and supporting various styles and aspect ratios up to 2K resolution [3]. Group 3: AI Agents for Convenience - The Project Mariner AI agent tool has been updated to handle multiple tasks simultaneously, enabling users to purchase tickets or groceries without visiting third-party websites [4]. - Google launched the Google Beam video calling platform, featuring a six-camera array and custom light field display, allowing for 3D rendering of video calls with real-time voice translation [4]. Group 4: XR Smart Glasses - Google has partnered with brands like Xreal and Samsung to launch Android XR smart glasses, which integrate AI assistant features for real-time translation, navigation, and information prompts [5]. Group 5: Subscription Plan - Google has introduced a monthly subscription plan priced at $249.99 for AI Ultra, providing access to advanced AI features such as Gemini 2.5 Pro's Deep Think mode and Veo 3 video generation tools, along with higher usage limits and additional storage [6].