Workflow
Gemini Live
icon
Search documents
X @Demis Hassabis
Demis Hassabis· 2025-09-25 18:55
RT Logan Kilpatrick (@OfficialLoganK)Introducing our latest Gemini Live model 🔊, built on all the things you love about Gemini, with significantly improved function calling and more natural feeling / sounding conversations (thanks to native audio)!Try out the new model at https://t.co/FpVyUQ9oEY ...
X @Demis Hassabis
Demis Hassabis· 2025-08-20 22:05
Product Updates - Gemini Live 正在成为一个更有帮助、更自然和更直观的助手 [1] - 新的视觉指导功能允许 GeminiApp 在用户分享相机时,直接在屏幕上高亮显示内容 [1] - 语音表达更加自然和富有表现力,包括改进的语调、节奏和音高等方面 [1] - Gemini 可以连接到更多的 Google 应用,如 Messages、Phone 和 Clock(需用户授权) [1]
X @Demis Hassabis
Demis Hassabis· 2025-08-20 19:28
Product Innovation - Google's AI advancements are making Pixel devices more helpful than ever [1] - New visual guidance in Gemini Live enhances user experience [1] - Voice Translate on phone calls improves communication accessibility [1]
X @Demis Hassabis
Demis Hassabis· 2025-08-12 02:00
Product Update - Gemini Live now integrates with Google apps, enabling users to share their camera or screen for real-time assistance [1] Functionality - The integration aims to provide instant help to users anytime [1]
Pick the perfect pairing. Gemini Live can look at the menu & make a suggestion based on your taste.
Google· 2025-07-25 17:34
Wine Recommendation & Pairing - Gemini can provide wine recommendations based on a wine list and meal pairing requests [1] - Gemini recommended Grignolino (La Casaccia, Poggeto '23) for pasta dishes due to its light body, high acidity, and red fruit notes [1] - The recommended wine has bright cherry and raspberry flavors, with hints of rose and subtle herbs [2] Wine Characteristics - Grignolino is described as a light-bodied red wine [1] - The wine has high acidity and red fruit notes [1] - The wine is versatile and pairs well with lighter pasta dishes [1]
速递|Anthropic推出Claude语音模式,卡位AI语音入口
Z Potentials· 2025-05-28 02:43
Core Insights - Anthropic has launched a voice mode for its AI model Claude, allowing users to interact using voice and choose from five unique tones [1] - This feature enhances user experience by enabling natural and intuitive conversations, similar to offerings from other AI companies like OpenAI and Google [2] Group 1 - The voice mode allows users to discuss documents and images, with the ability to switch between text and voice at any time [1] - Voice interactions are subject to usage limits, with most free users allowed 20-30 conversations [2] - Only paid subscribers can access the Google Workspace connector, which integrates with Google Calendar and Gmail, while Google Docs integration is exclusive to enterprise users [2] Group 2 - Anthropic's Chief Product Officer, Mike Krieger, confirmed the development of the voice feature in early March during an interview with the Financial Times [2] - The company is reportedly in discussions with Amazon, its main investor and partner, and AI startup ElevenLabs to enhance future voice capabilities for Claude [2]
每月1800元!谷歌推出最贵AI全家桶,谁买单?
Di Yi Cai Jing· 2025-05-21 09:16
Core Insights - Google faces significant challenges in successfully implementing its high-priced AI strategy, particularly with the introduction of its AI Ultra subscription service priced at $249.99 per month, which is $50 more expensive than ChatGPT Pro [3][16][17] Group 1: AI Model Developments - Google's Gemini 2.5 Pro and the newly released 2.5 Flash preview are leading the large model arena, surpassing ChatGPT-4o, but groundbreaking advancements like GPT-4 are unlikely to occur again [3][4] - The Gemini 2.5 Pro model has been updated and is currently ranked first in the large model arena, with a focus on integrating the best models into products quickly [4][5] - The Deep Think 2.5 Pro model has shown impressive performance, achieving a score of 40.4% in the challenging USAMO math competition, indicating gradual improvements in model capabilities [6] Group 2: AI Applications and Services - Gemini Live, a key product from Google, allows for real-time voice and visual processing, enabling users to interact naturally without needing to type [8] - Google has integrated AI capabilities into its search engine and Chrome browser, enhancing user experience by allowing quick content summarization [8] - New products include Google Beam, a 3D video communication platform, and Jules, an asynchronous AI code assistant [8] Group 3: Hardware Innovations - Google introduced two smart hardware devices, Project Moohan and XR glasses, emphasizing their compatibility with Gemini and potential to revolutionize spatial computing [9][16] Group 4: Market Position and Challenges - Despite being a pioneer in AI, Google faces significant competition and regulatory challenges, including antitrust lawsuits that threaten its market dominance [18][19] - Google's stock has seen a decline of nearly 20% since its peak in January, reflecting investor concerns about the company's ability to match AI investments with growth [19] - The search business, which generated $507 billion in revenue in Q1 2025, is under pressure from competitors and evolving AI technologies [19][20] Group 5: User Engagement and Future Outlook - Google aims to transform its AI offerings into a universal AI assistant, but the high price of its services may limit user adoption [16][17] - The company has reported a significant increase in monthly active users for Gemini applications, reaching over 400 million, but still trails behind ChatGPT's 600 million users [21] - The success of Google's AI strategy will depend on its ability to convert technological advantages into sustainable commercial value amidst intense competition [22]
2025谷歌开发者大会有哪些值得关注的内容?
Jin Shi Shu Ju· 2025-05-21 04:06
Core Insights - Google held its annual developer conference, Google I/O 2025, showcasing updates across its product lines, including Android, Chrome, Google Search, YouTube, and AI chatbot Gemini [1] Group 1: Gemini Ultra and Features - Gemini Ultra, available only in the U.S., offers the highest level of access to Google AI applications and services for a monthly fee of $249.99, including features like the Veo 3 video generator and the upcoming Gemini 2.5 Pro's Deep Think mode [1] - Subscribers of Gemini Ultra will receive enhanced quotas for NotebookLM and Whisk, along with 30TB of storage across Google services [2] Group 2: AI Enhancements - The Deep Think mode in Gemini 2.5 Pro is an enhanced reasoning mode that improves model performance by synthesizing multiple answers, similar to OpenAI's models [3] - Veo 3, a video generation AI, can create sound effects and voiceovers, and will be available exclusively to Gemini Ultra subscribers [4] - Imagen 4, a faster image generation AI, supports high-resolution outputs and detailed textures, enhancing video creation tools like Flow [5] Group 3: Gemini Application Updates - The Gemini series applications have surpassed 400 million monthly active users [6] - Gemini Live will soon allow all iOS and Android users to share their screens and engage in near real-time voice interactions with AI [7] Group 4: New AI Tools and Projects - Stitch is a new AI tool for designing web and mobile app front-ends, allowing users to generate UI elements and code from simple prompts [8] - Project Mariner, an experimental AI agent, can now handle multiple tasks simultaneously, enabling users to complete online shopping through AI interactions [9] - Project Astra, a low-latency multimodal AI project, is being developed in collaboration with companies like Samsung [10] Group 5: AI Mode and Search Enhancements - AI Mode, an experimental search feature, allows users to pose complex multi-part questions and will support visual search queries later this summer [11] Group 6: Video Conferencing and Communication - Beam, a 3D video conferencing tool, uses multiple cameras to create lifelike remote meetings and will integrate with Google Meet for real-time translation [12] Group 7: Integration and Updates - Gemini will be integrated into Chrome as a new AI browsing assistant, enhancing user experience across various Google applications [14] - Wear OS 6 introduces a unified font and improved interface consistency, while Google Play adds new tools for Android developers [15][16] - Android Studio will incorporate new AI features to assist in app development and quality insights [17]