Gemini Live

Search documents
X @Demis Hassabis
Demis Hassabis· 2025-08-20 22:05
RT Google (@Google)Gemini Live is becoming an even more helpful, natural and visual assistant:👀 New visual guidance: Now, when you share your camera, @GeminiApp not only sees what you see, but can highlight things directly on your screen🗣️ More natural and expressive speech — with improved including intonation, rhythm and pitch🤝 Connect to even more of your Google apps with your permission, like Messages, Phone and Clock#MadeByGoogle ...
X @Demis Hassabis
Demis Hassabis· 2025-08-20 19:28
RT Google (@Google)From Voice Translate on phone calls to new visual guidance in Gemini Live, here’s how AI is making Pixel 10 more helpful than ever → https://t.co/JY8TTGO3aM #MadeByGoogle https://t.co/6ONccitPAY ...
X @Demis Hassabis
Demis Hassabis· 2025-08-12 02:00
RT Google Gemini App (@GeminiApp)Gemini Live now connects with your favorite apps from @Google - just share your camera or screen to get instant help, anytime. ...
Pick the perfect pairing. Gemini Live can look at the menu & make a suggestion based on your taste.
Google· 2025-07-25 17:34
Wine Recommendation & Pairing - Gemini can provide wine recommendations based on a wine list and meal pairing requests [1] - Gemini recommended Grignolino (La Casaccia, Poggeto '23) for pasta dishes due to its light body, high acidity, and red fruit notes [1] - The recommended wine has bright cherry and raspberry flavors, with hints of rose and subtle herbs [2] Wine Characteristics - Grignolino is described as a light-bodied red wine [1] - The wine has high acidity and red fruit notes [1] - The wine is versatile and pairs well with lighter pasta dishes [1]
速递|Anthropic推出Claude语音模式,卡位AI语音入口
Z Potentials· 2025-05-28 02:43
Core Insights - Anthropic has launched a voice mode for its AI model Claude, allowing users to interact using voice and choose from five unique tones [1] - This feature enhances user experience by enabling natural and intuitive conversations, similar to offerings from other AI companies like OpenAI and Google [2] Group 1 - The voice mode allows users to discuss documents and images, with the ability to switch between text and voice at any time [1] - Voice interactions are subject to usage limits, with most free users allowed 20-30 conversations [2] - Only paid subscribers can access the Google Workspace connector, which integrates with Google Calendar and Gmail, while Google Docs integration is exclusive to enterprise users [2] Group 2 - Anthropic's Chief Product Officer, Mike Krieger, confirmed the development of the voice feature in early March during an interview with the Financial Times [2] - The company is reportedly in discussions with Amazon, its main investor and partner, and AI startup ElevenLabs to enhance future voice capabilities for Claude [2]
每月1800元!谷歌推出最贵AI全家桶,谁买单?
Di Yi Cai Jing· 2025-05-21 09:16
Core Insights - Google faces significant challenges in successfully implementing its high-priced AI strategy, particularly with the introduction of its AI Ultra subscription service priced at $249.99 per month, which is $50 more expensive than ChatGPT Pro [3][16][17] Group 1: AI Model Developments - Google's Gemini 2.5 Pro and the newly released 2.5 Flash preview are leading the large model arena, surpassing ChatGPT-4o, but groundbreaking advancements like GPT-4 are unlikely to occur again [3][4] - The Gemini 2.5 Pro model has been updated and is currently ranked first in the large model arena, with a focus on integrating the best models into products quickly [4][5] - The Deep Think 2.5 Pro model has shown impressive performance, achieving a score of 40.4% in the challenging USAMO math competition, indicating gradual improvements in model capabilities [6] Group 2: AI Applications and Services - Gemini Live, a key product from Google, allows for real-time voice and visual processing, enabling users to interact naturally without needing to type [8] - Google has integrated AI capabilities into its search engine and Chrome browser, enhancing user experience by allowing quick content summarization [8] - New products include Google Beam, a 3D video communication platform, and Jules, an asynchronous AI code assistant [8] Group 3: Hardware Innovations - Google introduced two smart hardware devices, Project Moohan and XR glasses, emphasizing their compatibility with Gemini and potential to revolutionize spatial computing [9][16] Group 4: Market Position and Challenges - Despite being a pioneer in AI, Google faces significant competition and regulatory challenges, including antitrust lawsuits that threaten its market dominance [18][19] - Google's stock has seen a decline of nearly 20% since its peak in January, reflecting investor concerns about the company's ability to match AI investments with growth [19] - The search business, which generated $507 billion in revenue in Q1 2025, is under pressure from competitors and evolving AI technologies [19][20] Group 5: User Engagement and Future Outlook - Google aims to transform its AI offerings into a universal AI assistant, but the high price of its services may limit user adoption [16][17] - The company has reported a significant increase in monthly active users for Gemini applications, reaching over 400 million, but still trails behind ChatGPT's 600 million users [21] - The success of Google's AI strategy will depend on its ability to convert technological advantages into sustainable commercial value amidst intense competition [22]
2025谷歌开发者大会有哪些值得关注的内容?
Jin Shi Shu Ju· 2025-05-21 04:06
Core Insights - Google held its annual developer conference, Google I/O 2025, showcasing updates across its product lines, including Android, Chrome, Google Search, YouTube, and AI chatbot Gemini [1] Group 1: Gemini Ultra and Features - Gemini Ultra, available only in the U.S., offers the highest level of access to Google AI applications and services for a monthly fee of $249.99, including features like the Veo 3 video generator and the upcoming Gemini 2.5 Pro's Deep Think mode [1] - Subscribers of Gemini Ultra will receive enhanced quotas for NotebookLM and Whisk, along with 30TB of storage across Google services [2] Group 2: AI Enhancements - The Deep Think mode in Gemini 2.5 Pro is an enhanced reasoning mode that improves model performance by synthesizing multiple answers, similar to OpenAI's models [3] - Veo 3, a video generation AI, can create sound effects and voiceovers, and will be available exclusively to Gemini Ultra subscribers [4] - Imagen 4, a faster image generation AI, supports high-resolution outputs and detailed textures, enhancing video creation tools like Flow [5] Group 3: Gemini Application Updates - The Gemini series applications have surpassed 400 million monthly active users [6] - Gemini Live will soon allow all iOS and Android users to share their screens and engage in near real-time voice interactions with AI [7] Group 4: New AI Tools and Projects - Stitch is a new AI tool for designing web and mobile app front-ends, allowing users to generate UI elements and code from simple prompts [8] - Project Mariner, an experimental AI agent, can now handle multiple tasks simultaneously, enabling users to complete online shopping through AI interactions [9] - Project Astra, a low-latency multimodal AI project, is being developed in collaboration with companies like Samsung [10] Group 5: AI Mode and Search Enhancements - AI Mode, an experimental search feature, allows users to pose complex multi-part questions and will support visual search queries later this summer [11] Group 6: Video Conferencing and Communication - Beam, a 3D video conferencing tool, uses multiple cameras to create lifelike remote meetings and will integrate with Google Meet for real-time translation [12] Group 7: Integration and Updates - Gemini will be integrated into Chrome as a new AI browsing assistant, enhancing user experience across various Google applications [14] - Wear OS 6 introduces a unified font and improved interface consistency, while Google Play adds new tools for Android developers [15][16] - Android Studio will incorporate new AI features to assist in app development and quality insights [17]