谷歌Gemini学会了看图作曲,你的朋友圈也能拥有专属BGM了
量子位·2026-02-19 07:03

Core Viewpoint - Google has transformed Gemini into a comprehensive creative tool capable of generating music and album covers based on user input, significantly simplifying the creative process [1][2][4]. Group 1: Features of Gemini - The latest Lyria 3 model integrated into Gemini allows users to create music by simply providing text or images, producing complete songs with lyrics, melodies, and vocals in seconds [2][4]. - The audio sampling rate of Lyria 3 is 48kHz, ensuring high-fidelity sound quality for generated music, enhancing the overall user experience [5][7]. - Users can upload photos, and the AI can generate music that captures the essence of the image, providing a personalized soundtrack for social media [7][9]. Group 2: Creative Capabilities - Gemini can generate songs in various styles, from nostalgic African beats to classic Motown soul, showcasing its versatility in music production [10][13]. - The AI can produce natural-sounding vocals and lyrics, making it feel like users have a personal music producer at their disposal [11][12]. - The integration of the Nano Banana model allows for the automatic creation of album covers that match the generated music, further streamlining the creative process [3][15]. Group 3: Strategic Intent of Google - Google aims to establish Gemini as a "super entry point" for digital life, integrating various services like cloud storage, photo albums, and YouTube into a single platform [16][18]. - This comprehensive approach reduces the need for users to switch between different applications, enhancing efficiency and convenience [17][18]. - By creating a seamless user experience, Google strengthens its position in the market, making it less likely for users to seek out independent applications [17][18].

谷歌Gemini学会了看图作曲,你的朋友圈也能拥有专属BGM了 - Reportify