Workflow
实测Gemini图片转视频新功能,终于蹲到经典梗图后续了(doge)
量子位·2025-07-12 04:57

Core Viewpoint - The article discusses the new feature of Gemini that allows users to convert images into videos with sound, showcasing its capabilities and performance through various tests and examples [54]. Group 1 - Gemini has integrated the Veo 3 Fast technology, enabling video generation of approximately 7-8 seconds in length, with a generation speed of about 1-2 minutes [54]. - Users can generate videos three times a day under the Google AI Pro membership, with retries also counting against this limit [54]. - The sound effects produced by Gemini are noted to be impressive, although more specific descriptions are needed for better accuracy in sound generation [55]. Group 2 - The article highlights various tests conducted with the new feature, including opening different types of boxes and the resulting animations, which often include humorous or unexpected elements [5][20][24]. - The performance ratings for generated videos vary, with some achieving high scores in speed and fun, while others have lower ratings for visual effects [17][22][26]. - There are limitations noted, such as the inability to generate specific human likenesses and the need for detailed prompts to achieve desired outcomes [56][57].