瞄准 Sora 2，谷歌发布 Veo 3.1，功能大更新，但硬刚还差点儿

Core Insights - Google has released its latest AI video generation model, Veo 3.1, which enhances audio and narrative control, as well as visual quality compared to its predecessor [2][3] Group 1: Model Improvements - Veo 3.1 offers richer audio and narrative control, improving support for dialogue and environmental sound effects [7] - The model maintains a basic generation duration of 8 seconds, extendable to 30 seconds, but with issues in audio continuity during extensions [4][12] - The core model quality has not significantly improved, remaining behind competitors like Sora2 [4] Group 2: New Features - Users can now generate longer clips, with the potential to extend videos beyond 30 seconds, maintaining continuity from the last frame of previous clips [11][19] - The introduction of native audio generation allows for better control over video emotion, rhythm, and narrative tone during the creation phase [12] - Enhanced input capabilities include support for text prompts, images, and video clips, allowing for more precise control over the generated output [13] Group 3: Deployment and Pricing - Veo 3.1 is accessible through various Google AI services, including Flow and Gemini API, with a pricing structure consistent with the previous version [15][17] - The model supports video outputs at 720p or 1080p resolution, with a frame rate of 24 fps [16] - Pricing is set at $0.40 per second for the standard model and $0.15 per second for the fast model, with charges applied only after successful video generation [18]