Core Insights - The launch of the Keling 2.6 model introduces a groundbreaking "audio-visual synchronization" capability, transforming the traditional AI video generation workflow from "silent video followed by manual voiceover" to a more efficient process that generates complete videos with natural language, sound effects, and ambient sounds in a single output [1][4]. Group 1: Model Features - The Keling 2.6 model upgrades two main functions: text-to-sound and image-to-sound, allowing users to generate videos with voice, sound effects, and ambient sounds directly from text or images [4][6]. - The model supports both Chinese and English voice generation, with a maximum video length of 10 seconds, significantly enhancing the efficiency of video creation for users [4][6]. Group 2: Performance and Quality - The Keling 2.6 model excels in audio-visual synchronization, audio quality, and semantic understanding, ensuring that the generated videos align closely with the rhythm of speech and environmental sounds, avoiding the disjointed experience typical of traditional workflows [6][7]. - The audio quality produced by the model is cleaner and richer in layers, closely resembling professional mixing effects, thus meeting high demands for sound detail in professional creative work [6]. Group 3: Industry Applications - The Keling 2.6 model is applicable across various sectors, including advertising, self-media, and e-commerce, significantly improving content creation efficiency [7][8]. - In advertising, the model can generate short promotional videos with integrated narration, dialogue, and sound effects, reducing production costs and enhancing efficiency [7]. - For self-media creators, the model facilitates diverse content types, such as interviews, dramas, and musical performances, thereby lowering the cost and complexity of content creation [7][8]. - In the e-commerce sector, the model enables the creation of product showcase and explanation videos through capabilities like solo narration and commentary, improving operational efficiency for businesses [8].
可灵2.6模型推出“音画同出”能力 重构AI视频创作工作流
Yang Guang Wang·2025-12-05 06:47