可灵2.6模型推出“音画同出”能力中文语音生成效果全球领先

Core Insights - The article highlights the launch of the Keling 2.6 model on December 3, which introduces a groundbreaking "audio-visual synchronization" capability, transforming the traditional AI video generation workflow [1] Group 1: Model Features - The Keling 2.6 model allows for the simultaneous generation of videos that include natural language, sound effects, and ambient audio, significantly enhancing creative efficiency [1] - The model upgrades two main functions: text-to-sound and image-to-sound generation [1] - The model supports voice generation in both Chinese and English, with the maximum video length reaching 10 seconds [1] Group 2: Performance and Competitive Edge - The Keling 2.6 model demonstrates impressive performance in audio-visual collaboration, audio quality, and semantic understanding [1] - It maintains a global leading position in Chinese voice generation effectiveness [1]