通义万相Wan2.5 preview系列模型
Search documents
通义万相2.5系列模型发布,可一键P图、生成BGM视频
Xin Lang Ke Ji· 2025-09-24 05:20
Core Insights - Alibaba launched the Tongyi Wanshang Wan2.5 preview series models at the 2025 Hangzhou Yunqi Conference, which includes four major models: text-to-video, image-to-video, text-to-image, and image editing [1] - The Tongyi Wanshang 2.5 video generation model can create videos with synchronized audio, effectively lowering the barrier for high-quality video production [1] - The new model enhances creative capabilities, increasing video generation time from 5 seconds to 10 seconds, and supports 24 frames per second in 1080P HD video [1] Summary by Category Product Features - The Tongyi Wanshang 2.5 model can generate human voices, sound effects, and background music that match the visuals, making video storytelling more vivid [1] - It has improved instruction-following capabilities, allowing for complex continuous changes in video generation tasks and enabling one-click effects in image editing tasks [1] - The model can generate Chinese and English text, complex layouts, artistic posters, flowcharts, and architecture diagrams, along with image editing capabilities [1] Performance Metrics - The Tongyi Wanshang model family now supports over 10 visual creation capabilities, including text-to-image, text-to-video, and action generation, with a total of 390 million images and 70 million videos generated [2] - Since February of this year, the Tongyi Wanshang has open-sourced over 20 models, achieving over 30 million downloads across open-source communities and third-party platforms [2]