Workflow
百度蒸汽机视频生成大模型升级2.0版本,价格低至行业70%
Xin Lang Ke Ji·2025-08-21 07:33

Core Insights - Baidu's MuseSteamer audio-video integration model has completed an upgrade, achieving the industry's first multi-person audio-video generation [2] - The upgraded versions, including Turbo, Lite, Pro, and a full audio version, are now available for users through Baidu search or the "Hui Xiang" platform [2] - The model is the world's first Chinese audio-video integration I2V model, featuring innovative Latent Multi-Modal Planner technology [2] Group 1 - The MuseSteamer can autonomously coordinate multiple roles, emotions, and interaction logic, achieving over 98% accuracy in rendering Chinese speech details and emotional expressions [2] - It delivers cinematic-quality HD video, realistic environmental sound effects, and natural character voices in synchronized output [2] - The model has been applied in various scenarios, including Baidu search and marketing, with pricing reduced to 70% of industry standards [2] Group 2 - Industry experts note that the upgrade not only enhances quality but also significantly reduces creative costs [2] - Renowned visual effects supervisor Yao Qi showcased a sci-fi short film "Return" created with MuseSteamer 2.0, stating that it eliminates the need for million-dollar budgets for Hollywood-level shots [2]