Workflow
百度蒸汽机
icon
Search documents
百度蒸汽机2.0视频生成模型全系上线,行业首次实现多人有声音视频一体化
Cai Jing Wang· 2025-08-21 13:05
Core Viewpoint - Baidu's MuseSteamer has achieved a significant upgrade, introducing multiple versions that enable integrated audio and video generation, marking a milestone in the industry with the ability to create multi-character audio-visual content [1][2] Group 1: Technological Breakthroughs - The MuseSteamer is the world's first Chinese audio-video integrated generation I2V model, supporting environmental sound effects and multi-character voice generation [2] - Five core technological breakthroughs include: 1. Multi-character audio-visual generation with millisecond precision in voice, lip-sync, expressions, and actions [2] 2. Latent Multi-Modal Planner technology for coordinating character identities, emotions, and interaction logic [2] 3. Over 98% accuracy in rendering Chinese voice details and emotional expressions [2] 4. End-to-end generation of movie-level video quality with precise dynamic character portrayal [2] 5. Master-level camera control with dozens of professional lens languages responding accurately to text commands [2] Group 2: Cost Structure Transformation - The upgrade leads to a fundamental change in cost structure, significantly reducing traditional filmmaking expenses such as actors, venue rentals, and post-production costs [3] - The cost of producing high-quality special effects has dropped to as low as hundreds of yuan, making Hollywood-level production accessible without a million-dollar budget [3] Group 3: Competitive Pricing and User Engagement - Baidu has introduced a competitive pricing system, offering services at up to 70% lower than similar products in the industry [4] - New users can receive free imagination points upon registration, and weekly events provide opportunities to win additional points [4] Group 4: Ecosystem Impact and Application - The development of MuseSteamer is driven by application needs, enhancing various ecosystems including search, content, commercial, and cloud [5] - The technology allows users to easily create videos from scripts, breaking down professional barriers and enabling individual creativity [5] - Businesses can leverage the technology for high-quality, low-cost marketing content, exemplified by successful campaigns like the one from FAW-Volkswagen [6]
百度二季度智能云营收65亿元 AI搜索商业化开始早期测试
Nan Fang Du Shi Bao· 2025-08-20 16:25
百度核心营收263亿元,同比下降2%,环比上升3%。归属百度核心净利润74亿元,同比增长35%。受AI驱动,涵 盖智能云在内的AI新业务收入增长强劲,首次超过100亿元,同比增长34%。 "二季度,智能云业务实现健康增长,这得益于我们不断加强的全栈AI能力和端到端AI产品及解决方案。"百度创 始人李彦宏表示,本季度内,百度加速推进搜索的AI转型与萝卜快跑全球化进程,"百度始终聚焦于那些最具长期 价值创造潜力的AI新领域,让我们的技术与创新,产生最具意义和持久的影响。" | | | | 百度集團股份有限公司 | | | | | --- | --- | --- | --- | --- | --- | --- | | (百萬元,每股美國存託股除外, | 2024年 | 2025年 | | | | | | 未經審計) | 第二季度 | 第一季度 | 2025年第二季度 | | 按年 | 按季 | | | 人民幣元 | 人民幣元 人民幣元 | | 美元 | | | | 總收入 | 33.931 | 32.452 | 32,713 | 4.567 | (4%) | 1% | | 經營利潤 | 5.944 | 4.508 ...