Workflow
Vidu 2.0
icon
Search documents
Vidu Q2的参考生视频,是AI视频多参党的胜利。
数字生命卡兹克· 2025-10-22 01:33
Core Viewpoint - Vidu Q2 has significantly improved the multi-image reference video capabilities, establishing itself as a leader in this new paradigm of AI video workflow [1][8][84]. Group 1: Consistency - The consistency in multi-image reference videos has greatly evolved, allowing for better handling of multiple subjects without losing individual characteristics [11][12]. - The previous version, Vidu Q1, struggled with multiple subjects, often resulting in incomplete or unrealistic representations [14][15]. - Vidu Q2 successfully showcases multiple characters together while maintaining their unique traits, demonstrating a marked improvement in consistency [29][15]. Group 2: Emotional Performance - Vidu Q2 enhances emotional expression in videos, allowing for more nuanced performances from characters [30][37]. - The platform enables users to create stable character representations by uploading multiple images from different angles, improving the management of character assets [32][33]. - The emotional depth in performances has been notably enhanced, with characters displaying a wider range of emotions and subtleties compared to previous versions [38][45]. Group 3: Multi-Style Expressiveness - Vidu Q2 excels in producing videos across various animation styles, reinforcing its reputation as a leader in AI-generated anime content [58][70]. - The platform allows for seamless integration of different styles, maintaining both character and stylistic consistency [70]. - The advanced camera movements and effects in Vidu Q2 enhance the overall visual storytelling, making it suitable for dynamic scenes [71][75]. Group 4: Pricing and Accessibility - The pricing model for Vidu Q2 is competitive, with a monthly subscription costing 59 yuan for 800 points, making it one of the most affordable AI video models available [79][80]. - The introduction of an app for interactive features similar to Sora2 adds to the user experience, allowing for collaborative video creation [82].
对话生数科技创始人兼首席科学家朱军:AI视频生成正迈入“高可控”时代
Mei Ri Jing Ji Xin Wen· 2025-03-29 13:17
Core Insights - The development of large models, particularly in AI video generation, is rapidly evolving, with significant advancements expected in 2025 [1][3] Company Insights - Shengshu Technology has officially launched the Vidu Q1, the first high-controllability video large model in the industry, which is set to go global in April [1][3] - Vidu Q1 has achieved major technical breakthroughs, allowing for spatial layout information as input, significantly enhancing video generation controllability [3][4] - The company’s SaaS products have reached over 10 million users within 100 days of launch, marking the fastest growth globally [4] - Shengshu Technology is focusing on multi-modal large model development, with video being one of the expressions of this technology [4][6] - The company emphasizes the importance of continuous innovation and the ability to meet user demands in different stages of development [4][7] Industry Insights - The commercialization path for video large models is expected to be more diverse compared to language models, with a faster market acceptance due to broader application scenarios [6][7] - The video industry is unlikely to see a dominant player like DeepSeek in language models, as the competition is more diversified and less crowded [7][8] - The industry is moving towards longer, more narrative-driven video content, transitioning from short videos to more complex storytelling [8]