Phased DMD
Search documents
国产芯片也能跑AI视频实时生成了,商汤Seko 2.0揭秘幕后黑科技
机器之心· 2025-12-15 08:10
Core Insights - The article discusses the competitive landscape of video generation models, highlighting the advancements made by various tech companies, including Google, Runway, and Kuaishou, while questioning the readiness of these models as productivity tools [2][9] - SenseTime's Seko 2.0 version is introduced as a significant advancement, enabling AI short drama creation with minimal human input, effectively allowing a single person to manage the production [2][4][7] Group 1: Industry Developments - Major tech companies are racing to release enhanced versions of video generation models before the end of the year, with Google launching Veo 3.1 and Runway introducing Gen-4.5 [2] - SenseTime's Seko 2.0 has been successfully deployed in over a hundred short drama studios, showcasing its capability to generate scripts, storyboards, and videos rapidly [7][9] Group 2: Technical Challenges - The article outlines the "impossible triangle" of video generation, where efficiency, cost, and quality are at odds, making it difficult for AI video generation models to meet commercial demands [11][13] - Current models, even at the Sora 2 level, require several minutes to generate just 10 seconds of video, which hampers rapid iteration and real-time feedback essential for industrial production [11][12] Group 3: Innovations in Video Generation - SenseTime's LightX2V framework is highlighted as a breakthrough in real-time video generation, achieving generation times of under 5 seconds for 5-second videos, significantly faster than current industry standards [16][17] - The framework employs Phased DMD technology, which enhances video quality and consistency while maintaining high generation speeds [19][20] Group 4: Engineering and Optimization - LightX2V incorporates a comprehensive optimization strategy across five dimensions: model, scheduling, computation, storage, and communication, enabling low-cost and real-time video generation [31][32] - The framework's architecture allows for efficient use of consumer-grade GPUs, achieving real-time generation capabilities with a memory requirement of less than 8GB [36][37] Group 5: Domestic Chip Adaptation - SenseTime's Seko 2.0 has achieved full compatibility with domestic AI chips, allowing for a cost-effective alternative to NVIDIA chips while maintaining comparable video quality [39][40] - The strategic support for domestic AI ecosystems is emphasized, marking a significant step for China's AI industry in achieving core technological independence [42]