Core Viewpoint - The article discusses the advancements in AI video generation, particularly focusing on the limitations of current models in achieving precise control without sacrificing quality. It introduces the World Forge framework developed by the West Lake University AGI Lab, which allows for enhanced control in video generation without retraining the model [2][3][28]. Group 1: World Forge Framework - World Forge is a new framework that enables "plug-and-play" guidance for video diffusion models, allowing for 360° world generation and cinematic video trajectory re-framing without altering model weights [3][11]. - The framework's core idea is to intervene and calibrate during the generation process rather than modifying the model during training, ensuring adherence to spatiotemporal consistency while allowing creative freedom [11][28]. Group 2: Key Innovations - The framework includes three key innovations: 1. Intra-step Recursive Refinement (IRR): This mechanism ensures that AI-generated movements strictly follow predefined camera trajectories by incrementally correcting predictions with real content [13]. 2. Flow-Gated Latent Fusion (FLF): This module separates motion and appearance channels in the latent space, allowing precise injection of motion commands without disturbing visual details [14]. 3. Dual-Path Self-Correcting Guidance (DSG): This strategy balances trajectory accuracy and visual quality by dynamically adjusting the guidance based on the differences between guided and non-guided paths [15]. Group 3: Practical Applications - World Forge can generate a clear and stable 360° surround video from a single image, making it suitable for complex scenes centered around a target [19]. - It allows users to specify complex camera movements for any video, enabling stable re-shooting and automatic content completion from new perspectives [20]. - The framework supports video editing capabilities, such as removing unwanted objects, adding new elements, and facilitating virtual try-ons, all while maintaining geometric consistency [25]. Group 4: Advantages of World Forge - One of the main advantages of World Forge is its training-free nature, which significantly reduces costs and barriers to high-quality 3D/4D content creation [27][29]. - The framework is flexible and can be integrated into various mainstream video models without the need for targeted training, showcasing strong generalization capabilities across different domains [29].
无需训练,即插即用:西湖大学发布世界模型WorldForge,让普通视频模型秒变「世界引擎」
机器之心·2025-09-23 03:16