Workflow
FlashWorld
icon
Search documents
世界模型可单GPU秒级生成了?腾讯开源FlashWorld,效果惊艳、免费体验
机器之心· 2025-10-30 08:52
Core Insights - The collaboration between Xiamen University and Tencent has produced a highly regarded paper titled "FlashWorld: High-quality 3D Scene Generation within Seconds," which has gained significant attention both domestically and internationally, ranking first on the Huggingface Daily Paper list and receiving endorsements from prominent AI figures [2][4]. Group 1: FlashWorld's Performance - FlashWorld achieves 3D scene generation in 5 to 10 seconds on a single GPU, representing a speed increase of up to 100 times compared to previous methods [4]. - The generated scenes can be rendered in real-time on web user interfaces, surpassing the quality of other closed-source models [4]. - In comparative tests, FlashWorld produced stable, complete, and high-quality rendering results, being five times faster than the quick mode of Marble and eliminating the need for backend GPU connections like RTFM [6][10]. Group 2: Technical Approach - FlashWorld utilizes a technology route based on 3DGS for scene output, allowing for local web rendering, which is a significant advantage over video models that require heavy loads [8]. - The method combines a multi-view diffusion model with a three-dimensional focus, enhancing visual quality through a distillation process that ensures multi-view consistency and reduces denoising steps [10][12]. - The training process includes dual-mode pre-training and cross-mode post-training, which enhances the model's ability to generalize across various scenes, styles, and trajectories without needing ground truth data [13][16]. Group 3: Experimental Results - FlashWorld has demonstrated superior performance in generating structured scenes, such as fences, which were previously challenging to achieve [18]. - The model excels in generating fine details, such as hair, from text inputs, showcasing its capability in dense perspective reconstructions [21]. - In benchmark tests, FlashWorld outperformed other methods in speed and quality, achieving the highest average scores in various qualitative metrics [23][24].