Workflow
模态编码
icon
Search documents
具身空间数据技术的路线之争:合成重建VS全端生成
量子位· 2025-04-20 13:24
Core Viewpoint - The breakthrough in embodied intelligence relies heavily on high-quality data, with a significant focus on synthetic data generation due to the high costs of real data collection [1][2]. Group 1: Data Challenges - The current state of embodied intelligence data is characterized by scarcity and inadequacy, with existing sources being limited and not sufficiently diverse [16][18]. - Three main categories of existing data sources are identified: real scan data, game engine environments, and open-source synthetic datasets, each with its limitations [17]. - The indoor embodied intelligence scenarios require structured, semantic, and interactive 3D scene data, which is challenging to collect due to the unique layouts and usage patterns of individual households [18][19]. Group 2: Technical Approaches - There are two primary technical routes for synthetic data generation: "video synthesis + 3D reconstruction" and "end-to-end 3D generation" [3][24]. - The "video synthesis + 3D reconstruction" approach involves generating video or images first, which can lead to cumulative errors and limited structural accuracy [24][39]. - The "end-to-end 3D generation" method aims for direct synthesis of structured spatial data but faces challenges such as low generation quality and lack of common sense [67][68]. Group 3: Innovations in Data Generation - A new technical solution called "modal encoding" is proposed to address the common sense gap in end-to-end 3D generation, allowing for the digital encoding and implicit learning of spatial solutions [5][91]. - The Sengine SimHub is introduced as a system that integrates design knowledge into the generation process, enhancing the stability and adaptability of the generated data [75][78]. - The focus is on creating a data generation system that not only produces space but also generates "understandable and usable" environments, incorporating design logic and user preferences [91][96]. Group 4: Future Directions - The industry is at a critical juncture where the need for a new approach to data generation is evident, moving beyond mere data accumulation to creating "useful data" [95][96]. - The future of embodied intelligence may hinge on how space is defined and understood, emphasizing the importance of integrating rules and preferences into spatial data generation [96][100].