Workflow
3D Gaussian Splatting
icon
Search documents
清华团队开源DISCOVERSE框架:用3D高斯渲染打通机器人仿真到现实的“最后一公里”!
机器人大讲堂· 2025-11-10 04:07
Core Insights - The article discusses the challenges in end-to-end robot learning, particularly focusing on the "Sim2Real" gap, which is primarily caused by the inadequacy of simulation environments to accurately replicate real-world scenarios [1][6][10]. Group 1: Challenges in Robot Simulation - Current simulation environments struggle with three main issues: insufficient realism in replicating real-world scenarios, high costs in scene asset acquisition and system configuration, and time-consuming data collection processes [1][5]. - The core obstacle is the performance drop during the Sim2Real transfer, which stems from the fundamental differences between simulated and real-world environments, such as object appearance, lighting effects, and spatial geometry [1][6]. Group 2: Existing Simulation Frameworks - Various simulation frameworks have been developed, but none meet the three critical requirements: high visual fidelity, accurate physical interaction, and efficient parallel scalability [3][6]. - Traditional simulators often compromise on either visual realism or physical accuracy, leading to ineffective training for robots [6][7]. Group 3: DISCOVERSE Framework - DISCOVERSE is an open-source simulation framework developed by Tsinghua University in collaboration with other institutions, integrating 3D Gaussian Splatting (3DGS), MuJoCo physics engine, and control interfaces into a unified architecture [5][10]. - The framework aims to bridge the Sim2Real gap by enhancing the realism of simulations through a three-layer innovation approach, focusing on accurate digital representation of real-world scenes and objects [10][12]. Group 4: Performance and Efficiency - DISCOVERSE significantly improves simulation speed, achieving rendering speeds up to 650 FPS on high-end hardware, which is three times faster than competing solutions [19][20]. - The framework supports a wide range of asset formats and robot models, enhancing compatibility and reducing the need for extensive configuration [21][22]. Group 5: Testing and Results - In comparative tests, DISCOVERSE outperformed other mainstream simulators in zero-shot transfer success rates across various tasks, demonstrating its effectiveness in real-world applications [24][27]. - The framework also enhances data collection efficiency, reducing the time required to gather demonstration data from 146 minutes to just 1.5 minutes, thus accelerating algorithm iteration [29]. Group 6: Future Implications - DISCOVERSE is positioned as a versatile robot simulation framework capable of supporting various complex tasks, with potential applications in robotics, drones, and autonomous driving sensors [30]. - The release of the framework's code and API aims to facilitate adoption by developers and enterprises, marking a significant milestone in the robotics industry [30].
ICCV 2025 | RobustSplat: 解耦致密化与动态的抗瞬态3DGS三维重建
具身智能之心· 2025-08-20 00:03
Core Viewpoint - The article discusses the RobustSplat method, which addresses the challenges of 3D Gaussian Splatting (3DGS) in rendering dynamic objects while maintaining high-quality static scene reconstruction [1][4][19]. Research Motivation - The motivation stems from understanding the dual role of Gaussian densification in 3DGS, which enhances scene detail but can lead to overfitting in dynamic areas, resulting in artifacts and scene distortion [4][6]. Methodology - **Transient Mask Estimation**: Utilizes a Mask MLP architecture to output pixel-wise transient masks, distinguishing between transient and static regions [9]. - **Feature Selection**: DINOv2 features are chosen for their balance of semantic consistency, noise resistance, and computational efficiency, outperforming other feature sets [10]. - **Supervision Design**: Combines image residual loss and feature cosine similarity loss for mask optimization, enhancing dynamic area recognition [10]. - **Delayed Gaussian Growth Strategy**: This core strategy postpones the densification process to prioritize static scene structure optimization, reducing the risk of misclassifying static areas as transient [12]. - **Mask Regularization**: Aims to minimize the misclassification of static regions during early optimization stages [12]. - **Scale Cascade Mask Guidance**: Initially estimates transient masks using low-resolution features, transitioning to high-resolution supervision for improved accuracy [14]. Experimental Results - Experiments on NeRF On-the-go and RobustNeRF datasets show that RobustSplat outperforms baseline methods like 3DGS, SpotLessSplats, and WildGaussians in PSNR, SSIM, and LPIPS metrics [16][20]. Conclusion - The RobustSplat method effectively reduces rendering artifacts caused by transient objects while preserving scene details, demonstrating its robustness in complex scenarios [18][19].