3D Gaussian Splatting (3DGS) - filings, earnings calls, financial reports, news

3D Gaussian Splatting (3DGS)

Search documents

挑战WorldLabs：Visionary，一个全面超越Marble底层渲染器的WebGPU渲染平台

机器之心· 2025-12-21 04:21

该工作由上海人工智能实验室钟志航团队联合四川大学、东京大学、上海交通大学、西北工业大学共同完成。在李飞飞团队 WorldLabs 推出 Marble、引爆「世界模型（World Model）」热潮之后，一个现实问题逐渐浮出水面：世界模型的可视化与交互，依然严重受限于底层 Web 端渲染能力。 Marble 所依赖的基于 WebGL 的 3D Gaussian Splatting (3DGS) 渲染器 SparkJS，让世界模型首次在浏览器中「跑起来」，但也暴露出明显瓶颈：大场景以及复杂场景下，CPU 排序成为性能天花板，动态场景与生成模型难以接入。相比 Genie3 等视频生成范式的世界模型，其对算力的依赖极为庞大，距离在 Web 端实现高质量、实时运行仍有不小差距。反观神经渲染路线，尤其是 3D Gaussian Splatting ，凭借其高效性，已经成为构建世界模型的重要表示形式。 3DGS 让高质量、实时的 3D 世界成为可能，但在实际落地中，仍存在明显断层：近日，开源项目 Visionary 给出了一个截然不同的答案：基于 WebGPU 与 ONNX，在浏览器中实现真正的动态 3DG ...

世界模型（World Model）

WebGPU

3D Gaussian Splatting (3DGS)

3D Gaussian Splatting (3DGS)

人工智能

Visionary

Marble

将3DGS嵌入Diffusion - 高速高分辨3D生成框架（ICCV'25）

自动驾驶之心· 2025-11-01 16:04

本文介绍一下我们在 ICCV 2025 上的新工作：这篇文章针对 Image-to-3D 生成任务设计了一种全新的 pixel-level 的 3D Diffusion，名为 DiffusionGS。通过在 Diffusion 的每一个 timestep 都预测一个 3D Gaussian 点云来保持生成结果的视角一致性（3D view consistancy），不仅能够用于 object-centric 的物体生成，还能够用于 larger-scale 的 scene-level 场景生成。 Object 生成效果：图1. object-centric 3D 生成效果展示 Scene 重建效果：点击下方卡片，关注" 3D视觉之心 "公众号第一时间获取 3D视觉干货图2. Single-view Scene-level 重建效果展示目前 training, testing, evaluation 的代码已经开源，为方便大家使用，还集成了一行 pipeline 直接运行的代码。代码晚点会包括高斯点云转 mesh。比如下图 3 中的 ikun 玩偶和死侍玩偶：欢迎大家来使用、提 issue、交 ...

Image-to-3D Generation

3D Gaussian Splatting (3DGS)

Neural Radiance Fields (NeRF)

具身智能

DiffusionGS

Hunyuan-v2.5

Image-to-3D Generation

3D Gaussian Splatting (3DGS)

Neural Radiance Fields (NeRF)

具身智能

DiffusionGS

Hunyuan-v2.5

ICCV 2025自动驾驶场景重建工作汇总！这个方向大有可为~

自动驾驶之心· 2025-07-29 00:52

Core Viewpoint - The article emphasizes the advancements in autonomous driving scene reconstruction, highlighting the integration of various technologies and the collaboration among top universities and research institutions in this field [2][12]. Summary by Sections Section 1: Overview of Autonomous Driving Scene Reconstruction - The article discusses the importance of dynamic and static scene reconstruction in autonomous driving, focusing on the need for precise color and geometric information through the integration of lidar and visual data [2]. Section 2: Research Contributions - Several notable research works from prestigious institutions such as Tsinghua University, Nankai University, Fudan University, and the University of Illinois Urbana-Champaign are mentioned, showcasing their contributions to the field [5][6][10][11]. Section 3: Educational Initiatives - The article promotes a comprehensive course on 3D Gaussian Splatting (3DGS), designed in collaboration with leading experts, aimed at providing in-depth knowledge and practical skills in autonomous driving scene reconstruction [15][19]. Section 4: Course Structure - The course is structured into eight chapters, covering foundational algorithms, technical details of 3DGS, static and dynamic scene reconstruction, surface reconstruction, and practical applications in autonomous driving [19][21][23][25][27][29][31][33]. Section 5: Target Audience - The course is targeted at researchers, students, and professionals interested in 3D reconstruction, requiring a foundational understanding of 3DGS and related technologies [36][37].

Autonomous Driving Scene Reconstruction

3D Gaussian Splatting (3DGS)

Neural Radiance Fields (NeRF)

Autonomous Driving

《面向科研&落地的3DGS全栈实战教程》

Autonomous Driving Scene Reconstruction

3D Gaussian Splatting (3DGS)

Neural Radiance Fields (NeRF)

Autonomous Driving

《面向科研&落地的3DGS全栈实战教程》

多样化大规模数据集！SceneSplat++：首个基于3DGS的综合基准~

自动驾驶之心· 2025-06-20 14:06

Core Insights - The article introduces SceneSplat-Bench, a comprehensive benchmark for evaluating visual-language scene understanding methods based on 3D Gaussian Splatting (3DGS) [11][30]. - It presents SceneSplat-49K, a large-scale dataset containing approximately 49,000 raw scenes and 46,000 filtered 3DGS scenes, which is the most extensive open-source dataset for complex and high-quality scene-level 3DGS reconstruction [9][30]. - The evaluation indicates that generalizable methods consistently outperform per-scene optimization methods, establishing a new paradigm for scalable scene understanding through pre-trained models [30]. Evaluation Protocols - The benchmark evaluates methods based on two key metrics in 3D space: foreground mean Intersection over Union (f-mIoU) and foreground mean accuracy (f-mAcc), addressing object size imbalance and reducing viewpoint dependency compared to 2D evaluations [22][30]. - The evaluation dataset includes ScanNet, ScanNet++, and Matterport3D for indoor scenes, and HoliCity for outdoor scenes, emphasizing the methods' capabilities across various object scales and complex environments [22][30]. Dataset Contributions - SceneSplat-49K is compiled from multiple sources, including SceneSplat-7K, DL3DV-10K, HoliCity, and Aria Synthetic Environments, ensuring a diverse range of indoor and outdoor environments [9][10]. - The dataset preparation involved approximately 891 GPU days and extensive human effort, highlighting the significant resources invested in creating a high-quality dataset [7][9]. Methodological Insights - The article categorizes methods into three types: per-scene optimization methods, per-scene optimization-free methods, and generalizable methods, with SceneSplat representing the latter [23][30]. - Generalizable methods eliminate the need for extensive single-scene computations during inference, allowing for efficient processing of 3D scenes in a single forward pass [24][30]. Performance Results - The results from SceneSplat-Bench demonstrate that SceneSplat excels in both performance and efficiency, often surpassing the pseudo-label methods used for its pre-training [24][30]. - The performance of various methods shows significant variation based on the dataset's complexity, indicating the importance of challenging benchmarks in revealing the limitations of competing methods [28][30].

3D Gaussian Splatting (3DGS)

Visual-Language Reasoning

Computer Vision

SceneSplat-Bench

SceneSplat-49K

3D Gaussian Splatting (3DGS)

Visual-Language Reasoning

Computer Vision

SceneSplat-Bench

SceneSplat-49K