Workflow
3D高斯溅射
icon
Search documents
群核科技发布3D高斯语义数据集,要让机器人「看懂」物理空间
IPO早知道· 2025-07-25 13:15
Core Viewpoint - The article discusses the launch of the InteriorGS dataset by Qunhe Technology, aimed at enhancing spatial perception capabilities for robots and AI agents, marking a significant advancement in AI training methodologies [2][4]. Group 1: InteriorGS Dataset - The InteriorGS dataset includes 1,000 3D Gaussian semantic scenes covering over 80 types of indoor environments, providing AI agents with a "spatial brain" to improve their environmental understanding and interaction capabilities [2][4]. - This dataset is claimed to be the world's first large-scale 3D dataset suitable for the free movement of intelligent agents [2][4]. Group 2: 3D Gaussian Technology - In recent years, 3D Gaussian technology has gained traction in fields such as cultural heritage preservation and spatial design due to its ability to "scan and reconstruct scenes" [3]. - Qunhe Technology engineers have utilized this technology to recreate a 60-year-old photo studio in Hangzhou, which has garnered significant attention [3]. Group 3: Training Data Generation Pathway - The dataset is part of a new training data generation pathway that combines 3D Gaussian reconstruction, spatial large model capabilities, and physical simulation [6]. - This process allows for rapid scene reconstruction from videos or images, enriching the data with semantic logic and simulating physical characteristics for intelligent agents to learn spatial understanding and interaction [6]. Group 4: Spatial Intelligence Platform - Qunhe Technology's SpatialVerse is positioned as a leading spatial intelligence training platform, having accumulated vast amounts of interactive 3D data and a suite of physical simulation tools [9]. - The goal of SpatialVerse is to become the "ImageNet" of the spatial intelligence field, similar to how ImageNet catalyzed the explosion of computer vision [9]. Group 5: Industry Collaboration - Qunhe Technology has formed partnerships with several embodied intelligence companies, including Zhiyuan Robotics and Galaxy General, to leverage its high-quality 3D scene data [9]. - The company aims to advance the "Sim2Real" paradigm, which is currently the most efficient training method for embodied intelligence, by collaborating with industry players [9].
李飞飞空间智能独角兽开源底层技术!AI生成3D世界在所有设备流畅运行空间智能的“着色器”来了
量子位· 2025-06-03 04:26
Core Viewpoint - World Labs, co-founded by Fei-Fei Li, has open-sourced a core technology called Forge, a real-time 3D Gaussian Splatting renderer that operates seamlessly across various devices, including desktops, low-power mobile devices, and XR [1][6]. Group 1: Technology Overview - Forge is a web-based 3D Gaussian Splatting renderer that integrates with three.js, enabling fully dynamic and programmable Gaussian splatting [2]. - The underlying design of Forge is optimized for GPU, serving a role similar to traditional 3D graphics components known as "shaders" [3]. - The technology allows developers to handle AI-generated 3D worlds as easily as manipulating triangle meshes, according to Ben Mildenhall, co-founder of World Labs [5]. Group 2: Features and Capabilities - Forge requires minimal code to start and run, supporting multiple splat objects, cameras, and real-time animations/edits [4]. - It is designed as a programmable 3D Gaussian Splatting engine, providing unprecedented control over the generation, animation, and rendering of 3D Gaussian splats [8]. - The renderer employs a painter's algorithm for sorting splats, which is a core aspect of its design [13]. Group 3: Rendering Process - The key component managing the rendering process is ForgeRenderer, which compiles a complete list of splats in a three.js scene and determines the drawing order using an efficient bucket sort algorithm [14]. - Forge supports multi-view rendering by generating additional ForgeViewpoint objects, allowing for simultaneous rendering from different perspectives [15]. Group 4: Future Plans - World Labs aims to elevate multimodal AI from 2D pixel planes to full 3D worlds, with plans to launch its first product in 2025 [17]. - The company intends to develop tools beneficial for professionals such as artists, designers, developers, filmmakers, and engineers, targeting a wide range of customers from video game developers to film studios [17].