Workflow
3D重建
icon
Search documents
李飞飞空间智能独角兽开源底层技术!AI生成3D世界在所有设备流畅运行空间智能的“着色器”来了
量子位· 2025-06-03 04:26
Core Viewpoint - World Labs, co-founded by Fei-Fei Li, has open-sourced a core technology called Forge, a real-time 3D Gaussian Splatting renderer that operates seamlessly across various devices, including desktops, low-power mobile devices, and XR [1][6]. Group 1: Technology Overview - Forge is a web-based 3D Gaussian Splatting renderer that integrates with three.js, enabling fully dynamic and programmable Gaussian splatting [2]. - The underlying design of Forge is optimized for GPU, serving a role similar to traditional 3D graphics components known as "shaders" [3]. - The technology allows developers to handle AI-generated 3D worlds as easily as manipulating triangle meshes, according to Ben Mildenhall, co-founder of World Labs [5]. Group 2: Features and Capabilities - Forge requires minimal code to start and run, supporting multiple splat objects, cameras, and real-time animations/edits [4]. - It is designed as a programmable 3D Gaussian Splatting engine, providing unprecedented control over the generation, animation, and rendering of 3D Gaussian splats [8]. - The renderer employs a painter's algorithm for sorting splats, which is a core aspect of its design [13]. Group 3: Rendering Process - The key component managing the rendering process is ForgeRenderer, which compiles a complete list of splats in a three.js scene and determines the drawing order using an efficient bucket sort algorithm [14]. - Forge supports multi-view rendering by generating additional ForgeViewpoint objects, allowing for simultaneous rendering from different perspectives [15]. Group 4: Future Plans - World Labs aims to elevate multimodal AI from 2D pixel planes to full 3D worlds, with plans to launch its first product in 2025 [17]. - The company intends to develop tools beneficial for professionals such as artists, designers, developers, filmmakers, and engineers, targeting a wide range of customers from video game developers to film studios [17].
美图公司AI视觉领域竞争力升级:七项图像编辑成果出炉
Zheng Quan Ri Bao· 2025-04-09 08:40
Core Insights - Meitu's MT Lab has achieved significant recognition with five research outcomes selected for the prestigious CVPR 2025 conference, which received over 13,000 submissions and has a low acceptance rate of 22.1% [2] - The lab also had two projects accepted at the AAAI 2025 conference, which had an acceptance rate of 23.4% from 12,957 submissions [2] - The seven research outcomes focus on image editing, including three generative AI technologies, three segmentation technologies, and one 3D reconstruction technology [2] Generative AI Technologies - GlyphMastero has been implemented in Meitu's app Meitu Xiuxiu, providing users with a seamless text modification experience [3] - MTADiffusion is integrated into Meitu's AI material generator WHEE, allowing for efficient image editing with simple commands [3] - StyO is utilized in Meitu Xiuxiu's AI creative and beauty camera features, enabling users to explore different dimensions easily [4] Segmentation and 3D Reconstruction Technologies - The segmentation breakthroughs include interactive segmentation and cutout technologies, which are applied in e-commerce design, image editing, and portrait beautification [4] - EVPGS represents advancements in 3D reconstruction, with increasing demand in new perspective generation, augmented reality (AR), 3D content generation, and virtual digital humans [4] Industry Position and Future Potential - Meitu's long-term investment in AI capabilities has allowed the company to integrate cutting-edge technologies into practical applications, enhancing its competitive edge in the core visual field [4] - The continuous iteration of product capabilities has led to increased user engagement and willingness to pay, indicating promising growth potential and expansion opportunities for the company [4]