如视发布空间大模型Argus1.0,支持全景图等多元输入,行业首创!
机器之心·2025-11-19 04:07

Core Viewpoint - The article discusses the emergence of Argus 1.0, a groundbreaking spatial model by Realsee, which aims to recreate the real world in a 3D interactive format, contrasting with AI-generated virtual worlds [2][4]. Group 1: Introduction of Argus 1.0 - Argus 1.0 is the world's first spatial model that supports panoramic image input and infers spatial depth, representing a significant shift from virtual generation to real-world replication [2][6]. - The model processes single or multiple panoramic images to derive camera poses, depth maps, and point clouds with millisecond-level speed [2][6]. Group 2: Foundation of Argus 1.0 - The development of Argus 1.0 is rooted in Realsee's extensive experience in spatial digitization since its establishment in 2017, driven by a "digital space-algorithm-industry application" flywheel [6][14]. - Realsee has accumulated over 53 million sets of digital space data, covering more than 4.4 billion square meters globally, forming the largest real space database [7][8]. Group 3: Technical Innovations - Argus 1.0 represents a transition from single-view depth estimation to multi-view consistency, utilizing a Transformer architecture trained on nearly one million sets of real high-definition spatial data [16][24]. - The model is the first in the industry to support panoramic images as input, significantly enhancing the efficiency of VR content production [17][21]. Group 4: Quality and Performance - Argus 1.0 achieves high-quality output due to its unique high-precision, scale-aware, pixel-aligned real database, allowing it to handle challenging scenarios like glass and mirrors effectively [24][29]. - The model's inference efficiency reaches millisecond-level, making it the first real-time panoramic global reconstruction system [22][23]. Group 5: Future Directions and Industry Impact - Argus 1.0 is a key component in Realsee's "spatial intelligence" framework, which outlines a four-layer theory from digitization to intelligence [30][34]. - The company plans to release Argus 2.0 and subsequent versions to further enhance real-time rendering capabilities and support advanced applications in various industries [36][38]. - Realsee aims to open a dataset of 10,000 indoor housing data sets to foster innovation in the spatial intelligence sector, addressing the significant gap in high-quality spatial data [39][40].