Workflow
ZED Stereo Depth深度相机
icon
Search documents
让机器人“看清”三维世界,蚂蚁灵波开源空间感知模型
Core Insights - Ant Group's Lingbo Technology has made significant advancements in spatial intelligence by open-sourcing the high-precision spatial perception model LingBot-Depth, which enhances depth perception and 3D spatial understanding for robots and autonomous vehicles [1] Group 1: Model Performance - LingBot-Depth demonstrates a generational advantage in authoritative benchmark evaluations, reducing relative error (REL) by over 70% compared to mainstream models like PromptDA and PriorDA in indoor scenes, and achieving a 47% reduction in RMSE error in challenging sparse SfM tasks [1] - The model excels in handling transparent and reflective objects, which are common in household and industrial environments, overcoming limitations faced by traditional depth cameras [1][2] Group 2: Technology and Innovation - The "Masked Depth Modeling" (MDM) technology developed by Lingbo Technology allows the model to infer and complete missing depth data by integrating texture, contours, and contextual information from RGB images, resulting in clearer and more complete 3D depth maps [2] - LingBot-Depth has been certified by the Oubo Zhongguang Depth Vision Laboratory, achieving industry-leading levels in accuracy, stability, and adaptability to complex scenes [2] Group 3: Data and Collaboration - The model's superiority is attributed to a vast dataset, with approximately 10 million raw samples and 2 million high-value depth pair data used for training, which will soon be open-sourced to accelerate community efforts in tackling complex spatial perception challenges [3] - Ant Group's Lingbo Technology has reached a strategic cooperation intention with Oubo Zhongguang to launch a new generation of depth cameras based on LingBot-Depth's capabilities [3]
让机器人“看清”三维世界 蚂蚁灵波开源空间感知模型
LingBot-Depth 的优异性来源于海量真实场景数据。灵波科技采集约 1000 万份原始样本,提炼出 200 万 组高价值深度配对数据用于训练,支撑模型在极端环境下的泛化能力。这一核心数据资产(包括 2M 真 实世界深度数据和 1M 仿真数据)将于近期开源,推动社区更快攻克复杂场景空间感知难题。 空间智能迎来重要开源进展。1月27日,蚂蚁集团旗下具身智能公司灵波科技宣布开源高精度空间感知 模型LingBot-Depth。 该模型基于奥比中光 Gemini 330 系列双目 3D 相机提供的芯片级原始数据,专注于提升环境深度感知与 三维空间理解能力,旨在为机器人、自动驾驶汽车等智能终端赋予更精准、更可靠的三维视觉,在"看 清楚"三维世界这一行业关键难题上取得重要突破。这也是蚂蚁灵波科技在2025外滩大会后首次亮相 后,时隔半年在具身智能技术基座方向公布重要成果。 在NYUv2、ETH3D等权威基准评测中,LingBot-Depth展现出代际级优势:相比业界主流的 PromptDA 与PriorDA,其在室内场景的相对误差(REL)降低超过70%,在挑战性的稀疏SfM 任务中RMSE误差降 低约47%。 在 ...