Workflow
OptiTrack光学动作捕捉技术
icon
Search documents
对话刘耀东:为具身智能铺就数据的“高速公路”
机器人大讲堂· 2026-01-23 09:04
在具身智能浪潮席卷全球之际,它不仅是一个技术概念,更是一场深刻的产业变革。在这场变革中,众多企业 扮演着不同角色:有像宇树科技的王兴兴这样,专注机器人本体研发与制造的开拓者;也有像利亚德集团副总 裁、虚拟动点董事长兼 CEO刘耀东这样,在产业底层默默耕耘,致力于解决核心数据瓶颈的"架桥者"。 本次,我们有幸对话刘耀东,探讨他的团队如何通过空间计算,构筑起连接数字与现实、加速具身智能落地的 关键引擎。 ▍ 从二维描绘到三维感知:数据是跨越鸿沟的关键 在刘耀东看来,过去三十年互联网的核心逻辑是将物理世界数字化、扁平化装入二维屏幕。而如今,以具身智 能为代表的下一波浪潮,则要求数字系统能够真正地感知、理解并三维化地与现实世界互动。这背后,是从 "描述世界"到"作用于世界"的根本性转变。 "当技术从二维向三维跃迁,真正的挑战往往不在算力或模型参数本身,而在于一个更基础、却易被忽略的层 面——数据。"刘耀东指出。工业仿真与真实场景的割裂,机器人从"实验室演示"到"稳定实际应用"的最后一百 米,其症结常常在于缺乏高质量、多模态、符合物理规律的真实世界数据。 "如果说具身智能是一场远征,高 质量的数据就是支撑远征的'粮草' ...
动捕设备能成为具身大模型的下一场蓝海吗?
机器人大讲堂· 2025-08-21 10:11
Group 1: Development of Embodied Intelligence - The concept of embodied intelligence dates back to the 1950s, with Turing laying the groundwork for its potential development [1] - Significant theoretical support was provided by researchers like Rodney Brooks and Rolf Pfeifer in the 1980s and 1990s, marking the early exploration and theoretical development phase [1] - The early 2000s saw the integration of interdisciplinary methods and technologies, leading to a more complete academic branch of embodied intelligence [1] - The rapid advancement of deep learning technology in the mid-2010s injected new momentum into the field, leading to increased industrial application since 2020 [1] Group 2: Large Models and Their Evolution - Large models refer to machine learning models with vast parameter counts, widely applied in NLP, computer vision, and multimodal fields [2] - The development of large models can be traced back to early AI research focused on logic reasoning and expert systems, which were limited by hard-coded knowledge [2] - The introduction of the Transformer model by Google in 2017 significantly enhanced sequence modeling capabilities, leading to the mainstream adoption of pre-trained language models [2] - The emergence of ChatGPT in late 2022 propelled advancements in the NLP field, with GPT-4 introducing multimodal capabilities in March 2023 [2] Group 3: Embodied Large Models - Embodied large models evolved from non-embodied large models, initially focusing on single-modal language models before expanding to multimodal inputs and outputs [4] - Google's RT series exemplifies embodied large models, with RT-1 integrating vision, language, and robotic actions for the first time in 2022, and RT-2 enhancing multimodal fusion and generalization capabilities in 2023 [4] - The future of embodied large models is expected to move towards more general applications, driven by foundational models like RFM-1 [4] Group 4: Data as a Core Barrier - The competition between real data and synthetic data is crucial for embodied robots, which often face challenges such as data scarcity and high collection costs [15] - The scale of embodied robot datasets is significantly smaller compared to text and image datasets, with only 2.4 million data points available [15] - Various organizations are expected to release high-quality embodied intelligence datasets in 2024, such as AgiBotWorld and Open X-Embodiment [15] Group 5: Motion Capture Systems - Motion capture technology records and analyzes real-world actions, evolving from manual keyframe drawing to modern high-precision methods [23] - The motion capture system consists of hardware (sensors, cameras) and software (data processing modules), generating three-dimensional motion data [23] - Different types of motion capture systems include mechanical, acoustic, electromagnetic, inertial, and optical systems, each with its own advantages and limitations [25] Group 6: Key Companies in Motion Capture Industry - Beijing Duliang Technology specializes in optical 3D motion capture systems, offering high-resolution and high-precision solutions [28] - Lingyun Technology is a professional supplier of configurable vision systems, providing optical motion capture systems with real-time tracking capabilities [29] - Aofei Entertainment focuses on motion capture solutions through investments in companies like Nuoyiteng, which offers high-precision products based on MEMS inertial sensors [30] - Liyade is a leading company in audiovisual technology, utilizing optical motion capture technology for various applications [31] - Zhouming Technology has developed a non-wearable human posture motion capture system that leverages computer vision and AI [32] - Xindong Lianke focuses on high-performance MEMS inertial sensors, expanding its business into motion capture hardware for robots [33]