刚刚,智元提出SOP,让VLA模型在真实世界实现可扩展的在线进化
机器之心·2026-01-06 09:38

Core Viewpoint - The article emphasizes the need for a paradigm shift in the development of general-purpose robots, advocating for continuous evolution and learning in real-world environments rather than being limited to factory settings [2][3][44]. Group 1: Challenges in Current Robotics - Current AI robots often fail to perform tasks in real-world scenarios despite being trained on vast amounts of data, highlighting the gap between understanding and execution [8][9]. - Traditional post-training methods are slow and inefficient, leading to challenges in learning new tasks without forgetting previous skills [9][10]. Group 2: Introduction of SOP Framework - The SOP (Scalable Online Post-training) framework is introduced as a revolutionary approach that integrates online, distributed, and multi-task mechanisms for robot learning [4][6]. - SOP creates a closed-loop system that allows robots to evolve continuously beyond their initial deployment, breaking the time constraints of cognitive development [6][13]. Group 3: Mechanisms of SOP - SOP enables distributed continuous learning by allowing multiple robots to operate in parallel, sharing strategies and experiences in real-time [14][19]. - The system utilizes a cloud-based architecture for rapid updates and learning, significantly enhancing the speed of evolution [19][20]. - A dynamic sampler within SOP optimizes learning by focusing on weak areas in real-time, allowing robots to quickly adapt and improve [23]. Group 4: Performance Validation - Experiments demonstrate that SOP significantly outperforms traditional single-machine or offline methods, particularly in complex tasks like folding clothes [31][34]. - The system shows remarkable robustness, allowing robots to recover from minor errors without task failure, achieving over 36 hours of continuous operation without performance degradation [34]. Group 5: Scalability and Efficiency - Increasing the number of robots in a distributed system leads to linear improvements in performance, confirming the effectiveness of scaling in real-world applications [36][38]. - SOP allows for substantial reductions in training time, achieving performance benchmarks much faster than traditional methods [37][41]. Group 6: Implications for Robotics Industry - The SOP framework signifies a shift in the lifecycle of robotic systems, where deployment is not the end but the beginning of continuous learning and improvement [43][44]. - This approach lowers the barriers for real-world deployment, enabling robots to learn and evolve through practical experience rather than waiting for perfect models [44][45].

刚刚,智元提出SOP,让VLA模型在真实世界实现可扩展的在线进化 - Reportify