全部超越π0、π0.5!端到端全身VLA模型Lumo-1
自动驾驶之心·2025-12-12 03:02

Core Insights - The article discusses the advancements in robotics, particularly focusing on the Lumo-1 model developed by Stardust Intelligence, which aims to enhance robots' reasoning and action capabilities, allowing them to perform complex tasks without explicit programming [9][11][12]. Group 1: Lumo-1 Model Overview - Lumo-1 is an end-to-end VLA model designed to enable robots to understand and execute tasks through reasoning, rather than just mimicking actions [9]. - The model demonstrates superior operational intelligence and generalization capabilities, outperforming previous models like π0 and π0.5 in multi-step tasks and handling unseen objects and instructions [11][13]. Group 2: Training Phases - The training of Lumo-1 consists of three stages: 1. Embodied VLM pre-training on visual-language data to develop spatial understanding and trajectory inference [17]. 2. Cross-domain joint training to enhance instruction following and spatial reasoning [18]. 3. Real-world reasoning-action training using the Astribot S1 robot to learn executable action patterns [18][20]. Group 3: Technical Innovations - Lumo-1 employs a Spatial Action Tokenizer (SAT) to model action spaces, allowing for the combination and reuse of actions in a structured manner [21]. - The model integrates structured reasoning to form a chain of explanations for actions, enabling it to understand the "why" behind tasks before executing the "how" [25]. Group 4: Performance and Validation - Lumo-1 has shown significant improvements in various multimodal benchmarks, outperforming specialized models like RoboBrain-7B and Robix-7B [31]. - The model's ability to adapt to different environments and instructions demonstrates its robust generalization capabilities, such as adjusting arm positions for varying container heights [31]. Group 5: Implications for the Industry - The findings suggest that data diversity in training is more impactful for generalization than merely increasing data volume, indicating a shift in focus towards data quality [30]. - The advancements in Lumo-1 highlight the potential for robots to perform complex tasks autonomously, which could revolutionize industries reliant on automation and robotics [9][11].