Workflow
EVAC模型
icon
Search documents
智元主办AGIBOT WORLD CHALLENGE @ICRA2026,两大赛道服务器正式开启
IPO早知道· 2026-03-03 10:36
Core Insights - The AGIBOT WORLD CHALLENGE @ICRA2026, hosted by Zhiyuan Robotics, features a total prize pool of $530,000, highlighting the significance of the event in the robotics field [3][9]. Group 1: Reasoning to Action Track - This track aims to evaluate the reasoning and action execution capabilities of models through both online simulations and real-world tasks [5]. - Participants will train models using the AGIBOT WORLD open dataset to solve complex tasks, focusing on bridging the Sim2Real gap [6]. - The competition includes various scenarios such as logistics, industrial, supermarket, dining, and home environments, with multiple tasks of varying difficulty [6]. - Zhiyuan has curated high-quality data for the competition, with each task containing hundreds of complete operation trajectories, available on hugging face and modelscope [6]. - A baseline model, ACoT-VLA, will be provided to assist participants in mastering the training, testing, and submission processes [6]. - The Genie Sim 3.0 platform, a first-of-its-kind large language model-driven open-source simulation platform, will be used for comprehensive model evaluation [6]. Group 2: World Model Track - The World Model track focuses on the core ability of embodied world models to accurately model the dynamics of physical environments based on robot actions [8]. - Participants will train video generation models using the AGIBOT WORLD dataset to generate interaction videos in ten real-world operational scenarios [8]. - The dataset consists of over 30,000 real trajectories covering diverse robot-environment interactions, including actions like grasping, placing, pushing, and pulling [9]. - The competition will utilize the EVAC model, an open-source embodied world model driven by robot actions, as the baseline model [9]. - Evaluation will be conducted using the EWMBench benchmark, assessing image quality, scene consistency, and trajectory adherence to provide reliable performance feedback [9].