Generalist发现具身智能的Scaling Law,还让模型能同时思考与行动
3 6 Ke·2025-11-21 01:52

Core Insights - Generalist, a company founded by Pete Florence, has released a new embodied foundation model called GEN-0, which can scale predictably with the growth of physical interaction data [1][4] - The company aims to create universal robots, focusing initially on the dexterity of robots [4][5] Company Overview - Generalist was co-founded by Pete Florence, Andrew Barry, and Andy Zeng, with a team that includes experts from OpenAI, Waymo, and Boston Dynamics [4] - Early investors include Spark Capital, NVIDIA, and Bezos Expeditions, although the investment amounts remain undisclosed [3] Model Features - GEN-0 is based on high-fidelity raw physical interaction data and employs a multi-modal training approach [5] - A key feature of GEN-0 is "Harmonic Reasoning," allowing the model to think and act simultaneously, which is crucial for real-world applications [6][7] Scaling and Performance - The model exhibits a "phase transition" point in its intelligence capacity, indicating that larger models are necessary to absorb complex sensory-motor data [8][10] - Models with 1 billion parameters struggle to absorb diverse data, while those with 6 billion parameters show strong multi-task capabilities [10][11] - Models with over 7 billion parameters can internalize large-scale pre-training data and quickly adapt to downstream tasks [12] Scaling Law - GEN-0 demonstrates a clear Scaling Law, where increased pre-training data and computational resources lead to predictable improvements in downstream performance [15] - The company has developed a predictive formula to determine the optimal data allocation for specific tasks [15][16] Data Quality and Diversity - The training dataset for GEN-0 consists of 270,000 hours of real-world manipulation trajectories collected from diverse environments, significantly larger than existing datasets [16][18] - The quality and diversity of data are more critical than sheer volume, allowing for the creation of models with different characteristics [18] Industry Context - The field of embodied intelligence is still in its early stages, with various companies exploring foundational models [19] - Despite the presence of numerous top-tier companies, the technology landscape remains fragmented, and commercial applications are limited [19][20] Future Prospects - The advancements in Scaling Law and model capabilities suggest a promising future for the commercialization of embodied intelligence [20] - Chinese entrepreneurs have a competitive advantage in this field due to a mature hardware supply chain and rich data sources [21]