Workflow
机器人学习
icon
Search documents
影响市场重大事件:时隔10年,A股两融余额重回2万亿;全国一体化算力网算力池化、算网安全相关技术文件公开征求意见
Mei Ri Jing Ji Xin Wen· 2025-08-07 00:05
Group 1 - The National Data Standardization Technical Committee has publicly solicited opinions on two technical documents related to the National Integrated Computing Network, marking the transition from planning to implementation [1] - The Ministry of Industry and Information Technology expressed willingness to collaborate with APEC member economies to promote digital and AI innovation applications [2] - The A-share market's margin trading balance has reached 2 trillion yuan, the highest since July 2015, indicating increased trading activity [3] Group 2 - The Ministry of Transport, Ministry of Finance, and Ministry of Natural Resources have issued a new rural road enhancement action plan, focusing on innovative financing models and encouraging participation from financial institutions [4][9] - The National Development and Reform Commission has introduced a management method for central budget investment in training bases, emphasizing support for emerging fields with talent shortages and traditional industries with strong employment absorption [5] - The China Photovoltaic Industry Association is collecting opinions on the draft amendment to the Price Law, aiming to reflect the demands of the photovoltaic industry [6] Group 3 - Heilongjiang Province has implemented 20 policy measures to support the high-quality development of the high-end intelligent agricultural machinery industry [7] - Shanghai's financial regulatory authorities have introduced measures to promote the development of commercial health insurance, including tax deductions and optimized financing [8]
10%训练数据超越100%表现,机器人学习领域迎来重要突破
机器之心· 2025-06-11 03:54
Core Viewpoint - The ViSA-Flow framework represents a revolutionary approach to robot skill learning, significantly enhancing learning efficiency in data-scarce situations by extracting semantic action flows from large-scale human videos [4][36]. Group 1: Research Background and Challenges - Traditional robot imitation learning methods require extensive, meticulously curated datasets, which are costly to collect, creating a bottleneck for developing robots capable of diverse real-world tasks [7]. - Humans exhibit remarkable abilities to learn new skills through observation, focusing on semantically relevant components while filtering out irrelevant background information [8]. Group 2: Key Innovations - The core innovation of the ViSA-Flow framework is the introduction of Semantic Action Flow as an intermediate representation, capturing the essential spatiotemporal features of operator-object interactions, unaffected by surface visual differences [11]. - Key components of the framework include: 1. Semantic entity localization using pre-trained visual language models to describe and locate operators and task-related objects [11]. 2. Hand-object interaction tracking to maintain stable segmentation across frames [12]. 3. Flow-conditioned feature encoding to generate rich feature vectors while preserving visual context [13]. Group 3: Experimental Evaluation - In the CALVIN benchmark tests, ViSA-Flow outperformed all baseline methods using only 10% of annotated robot trajectories (1,768), achieving a success rate of 31.4% in completing five consecutive tasks, nearly double that of the next best method [19]. - The average sequence length of 2.96 further demonstrates ViSA-Flow's effectiveness in handling long-duration operational tasks [20]. Group 4: Ablation Studies - Ablation studies indicate that removing semantic entity localization significantly reduces performance, while omitting the time tracking phase decreases the average success length [26]. - The full ViSA-Flow model achieved a success rate of 89.0% in task completion, showcasing its robustness [21]. Group 5: Real-World Experiments - Real-world evaluations of ViSA-Flow included single-stage and long-duration operational tasks, demonstrating its ability to maintain performance across varying task complexities [23][30]. - The model's focus on operator and task-related objects allows for smooth transitions in spatial support as scenes change [31]. Group 6: Technical Advantages and Limitations - Advantages include data efficiency, cross-domain generalization, long-duration stability, and semantic consistency in task execution [40]. - Limitations involve the absence of explicit 3D geometric modeling, reliance on pre-trained components, and potential challenges in tasks requiring precise physical interactions [40]. Group 7: Future Directions - Future developments may include integrating physical modeling, reducing reliance on pre-trained components, combining with reinforcement learning algorithms, and expanding pre-training datasets [40]. Group 8: Significance and Outlook - ViSA-Flow represents a significant breakthrough in robot learning, proving the feasibility of extracting semantic representations from large-scale human videos for skill acquisition [36]. - The framework bridges the gap between human demonstration observation and robot execution, paving the way for more intelligent and efficient robotic learning systems [37].
马斯克:Optimus人形机器人2027年将在火星表面行走;阿里云发布通义灵码AI IDE,可调用3000多款工具丨AIGC日报
创业邦· 2025-05-31 00:57
Group 1 - Elon Musk announced that the Optimus humanoid robot will walk on the surface of Mars by 2027, with SpaceX planning to launch the robot aboard a Starship by the end of next year [1] - Alibaba Cloud released its first AI-native development environment tool, Tongyi Lingma AI IDE, which supports over 3,000 tools and has seen over 15 million plugin downloads, with numerous enterprises already integrating it [1] - Figure's CEO Brett Adcock revealed a major restructuring, merging three independent teams into the AI team "Helix" to accelerate robot learning and market expansion [1] Group 2 - The U.S. Department of Energy announced a partnership with NVIDIA and Dell to develop a next-generation flagship supercomputer, expected to be operational by 2026, named after Nobel Prize-winning biochemist Jennifer Doudna [1]