EMMA

Search documents
具身领域LLM结合强化学习与世界模型工作汇总
具身智能之心· 2025-07-30 00:02
Core Insights - The article discusses recent advancements in embodied intelligence, particularly focusing on the integration of large language models (LLMs) with reinforcement learning and world models for various applications in artificial intelligence [2][3]. Group 1: UniSim and Real-World Simulators - UniSim aims to learn general real-world interactive simulators through generative modeling, revealing that diverse natural datasets can enhance the learning of realistic simulations [3]. - The research demonstrates that high-level visual language strategies and low-level reinforcement learning strategies can be trained in a simulated environment and applied directly to real-world scenarios without additional training [3]. Group 2: Causal World Models - The study from Google DeepMind asserts that robust agents must learn causal models to generalize across varying distributions, providing a clear answer to a long-standing question in the field [5]. Group 3: MAMBA Framework - MAMBA introduces an efficient world model approach for meta-reinforcement learning, achieving up to 15 times improvement in sample efficiency while performing well in high-dimensional tasks [8]. Group 4: EMMA and Multimodal Agents - EMMA leverages LLMs trained in text-based worlds to guide visual world training, resulting in a significant performance boost of 20%-70% in task success rates compared to existing visual language models [10]. Group 5: Text2Reward Framework - The Text2Reward framework allows for the automatic generation and optimization of dense reward functions using LLMs, achieving over 94% success rates in new motion behaviors and enhancing strategy performance through human feedback [13][14]. Group 6: Online Continual Learning - The proposed online continual learning frameworks (Behavior-IL and Environment-IL) enable agents to learn continuously in real-world settings without relying on task boundary information, significantly outperforming existing methods [17][18]. Group 7: AMAGO Framework - AMAGO addresses challenges in generalization and long-term memory in reinforcement learning, demonstrating superior scalability and performance in complex tasks [21]. Group 8: PDDL and Planning with LLMs - The research presents a novel paradigm for task planning using pre-trained LLMs, effectively integrating human feedback and reducing the need for extensive manual corrections in planning tasks [22][23].
寻找下一个泡泡玛特 东莞石排镇掀起潮玩淘金热
Zheng Quan Shi Bao· 2025-07-29 22:15
因为LABUBU在海外销售火爆,东莞石排镇被推到了聚光灯下。不少海外和国内的商人们顺藤摸瓜地 找到了这里,让这个在东莞存在感并不太强的小镇,引来全球的目光。 事实上,石排镇拥有国内顶尖的玩具制造产业链,泡泡玛特的发展离不开石排镇众多工厂的支撑。随 着"潮玩"成为热门词汇,商人们追逐着潮玩带来的机会,正在重塑东莞的玩具产业。 原创IP聚集地 陈守丽是一位资深销售员,为了把握潮玩产业发展机会,去年她所在的公司将办公场地从广州搬到东莞 市石排镇,她连家也跟着搬了过来。 在石排镇"中国潮玩之都 潮玩中心"的一间写字楼里,陈守丽介绍,公司之所以搬迁到这里,是因为这 里产业集中,方便客户前来考察。以前广州也是对外展示窗口,交通等配套条件也很好,但石排镇的潮 玩中心建好后,更有集群效应。 "我们2019年就推出自己的潮玩品牌,是最早做自有IP的公司之一。"陈守丽介绍道。她拿起一个玩偶, 这是"EMMA"——公司自主设计的品牌。店里大大小小的EMMA形态各异:有的坐在地上,脚朝前; 有的戴着硕大且样式繁复的帽子;有的站着,穿着各式好看的衣服。无论什么样式,它们的眼睛的形状 和表情大致相同。 陈守丽所在的公司为东莞衍创文化发展 ...
寻找下一个泡泡玛特东莞石排镇掀起潮玩淘金热
Zheng Quan Shi Bao· 2025-07-29 18:31
Core Insights - The surge in overseas sales of LABUBU has spotlighted Dongguan's Shipaizhen, attracting both domestic and international businesses to the area [1] - Shipaizhen boasts a top-tier toy manufacturing industry chain in China, which supports the growth of brands like Pop Mart [1] - The rise of "trendy toys" is reshaping Dongguan's toy industry, with businesses pursuing opportunities in this sector [1] Industry Overview - Shipaizhen is recognized as the "Trendy Toy Capital of China," with over 400 toy and trendy toy production enterprises [11] - The total output value of the trendy toy industry cluster in Shipaizhen is projected to reach 13.218 billion yuan in 2024, reflecting a year-on-year growth of 12.3% [11] - The town accounts for approximately 30% of Dongguan's trendy toy industry output value, making it the largest in the city [11] Company Developments - Dongguan Yanchuang Cultural Development Co., Ltd. was established in 2024 and has developed its own trendy toy brand, EMMA, which has gained popularity [2][3] - The company transitioned from producing complex figurines to focusing on trendy toys due to market demand and operational challenges with previous partners [3] - Dapiaoqiang Toy Co., Ltd. has capitalized on the popularity of plush toys, achieving significant sales growth due to the demand for alternatives to high-priced limited-edition products from LABUBU [4][5] Market Dynamics - The trend of live-streaming sales has significantly influenced the growth of companies like Dapiaoqiang, which has amassed a large following and expanded its product offerings [5][6] - The competitive landscape is shifting as traditional toy manufacturers face pressure to adapt to the higher profit margins associated with trendy toys [9][10] - The entry of non-industry players into the trendy toy market has led to increased competition, with many struggling to sell their products [10] Technological and Operational Advancements - Companies like Wenbo Craft have upgraded their production capabilities by introducing advanced machinery and establishing a professional product laboratory [7][8] - The local industry benefits from a well-established supply chain, allowing for quick assembly and production of trendy toys [6] Consumer Behavior - Trendy toys differ from traditional toys in that they often rely on aesthetic appeal rather than established character narratives, leading to a stronger emotional connection with consumers [10] - The lower production costs associated with plush and rubber toys have made them more accessible for manufacturers, although this has also intensified competition [10]
Waymo's EMMA: Teaching Cars to Think - Jyh Jing Hwang, Waymo
AI Engineer· 2025-07-26 17:00
Autonomous Driving History and Challenges - Autonomous driving research started in the 1980s with simple neural networks and evolved to end-to-end driving models by 2020 [2] - Scaling autonomous driving presents challenges, requiring solutions for long-tail events and rare scenarios [5][7] - Foundation models, like Gemini, show promise in generalizing to rare driving events and providing appropriate responses [8][9][10][11] Emma: A Multimodal Large Language Model for Autonomous Driving - The company is exploring Emma, a driving system leveraging Gemini, which uses routing text and camera input to predict future waypoints [11][12][13][14] - Emma is self-supervised, camera-only, and high-dimension map-free, achieving state-of-the-art quality on the nuScenes benchmark [15][16][17] - Channel reasoning is incorporated into Emma, allowing the model to explain its driving decisions and improve performance on a 100k dataset [17] Evaluation and Validation - Evaluation is crucial for the success of autonomous driving models, including open loop evaluation, simulations, and real-world testing [25] - Generative models are being explored for sensor simulation to evaluate the planner under various conditions like rain and different times of day [26][27][28] Future Directions - The company aims to improve generalization and scale autonomous driving by leveraging foundation models [30] - Training on larger datasets improves the quality of the planner [19][20] - The company is exploring training on various tasks, such as 3D detection and rograph estimation, to create a more generalizable model [21][22][23][24]
自动驾驶端到端VLA落地,算法如何设计?
自动驾驶之心· 2025-06-22 14:09
Core Insights - The article discusses the rapid advancements in end-to-end autonomous driving, particularly focusing on Vision-Language-Action (VLA) models and their applications in the industry [2][3]. Group 1: VLA Model Developments - The introduction of AutoVLA, a new VLA model that integrates reasoning and action generation for end-to-end autonomous driving, shows promising results in semantic reasoning and trajectory planning [3][4]. - ReCogDrive, another VLA model, addresses performance issues in rare and long-tail scenarios by utilizing a three-stage training framework that combines visual language models with diffusion planners [7][9]. - Impromptu VLA introduces a dataset aimed at improving VLA models' performance in unstructured extreme conditions, demonstrating significant performance improvements in established benchmarks [14][24]. Group 2: Experimental Results - AutoVLA achieved competitive performance metrics in various scenarios, with the best-of-N method reaching a PDMS score of 92.12, indicating its effectiveness in planning and execution [5]. - ReCogDrive set a new state-of-the-art PDMS score of 89.6 on the NAVSIM benchmark, showcasing its robustness and safety in driving trajectories [9][10]. - The OpenDriveVLA model demonstrated superior results in open-loop trajectory planning and driving-related question-answering tasks, outperforming previous methods on the nuScenes dataset [28][32]. Group 3: Industry Trends - The article highlights a trend among major automotive manufacturers, such as Li Auto, Xiaomi, and XPeng, to invest heavily in VLA model research and development, indicating a competitive landscape in autonomous driving technology [2][3]. - The integration of large language models (LLMs) with VLA frameworks is becoming a focal point for enhancing decision-making capabilities in autonomous vehicles, as seen in models like ORION and VLM-RL [33][39].