具身多模态大模型
Search documents
每周十大股权投资:银河通用吞下3亿美元热钱,复杂世界交互进入倒计时;这家公司狂吸谷歌英伟达,程序员高薪饭碗还能端稳几年?
3 6 Ke· 2025-12-22 10:46
1. 银河通用机器人获3亿美元B+轮融资 银河通用机器人 公司完成B+轮融资,融资金额高达3亿美元,本次融资由中移创新产业基金(中国移 动)、中金资本、苏创投等知名机构联合投资,融资用于推进"具身多模态大模型"与通用机器人本体的 深度融合。其产品核心不是单一功能机器人,而是一个能理解物理世界、具备通用智能的机器人平台。 这笔巨额融资将用于把大规模AI模型的能力"注入"机器人身体,解决机器人适应复杂开放世界的根本难 题,目标是造出真正实用的通用型智能体。 银河通用机器人(北京)是一家专注于具身多模态大模型通用机器人研发的创新企业。公司致力于为全 球用户提供智能机器人产品,其技术涵盖人形机器人、工业机器人等多个前沿领域,是当前人工智能与 机器人交叉领域的明星公司。 5. 栈略数据完成C+轮融资,深耕保险科技 栈略数据 公司完成C+轮融资,投资方为香港创科创投基金,投后估值约36亿人民币。栈略数据为完善 健康险智能风控与理赔自动化平台。其核心产品是一个能自动识别医疗票据欺诈、过度医疗等风险的 2. Momenta获Grab战略投资,加速全球化 Momenta 自动驾驶头部公司获得战略投资,投资方为东南亚出行巨头Gr ...
星源智T5域控制器亮相百度大会 赋能智元精灵G2开启机器人新纪元
Zheng Quan Ri Bao Wang· 2025-11-13 06:11
Core Insights - Baidu World Conference 2025 showcased the T5 domain controller developed by Beijing Xingyuan Intelligent Robot Technology Co., Ltd, highlighting its advanced capabilities in robotics [1] Company Overview - Xingyuan Intelligent focuses on multi-modal spatial intelligence and aims to create a universal embodied brain for the physical world [1] - The company was incubated by the Beijing Academy of Artificial Intelligence and possesses leading capabilities in embodied multi-modal large models and spatial intelligence [1] Product Details - The T5 controller features high computational power of 2070 TFLOPS, low power consumption, and high performance, supporting advanced algorithms like deep learning and computer vision [1] - The T5 is equipped with NVIDIA's latest Jetson Thor processor, enhancing its ability to meet the demands of real-time perception, intelligent decision-making, and precise control in robotics [1] Collaboration and Industry Impact - A deep collaboration has been established between Zhiyuan Robotics and Xingyuan Intelligent, showcasing the new generation industrial interactive embodied robot, Zhiyuan Spirit G2, at the conference [1] - The exhibition demonstrated the potential industry transformation brought by the technological breakthroughs represented by the T5 controller [1]
从近1000篇工作中,看具身智能的技术发展路线!
具身智能之心· 2025-09-05 00:45
Core Insights - The article discusses the evolution and challenges of embodied intelligence, emphasizing the need for a comprehensive understanding of its development, issues faced, and future directions [3][4]. Group 1: Robotic Manipulation - The survey on robotic manipulation highlights the transition from mechanical programming to embodied intelligence, focusing on the evolution from simple grippers to dexterous multi-fingered hands [5][6]. - Key challenges in dexterous manipulation include data collection methods such as simulation, human demonstration, and teleoperation, as well as skill learning frameworks like imitation learning and reinforcement learning [5][6]. Group 2: Navigation and Manipulation - The discussion on robotic navigation emphasizes the importance of physics simulators in addressing high costs and data scarcity in real-world training, with a focus on the Sim-to-Real transfer challenges [9][15]. - The evolution of navigation techniques is outlined, transitioning from explicit memory to implicit memory, and the role of various simulators in narrowing the Sim-to-Real gap is analyzed [15][16]. Group 3: Multimodal Large Models - The exploration of embodied multimodal large models (EMLMs) reveals their potential to bridge perception, cognition, and action gaps, driven by advancements in large model technologies [17][19]. - Challenges identified include cross-modal alignment difficulties, high computational resource demands, and weak domain generalization [19]. Group 4: Teleoperation and Data Collection - The survey on teleoperation of humanoid robots discusses the integration of human cognition with robotic capabilities, particularly in hazardous environments, while addressing challenges such as high degrees of freedom and communication limitations [29][30]. - Key components of teleoperation systems include human state measurement, motion retargeting, and multimodal feedback mechanisms [30][33]. Group 5: Vision-Language-Action Models - The analysis of Vision-Language-Action (VLA) models covers their evolution from cross-modal learning architectures to the integration of visual language models and action planners [33][36]. - The article identifies core challenges in real-time control, multimodal action representation, and system scalability, while proposing future directions for adaptive AI and cross-entity generalization [36][41].
申万宏源银河通用投资项目突破融资新纪录
申万宏源证券上海北京西路营业部· 2025-07-09 02:45
Group 1 - The core viewpoint of the article highlights the successful financing of Beijing Galaxy General Robot Co., Ltd., which raised 1.1 billion RMB, setting records in the field of embodied large model robots [1] - The financing round was led by CATL and Puxuan Capital, attracting major domestic state-owned investment platforms, strategic and industrial investors, and internationally renowned investment institutions [1] - Since its establishment in May 2023, Galaxy General has accumulated over 2.4 billion RMB in financing, receiving high recognition from market-oriented investment institutions, industrial capital, research institution funds, and state-owned investment platforms [1] Group 2 - Galaxy General focuses on the research and innovation of embodied multimodal large model general robots [1] - The company launched the world's first humanoid robot smart pharmacy solution in March 2024, achieving full automation of drug inventory, replenishment, delivery, and packaging processes, with 100 store orders already received [1] - In the industrial sector, Galaxy General has collaborated with internationally renowned automotive companies to execute tasks such as sunroof glass handling and real-time anomaly processing, all based on visual guidance without relying on QR codes [1][2]