机器人大讲堂

Search documents
6大基准全面碾压!TW-GRPO刷新视频推理天花板,CLEVRER准确率突破50.4%!
机器人大讲堂· 2025-07-06 05:23
Core Viewpoint - The rapid development of multi-modal large language models (MLLMs) is significantly enhancing video reasoning capabilities, driven by reinforcement learning (RL) as a key engine for this technological revolution [1] Group 1: TW-GRPO Framework Introduction - The TW-GRPO framework is proposed to address challenges in reasoning quality and reward granularity in video reasoning tasks, inspired by the traditional GRPO framework [2] - TW-GRPO integrates focused thinking and multi-level soft reward mechanisms for multi-choice QA tasks [3] Group 2: Key Improvements in TW-GRPO - The framework enhances information weighting and reward mechanism design, applying a soft reward mechanism from video localization to video reasoning tasks [4] - A dynamic weighting mechanism prioritizes high information density tokens, improving reasoning accuracy and efficiency by focusing on key content [4] - The multi-level reward mechanism redefines rewards, allowing for partial correctness in answers, thus improving training stability and efficiency [5] Group 3: Data Augmentation and Training Efficiency - TW-GRPO introduces a question-answer inversion (QAI) data augmentation technique to convert single-choice tasks into multi-choice formats, effectively expanding the training data pool [6] - This approach disrupts traditional equal treatment of tokens, enhancing training efficiency and reasoning performance through differentiated information processing [6] Group 4: Experimental Validation - Extensive experiments demonstrate TW-GRPO's effectiveness in video reasoning and general understanding tasks, outperforming Video-R1 by 18.8%, 1.8%, and 1.6% in various benchmarks [12][15] - The framework shows faster convergence and more stable learning processes compared to traditional GRPO, with shorter output sequences indicating more efficient reasoning [11][17] Group 5: Qualitative Analysis of Reasoning Paths - A qualitative comparison of reasoning paths between T-GRPO and TW-GRPO illustrates significant improvements in accuracy and efficiency in dynamic visual cue reasoning tasks [22]
东大造出"活体皮肤"机器人手指!内置循环系统,7天不会干,还能自我修复
机器人大讲堂· 2025-07-06 05:23
东京大学的实验室里,一根覆盖着 淡黄色 皮肤的机器人手指正在缓缓弯曲。这不是硅胶,不是乳胶,而是 真正的活体人类皮肤组织 。更神奇的是,这层皮肤内部还有一套 "血管系统" 在源源不断地输送营养液。 7天 过去了,暴露在空气中的皮肤 依然保持着湿润和活性 。要知道,此前所有的活体皮肤机器人实验中,皮 肤组织在空气中几个小时就会干燥死亡。 这是不久前 发表在《 Advanced Intelligent Systems》 期刊 上的研究 。研究 团队通过模仿人体的血液循 环系统,为机器人皮肤设计了一套双层渗透性皮下支撑结构, 成功解决了活体组织机器人最大的技术难题 ——如何让皮肤在空气中长时间存活 。 "就像人体皮肤通过血管获得营养一样,我们的机器人皮肤也能通过内部的循环系统持续获得水分和营养。"研 究团队在论文中写道。这 或许 意味着,科幻电影中那些拥有真实触感的机器人,正在一步步成为现实。 ▍ 双层渗透性皮下支撑:机器人皮肤的 "血管系统" 研究团队的核心创新在于设计了一种 双层渗透性皮下支撑结构 。这个结构由两层组成:一层是密集穿孔的 3D打印骨架层,另一层是海绵状的聚乙烯醇(PVA)水凝胶层。 先说 骨架层 ...
早鸟64480元半价购!限量100台!强调美国制造的人形机器人终究还是用了中国零部件!
机器人大讲堂· 2025-07-05 04:09
Core Viewpoint - K-Scale Labs has launched the K-Bot, a low-cost open-source bipedal humanoid robot, emphasizing American manufacturing and aiming to democratize humanoid robotics through an open-source platform [1][23][36]. Group 1: Product Overview - The K-Bot is priced at $16,000 (approximately 114,644 RMB), with an early bird price of $8,999 (approximately 64,480 RMB) for the first 100 pre-orders, with shipments expected by November 2025 [1]. - K-Bot stands 1.4 meters tall, weighs 34 kg, and has a maximum payload of 10 kg, with a battery life of up to 4 hours [11]. - The robot features dual system architecture, utilizing reinforcement learning for movement control and a visual-language action strategy for advanced task execution [13]. Group 2: Company Background - K-Scale Labs, founded in early 2024 and headquartered in New York, has developed six humanoid robot models within a year, starting from a small garage-sized workspace [2][14]. - The company has received multiple rounds of funding, including $500,000 from Y Combinator and $4 million in seed funding, achieving a valuation of $50 million [14][16]. Group 3: Technical Features - The K-Bot is designed with open-source software and hardware, allowing users to access design documents and code [4]. - The upcoming product, Stompy, aims to lower the barrier to entry for robotics, featuring a fully open architecture that can be assembled using 3D-printed parts [6][8]. Group 4: Team Expertise - The team comprises engineers and robotics experts with experience from companies like Tesla and Meta AI, enhancing their capability in hardware and algorithm integration [18][22]. - The CTO, Paweł Budzianowski, has a PhD from Cambridge and has designed the core algorithm architecture for K-Bot, contributing to its advanced capabilities [21]. Group 5: Manufacturing and Supply Chain - K-Scale Labs emphasizes American manufacturing, although it sources components from Chinese suppliers, which has raised questions about the authenticity of its "Made in America" claim [23][27][33]. - The company collaborates with various Chinese firms for components, including Suzhou Maita Intelligent and Beijing Lingzu Times, to optimize costs and ensure production flexibility [22][33].
2025第三届全球手术机器人大会定档9月,汇聚全球智慧,共绘医疗科技革新蓝图
机器人大讲堂· 2025-07-05 04:09
Core Insights - The medical robotics industry has entered a complex phase by 2025, with evolving product forms, clearer clinical pathways, and a more cautious capital environment [1] - The Third Global Surgical Robotics Conference focuses on the systematic upgrade of intelligent surgical systems, the construction of a full-chain ecosystem, and global pathways [1][2] Event Details - The conference will take place on September 5-6, 2025, at the Beijing Zhongguancun National Independent Innovation Demonstration Zone Exhibition and Trading Center [4] - The agenda includes a keynote session, a visit to a robot leasing hospital, and a gala dinner for marketing leaders [5][6] Key Topics - The conference will cover various topics, including the core architecture trends of next-generation surgical robots, AI integration, commercialization, hospital system implementation, global strategies, and supply chain innovations [7][8][9][11] - Discussions will also address challenges in integrating surgical robots with hospital systems, the construction of a global R&D system, and the commercialization pathways for surgical robots [12] Industry Impact - The event serves as a platform for hospitals, governments, and investors to redefine surgical environments and the future of healthcare [14] - For innovative companies, participation signifies recognition as industry leaders and an opportunity to share insights on the redefinition of medical systems through technology [17]
UC Berkeley池宇峰: 采用3D打印技术制造 人形机器人成本不超35000元!
机器人大讲堂· 2025-07-05 04:09
Group 1 - The main factors limiting the large-scale deployment of humanoid robots are their limited generalization capabilities and high manufacturing costs, with mainstream humanoid robots priced around 500,000 yuan, and some high-end models reaching over 1 million yuan [1] - High development costs, closed-source design architectures, and limited customization capabilities are major bottlenecks for rapid development in this field [2] - The Berkeley Humanoid Lite, developed by a team from the University of California, Berkeley, is a low-cost open-source humanoid robot that utilizes desktop 3D printing for manufacturing, with a BOM cost under $5,000, approximately 35,000 yuan [2][8] Group 2 - Current humanoid robot platforms are divided into commercial products and research prototypes, with commercial products like Agility Robotics' Digit and Fourier Intelligence's GR1 being too expensive for most research institutions and individuals [3] - Open-source hardware and software have significant potential in driving technological innovation and collaboration, allowing researchers to learn from each other and accelerate the iterative process [5] - The design of Berkeley Humanoid Lite emphasizes modularity and ease of manufacturing, with all structural components printable on standard desktop 3D printers, significantly reducing manufacturing complexity [9] Group 3 - Berkeley Humanoid Lite is designed as a medium-sized humanoid robot platform, standing at 0.8 meters tall and weighing 16 kilograms, featuring joint actuators and an integrated IMU for position sensing [10] - The robot's joint actuators utilize a 3D-printed cycloidal gearbox design, enhancing load capacity and lifespan through optimized gear parameters and manufacturing processes [12] - The robot's structure is built using aluminum extrusions for high rigidity and lightweight characteristics, with real-time monitoring of posture and motion state through the integrated IMU [14] Group 4 - The electronic system of Berkeley Humanoid Lite consists of a control computer, joint actuator controllers, an IMU, and a power management module, ensuring stable power supply and precise control [16] - Performance tests showed that the joint actuators maintained a mechanical efficiency of around 90%, indicating effective energy conversion and reduced energy loss [17] - The robot demonstrated excellent walking capabilities across various terrains, maintaining balance and adapting to slopes and steps, showcasing its dynamic balance control [20] Group 5 - Remote operation experiments validated the robot's precision and responsiveness in performing tasks like writing and object manipulation, indicating its suitability for various practical applications [21][24] - Berkeley Humanoid Lite represents an open-source, customizable, and cost-effective humanoid robot platform, significantly lowering development costs and simplifying manufacturing processes [25] - Future optimizations will focus on enhancing system stability and adaptability, further exploring the platform's potential application value [25]
Mech. Mach. Theory发表!南京航空航天大学团队打造新一代踝关节康复“机器人助手”,性能更强更安全!
机器人大讲堂· 2025-07-05 04:09
Core Viewpoint - The article discusses the development and advantages of a new type of ankle rehabilitation robot that utilizes redundant drive mechanisms to enhance rehabilitation outcomes and address limitations of existing devices [1][2]. Group 1: Ankle Rehabilitation Robot Overview - The ankle joint is crucial for weight-bearing but is prone to injuries, making rehabilitation essential for recovery [1]. - Traditional rehabilitation methods are lengthy and inconsistent, leading to the exploration of robotic assistance for more effective and continuous treatment [1]. - Current ankle rehabilitation robots are categorized into platform-type and wearable-type, with platform-type robots being the primary choice for functional rehabilitation due to their ability to perform complex movements [1]. Group 2: Challenges in Existing Designs - Existing designs of ankle rehabilitation robots often have complex structures that are difficult to manufacture and assemble, impacting their effectiveness [1][2]. - Many traditional robots do not adequately consider the stretching movements necessary for rehabilitation, which can limit their effectiveness [1][2]. Group 3: New Robot Design and Features - The new ankle rehabilitation mechanism (PARM-N) and its redundant drive form (PARM-R) are designed to accurately simulate three core movements required for rehabilitation: dorsiflexion/plantarflexion, inversion/eversion, and axial traction [2]. - The design simplifies the structure while enhancing performance, reducing manufacturing costs and assembly complexity [3][5]. Group 4: Performance Analysis - The kinematic analysis of the mechanism is crucial for performance evaluation and optimization, with the study employing methods like the Newton-Raphson method for solving position equations [6][10]. - The redundant drive mechanism avoids singular configurations that can hinder rehabilitation effectiveness, while the non-redundant configuration has multiple singular forms [10][19]. Group 5: Stiffness and Optimization - Stiffness performance is analyzed using virtual spring methods, with results showing that the redundant drive mechanism exhibits better stiffness characteristics in specific ranges compared to the non-redundant design [22][29]. - A multi-objective size optimization approach is applied to enhance the overall performance of the mechanism, resulting in significant improvements in both motion/force transmission and stiffness metrics [28][29]. Group 6: Research Publication - The findings of this research have been published in the journal "Mechanism and Machine Theory," highlighting the contributions of the team from Nanjing University of Aeronautics and Astronautics [18].
软体机器人新突破!用剪纸原理造出的充气爬行机器人可穿越各种复杂地形
机器人大讲堂· 2025-07-04 02:26
Core Insights - The article discusses a groundbreaking soft robotic crawler developed by a research team at the University of Southern Denmark, which utilizes the principles of Kirigami art for its design [1][3]. Design Innovation - The core innovation lies in integrating Kirigami metamaterials into the design of inflatable soft actuators, allowing for coordinated local deformation when inflated [5][7]. - At a pressure of 100 kPa, the actuator can achieve a uniform contraction of up to 32%, nearly double that of traditional inflatable structures without cuts [7][9]. Motion Mechanism - The research team analyzed the deformation mechanism through experiments and finite element simulations, revealing a nonlinear pressure-contraction relationship that allows for precise control [9]. - The actuator's surface features, resembling scales, create anisotropic friction properties essential for snake-like movement, with forward friction coefficients significantly lower than backward ones [9]. Modular Design - A dual-channel Kirigami actuator was developed, enabling complex motion control by independently inflating two parallel airways [11]. - By connecting multiple dual-channel modules, a three-module crawling robot was created, capable of various movement patterns such as linear crawling and serpentine motion [11][13]. Real-World Performance - The robot was tested in real-world scenarios, demonstrating remarkable adaptability across different terrains, including asphalt, narrow passages, and slopes [15][17]. - Friction tests showed that the inflatable Kirigami actuator exhibited directional friction anisotropy, ensuring effective forward movement during inflation cycles [15][16]. Future Potential - The design requires external air sources and control systems, but the compactness of the micro-pumps and low-power solenoids lays the groundwork for future autonomous designs [19]. - The research highlights the potential applications of this soft robotic technology in fields such as search and rescue, pipeline inspection, and environmental monitoring [16][19].
快讯|Meta官宣研发AI聊天机器人;谷歌推出全新AI工具套件;宝马自动充电机器人完成概念验证
机器人大讲堂· 2025-07-04 02:26
1、 Meta官宣研发AI聊天机器人Project Omni 近日,Meta正式官宣正在研发"Project Omni"项目,致力打造一批能主动与用户互动的AI聊天机器人,为 用户提供更多情绪价值,提升使用黏性与互动频率。据悉,这些AI机器人可在用户未主动开启对话时,依 据历史聊天记录主动发起交流。比如用户曾询问音乐见解,AI后续可能主动询问是否需要音乐推荐,若用 户未回复则不再继续发消息。另据Business Insider爆料,该项目上线尚需时日。其AI模型训练数据来自A lignerr,机器人支持多样化人设,能生成不同风格对话,还可接入第三方,充当提醒消息类机器人使用。 2、 谷歌推出全新AI工具套件Gemini for Education 近日,谷歌正式发布面向全球教育领域的全新AI工具套件Gemini for Education,并推出免费版本,助力 高效学习与教学。据悉,该产品专为教育场景定制,集成Gemini 1.5 Pro大模型,有文本生成、理解及编 程辅助等强大能力。教师可借此快速准备材料、设计考题,学生能获得个性化学习建议与作业辅导。它分 免费版和付费版(Gemini Education ...
从抓薯片到玩手机,这款传感器凭什么让灵巧手通吃全场景抓取?
机器人大讲堂· 2025-07-04 02:26
Core Viewpoint - The article emphasizes the evolution of robots from "programmed execution" to "environmental adaptability" in the context of Industry 4.0 and AI integration, highlighting the importance of tactile sensors for enhancing robotic dexterity and interaction capabilities [1]. Group 1: Advantages of Multi-modal Tactile Sensors - Traditional tactile sensors are limited by issues such as single modality, hysteresis, poor non-linearity, low dynamic response speed, inconsistent batch quality, and insufficient durability, which do not meet the market's demand for high-performance sensors [2]. - The multi-modal tactile sensor developed by Suzhou Hangkai Microelectronics utilizes advanced algorithms for three-dimensional force perception, instantaneous detection of tangential components, and tactile fusion, overcoming the limitations of traditional sensors and providing precise force analysis for complex control [2]. Group 2: Features of Multi-modal Tactile Sensors - High precision measurement is achieved with a minimum discernible force of 0.01N within a 20N range, allowing for accurate control of gripping force when handling delicate or fragile items [3]. - The sensor's multi-modal perception capability includes the ability to sense normal pressure and tangential force, as well as temperature sensing upon contact with objects, providing comprehensive tactile feedback for better environmental understanding [4]. - The sensor features rapid response capabilities, with a maximum output frequency of 300Hz and precision better than 2%, ensuring real-time and accurate force measurement in dynamic applications [5]. - The compact design of the sensor facilitates integration into various devices and systems, accommodating different application scenarios and spatial constraints [6]. - The sensor demonstrates high stability and reliability, maintaining performance after over 1 million presses at 10N, ensuring long-term operational stability for robotic dexterity [7]. Group 3: Application and Patent - The integration of the multi-modal tactile sensor with adaptive grasping algorithms significantly enhances the performance of robotic hands, enabling them to automatically adjust grip strength and posture for various objects [8]. - The technology has been patented, showcasing its capabilities in three-dimensional tactile perception, tangential force detection, and impact detection, making it suitable for industrial grasping and precision operations [9].
1ms,埃斯顿ERI实时控制解决方案,打造机器人高端应用壁垒!
机器人大讲堂· 2025-07-04 02:26
Core Viewpoint - The article highlights Estun's advancements in high-end robotics, particularly through the introduction of the ERI (Estun Robot Real-Time Interface), which signifies a leap in the capabilities of domestic robots, enabling them to meet complex application demands and improve performance in various intelligent scenarios [1][4][12]. Group 1: High-End Robotics Development - In recent years, high-end robotics in research has predominantly relied on foreign brands due to their advanced external control interfaces, which allow for real-time adjustments and flexible path planning [2][3]. - Domestic robots often struggle with real-time performance, leading to lost orders in applications requiring high precision and responsiveness, such as welding and coating [2][3]. Group 2: ERI Solution Overview - ERI is designed to address the limitations of domestic robots by providing a real-time, universal external control interface, enabling users to guide robot actions and output status information for process control [4][5]. - The ERI solution supports high-precision control and simplifies system integration, reducing development time and costs associated with adapting to different sensors [5][7]. Group 3: Advantages of ERI - ERI offers robust secondary development capabilities with a rich SDK, supporting multiple programming languages and real-time control features, which cater to complex application needs [7][9]. - The solution enables real-time external control for various applications, including academic research and advanced manufacturing processes, enhancing the flexibility and efficiency of robotic systems [7][9]. Group 4: Practical Applications of ERI - ERI facilitates high-precision data flow and real-time path correction, making it suitable for demanding production environments that require strict control over precision and speed [9][11]. - The interface allows for the integration of multiple external sensors, enhancing the robot's ability to perform complex automated processes, such as online welding and precise coating control [11][12]. Group 5: Future Prospects - Estun is actively expanding its market presence internationally, targeting regions such as Europe, America, the Middle East, and Southeast Asia, with a focus on developing high-end, differentiated products [12].