Wanda 2.0

Search documents
直击WRC:消费级机器人登场,平台级较量升温
Di Yi Cai Jing· 2025-08-09 02:52
人山人海的会场之外,机器人离我们更近了吗? 机器人离我们更近了吗?在2025世界机器人大会(WRC)现场,第一财经记者发现,丢掉遥控器的消费级机器人已经登场,各种形态的机 器人产品正在以更智能、更日常化的姿态贴近用户的生活。 不少厂商也押注平台化路线,开放开发套件、采集多模态数据、优化软硬件协同,试图在"鹿死谁手"的竞争到来之前,夯实技术底座。资 本与产业链的结合也在加速,行业正在从单点技术突破走向系统化交付,产业生态的协同迭代正进入快车道。 会场的人山人海之外,这场由算法、硬件与资本共同驱动的竞赛,正在重绘未来机器人的边界。 丢掉遥控器,消费级机器人更近了 要成为消费级机器人的第一步是什么?可能是丢掉遥控器。 在WRC现场,第一财经记者在Vbot维他动力展台体验了全场唯一一个能够全程不用遥控器操作的机器人,一只名叫"大头"的四足机器狗。 维他动力的团队把这个产品定义为"伴随机器人",并且明确其打向消费市场的定位。 1 2000 100 12 und d 1 14552 / 不用遥控器,意味着"大头"需要高度自主的环境感知能力和一定的理解和决策能力。在这些能力背后,硬件的打磨和更智能的算法是必选 项。除了可运 ...
2025WRC:众擎、优必选、乐聚等15家企业展台情况一览
机器人大讲堂· 2025-08-08 16:23
Core Viewpoint - The 2025 World Robot Conference (WRC) showcased advancements in humanoid robots, emphasizing the integration of 5G-A technology and embodied intelligence to enhance operational capabilities and adaptability in complex environments [1][3]. Group 1: Humanoid Robots - The "Kua Fu" humanoid robot by Leju demonstrated significant breakthroughs in communication and decision-making through 5G-A technology, enabling real-time control over distances of 1200 kilometers [1][3]. - The "Kua Fu" robot features a dual-brain system, enhancing human-robot interaction and stability in flexible environments, showcasing its potential in high-risk scenarios like firefighting and chemical industries [3][6]. - The "T800" humanoid robot by Zhongqing is designed for heavy-duty tasks, featuring 41 degrees of freedom and advanced sensor integration for real-time environmental adaptation [7]. Group 2: Industrial Robots - Cyborg Robotics presented the Cyborg-H01 dexterous hand and Cyborg-R01 humanoid robot, focusing on high precision and energy efficiency in industrial applications [4][6]. - The CW series robots by Chuan Robotics combine autonomous mobile platforms with collaborative robotic arms, enhancing flexibility in production environments [30]. Group 3: Perception and Sensing Technologies - Aobi Zhongguang introduced the Pulsar ME450 3D LiDAR and Gemini 345Lg dual 3D camera, enhancing visual perception capabilities for outdoor robots [12][14]. - Pasini launched the third-generation multi-dimensional tactile sensor matrix PX-6AX-GEN3, providing advanced sensing solutions for embodied intelligent systems [10][25]. Group 4: General-Purpose Robots - Zhi Ping Fang showcased the "Ai Bao" robot, capable of performing various tasks in simulated environments, highlighting its adaptability and learning capabilities [17][19]. - The "Wanda" platform by Youliqi demonstrated its application in multiple real-life scenarios, emphasizing its interactive capabilities and service-oriented design [23][24]. Group 5: Collaborative and Ecosystem Development - Xinghai Tu introduced the G-0 VLA model, which enhances human-robot collaboration and opens up new possibilities for developers through a comprehensive toolkit [20][22]. - The collaboration among various companies at WRC indicates a growing ecosystem focused on advancing embodied intelligence and robotics technology across multiple sectors [41].
腾讯研究院AI速递 20250530
腾讯研究院· 2025-05-29 15:55
Group 1: DeepSeek-R1 and AI Developments - The new version of DeepSeek-R1 has been officially open-sourced, surpassing Claude 4 Sonnet in programming capabilities and performing comparably to o4-mini (Medium) [1] - DeepSeek-R1's core advantages include deep reasoning capabilities, natural text generation, and support for long-duration thinking of 30-60 minutes, allowing for the execution of complex code in a single run [1] - Tencent has integrated multiple products with the latest DeepSeek R1 model within a day, offering users free and unlimited access to the model [3] Group 2: Keling 2.1 Launch - Keling 2.1 has been launched with a price reduction of 65%, featuring improved performance and speed, categorized into standard, high-quality, and master versions [2] - The high-quality version (35 inspiration points) matches the old master version in quality, supporting 1080P video but only for image-to-video generation [2] - The new version significantly enhances cost-effectiveness, making AI video creation more accessible for ordinary users [2] Group 3: Opera Neon Browser - Opera has introduced Opera Neon, the first "AI Agent" browser, aiming to redefine the role of browsers in the network [4] - Opera Neon consists of three main features: Neon Chat (chatting), Neon Do (executing web tasks), and Neon Make (complex creation), which can understand user intent and convert it into actions [4] - The Neon Make feature utilizes cloud technology to execute complex tasks, such as generating reports and designing game prototypes, even while the user is offline [4] Group 4: VAST's Tripo Studio Upgrade - VAST has upgraded Tripo Studio with four core functionalities: intelligent component segmentation, texture magic brush, intelligent low-poly generation, and automatic rigging for all objects [5] - Intelligent component segmentation allows for one-click disassembly, accurately identifying different parts of a model [5] - The automatic rigging feature can recognize various biomechanical characteristics and quickly allocate skeletal weights, enabling non-professionals to complete the entire 3D creation process with over a tenfold efficiency increase [5] Group 5: Odyssey's World Model - Odyssey, founded by autonomous driving experts, has launched a world model capable of real-time video generation at 40 milliseconds per frame, supporting real-time interaction [6] - This technology differs from traditional video models by learning pixel and motion data from real-life videos, using a narrow distribution model architecture to address autoregressive modeling challenges [6] - Odyssey has secured $27 million in funding, with the current preview version supported by H100 GPU clusters, outputting 30 FPS for 5-minute coherent interactive videos [6] Group 6: AI Scientist Zochi - The AI scientist Zochi's paper has been accepted by the top-tier conference ACL, marking it as the first AI system to independently pass peer review at an A* level conference [7] - Zochi's paper demonstrates a multi-round attack method with a success rate of 100% on GPT-3.5 and 97% on GPT-4 [7] - Zochi can autonomously complete the scientific research process from literature analysis to peer review, although its company has faced criticism regarding the misuse of the scientific peer review process [7] Group 7: Wanda 2.0 Robot - Youliqi has launched the Wanda 2.0 wheeled dual-arm robot, priced from 88,000 yuan, capable of autonomously completing complex long-sequence tasks [8] - Wanda 2.0 is equipped with a pre-trained multimodal large model UniTouch and a long-sequence task planning model UniCortex, learning new actions with only 5-10 demonstrations [8] - Youliqi has reduced costs by 70% through full-stack self-research, targeting the C-end and small B customer market, and has completed several hundred million yuan in financing [8] Group 8: Boston Dynamics Atlas Robot - Boston Dynamics has upgraded the Atlas robot, which now features 3D spatial perception and real-time object tracking capabilities, allowing it to perform complex industrial tasks in automotive factories [9] - The core technology includes a 2D object detection system, 3D spatial positioning based on key points, and a SuperTracker object pose tracking system, capable of handling object occlusion and positional changes [9] - The system integrates kinematic data, visual data, and force feedback to estimate poses accurately, with the team working on building a unified foundational model to enhance perception and action integration [9] Group 9: Google CEO's Perspective on AI - Google CEO Pichai believes AI represents a platform-level transformation larger than the internet, entering a phase where research is becoming reality [10] - AI is transitioning into the second stage of building usable products, with search evolving into an agent that can execute tasks on behalf of users, potentially creating Web 2.0-level killer applications [10] - The key transformation brought by AI lies in the change of interaction methods and the lowering of creative barriers, with the third stage involving the integration of AI with the physical world to form universal robotic systems [10]