Workflow
悟能具身智能平台
icon
Search documents
从“能动”到“能想”再到“有温度”,这些企业让机器人“活过来”
Hua Xia Shi Bao· 2025-07-30 05:45
人工智能企业让机器人真正"活过来"。 7月26日至29日,2025世界人工智能大会(WAIC)在上海举行。3000余项前沿成果集中亮相,上百款 形态各异的机器人同台竞技:有的挥拳格斗,有的跳街舞、打麻将,还有的成为长跑冠军、叠衣小助 手,甚至"售货员"。 大会期间,《华夏时报》记者穿梭于具身智能展区,记录下数十家产业链上下游企业如何让机器人 从"能动"到"能想",再到"有温度"。这些企业赋予机器人记忆力、物理感知力、语言能力、行动力与视 触觉,让钢铁之躯逐渐拥有鲜活的"生命感"。 赋予机器人"记忆力" 7月28日,位于H3展馆的国家地方共建人形机器人创新中心(下称"上海国地中心"),携全新系列人形 机器人矩阵、国产化核心组件、通用具身智能开发平台等8款产品亮相。 本届大会上,上海国地中心发布"青龙"全产品体系,包括青龙Pro、青龙Lite、青龙Wheel等,构建出 以"青龙矩阵"为核心的具身智能机器人生态。 《华夏时报》记者在现场看到,身高185cm、体重85kg的全尺寸人形机器人青龙Pro在展台供参展人员 观摩。工作人员介绍称,它搭载了全新一代的操控系统,集成五感感知、双臂协同、能源动力、灵巧 手、通讯交互 ...
WAIC|商汤首席科学家林达华:多模态是通向AGI的必经之路
Core Insights - The essence of artificial intelligence (AI) is to create a form of genuine intelligence that can autonomously interact with the real world, which is the ultimate goal of intelligence [1] - The rapid evolution of large models, particularly language models, is seen as a stepping stone towards achieving AGI (Artificial General Intelligence), with a necessary focus on multimodal capabilities for real-world applications [1][2] Company Developments - SenseTime has officially launched the "Riri Xin" V6.5 "Awakening" world model and the "Wuneng" embodied intelligence platform during the WAIC [1] - The company has been a pioneer in multimodal integration, demonstrating that multimodal models outperform pure language models in language tasks after effective training [2] - The latest version, "Riri Xin" 6.5, has achieved advanced performance in both pure language and text tasks, showcasing the maturity of SenseTime's technology in this area [2] Industry Trends - The rise of ChatGPT has highlighted a new era in AI technology, presenting opportunities for companies like SenseTime to leverage this wave of transformation to create significant impact [3] - The shift from AI 1.0, which focused on specialized tasks, to general AI models that are more autonomous and versatile is a key development in the industry [3] - The future of software development is expected to become more accessible, allowing non-experts to create software simply by expressing their needs, which could reshape industry dynamics [3][4] Technological Advancements - The development of multimodal models is progressing through three critical stages, with the final goal being the connection between digital and physical spaces to achieve AGI [5] - SenseTime's experience in computer vision and collaboration with hardware companies has positioned it well to enhance its embodied intelligence platform [6] - The integration of world models with multimodal training data has proven effective in training autonomous driving modules, significantly improving efficiency compared to relying solely on real-world data [6] Strategic Focus - SenseTime emphasizes aligning research and development with its commercial vision, ensuring that scientific advancements translate into business value [6] - The company prioritizes projects that can achieve commercial viability, avoiding areas that do not align with its business goals [6] - Investments in embodied intelligence and foundational models are interconnected, allowing for a more efficient allocation of resources [6]
2025世界人工智能大会这些新品最值得关注!一文看懂→
第一财经· 2025-07-29 10:35
Core Viewpoint - The article highlights the significant advancements in robotics showcased at the World Artificial Intelligence Conference (WAIC) 2025, emphasizing the shift from remote-controlled to autonomous robots, driven by new perception-action models and world models developed by various companies [3][4][5]. Group 1: Robotics Developments - Nearly all humanoid robot companies, including Zhiyuan, Yushu Technology, and Galaxy General, showcased their progress at WAIC 2025, with a focus on software advancements rather than hardware changes [4]. - Companies like Tencent and SenseTime introduced perception-action models aimed at improving robot interactions with their environments, marking a paradigm shift in robotics [4][5]. - Zhiyuan's "Genie Envisioner" world model allows robots to pre-visualize actions before execution, enhancing their operational capabilities [10][12][14]. Group 2: Major Product Releases - SenseTime launched the "Wuneng" embodied intelligence platform, enabling robots to understand and interact with their environments effectively [17][18]. - Alibaba announced the development of its first self-developed AI glasses, integrating various functionalities and aiming to enhance user experience [19]. - Tencent released the "Hunyuan 3D World Model," which simplifies 3D scene construction and allows users to generate 360-degree scenes from text or images [20][21]. Group 3: Competitive Landscape - MiniMax and Yuezhi Anmian are competing for dominance in the open-source model community, with both claiming significant achievements in their respective model rankings [8][9]. - The focus of major model companies has shifted towards professional developers rather than general consumers, indicating a strategic pivot in their market approach [8][9]. Group 4: Industry Insights - Industry leaders emphasize the importance of high-precision actuators and sensor integration for the successful deployment of robots in real-world applications [26][27]. - The distinction between world models and multimodal models is highlighted, with world models aiming for deeper environmental understanding and proactive interaction capabilities [28]. - The current investment climate in AI is robust, with a notable increase in funding and interest in AI applications, reminiscent of the mobile internet boom from 2009 to 2014 [42].
恒指收涨173点,科指连跌三日
国都港股操作导航 | 海外市场重要指数 | 收市 | 幅度 | | --- | --- | --- | | 道琼斯工业指数 | 44,837.56 | -0.14% | | 标普 500 指数 | 6,389.77 | 0.02% | | 纳斯达克综合指数 | 21,178.58 | 0.33% | | 英国富时 100 指数 | 9,081.44 | -0.43% | | 德国 DAX 指数 | 23,970.36 | -1.02% | | 日经 225 指数 | 40,689.97 | -0.75% | | 台湾加权指数 | 23,412.98 | 0.21% | | 内地股市 | | | | 上证指数 | 3,597.94 | 0.12% | | 深证成指 | 11,217.58 | 0.44% | | 香港股市 | | | | 恒生指数 | 25,562.13 | 0.68% | | 国企指数 | 9,177.15 | 0.29% | | 红筹指数 | 4,327.68 | 0.13% | | 恒生科技指数 | 5,664.02 | -0.24% | | AH 股溢价指数 | 123.44 | -0. ...
腾讯研究院AI速递 20250729
腾讯研究院· 2025-07-28 15:36
生成式AI 一、 智谱 发 布 GLM-4.5, 面向 推理、代码与智能体 的基础模型 1. GLM-4.5是专为智能体打造的开源模型,在推理、代码、智能体方面表现优异,国内实测 效果 领先 ; 2. 采用混合专家架构,提供两种模式,具有高参数效率,性能 可 达 参数量更大的竞争对 手; 3. 具备低成本(输入0.8元/百万tokens)、高速度(最高100tokens/秒)特性,支持全栈开发任 务。 https://mp.weixin.qq.com/s/Psb5TJSFszReCQ8SwnjFyA 二、 云天励飞宣布全面聚焦AI推理芯片!要支撑万亿参数大模型 1. 云天励飞全面聚焦AI推理芯片,计划至2028年将单芯片算力提升至数千TOPS,支撑万亿 参数大模型; 2. 采用创新"算力积木"架构的纯国产工艺AI芯片,已适配DeepSeek、QwQ等主流开源模型 和鸿蒙系统; 3. 端边云"三栖"布局,形成四大业务板块,重点面向边缘计算、云端大模型推理和智能机器 三大市场。 https://mp.weixin.qq.com/s/8_LKJtkNayUR_JEhtSup1Q 三、 Coze宣布开源两款核心产品: ...
商汤发布新平台布局具身智能赛道,或将成立独立公司
Nan Fang Du Shi Bao· 2025-07-28 13:12
Core Insights - SenseTime is establishing an independent embodied intelligence company, led by key figures including Chief Scientist Wang Xiaogang and Tao Dacheng, indicating a strategic shift towards this emerging sector [1][3] - The launch of the "Wuneng" embodied intelligence platform at the 2025 World Artificial Intelligence Conference (WAIC) showcases SenseTime's commitment to integrating its technology in multimodal models and computer vision into embodied intelligence [1][4] Group 1: Strategic Developments - The "Wuneng" platform is designed to provide perception, visual navigation, and multimodal interaction capabilities for hardware like robots, reflecting SenseTime's ambition to facilitate real-world interactions for various embodied intelligence enterprises [1][4] - SenseTime's organizational restructuring, termed the "1+X" strategy, aims to focus on core businesses while allowing ecosystem companies to operate independently, with the new embodied intelligence company potentially becoming another "X" business [3][4] Group 2: Technological Pathways - SenseTime's entry into embodied intelligence is seen as a necessary step towards achieving AGI (Artificial General Intelligence), emphasizing the importance of transitioning intelligence from digital to physical spaces [4][5] - The company plans to utilize a combination of internet-sourced data for pre-training multimodal models, simulation data generated through a "world model," and limited real-world operation data for model alignment, enhancing training efficiency [5]
21对话|商汤科技林达华:具身智能需数字空间与物理空间连接
Core Insights - The rise of large language models (LLMs) marks a significant leap in AI technology, but achieving Artificial General Intelligence (AGI) requires more than just text understanding and generation [2] - The development of AI is transitioning from single language models to a new stage of multimodal integration, which is essential for reaching AGI [2][3] - The future of AI lies in the fusion of multimodal information and interaction with the physical world, with a full-scale adoption of multimodal models expected by the second half of 2025 [2][3] Multimodal Development - The evolution of large models is moving towards deeper cross-modal understanding, transitioning from mere comprehension to cognitive processing [4][6] - Early multimodal architectures had limitations, but advancements like the Gemini model are integrating image and video information into pre-training processes, enhancing cross-modal modeling capabilities [6] - Effective training of multimodal models can lead to superior performance in pure language tasks compared to single language models [6] Embodied Intelligence - Embodied intelligence is viewed as one of the ultimate forms of AGI, with significant attention in 2025 [3] - The development of agents is crucial for the practical application of large model capabilities, but current agents still face challenges in complex real-world scenarios [7] - The reliability and success rate of agents in real-world applications are critical for their perceived value [7] Key Challenges - A major challenge for achieving AGI is the ability to generalize reasoning from narrow domains to complex real-life scenarios [8] - Current multimodal models exhibit insufficient spatial understanding, which is a significant barrier to the realization of embodied intelligence [8] - The data acquisition methods for embodied intelligence are limited, primarily relying on robotic operations, which results in lower data throughput compared to digital models [10]
对话商汤联创林达华:多模态是AGI的必经之路,是不可缺少的部分
Xin Lang Ke Ji· 2025-07-28 04:24
Core Insights - SenseTime launched the "Wuneng" embodied intelligence platform during the WAIC 2025, which aims to enhance the autonomy and intelligence of smart devices and robots through advanced perception, visual navigation, and multimodal interaction capabilities [1] Company Developments - The platform is built on SenseTime's embodied world model and leverages both edge and cloud computing power from its large-scale infrastructure [1] - SenseTime's co-founder and chief scientist, Lin Dahua, emphasized the importance of multimodality in achieving Artificial General Intelligence (AGI) and highlighted the company's extensive experience in computer vision and collaboration with hardware companies [1] Market Opportunities - The embodied intelligence market is rapidly growing, and SenseTime aims to capture commercial opportunities within this space, leveraging its multimodal capabilities and accumulated knowledge in world models [1] - SenseTime's investment arm, Guoxiang Capital, has invested in several companies within the embodied intelligence sector, including Galaxy General, Zhongqing Robotics, and Titanium Tiger Robotics [1] - Recent funding rounds in the sector include Galaxy General securing 1.1 billion yuan from CATL and Zhongqing Robotics completing a financing round close to 1 billion yuan [1]
我国大模型数量居全球首位;AI投资联盟正式成立;清华大学开发出新型EUV光刻胶
Guan Cha Zhe Wang· 2025-07-28 01:45
Group 1: AI Industry Developments - China leads the world with 1,509 large models out of a total of 3,755 globally, showcasing rapid iteration in foundational AI models across various industries such as electronics and consumer goods [1] - The "AI Investment Alliance" was officially established during the 2025 World Artificial Intelligence Conference, aiming to create a collaborative platform for investors and integrate resources for high-quality development in China's AI industry [2] - Keling AI announced over 45 million global users and has generated more than 2 billion videos and 400 million images since its product launch, indicating significant user engagement and content creation [2] Group 2: Technological Innovations - Tencent released and open-sourced the "Hunyuan 3D World Model 1.0," along with various other models, enhancing capabilities in 3D modeling and multi-modal understanding [3] - SenseTime launched the "Wuneng" embodied intelligence platform, which utilizes its world model to provide advanced perception and interaction capabilities for robots and smart devices [4] - Tesla's third-generation robot is set to enter the Chinese consumer market by 2025, with plans for mass production of 1 million units within five years [4] Group 3: Automotive Industry Insights - In June 2025, China's automotive market reached a global share of 36%, a 4 percentage point increase from the previous year, with total sales of 1,565 million vehicles in the first half of the year [6] - Shanghai plans to deploy 500 data-collecting ride-hailing vehicles in 2024, aiming to gather over 10 million data clips to support the development of high-level autonomous driving [6]
8点1氪:少林寺住持释永信涉嫌刑事犯罪;北京大学将全面取消绩点;警方调查“上千万元金饰被洪水冲走”
36氪· 2025-07-27 23:58
Group 1 - The abbot of Shaolin Temple, Shi Yongxin, is under investigation for criminal activities, including misappropriation of project funds and maintaining improper relationships with multiple women [2][3] - Shi Yongxin's social media account has been inactive since July 24, 2023, after posting 4,005 updates over 2,683 days, with a follower count of 878,000 [2][3] Group 2 - Peking University announced the cancellation of GPA for students starting from the 2025 cohort, allowing grades to be recorded in percentage or letter format instead [3] - The change aims to encourage students to explore interdisciplinary studies and take on challenging courses [3] Group 3 - A severe flood in Shaanxi province led to the loss of nearly 20 kilograms of gold and silver jewelry from a local jewelry store, with an estimated value exceeding 10 million yuan [3] - The police are involved in the investigation, and some items have been recovered [3] Group 4 - Several universities, including Huazhong Normal University, have announced an extension of the master's program duration to three years, citing the need for improved training quality [7] - The extension is expected to enhance the overall educational experience and increase the employability of graduates [7] Group 5 - The outbreak of Chikungunya fever in Foshan, Guangdong, has resulted in over 4,000 confirmed cases, with the majority being mild [8] - The World Health Organization has warned of the virus's global spread, affecting 119 countries [8] Group 6 - Amazon founder Jeff Bezos has sold a significant amount of Amazon stock, cashing out approximately 57 billion USD, while still retaining a substantial shareholding [11] - The stock sale was part of a pre-established trading plan, and Amazon's stock has seen a 38% increase since April [11] Group 7 - NASA is facing a significant budget cut, leading to an expected reduction of approximately 3,870 employees, which is about 20% of its workforce [15] - The budget for NASA's scientific projects is set to be reduced by 47%, impacting several key missions [15] Group 8 - Volkswagen reported a 33% drop in operating profit for the first half of 2025, primarily due to increased import tariffs in the U.S., resulting in a loss of 1.3 billion euros [16] - The company has adjusted its annual revenue forecast, expecting no growth compared to the previous year [16] Group 9 - The summer box office in China has surpassed 5 billion yuan, with a total of 129 million admissions recorded [12] - This indicates a strong recovery in the film industry following previous downturns [12] Group 10 - The electric bicycle industry in China has seen a steady increase in the number of registered companies, with over 1 million currently operating [22][23] - A new national standard for electric bicycles will be implemented on September 1, 2025, aimed at improving safety [22][23]