Workflow
SENSETIME(00020)
icon
Search documents
商汤科技徐立:世界模型和具身AI结合将开启AI发展的下一个阶段
Jing Ji Guan Cha Wang· 2025-07-29 11:25
Core Insights - SenseTime has launched the "Wuneng" embodied intelligence platform, leveraging various technologies such as Ark Vision perception and large models to enhance robotic and smart device capabilities [1] - The CEO of SenseTime, Xu Li, stated that the integration of world models and embodied AI will usher in the next phase of AI development, transitioning from "tools" to "humans" and accelerating towards AGI [1] - SenseTime aims for the "Wuneng" platform to assist various embodied intelligence enterprises in achieving their goals of interacting with the real world [1] Technology and Capabilities - The "Wuneng" platform is built on the core engine of SenseTime's "KAIWU" world model, supported by robust edge and cloud computing capabilities [1] - The platform provides advanced perception, visual navigation, and multimodal interaction abilities for robots and smart devices [1] Industry Impact - The launch of the "Wuneng" platform signifies a potential shift in the AI landscape, indicating a move towards more sophisticated and human-like interactions in technology [1] - SenseTime's advancements may position the company as a leader in the embodied AI sector, influencing the development of related technologies and applications [1]
商汤科技林达华:具身智能需数字空间与物理空间连接
Core Insights - The rise of large language models (LLMs) marks a significant leap in AI technology, but achieving Artificial General Intelligence (AGI) requires more than just text understanding and generation [1] - The future of AI development lies in the integration of multimodal information and interaction with the physical world, with a shift towards multimodal models expected to accelerate [1][2] - The realization of AGI necessitates long-term technological accumulation and iterative scene development, overcoming key bottlenecks such as spatial perception and data scarcity [2][8] Multimodal Development - The evolution of large models is transitioning from single-language models to native multimodal architectures, which integrate various types of information during the pre-training process [4][5] - Current multimodal models need to extend from understanding to thinking, incorporating both logical and visual thinking processes [4][5] - Domestic companies are expected to adopt multimodal models comprehensively by the second half of 2025, moving away from standalone language models [5] Challenges in Achieving AGI - Key challenges include the generalization of reasoning capabilities from narrow domains to complex real-life scenarios, as well as the current limitations in spatial perception of multimodal models [2][7] - The development of agents, seen as crucial for AI's real-world application, faces significant gaps in understanding complex conditions and specific industry needs [6][7] - The ability of agents to effectively solve problems in real scenarios is essential for their perceived value and reliability [6] Bottlenecks in Embodied Intelligence - Embodied intelligence must bridge the gap between digital and physical spaces, with current data acquisition methods relying heavily on limited robotic operations [8] - The data throughput for embodied intelligence is significantly lower than that available from the internet, creating a challenge for effective development [8] - To advance embodied intelligence, leveraging prior knowledge and multimodal data from the internet is necessary, as relying solely on real-world data is insufficient [8]
研报掘金|中泰证券:首予商汤“增持”评级 指其大模型能力处于第一梯队
Ge Long Hui· 2025-07-29 11:17
报告指,商汤利用生成式AI的底层技术优势,依托四大行业解决方案打造差异化竞争壁垒。SenseCore 大装置作为高效率、低成本、规模化的新型人工智能基础设施,实现从数据标注,算法设计,到模型训 练、部署的全链路、批量化过程,算力已实现全国联网的统一调度。 中泰证券发表报告,首次覆盖商汤,并予以"增持"评级。该行指,商汤底层技术积累丰厚,其大模型能 力处于第一梯队,具有稀缺性;同时公司亏损业务线正陆续实现拆分融资,主业生成式AI增长良好。 该行预计公司2025至2027年实现总营业收入各48.72亿、62.79亿及80.93亿元,按年增长各29%。 ...
发布“悟能”具身智能平台,商汤让机器人像人一样和现实世界交互
Ge Long Hui· 2025-07-29 10:58
商汤具身世界模型还能够生成多视角视频,并确保良好的时间一致性和空间一致性,让机器能够理解、生成、编辑真实世界,在空间层面实现世界交互, 让"在真实的街道场景中玩'极品飞车'"成为可能。 商汤具身世界模型还能构建面向人、物、场的4D真实世界。用户仅需要输入简单的提示词,比如"在厨房区域的架子上找东西","进入娱乐室、向右转,然 后打开通往院子的门"等等,具身世界模型就能自主进行位姿、动作骨架和指令的生成。 徐立表示,"商汤希望「悟能」具身智能平台能够帮助各种具身智能企业,帮助他们完成和现实世界交互的梦想。" 「悟能」具身智能平台可赋能机器人等各种终端硬件,实现对世界万物的感知理解能力,并支持嵌入到端侧芯片,具有强大的场景适配性。 现场,商汤科技董事长兼首席执行官徐立展示了搭载具身世界引擎的人形机器人,生动讲解"长安的荔枝"PPT的效果,语言自然,风趣幽默,不仅可以自动 翻页,还能回答各类问题,并进行阶段性小结。 7月27日,在"大爱无疆·模塑未来"WAIC 2025大模型论坛上,商汤科技重磅发布「悟能」具身智能平台。 据「TMT星球」了解,「悟能」具身智能平台以商汤具身世界模型为核心引擎,依托商汤大装置提供端侧 ...
2025世界人工智能大会这些新品最值得关注!一文看懂→
Di Yi Cai Jing· 2025-07-29 10:47
Core Insights - The WAIC 2025 highlighted the prominence of robotics, marking a shift in focus from hardware to software advancements in the field [3][4] - Companies like Zhiyuan, Tencent, and SenseTime showcased their developments in perception-action models and world models, enhancing robot autonomy and interaction with the environment [3][5] - Major model companies like MiniMax and Moonlight have recently released models competing with DeepSeek, indicating a competitive landscape in the AI model sector [5][8] Robotics Developments - Almost all humanoid robot companies participated in WAIC 2025, showcasing limited hardware changes but significant software advancements [3][4] - Zhiyuan introduced the "Genie Envisioner" world model, enabling robots to predict and plan actions before execution, marking a shift from passive to active operation [9][11] - SenseTime launched the "Wuneng" embodied intelligence platform, allowing robots to understand and interact with their environment effectively [13] AI Model Innovations - MiniMax and Moonlight are competing for dominance in the open-source model community, with MiniMax's M1 model ranking second and Moonlight's K2 model claiming the top spot in different rankings [8] - Tencent released the "Hunyuan 3D World Model," simplifying 3D scene construction and enabling user interaction [15][16] - Step 3, a new multimodal reasoning model from Jieyue Star, is designed to optimize performance on domestic chips, enhancing the cost-effectiveness of AI applications [17] Industry Insights - The robotics industry is expected to see significant commercialization within the next two years, with companies like Yushun targeting specific market segments [21] - The competition among AI models is shifting towards professional developers rather than general consumers, indicating a strategic focus on specialized applications [8][20] - The AI investment landscape in China has seen a resurgence, with a 45.3% increase in funding and a 59.9% rise in investment events compared to the previous year [34]
具身智能平台让机器人看懂、会动、能交互
Xin Hua She· 2025-07-29 10:36
2025世界人工智能大会展览持续至7月29日。作为具身智能的典型应用形态之一,人形机器人是市场热 点。在商汤科技展台,机器人讲解员能够为观众讲解PPT,实时回答问题,并自如地控制身体动作来配 合展示。这背后依靠的是商汤科技本次大会期间首次发布的"悟能"具身智能平台,帮助机器人看懂、会 动、能交互,为机器人厂商提供一个面向人、物、场真实世界的机器人"训练场"。 记者:张梦洁 0:00 新华社音视频部制作 ...
第二十七届高交会将于11月在深圳举行 意向投资额超过10亿元
Mei Ri Jing Ji Xin Wen· 2025-07-29 09:57
Core Viewpoint - The China International High-Tech Achievements Fair (referred to as the High-Tech Fair), known as "China's First Technology Exhibition," will be held from November 14 to 16, 2025, in Shenzhen, showcasing advancements in high-tech industries and facilitating global technology transactions [1]. Group 1: Event Overview - The High-Tech Fair has been successfully held for 26 sessions since its inception in 1999, becoming a significant platform for China's high-tech sector and innovation [1]. - The 27th High-Tech Fair will focus on a market-oriented exhibition concept, with an exhibition area planned to reach 400,000 square meters, inviting renowned technology companies from over 100 countries and regions [1]. Group 2: Exhibition Focus - The fair will cover various cutting-edge fields, including national key projects, artificial intelligence, smart manufacturing, robotics, semiconductors, clean energy, and more, showcasing the latest technological achievements and innovative solutions [1]. - Specific exhibition areas will include national key equipment, technology giants' industrial chains, international technology achievements, and sectors like low-altitude economy and data industry [1]. Group 3: Participation and Investment - Over 5,000 well-known companies, including China National Petroleum, China Petroleum & Chemical, Huawei, BYD, and Xiaomi, are expected to participate with significant technological equipment and innovative products [2]. - The total intended investment from participating investment institutions has exceeded 1 billion yuan, covering all stages of capital investment from angel rounds to Pre-A rounds [2].
“AI降低了创作门槛,让大众也能参与艺术创作”
Guan Cha Zhe Wang· 2025-07-29 07:27
Core Insights - The 2025 World Artificial Intelligence Conference held in Shanghai showcased over 3,000 cutting-edge exhibits, marking a record scale for the event [1] - The rapid development of AI-generated content (AIGC) is transforming various fields, with companies like SenseTime and Kuaishou demonstrating innovative AIGC applications [1][5] - Despite the advancements, challenges such as content quality, copyright issues, and misinformation have emerged, prompting discussions on the need for regulation and ethical guidelines [1][11] AIGC Applications - Kuaishou introduced the "Ling Animation Canvas" feature, enabling users to convert fragmented ideas into cohesive visual works [2] - SenseTime launched the Seko short film creation agent, which automates the entire video production process from scriptwriting to final editing, focusing on high-quality output [5][7] - The conference featured various themed spaces, including an "AI Painting Laboratory" and an "AI Music Workshop," encouraging creative exploration using AI tools [7] Regulatory Landscape - The need for AI regulation is becoming increasingly urgent as AI-generated content becomes more realistic, with concerns about ethical use and potential illegal activities [13] - The European Union's AI Act, approved in March 2022, represents a significant step in regulating AI applications, with most rules set to take effect by August 2026 [13][14] - China is actively advancing AI legislation, having introduced several regulations and ethical guidelines to ensure the healthy development of AI technologies [14]
搭建AI通往真实世界交互的桥梁,商汤“绝影开悟”世界模型再升级
Tai Mei Ti A P P· 2025-07-29 06:02
Core Insights - The value of the world model lies in expanding the physical boundaries of AI rather than replacing human cognition, as stated by the CTO of SenseTime, Wang Xiaogang [2] - SenseTime showcased its upgraded "Jueying Awakening" world model at the World Artificial Intelligence Conference (WAIC 2025), emphasizing its ambition to extend into embodied intelligence by constructing a 4D real-world model [2][3] Group 1: Product Development and Capabilities - The "Jueying Awakening" model is the first generative world model in the industry to achieve mass production, demonstrating its technical value in practical applications [3] - SenseTime has collaborated with SAIC's Zhiji Auto to generate critical driving scenarios like Cut-in and collisions, allowing for the batch generation of high-risk, low-probability driving scenarios without relying on real road tests [3] - The newly launched product platform for the generative world model is open for trial use by B-end enterprises and C-end developers [4] Group 2: Data Generation and Efficiency - The platform offers flexible scene customization, allowing adjustments for weather, lighting, and road types, and features convenient prompt word generation for scene video creation [5] - The "WorldSim-Drive" dataset, the largest generative driving dataset in the industry, contains over 1 million clips of production-level data, covering various weather and lighting conditions [5] - The model can generate data equivalent to the collection capacity of 10 real test vehicles or 100 road test vehicles daily, achieving efficiency comparable to 500 production vehicles [5] Group 3: 4D Interactive Training Environment - The 4D interactive training environment integrates 3DGS reconstruction technology with world model generation capabilities, enabling high-precision digital reconstruction of real spaces [6][8] - Users can trigger a closed-loop process to quickly generate complex scenarios through text descriptions or scene layouts [8] - The training environment has been implemented in collaboration with Zhiji Auto, covering typical scenarios and aiming to expand to millions of scenarios to encompass nearly all driving possibilities [8] Group 4: Extension to Embodied Intelligence - SenseTime aims to transfer the "virtual-real fusion" data from the autonomous driving sector to embodied intelligence, addressing the challenges of data dimensional explosion and the Sim2Real gap [10] - The model utilizes multi-modal spatiotemporal alignment capabilities and generates high-fidelity 4D environments to predict object movement trajectories in real-time [10] - The model can generate both first-person and third-person perspectives, enhancing the completeness of data views for robotic applications [11] Conclusion - The evolution of "Jueying Awakening" represents a shift of AI from the digital world to the physical world, with the core value being the transformation of AI creativity into productivity [11]
意向投资总额已超10亿,今年高交会设各专业展“链”全球
Nan Fang Du Shi Bao· 2025-07-29 05:41
Group 1 - The 27th China International High-Tech Achievements Fair (referred to as the High-Tech Fair) will take place from November 14-16, 2025, at the Shenzhen International Convention and Exhibition Center, marking a significant event in the global technology landscape [1][11] - The High-Tech Fair has been held 26 times since its establishment in 1999, becoming a crucial platform for showcasing cutting-edge technology and promoting brand visibility for Chinese tech companies [1][3] - Over 5,000 renowned enterprises, including major players like Huawei, BYD, and Xiaomi, are expected to showcase their latest technological innovations at this year's fair [5][9] Group 2 - The exhibition area for the 27th High-Tech Fair is planned to cover 400,000 square meters, with participation from over 100 countries and regions, focusing on the latest high-tech developments and facilitating technology transactions [3][6] - Specialized exhibition areas will be set up to highlight advancements in various fields, including artificial intelligence, robotics, semiconductor technology, and clean energy, aiming to drive innovation and collaboration [6][8] - The fair aims to attract significant investment, with intentions exceeding 1 billion yuan in various sectors, including biomedical and hard technology, indicating strong market interest and potential for growth [11][9] Group 3 - The previous High-Tech Fair achieved record-breaking results, with over 40,000 professional visitors and a transaction amount exceeding 120 billion yuan, showcasing its status as a leading global technology event [9][11] - The fair serves as a vital platform for technology transfer and enterprise financing, fostering deep integration between technology and the economy, and stimulating innovation [11][9] - The event will also feature numerous forums and procurement meetings, facilitating connections between technology providers and investors, further enhancing the collaborative environment [9][11]