3D数字人
Search documents
SIGGRAPH Asia 2025 | 只用一部手机创建和渲染高质量3D数字人
机器之心· 2025-12-18 10:15
Core Insights - The article discusses the advancements in 3D digital human reconstruction and rendering technology, specifically focusing on the HRM²Avatar system developed by the Taobao technology - Meta team, which allows for high-fidelity, real-time 3D digital humans to be created using only a smartphone [4][5][6]. Group 1: Technology Overview - HRM²Avatar is a system designed for high-fidelity real-time 3D digital human reconstruction and rendering, utilizing a two-stage capture method and a combination of explicit clothing mesh representation and Gaussian-based dynamic detail modeling [12][36]. - The system allows for the reconstruction of human figures, clothing structures, and detailed appearances under ordinary smartphone conditions, achieving a balance between visual realism, cross-pose consistency, and mobile real-time rendering [6][12]. Group 2: Methodology - The capture process involves both static and dynamic scanning phases, where users maintain a fixed pose for static scans and perform natural movements for dynamic scans, enabling the system to capture necessary signals for reconstruction and dynamic modeling [18][28]. - The system employs a mixed representation approach, attaching Gaussian points to the clothing mesh to provide controllable parameters for pose-related deformations and lighting modeling [40][46]. Group 3: Performance Evaluation - HRM²Avatar has been tested on mobile devices, achieving stable real-time performance with approximately 530,000 Gaussian points at 2K resolution and 120 FPS on the iPhone 15 Pro Max, and 2K at 90 FPS on Apple Vision Pro [87][89]. - Comparative evaluations show that HRM²Avatar outperforms existing methods in static reconstruction quality and appearance consistency under pose variations, as evidenced by higher PSNR and SSIM scores [76][80]. Group 4: Future Directions - The article emphasizes the ongoing need for optimization, particularly in handling complex clothing structures and extreme lighting conditions, indicating that HRM²Avatar is a significant milestone in making high-quality digital humans accessible to ordinary users [90].
给AI一个“身体”:3D数字人是具身智能的解法?|机器人系列
硅谷101· 2025-11-07 23:38
Key Concepts & Industry Focus - 3D digital humans are seen as a crucial link between the virtual and real worlds, evolving beyond AI avatars to become a "embodied intelligence driving layer" [1] - The industry faces the challenge of balancing high quality, low latency, and low cost in 3D digital human creation, while achieving "generalization" across diverse scenarios [1] - The report explores the potential future where screens can interact, NPCs evolve, and robots empathize, questioning the evolving relationship between humans and AI and the proximity to a "digital life" era [1] Technical Aspects & Challenges - The development of 3D digital humans involves the fusion of large models and robotics, acting as a bridge to embodied intelligence [1] - Two main technical paths exist: 2D "speaking" and 3D "expressing," with the latter focusing on building an embodied intelligence driving layer through five key stages [1] - Key challenges include high modeling costs and scarcity of high-quality data, along with achieving generalization in interaction, motion, and emotion [1] Applications & Future Implications - 3D digital humans have the potential to revitalize screens by enabling more natural human-computer interaction through "natural dialogue" [1] - Virtual IPs can evolve into "digital idols," allowing NPCs and players to engage in collaborative adventures [1] - 3D digital humans are accelerating the evolution of robots, driving the arrival of the embodied intelligence era [1]
站在内容创作者与机器人的交界处:聊聊3D数字人的进化
3 6 Ke· 2025-10-29 11:24
Core Insights - The rise of 3D digital humans is transforming content creation and interaction, moving from rigid, scripted avatars to dynamic, responsive entities capable of real-time expression and movement [1][2] - The technology behind 3D digital humans is evolving, with significant advancements in AI and rendering techniques that reduce costs and improve quality [7][36] - The integration of 3D digital humans into various industries, including gaming and film, is expected to grow, with potential applications in customer service and interactive experiences [1][12] Group 1: Technological Advancements - 3D digital humans have progressed from basic, scripted models to sophisticated entities that can generate voice, expressions, and movements in real-time [1][2] - The introduction of models like Sora2 demonstrates the potential for generating human-like actions and interactions, although challenges remain in error correction and precise control [3][5] - The combination of 2D and 3D training techniques is being explored to enhance the expressiveness and accuracy of digital humans [5][7] Group 2: Cost Efficiency - The cost of creating and deploying 3D digital humans is significantly lower than traditional methods, with estimates suggesting costs are a fraction of those associated with large models [7][36] - AI rendering and calculation techniques have enabled the use of inexpensive hardware to run complex 3D models, making the technology more accessible [36][38] - The ability to generate high-quality 3D content without the need for expensive graphics engines or hardware is a game-changer for the industry [36][39] Group 3: Industry Applications - 3D digital humans are being positioned as the next generation of content producers, with applications in live streaming, customer service, and entertainment [2][12] - The technology is expected to bridge the gap between virtual and physical interactions, enhancing user experiences in various sectors [1][12] - The potential for 3D digital humans to be integrated into VR and AR environments opens new avenues for immersive experiences [5][12] Group 4: Data and Model Development - The accumulation of high-quality 3D animation data is crucial for training effective AI models, with the company claiming to have over 1,000 hours of such data [24][25] - The integration of video data with 3D data is being pursued to improve model training and enhance the realism of digital human interactions [25][26] - The development of a "3D action model" is underway, which aims to automate the generation of 3D motion data for robots, further bridging the gap between digital and physical realms [46][48] Group 5: Future Prospects - The company aims to transition from a 3D digital human provider to a platform that enables other developers to create applications using their technology [12][13] - The potential for 3D digital humans to impact the film industry is acknowledged, although challenges in achieving the high-quality standards of Hollywood remain [33][34] - The ongoing evolution of AI and robotics suggests a promising future for the integration of 3D digital humans in various applications, with expectations for significant advancements in the coming years [57][59]