Workflow
又是浙大校友!AI眼镜“隔空取物”,戴上即可随心选中现实世界任意物体
量子位·2025-08-07 10:13

Core Viewpoint - The article discusses the introduction of Reality Proxy, a technology that enhances mixed reality (XR) interactions by allowing users to manipulate real-world objects through digital proxies, thereby overcoming limitations of traditional XR devices [11][12][14]. Group 1: Technology Overview - Reality Proxy enables seamless interaction with real-world objects by creating abstract digital representations of them, allowing users to select and manipulate these proxies instead of the physical objects directly [9][12]. - The technology aims to improve user experience by addressing challenges such as small object sizes, unstable line of sight, and hand tremors that often hinder traditional XR interactions [11][14]. - The process involves three main steps: activating the proxy, generating the proxy while retaining spatial relationships, and interacting with the proxy while maintaining focus on the real-world object [15][16][21]. Group 2: Interaction Features - Users can browse object previews by sliding their fingers over multiple proxies, select multiple objects by defining an area with hand gestures, and filter objects by attributes [27][28][29]. - The system supports intuitive interactions by utilizing physical surfaces as touchpads, allowing users to perform familiar gestures to select and manipulate objects [30]. - Additional features include semantic grouping, spatial scaling, and custom grouping, enhancing the flexibility of user interactions with real-world objects [32][35][37]. Group 3: Practical Applications - Reality Proxy can be applied in various scenarios, such as quickly locating specific books in an office, interacting with kitchen appliances, and navigating large buildings efficiently [38][40][42]. - The technology also facilitates the control of dynamic real-world objects, exemplified by a mixed reality application for drone control [44][45]. Group 4: User Feedback and Evaluation - The evaluation of Reality Proxy indicated positive feedback regarding its practicality, ease of learning, and usability, particularly in addressing interaction challenges with distant or hard-to-reach objects [48][50]. - Participants noted that the system is faster than traditional methods, reduces physical fatigue, and is more accommodating for users with mobility or visual stability issues [51]. - Some concerns were raised about the learning curve for new users and the need for improved accuracy in proxy positioning [52].