Workflow
深度强化学习
icon
Search documents
致敬钱学森,我国学者开发AI虚拟现实运动系统——灵境,解决青少年肥胖难题,揭示VR运动的减肥及促进大脑认知作用机制
生物世界· 2025-06-24 03:56
Core Viewpoint - Adolescent obesity is a global public health crisis with rising prevalence, leading to increased risks of cardiovascular and metabolic diseases, as well as cognitive impairments [2] Group 1: Research and Development - A research team from Shanghai Jiao Tong University and other institutions developed the world's first VR-based exercise intervention system, REVERIE, aimed at overweight adolescents [4][8] - The REVERIE system utilizes deep reinforcement learning and a Transformer-based virtual coach to provide safe, effective, and empathetic exercise guidance [4][8] Group 2: Study Design and Methodology - The study included a randomized controlled trial with 227 overweight adolescents, comparing outcomes between VR exercise, real-world exercise, and a control group [11] - Participants were assigned to different groups, including VR and real-world sports, with all groups receiving uniform dietary management over an eight-week intervention [11] Group 3: Results and Findings - After eight weeks, the VR exercise group lost an average of 4.28 kg of body fat, while the real-world exercise group lost 5.06 kg, showing comparable results [13] - Both VR and real-world exercise groups showed improvements in liver enzyme levels, LDL cholesterol, physical fitness, mental health, and exercise willingness [13] - VR exercise demonstrated superior cognitive function enhancement compared to real-world exercise, supported by fMRI findings indicating increased neural efficiency and plasticity [14] Group 4: Safety and Implications - The injury rate in the VR exercise group was 7.69%, lower than the 13.48% in the real-world exercise group, with no severe adverse events reported [15] - The REVERIE system is positioned as a promising solution for addressing adolescent obesity and promoting overall health improvements beyond weight loss [16][17]
字节跳动ByteBrain团队提出秒级推理强化学习VMR系统
news flash· 2025-06-05 06:49
Core Insights - ByteDance's ByteBrain team, in collaboration with UC Merced and UC Berkeley, has developed a VMR system based on deep reinforcement learning, achieving a significant reduction in inference time to 1.1 seconds while maintaining near-optimal performance [1] Group 1 - The VMR system addresses the long-neglected but critical issue of virtual machine re-scheduling (VMR) [1] - The research has been presented at the prestigious EuroSys25 conference, highlighting its academic significance [1] - The two co-first authors of the paper are interns from ByteDance's ByteBrain team, indicating the company's investment in nurturing talent [1]
深度强化学习赋能城市消防优化,中科院团队提出DRL新方法破解设施配置难题
3 6 Ke· 2025-06-03 07:27
Core Viewpoint - The presentation by Dr. Liang Haojian focuses on optimizing urban emergency fire facility allocation using a hierarchical deep reinforcement learning (DRL) approach, highlighting the advantages and potential of deep learning in geographic spatial optimization [1][4][17]. Geographic Spatial Optimization - Geographic spatial optimization combines mathematical combinatorial optimization with geographic information science, addressing practical issues such as spatial layout and resource allocation in urban development [4][5]. - Traditional methods for solving spatial optimization problems include exact algorithms, approximate algorithms, and heuristic algorithms, each with its limitations [4][5]. Deep Learning in Geographic Spatial Optimization - The exploration of neural spatial optimization (NeurSPO) aims to utilize deep learning to solve spatial optimization problems, motivated by the need for faster heuristic methods and the automatic design of new algorithms [6]. - Two main constructs of NeurSPO are deep construction and deep improvement, focusing on stepwise solution construction and local search enhancements, respectively [6]. SpoNet Model - The SpoNet model integrates dynamic coverage information and attention mechanisms to address location selection challenges, allowing the model to focus on specific input sequences during decoding [7][11]. - In a case study involving emergency facilities in Beijing's Chaoyang District, the model selected 20 out of 132 candidate facilities to maximize coverage [11]. AIAM Model - The adaptive interaction attention model (AIAM) is designed to solve the p-median problem by incorporating user-facility interaction, enhancing local search capabilities [12][16]. - The model demonstrated feasibility by retaining 15 hospitals from 80 candidates to minimize total distance to residents [16]. Hierarchical DRL for Fire Facility Allocation - The hierarchical DRL approach addresses the challenges of urban emergency fire facility allocation by improving fire risk prediction accuracy, optimizing resource allocation, and enhancing response timeliness [17][21]. - The model incorporates multi-dimensional spatiotemporal feature extraction, uncertainty considerations, and a hierarchical strategy for facility layout optimization [18][21][22]. Future Outlook - The research team plans to enhance geographic spatial optimization by integrating geographic computing mechanisms, expanding to large-scale emergency response issues, and designing more efficient DRL frameworks [23][24][25]. - The proposed hierarchical DRL method aims to address inefficiencies in traditional fire facility layouts and improve emergency management through innovative solutions [25].