Core Insights - The AICon Global Artificial Intelligence Development and Application Conference will take place in Beijing, featuring over 50 experts from leading companies like Tencent, Alibaba, Baidu, and ByteDance, focusing on AI Agent, multimodal applications, and optimization of reasoning performance [1][4]. Group 1: Conference Highlights - The conference will cover various topics including AI Agent construction, multimodal practices, large model support for development, and AI's deep integration into business operations [4]. - A notable presentation will be given by Han Ai, the Algorithm Director of JD Group, discussing the JDAgents-R1 framework, which addresses challenges in multi-agent reinforcement learning (MARL) [2][3]. Group 2: JDAgents-R1 Framework - JDAgents-R1 introduces a joint evolution algorithm framework for heterogeneous multi-agents, utilizing Group Relative Policy Optimization (GRPO) to enhance training efficiency and stability [2]. - The framework balances decision-making and memory capabilities, reducing redundant reasoning and accelerating training convergence, achieving performance comparable to large-scale language models with smaller open-source models [2]. Group 3: Expert Contributions - Han Ai has extensive academic and professional credentials, including a PhD from a joint program between the Chinese Academy of Sciences and Cornell University, and has published numerous papers in top-tier journals [3]. - The presentation will include insights on multi-agent training technologies, application cases, and the evolution of decision-making and memory in multi-agent systems [3].
京东集团算法总监韩艾将在 AICon 北京站分享基于强化学习的异构多智能体联合进化算法