联邦学习
Search documents
Meta又一AI大将跟LeCun跑了
量子位· 2026-03-22 06:28
Core Viewpoint - The departure of John Nguyen from Meta to join AMI, a company founded by Yann LeCun, highlights the ongoing challenges and internal turmoil at Meta, particularly within its FAIR team, as it struggles with technological advancements and employee retention [1][5][30]. Group 1: John Nguyen's Background and Contributions - John Nguyen, a key figure at Meta's FAIR, has a strong academic background with dual degrees in statistics and computer science from the University of California, Davis, and has been with Meta for over six years [12][15]. - His research trajectory at Meta included significant contributions to federated learning, large-scale deep learning training, and multi-modal systems, aligning with Meta's technological evolution [16][18][20]. - Nguyen's expertise in both foundational training and practical system implementation positions him as a valuable asset in the AI industry, particularly as the focus shifts from language modeling to real-world modeling [20][28]. Group 2: Meta's Current Challenges - Meta is experiencing significant internal challenges, including rumors of leadership changes and difficulties in model development, particularly with the delayed release of its new model "Avocado," originally expected by late last year [30][34]. - The company has faced public relations issues, including a recent incident involving unauthorized data leaks, contributing to a negative perception of its operational stability [36][37]. - The contrast between Meta's struggles and the rapid growth of AMI, which secured $1.03 billion in seed funding, suggests a potential trend of further departures from Meta's FAIR team to join LeCun's new venture [28][38].
ICLR 2026 | 从「聚合」到「引导」:FedDRM开启客户端智能路由新范式
机器之心· 2026-03-17 03:58
Core Viewpoint - The article presents FedDRM, a novel approach in federated learning that utilizes data heterogeneity as information rather than noise, enabling the server to route requests to the most suitable client for processing [2][5]. Group 1: Traditional Federated Learning Limitations - Existing federated learning methods do not address the "client routing" issue, focusing instead on training multiple local models without the ability to select the appropriate model for external requests [10]. - Current federated learning systems typically use coarse strategies like averaging or voting for decision-making, lacking a mechanism for matching and distributing requests effectively [13]. Group 2: FedDRM's Innovative Approach - FedDRM models the client routing problem as a density ratio estimation problem, allowing for a unified objective function that learns both model prediction capabilities and client routing abilities [12]. - The framework incorporates empirical likelihood to ensure that the underlying distribution is driven by data rather than strong parametric assumptions, thus avoiding model misspecification bias [15]. Group 3: Evaluation of Routing Capability - The evaluation of a federated system's routing capability is based on system accuracy, which measures whether the server can correctly predict the most suitable client for a new external query [20]. - This approach contrasts with traditional methods that focus solely on local accuracy, providing a more realistic assessment of system performance [20]. Group 4: Experimental Results - Experiments on CIFAR-10/100 and RETINA datasets show that FedDRM consistently improves system-level accuracy compared to existing personalized federated learning methods, with enhancements of approximately 1.41% to 7.67% on the RETINA dataset [24]. - The training process remains stable without the need for complex generative models, indicating a practical advantage of the FedDRM approach [24]. Group 5: Implications for Federated Learning - FedDRM transforms the service model of federated learning systems, enabling them to provide structured decision-making capabilities and serve as intelligent systems rather than merely aggregating models [26]. - This advancement opens new possibilities for real-world applications, such as medical collaboration, financial risk management, and IoT, where systems can intelligently match cases to the most appropriate models [27].
不共享数据,也能联合训练,UCL团队用联邦学习重塑血液形态学检查
3 6 Ke· 2026-02-13 09:55
Core Insights - A research team from University College London (UCL) has developed a federated learning framework for white blood cell morphology analysis, enabling collaborative training without data exchange among institutions, thus ensuring data privacy while learning robust and domain-invariant feature representations [1][2]. Group 1: Federated Learning Framework - The federated model utilizes blood smear data from multiple clinical sites, demonstrating superior cross-site performance and generalization capabilities compared to centralized training [1][2]. - The framework addresses the critical issue of data privacy in healthcare, allowing institutions to collaborate on model training without sharing sensitive medical data [2][20]. Group 2: Clinical Relevance and Data Heterogeneity - Blood morphology examination is vital for diagnosing blood diseases, but it is labor-intensive and heavily reliant on skilled professionals, particularly in low- and middle-income countries where expertise is scarce [1]. - The study employed blood smear datasets from two centers, ensuring coverage of various cell types and reflecting real-world clinical heterogeneity, which is crucial for testing the federated learning model's generalization ability [5][8]. Group 3: Model Architecture and Training - The research utilized two deep learning architectures: ResNet-34 and DINOv2-Small, with a unified training protocol involving five rounds of global communication and local training cycles [9][11]. - Four federated aggregation strategies were implemented: FedAvg, FedMedian, FedProx, and FedOpt, each with distinct characteristics and performance implications [12]. Group 4: Performance Evaluation - The federated learning framework showed significant performance improvements, with models achieving a balanced accuracy of 58% compared to 52% for models trained on single institution data, highlighting the advantages of collaborative training without data sharing [16]. - External validation on data from a clinical hospital in Barcelona demonstrated that federated methods outperformed centralized training in generalization capabilities, achieving a balanced accuracy of 67% versus 64% [17][19]. Group 5: Implications for Healthcare - Federated learning is positioned as a key solution to overcome the "data silo" problem in healthcare, enabling collaborative model training while maintaining data privacy and compliance with regulations [20][22]. - The approach is expected to facilitate the transition of AI applications in blood morphology analysis from single-institution settings to cross-regional, clinical-grade intelligent diagnostic services, supporting precision medicine and digital healthcare [22].
中移取得联邦学习方法系统和存储介质专利
Sou Hu Cai Jing· 2026-02-03 07:00
Group 1 - The State Intellectual Property Office of China has granted a patent titled "Federated Learning Method, System, and Storage Medium" to China Mobile (Shanghai) Information Communication Technology Co., Ltd., China Mobile Intelligent Transportation Network Technology Co., Ltd., and China Mobile Communications Group Co., Ltd. The patent was applied for on November 2021 [1] - China Mobile (Shanghai) Information Communication Technology Co., Ltd. was established in 2018 with a registered capital of 2 billion RMB. The company has participated in 2,020 bidding projects and holds 953 patents [1] - China Mobile Intelligent Transportation Network Technology Co., Ltd. was founded in 2015 with a registered capital of 1 billion RMB. The company has engaged in 705 bidding projects and possesses 861 patents [1] Group 2 - China Mobile Communications Group Co., Ltd. was established in 1999 with a registered capital of 30 billion RMB. The company has invested in 55 enterprises and participated in 5,000 bidding projects, holding 5,000 patents [2] - The company has 2,211 trademark registrations and 50 administrative licenses [2]
爱尔眼科参与起草《优化消费环境 放心消费品牌评价规范》
Sou Hu Cai Jing· 2026-01-27 02:53
Group 1 - The 2025 Brand Strong Country Economic Forum was held in Beijing, where the group standard "Optimizing Consumption Environment and Reassuring Consumption Brand Evaluation Norm" was officially launched [1] - The standard was drafted by the National Business Newspaper Association in collaboration with authoritative institutions, establishing a comprehensive evaluation index system for industry standardization and reassuring consumption brand construction [1] - Aier Eye Hospital Group, as one of the drafting units, integrates its development experience into the standard to assist in optimizing the consumption environment and building a reassuring consumption ecosystem [1] Group 2 - Aier Eye Hospital has created a tiered chain model to address the uneven distribution of medical resources, promoting a three-dimensional eye care service network [2] - The company aims to synchronize medical technology, equipment, and pharmaceuticals with international standards, ensuring that innovative technologies benefit Chinese eye care patients sooner [2] - Aier Eye Hospital has achieved steady growth in outpatient volume, surgical volume, and discharge numbers, with a postoperative infection rate of 0.0156% for high-level surgeries, outperforming international averages [2] Group 3 - Aier Eye Hospital is advancing its digital transformation in eye care by launching a "Digital Eye Care" model, utilizing cutting-edge technologies like AI and federated learning [3] - The company aims to create an intelligent closed-loop management system for eye health services, enhancing accessibility and efficiency for grassroots patients [3] - Aier Eye Hospital plans to use the new standard as a guide to deepen technological innovation and upgrade service quality, contributing to the high-quality development of the Chinese eye care industry [3]
医渡科技宫如璟达沃斯之行:密集开展国际对话,释放AI医疗价值与全球合作战略
Sou Hu Cai Jing· 2026-01-26 08:01
Core Insights - The founder and chairwoman of the company participated in the World Economic Forum's 56th Annual Meeting, engaging in high-level discussions on the intelligent transformation of the healthcare industry and the role of AI in addressing global health challenges [1][3][5]. Group 1: AI and Healthcare Resilience - The company emphasized the importance of AI and big data in enhancing public health system resilience and addressing new health challenges driven by climate change [1]. - AI healthcare enterprises can leverage technology to monitor climate-sensitive disease trends and innovate chronic disease management, thereby improving societal health resilience [1]. Group 2: AI in Health Security - The company highlighted the critical role of AI in infectious disease monitoring, drug development, and health management during discussions on reconstructing health security [3]. - The use of federated learning and privacy computing technologies is being implemented to enhance data security while improving model training efficiency across regions [3]. Group 3: China's AI Healthcare Development - The company discussed China's unique advantages in AI healthcare, including rich application scenarios, an improving data ecosystem, and strong policy support [5]. - The global value of China's experience lies not only in technological breakthroughs but also in systematic implementation capabilities and efficient localization models [5]. Group 4: Global Collaboration and Innovation - The company is actively building and expanding its collaboration network through bilateral meetings with key stakeholders, including government officials and industry leaders [7]. - A significant partnership was announced with Novartis for a cardiovascular disease prevention project, showcasing the company's commitment to AI-driven healthcare solutions [7]. Group 5: Contribution to Global AI Healthcare Ecosystem - The company's presence at the forum underscored the role of new-generation Chinese tech entrepreneurs in contributing to global wisdom and fostering cross-sector collaboration for a resilient future [8].
【全网无错版】上周末,唐杰、杨强、林俊旸、姚顺雨真正说了什么?
机器人圈· 2026-01-13 09:41
Core Viewpoint - The article discusses the vibrant developments in China's AI sector at the beginning of 2026, highlighting key figures in the field and their contributions to the evolution of large models and AI applications. Group 1: Event Highlights - The event featured prominent figures in AI, including Professor Tang Jie, Yang Zhilin, Lin Junyang, and Yao Shunyu, marking a significant gathering in Beijing [1]. - The presence of foundational figures like Zhang Bo and Yang Qiang indicates the event's importance in shaping the future of the large model industry [1]. Group 2: Observations on AI Development - The year 2025 was noted as a breakthrough year for open-source models in China, with a 10 to 20 times increase in coding activities [6]. - The discussion emphasized the differentiation of AI models, with a focus on enterprise applications and coding, inspired by developments in Silicon Valley [7][8]. Group 3: Model Differentiation - Yao Shunyu pointed out the clear division between To C (consumer) and To B (business) models, with a growing trend towards vertical integration and layered applications [9][12]. - The article highlights that while consumer applications may not require the highest intelligence, business applications benefit significantly from stronger models, leading to a willingness to pay for superior performance [10][11]. Group 4: Future Paradigms in AI - The conversation shifted to the next paradigm in AI, focusing on autonomous learning and self-improvement, with various interpretations of what this entails [23][24]. - Yao Shunyu mentioned that the bottleneck for autonomous learning is not methodology but rather the data and tasks involved, indicating a need for context and environment to enhance AI capabilities [23][25]. Group 5: Agent Strategy - The potential for agents to automate human tasks significantly was discussed, with expectations that by 2026, agents could handle workloads equivalent to one or two weeks of human effort [39][40]. - The article suggests that the development of agents is closely tied to advancements in model capabilities and the complexity of interaction environments [45][46].
联邦学习不再安全?港大TPAMI新作:深挖梯度反转攻击的内幕
机器之心· 2026-01-11 04:00
Core Viewpoint - Federated Learning (FL) is not as secure as previously thought, as Gradient Inversion Attacks (GIA) can potentially compromise privacy by reconstructing private training data from shared gradient information [3][5]. Group 1: Background and Importance of the Study - Federated Learning allows clients to collaboratively train models without sharing raw data, but recent studies indicate that "not sharing data" does not equate to "absolute security" [5]. - Attackers can utilize GIA to reconstruct private data such as facial images and medical records, highlighting the need for a systematic classification and analysis of these attacks [5][6]. Group 2: Classification of GIA Methods - The research categorizes existing GIA methods into three main types: 1. Optimization-based attacks (OP-GIA) 2. Generation-based attacks (GEN-GIA) 3. Analysis-based attacks (ANA-GIA) [9]. Group 3: Theoretical Contributions - The study presents significant theoretical advancements, including: - Theorem 1: Establishes a linear relationship between the reconstruction error of OP-GIA and the square root of Batch Size and image resolution, indicating that larger batch sizes and higher resolutions make attacks more difficult [11]. - Proposition 1: Reveals that the similarity of gradients during model training affects the difficulty of data recovery, with more similar gradients making recovery harder [13]. Group 4: Experimental Findings - Extensive experiments were conducted on datasets like CIFAR-10/100, ImageNet, and CelebA, covering various attack types and model architectures [15]. - Key findings indicate that: - OP-GIA is practical but limited by batch size and resolution, with its threat significantly reduced in Practical FedAvg scenarios. - GEN-GIA can generate high-quality images but relies heavily on pre-trained generators and specific activation functions, making it less effective if those conditions are not met. - ANA-GIA can achieve precise data recovery but is easily detectable by clients, limiting its practical application [25]. Group 5: Defense Guidelines - The authors propose a three-phase defense pipeline to enhance security without complex encryption: 1. Network design phase 2. Training protocol phase 3. Client verification phase, where clients should validate model architecture and parameters to prevent malicious modifications [22]. Group 6: Summary and Practical Implications - This research serves as a comprehensive examination of existing GIA methods and provides practical guidelines for enhancing the security of federated learning systems, emphasizing that while privacy risks are real, they can be effectively managed through thoughtful design and protocols [24].
海尔消费金融2025年“特征英雄”落下帷幕,数智化风控质效显著
Sou Hu Cai Jing· 2026-01-06 07:50
Core Insights - Haier Consumer Finance successfully concluded its 2025 "Feature Hero" initiative, aimed at enhancing data-driven value in financial services and expanding multi-dimensional data samples [1][6] - The initiative emphasizes the importance of data and features in risk control, with advanced models and algorithms striving to approach the risk identification "ceiling" determined by data [1] Group 1: Feature Hero Competition - The first prize of the "Feature Hero" competition was awarded to the Risk Management Center, which innovatively utilized large models to replace manual processing of voice data, aiding in credit risk control strategies [5] - The competition attracted 32 employees, resulting in the extraction of 2,023 high-quality features from vast data, significantly enhancing the risk control system [5] Group 2: Intelligent Risk Control System - By 2025, Haier Consumer Finance's intelligent risk control system had launched a total of 10,427 real-time features, a 70% increase year-on-year [6] - The company emphasizes the importance of continuous competitions like "Feature Hero" to foster an AI-driven culture and enhance data asset exploration [6] Group 3: AI Integration and Industry Trends - The integration of deep learning technologies such as large models, graph learning, and natural language processing is transforming credit risk control models, showcasing a trend of multi-technology application in the field [6] - Haier Consumer Finance's AI-driven risk control system significantly reduces fraud risk and improves credit approval efficiency, achieving a dual advantage of controllable risk and efficient service [6] Group 4: Future Developments - Future advancements in technologies like federated learning, reinforcement learning, and AGI are expected to further enhance risk control models in areas such as data privacy protection and dynamic strategy optimization [7] - The company plans to deepen its AI First strategy, continuously strengthening data governance and technical application capabilities for high-quality development in credit business [7]
港理工成立人工智能高等研究院 聚焦去中心化AI与成果转化
2 1 Shi Ji Jing Ji Bao Dao· 2025-12-10 12:07
Core Viewpoint - The establishment of the PolyU Artificial Intelligence Advanced Research Institute (PAAI) aims to address challenges in AI development, focusing on decentralized AI architectures and the transformation of research outcomes into practical applications [1][2]. Group 1: PAAI Establishment and Goals - PAAI is one of the five key research institutes at Hong Kong Polytechnic University, dedicated to interdisciplinary research and the conversion of cutting-edge research into products and technologies [1]. - The institute aims to leverage PolyU's research network in mainland China to facilitate the application of AI technologies for societal development [1]. Group 2: Government Support and AI Ecosystem - The Hong Kong government is advancing AI development through talent acquisition, data management, and application initiatives, with plans to establish the Hong Kong AI Research Institute (AIRDI) by 2026 [1]. - Over 1,000 experts in AI and robotics have been gathered in Hong Kong, highlighting the region's commitment to fostering AI growth [1]. Group 3: Addressing AI Challenges - PAAI aims to tackle issues such as data centralization and privacy concerns, promoting a shift from centralized to decentralized AI models [2]. - The concept of "Federated Learning" is introduced as a secure, decentralized approach to AI development, enabling data flow across sectors like healthcare and finance without compromising privacy [2]. Group 4: Technological Innovations and Collaborations - PAAI is focusing on collaborative generative AI, federated learning, and edge-based models, with applications in healthcare, education, finance, and robotics [2]. - The institute is collaborating with medical institutions on the "Cancer GenAI" project and exploring AI applications in infectious disease control and financial sectors [2][3]. Group 5: Co-GenAI Project and Funding - The "Co-GenAI" project aims to develop decentralized AI technologies, addressing the limitations of centralized AI in accessing private data in high-end fields like healthcare [3]. - PAAI has received dual funding from the Hong Kong government's RAISE+ and TRS programs to support the "Cancer GenAI" project and has launched a blockchain-based global research collaboration platform [3].