WizardLM

Search documents
微软华人AI团队核心成员被曝加入腾讯混元,知情人称与裁员无关|独家
AI前线· 2025-05-14 08:12
Core Viewpoint - The WizardLM team, including key member Can Xu, has left Microsoft to join Tencent's Hunyuan division, amidst speculation regarding the timing of their departure coinciding with Microsoft's global layoffs [1][2]. Group 1: Team Departure and Background - Can Xu announced his departure from Microsoft, clarifying that it was his personal decision and not the entire WizardLM team [1]. - Most core members of the WizardLM team have reportedly already left Microsoft prior to the announcement, and their departure is not directly related to the layoffs affecting approximately 6,000 employees [2]. - The WizardLM team was established in early 2023, focusing on the development of advanced large language models (LLMs) [4]. Group 2: Team Members and Contributions - Key members of the WizardLM team include Qingfeng Sun and Can Xu, both of whom have significant backgrounds in AI research and have contributed to various projects at Microsoft [5]. - Can Xu has led the development of several models under the WizardLM series, with over 40 papers published in top international conferences and more than 3,300 citations on Google Scholar [5]. Group 3: Model Development and Achievements - The WizardLM team introduced the Evol-Instruct method, which generates diverse instruction data using LLMs, outperforming human-created datasets in evaluations [6][9]. - The WizardLM model has achieved notable performance metrics, including a 97.8% score compared to ChatGPT on the Evol-Instruct test set [10]. - In a ranking of large language models, WizardLM was placed fourth globally, marking it as the top open-source model from a Chinese team [13][14]. Group 4: Tencent's AI Strategy - Tencent has restructured its AI model development framework, focusing on "computing power, algorithms, and data," and plans to invest approximately 124.9 billion USD in AI development this year [24][26]. - The company has established new technical departments dedicated to large language models and multimodal models to enhance its AI capabilities [24][25]. Group 5: Challenges and Community Impact - Following the release of the WizardLM-2 models, Microsoft retracted them due to missing toxicity testing, which has raised concerns within the AI community [19][21]. - The CEO of Hugging Face expressed that Microsoft's actions have negatively impacted various open-source projects and the community at large [21][23].
原微软WizardLM项目团队加入腾讯混元
news flash· 2025-05-14 06:27
Core Insights - The creator of the WizardLM project, Xu Can, announced the team's departure from Microsoft to join Tencent's AI development organization, Hunyuan, with a focus on advancing LLM training technology and building better AI models [1] Company Developments - The WizardLM team consists of six key members, most of whom have left Microsoft to pursue their mission under Tencent [1]
微软这支神秘的华人AI团队加入腾讯混元,曝与裁员无关|独家
AI前线· 2025-05-14 05:47
Core Viewpoint - The WizardLM team, creators of advanced large language models, has transitioned from Microsoft to Tencent's AI development organization, Hunyuan, aiming to enhance LLM training technology and develop superior AI models [1][3][31]. Group 1: Team Transition and Background - The WizardLM team, consisting of six key members, has left Microsoft amid speculation regarding layoffs affecting 3% of the workforce, although their departure is reportedly unrelated to these layoffs [4][6]. - The team was established in early 2023, focusing on the development of advanced large language models, with notable members including Qingfeng Sun and Can Xu, both of whom have significant experience in AI research [7][9][10]. - The team has previously contributed to the development of models such as WizardLM, WizardCoder, and WizardMath, and has published over 40 papers in top international conferences [10][13]. Group 2: Model Development and Achievements - WizardLM has released models that outperform Google's Gemma 3 series and have ranked among the top four global large language models in competitions [3][16]. - The core algorithm, Evol-Instruct, allows for the efficient generation of complex instruction data, leading to superior performance in human evaluations compared to traditional methods [13][14][17]. - The WizardLM-30B model achieved a 97.8% score compared to ChatGPT in specific tests, showcasing its advanced capabilities [14]. Group 3: Tencent's AI Strategy - Tencent has restructured its AI development framework, focusing on "computing power, algorithms, and data," and plans to invest approximately 124.9 billion USD in AI development [28][30]. - The company has established new technical departments dedicated to large language models and multimodal models, aiming to enhance AI capabilities in natural language processing and data integration [28][29]. - Following the acquisition of the WizardLM team, Tencent's ambition in the AI sector is expected to grow, with the team continuing to develop and release AI models [31].