小米MiMo
Search documents
全网都在扒的小米MiMo团队,几乎被“北大学子”承包了
量子位· 2026-03-20 00:18
Core Insights - Xiaomi's MiMo team has rapidly ascended to the forefront of global large model development, achieving significant milestones in less than a year since the launch of its first inference model, MiMo-7B [5][40] - The team's success is attributed to a combination of strong academic backgrounds, particularly from Peking University, and a product-driven approach that emphasizes cost-effectiveness and an internet ecosystem mindset [48][46] Team Dynamics - Xiaomi's MiMo team operates with a high-performance culture, where team members are expected to engage in a minimum of 100 dialogues daily, reflecting a commitment to productivity [1] - The team has garnered attention not only for its model performance but also for its rapid pace of product and research output, which has kept the public and industry stakeholders engaged [12][3] Key Contributors - The core contributors to the MiMo-7B model include notable figures such as Bingquan Xia, Bowen Shen, and Dawei Zhu, many of whom have strong ties to Peking University [14][40] - The team is characterized by a high concentration of members with academic backgrounds from Peking University, which fosters a collaborative environment and facilitates the transition from research to practical applications [41][42] Technical Philosophy - The MiMo team's technical philosophy is heavily influenced by Xiaomi's corporate culture, focusing on delivering high-performance models with a clear strategy for open-source deployment and edge computing [46][47] - The emphasis on a 7 billion parameter model and a commitment to open-source strategies reflect Xiaomi's strategic positioning in the AI landscape [47] Industry Context - In contrast to Xiaomi's rapid advancements, competitors like Meta's superintelligence lab have faced challenges, including delays and underperformance of their models, highlighting the competitive dynamics in the AI model development space [7][8] - The emergence of Xiaomi's MiMo team as a key player in the industry raises questions about the factors contributing to its swift rise and the potential implications for the broader AI ecosystem [8][40]
DeepSeek前骨干罗福莉C位亮相小米,曾网传雷军千万年薪挖她
程序员的那些事· 2025-11-13 11:24
Core Insights - Luo Fuli has officially joined Xiaomi as the head of the MiMo team, marking a significant step in the company's AI ambitions [1][3] - The evolution of intelligence is transitioning from the language domain to the physical world, aiming to unlock multi-modal spatial intelligence, which is crucial for achieving true Artificial General Intelligence (AGI) [4] Timeline and Background - Rumors about Luo Fuli joining Xiaomi surfaced last year, with reports indicating she was recruited by Lei Jun with a salary of tens of millions [5][10] - Key dates include the launch of DeepSeek-V3 on December 25, followed by media reports of Xiaomi assembling a GPU cluster [6][7] - On December 31, Lei Jun publicly shared Xiaomi's ambitions in AI during a New Year's live stream [8] Luo Fuli's Profile - Luo Fuli holds a Bachelor's degree in Computer Science from Beijing Normal University and a Master's in Computational Linguistics from Peking University, with numerous publications in top NLP conferences [15] - She has worked at Alibaba's DAMO Academy and DeepSeek, contributing to the development of various deep learning models [17] - Her academic work has garnered over 11,000 citations, with approximately 8,000 citations added in the past year alone [18] Xiaomi's AI Strategy - The MiMo project is central to Xiaomi's efforts in advancing large model research, with a focus on spatial intelligence [24] - Spatial intelligence aims to bridge the gap between information AI and physical AI, aligning with Xiaomi's ecosystem of people, vehicles, and homes [26]
罗福莉C位亮相小米,离职DeepSeek后首次官宣
36氪· 2025-11-13 10:26
Core Viewpoint - The article highlights the appointment of Luo Fuli as the head of Xiaomi's MiMo team, focusing on advancing spatial intelligence as a key step towards achieving Artificial General Intelligence (AGI) [1][3][23]. Group 1: Appointment and Background - Luo Fuli officially announced her role at Xiaomi on November 12, leading the MiMo team [1]. - She previously worked at DeepSeek and was reportedly recruited by Lei Jun with a salary of tens of millions [4][7]. - Luo has a strong academic background, with over 11,000 citations for her research papers, indicating her prominence in the field [17][18]. Group 2: MiMo Team and Objectives - The MiMo team aims to unlock multi-modal spatial intelligence, which includes perception, reasoning, generation, and action capabilities [4][23]. - Luo's vision aligns with the broader goal of integrating information AI with physical AI, creating a seamless connection between the digital and physical worlds [25]. Group 3: Industry Context and Implications - The concept of spatial intelligence has gained attention, with AI experts like Fei-Fei Li discussing its significance for embodied intelligence and AGI [24]. - Xiaomi's focus on spatial intelligence is seen as a strategic move, leveraging its ecosystem that includes people, vehicles, and homes [25].
中国电子:国产开源模型千帆竞发,阿里 Qwen-3、小米 MiMo、DeepSeek Prover 集中发布
Haitong Securities International· 2025-04-30 15:15
Investment Rating - The report indicates that Alibaba's Qwen currently ranks at the top of the open-source model rankings, with expectations for continued leadership in model capability and ecosystem monetization [2]. Core Insights - The report highlights a surge in domestic open-source models, with significant releases from Alibaba, Xiaomi, and DeepSeek, showcasing advancements in large language models (LLMs) [1][8]. - Alibaba's Qwen-3 series demonstrates substantial performance improvements, achieving 10-30% accuracy gains on various benchmarks and enhancing inference speed by 20-40% [9][12]. - Xiaomi's MiMo model, with 7 billion parameters, excels in reasoning and code generation tasks, outperforming larger proprietary models through innovative training strategies [10][12]. - DeepSeek's Prover-V2-671B model shows strong performance in formal logic reasoning, indicating a strategic focus on specialized AI applications [11][12]. - The report anticipates that as more domestic models are released, the industry may face challenges related to homogenization and competition, pushing for more customized solutions in vertical industries [5]. Summary by Sections Alibaba Qwen-3 - The Qwen-3 series includes models ranging from 1.5 billion to 72 billion parameters, designed for various inference needs, with notable performance enhancements over previous generations [9]. - Deployment costs are significantly lower, requiring only 4 H20 GPUs for full-capacity operation, which is advantageous compared to similar models from OpenAI and Grok [2][12]. Xiaomi MiMo - MiMo's training involved 25 trillion tokens and innovative mechanisms to improve training efficiency, achieving a 2.29x increase in training speed and a 1.96x acceleration in verification processes [10]. DeepSeek-Prover-V2-671B - This model excels in mathematical theorem proving, particularly in formal logic, and serves as a precursor to DeepSeek's upcoming models, reflecting the company's commitment to advancing AI capabilities [11]. Industry Trends - The report suggests that the next phase for open-source models will involve customization based on user data and feedback, aiming to establish long-term barriers and user loyalty in specific industries [5].