MiMo
Search documents
鏖战2025年,大模型围着开源转
3 6 Ke· 2025-12-25 10:29
Core Viewpoint - By 2025, open-source will dominate the landscape of large models, with a significant increase in the number of users adopting open-source models globally, marking a shift in the competitive dynamics between open and closed-source approaches [1][20]. Group 1: Open-Source vs Closed-Source Dynamics - The debate between open-source and closed-source large models has been ongoing, with both sides presenting strong arguments, but the trend is shifting towards open-source as more major internet companies adopt this approach [1][5]. - Closed-source models, initially seen as the only viable path due to advantages in data security and commercial monetization, are now facing challenges in areas like AI accessibility and ecosystem development [3][10]. - The emergence of open-source models has created a new competitive landscape, with companies like Meta and Alibaba leading the charge in open-source initiatives [5][10]. Group 2: Impact of DeepSeek - The introduction of DeepSeek has significantly altered the competitive balance, demonstrating that open-source models can achieve high performance at lower costs, thus attracting more companies to switch to open-source strategies [7][20]. - DeepSeek's training cost was approximately $294,000, with a training duration of about 80 hours, showcasing a more efficient approach compared to traditional methods [7]. - Open-source models like DeepSeek and Qwen have reportedly matched or even surpassed the performance of leading international products, shifting the focus of competition from pure performance to cost, efficiency, and commercialization capabilities [8][20]. Group 3: Market Trends and User Engagement - The AI application market is rapidly evolving, with mobile and PC active user numbers reaching 729 million and 200 million respectively by September 2025, indicating a shift towards more specialized and efficient applications [11][13]. - Open-source models are seen as the quickest path to market, fostering a collaborative ecosystem that enhances user engagement and accelerates innovation [13][14]. - Companies are increasingly recognizing the long-term commercial value of high user engagement within open-source ecosystems, leading to a competitive race among internet giants to provide comprehensive open-source solutions [15][19]. Group 4: Commercialization of Open-Source - Open-source does not equate to free; companies are exploring various monetization strategies, including enterprise versions, commercial APIs, and cloud services, to sustain their open-source initiatives [18][19]. - Alibaba has open-sourced over 300 models, generating more than 170,000 derivative models, positioning itself as a leader in the global open-source model landscape [16]. - Baidu is integrating its self-developed Kunlun chips with open-source models, adopting a full-stack autonomous approach to enhance its competitive edge [17].
“天才少女”罗福莉亮相背后:曾被雷军亲自点将,能成小米新王牌?
Sou Hu Cai Jing· 2025-12-18 12:26
Core Insights - The core focus of the article is on the public debut of Xiaomi's MiMo model leader, Luo Fuli, who emphasizes the need for next-generation intelligent agents to transition from merely answering questions to completing tasks and interacting with the world [2][4]. Group 1: Luo Fuli's Background and Role - Luo Fuli, born in 1995 in Yibin, Sichuan, has a strong academic background in computer science and computational linguistics, having published eight papers at the ACL conference in 2019 [1]. - After working at Alibaba's Damo Academy and DeepSeek, Luo Fuli joined Xiaomi, reportedly at a salary of tens of millions, to lead large model research [1][2]. Group 2: Xiaomi's AI Developments - Xiaomi has been rapidly advancing its model releases throughout the year, including MiMo, MiMo-VL, MiMo-Audio, and Miloco, with the AIoT platform connecting over 1.04 billion devices globally by Q3 [2][5]. - The introduction of the MiMo-V2-Flash model, with 309 billion total parameters and 15 billion active parameters, aims to achieve low-cost, high-speed performance and initial world simulation capabilities [5][6]. Group 3: Focus on Physical AI - Luo Fuli's research direction at Xiaomi includes a strong interest in "Physical AI," which involves models that understand and interact with the real world through movement skills, as seen in robotics and autonomous vehicles [4][6]. - The MiMo-Embodied model aims to bridge the fields of autonomous driving and embodied intelligence, indicating a strategic focus on integrating AI into physical environments [4][5]. Group 4: Smart Home Ecosystem - Xiaomi's Miloco initiative seeks to transform smart home devices from passive responders to proactive service providers, enhancing user interaction through natural language processing and real-time visual data [6][11]. - The Miloco system aims to eliminate the need for complex manual settings by allowing users to express needs verbally, thus shifting from a fragmented app-based control to a more intuitive, user-intent-driven experience [7][12].
“天才少女”罗福莉走向台前
Hua Er Jie Jian Wen· 2025-12-17 12:35
Core Insights - The article highlights the ambitious plans of Xiaomi in the AI era, particularly through the introduction of the MiMo model led by the young scientist Luo Fuli, who emphasizes a shift from traditional hardware to intelligent services [2][10] - Xiaomi's strategy involves a significant investment of 200 billion yuan over the next five years to enhance its research and development capabilities, aiming to secure its position in the evolving tech landscape [2][10] Group 1: Xiaomi's AI Strategy - Luo Fuli's presence at the Xiaomi Partner Conference signifies a strategic shift towards AI, with a focus on developing the MiMo-V2-Flash model, which aims to integrate AI more closely with physical interactions rather than just language processing [2][5] - The MiMo-V2-Flash model utilizes a unique architecture that activates only a fraction of its total parameters during operation, allowing it to be lightweight enough for mobile and automotive applications, achieving three times the inference speed of competitors while being significantly more cost-effective [5][10] - Xiaomi's approach is to create a "virtual universe" that interacts with the physical world, moving beyond traditional chatbots to develop AI that understands and responds to real-world conditions [5][10] Group 2: Industry Context and Challenges - The AI industry is experiencing a shift from a focus on scaling models to a more research-oriented approach, as the marginal returns from simply increasing computational power are diminishing [8][9] - Competitors in the AI space are increasingly seeking hardware integration to enhance their models, indicating a trend where software giants are looking to establish a physical presence to interact with the real world [9][10] - Xiaomi's existing infrastructure, with its vast IoT ecosystem connecting 1.04 billion devices, positions it uniquely to leverage AI for smart services, but it must ensure that its models are competitive to retain user loyalty [10][11]
China's Xiaomi says returns from AI investments 'far exceed expectations'
Yahoo Finance· 2025-12-05 09:30
Core Insights - Xiaomi's investment in artificial intelligence (AI) has yielded returns in 2025 that significantly exceeded expectations, according to the company's president Lu Weibing [1] - The company is shifting focus towards embodied AI after substantial investments in general AI over recent quarters, paralleling strategies seen in companies like Tesla [2] - Xiaomi's advancements in AI large models and applications have surpassed initial expectations, with a belief that the integration of AI with the physical world represents the next generation of intelligent technology [3] AI Developments - Xiaomi launched its first AI model, MiMo, in April and has recently open-sourced MiMo-Embodied, which showcases advanced performance in autonomous driving and embodied AI tasks [4] - The company has seen increased interest in AI applications within the electric vehicle (EV) sector, highlighted by the introduction of its premium SU7 Ultra, which features Hyper-Autonomous Driving technology [5] - There is a growing interest in embodied AI, leading Xiaomi to enhance its investments in robotics, following the introduction of its robot dog in 2021 and a humanoid prototype in 2022 [6] Talent Acquisition - Xiaomi is actively seeking talent to bolster its AI initiatives, recently hiring Luo Fuli, a former researcher from DeepSeek, to lead the MiMo team [7] Financial Performance - Xiaomi's smart EV and AI initiatives turned a profit for the first time in Q3, generating a record revenue of 29 billion yuan (approximately US$4.1 billion), marking a 199% year-on-year increase [8] - The company's total revenue for the quarter rose by 22% year-on-year to 113.1 billion yuan [8]
罗福莉官宣后,小米放出首个AI大招,10亿IoT设备一键接入大模型
3 6 Ke· 2025-11-14 11:16
Core Insights - Xiaomi has launched its first "large model + smart home" solution called Xiaomi Miloco, which is a local AI assistant designed to enhance smart home interactions [1][3]. Product Overview - Xiaomi Miloco utilizes the MiMo-VL-Miloco-7B model, which is based on the previously released MiMo model, and connects to various IoT devices in the home [2][11]. - The solution aims to provide a new interaction paradigm through natural language processing, allowing users to communicate with their smart home systems [5][6]. Features and Capabilities - Miloco features a new interaction paradigm that allows users to set rules and control devices using natural language [5]. - It leverages visual data from Xiaomi's cameras to analyze home events and respond to user queries [5]. - The model operates on edge devices, ensuring privacy and security by processing data locally without sending it to external servers [14]. Ecosystem Integration - Miloco connects with the Xiaomi ecosystem and supports integration with third-party IoT platforms, enhancing its functionality [6][9]. - The hardware requirements for deploying Miloco are minimal, needing only x64 architecture and a GPU from NVIDIA's 30 series or higher [6]. Market Context - The launch of Miloco is seen as a significant moment for smart home technology, comparable to the impact of ChatGPT in the AI space [3][14]. - Xiaomi's move follows similar advancements from competitors, indicating a competitive landscape in the smart home sector [14].
罗福莉C位亮相小米,离职DeepSeek后首次官宣
猿大侠· 2025-11-14 04:11
Core Viewpoint - Luo Fuli has officially joined Xiaomi as the head of the MiMo team, focusing on advancing multi-modal spatial intelligence, which is a crucial step towards achieving true Artificial General Intelligence (AGI) [4][24]. Timeline of Events - Rumors about Luo Fuli joining Xiaomi surfaced at the end of last year, with reports indicating that Lei Jun offered her a salary in the millions to lead Xiaomi's AI efforts [5][10]. - Significant milestones include the launch of DeepSeek-V3 on December 25, followed by media reports of Xiaomi assembling a GPU cluster the next day [6][7]. - On December 31, 2024, Lei Jun publicly shared Xiaomi's ambitions in AI during a New Year's live stream [8]. Background of Luo Fuli - Luo Fuli holds a Bachelor's degree in Computer Science from Beijing Normal University and a Master's degree in Computational Linguistics from Peking University, where she has published papers in top NLP conferences [15]. - She has worked at Alibaba's DAMO Academy and later at DeepSeek, contributing to the development of various deep learning models [17]. - Her academic work has garnered over 11,000 citations, with approximately 8,000 citations added in the past year alone [18]. Xiaomi's AI Strategy - The MiMo initiative is central to Xiaomi's efforts in developing large models, with a focus on "spatial intelligence," which aims to bridge the gap between information AI and physical AI [24][26]. - Luo Fuli's role is seen as pivotal in connecting Xiaomi's AI research with academic institutions, particularly with her former mentor from Peking University [22]. Concept of Spatial Intelligence - Spatial intelligence is described as the ultimate goal of integrating information AI with physical AI, facilitating a seamless connection between the digital and physical worlds [26]. - This concept aligns with Xiaomi's broader ecosystem strategy, which encompasses people, vehicles, and home integration [26].
罗福莉C位亮相小米,离职DeepSeek后首次官宣
量子位· 2025-11-12 08:01
Core Insights - Luo Fuli has officially announced her position at Xiaomi, leading the MiMo team to advance the development of multi-modal spatial intelligence, a key step towards achieving Artificial General Intelligence (AGI) [1][3][7] Group 1: Background and Context - Rumors about Luo Fuli joining Xiaomi surfaced at the end of last year, with reports indicating that she was recruited by Lei Jun with a salary of tens of millions [4][10] - Significant events include the launch of DeepSeek-V3 on December 25, followed by media reports of Xiaomi assembling a GPU cluster [5][6] - Luo Fuli's name appeared in Xiaomi's AI team papers as an independent researcher prior to her official announcement [11][20] Group 2: Luo Fuli's Profile - Luo Fuli holds a Bachelor's degree in Computer Science from Beijing Normal University and a Master's degree in Computational Linguistics from Peking University, with numerous publications in top NLP conferences [15][17] - She has over 11,000 citations for her academic papers, with approximately 8,000 citations added in the current year alone [18] - Luo previously worked at Alibaba's DAMO Academy and DeepSeek, contributing to the development of various deep learning models [17] Group 3: Xiaomi's AI Ambitions - Xiaomi aims to enter the deep waters of AI following the establishment of its automotive business, with a focus on spatial intelligence [9][24] - The concept of spatial intelligence, as articulated by Luo Fuli, involves bridging the gap between information AI and physical AI, which aligns with Xiaomi's ecosystem of people, vehicles, and homes [23][25]
官宣!95后「AI天才少女」罗福莉加入小米,雷军终于“挖人”成功
Sou Hu Cai Jing· 2025-11-12 07:43
Core Insights - The article highlights the rise of Luo Fuli, a talented AI researcher, who gained significant attention after her involvement in the successful development of the DeepSeek-V2 model, which is recognized as a leading Chinese AI model [2][3]. Group 1: Luo Fuli's Background and Achievements - Luo Fuli, known as a "genius girl," began her journey in AI while studying computational linguistics at Peking University, where she published eight papers at the prestigious ACL conference in 2019 [2]. - Her notable contributions to the DeepSeek-V2 model, which offers high cost-effectiveness at 1 yuan per million input tokens, positioned her as a key figure in the AI community [2]. Group 2: Transition to Xiaomi - Reports indicate that Luo Fuli left DeepSeek in February 2025, and her name appeared in a paper co-authored by Xiaomi's AI team and Peking University in October 2025, suggesting her transition to Xiaomi [5]. - Xiaomi's acquisition of Luo Fuli is seen as a strategic move, as the company is building a robust AI ecosystem, including the MiMo model and a GPU cluster, which can leverage her expertise [6]. Group 3: Talent Competition in AI - The article emphasizes the intense competition for top AI talent, with companies vying for individuals capable of developing practical AI products [6]. - Luo Fuli's rise to prominence reflects the scarcity of elite AI professionals, making her a highly sought-after asset in the industry [6]. Group 4: Personal Attributes and Work Ethic - Despite her accolades, Luo Fuli maintains a humble approach, focusing on technical challenges and expressing a desire to work quietly on meaningful projects [8].
监督学习未死,一题训练五小时起飞!华人学者新方法20倍训练效率释放大模型推理能力
量子位· 2025-08-04 07:00
Core Viewpoint - The article discusses the breakthrough of One-Shot Critique Fine-Tuning (One-Shot CFT) in enhancing reasoning capabilities of large language models (LLMs) with minimal data and computational resources, outperforming traditional reinforcement learning (RL) methods and small-scale supervised fine-tuning (SFT) approaches [1][3][14]. Group 1: One-Shot CFT Methodology - One-Shot CFT is a new method that allows models to learn reasoning by analyzing the quality of answers rather than merely imitating them, thus providing a deeper learning signal [3][12]. - The process involves selecting a representative task, generating multiple answers using various models, and then having a more powerful model critique these answers, which serves as the supervision signal for training [4][5]. - The entire training process requires only one question, multiple answers, and critiques, taking approximately 5 GPU hours, significantly less than RL methods [5][14]. Group 2: Performance and Results - In experiments, Qwen2.5-Math-7B achieved a 15% accuracy increase after One-Shot CFT fine-tuning on a single question, surpassing both RL and full supervised fine-tuning models that used tens of thousands of training samples [9][10]. - The method demonstrated strong performance across various mathematical and logical reasoning tasks, with accuracy improvements ranging from 10% to 16% in specific sub-tasks [10][11]. - One-Shot CFT showed stability and reproducibility across different tasks and model configurations, indicating its robustness [11][13]. Group 3: Advantages of One-Shot CFT - The method emphasizes critical learning, allowing models to understand why answers are correct or incorrect, which enhances the depth of learning compared to traditional SFT [12]. - It introduces multi-perspective inputs by generating multiple answers and critiques for a single task, closely mimicking human learning processes [12]. - The training signals from critiques are highly generalizable, reducing the risk of overfitting and allowing for easier transfer to new tasks [12]. Group 4: Accessibility and Practical Implications - One-Shot CFT's low computational cost makes it accessible for individual researchers, resource-limited labs, and startups, providing a cost-effective solution for enhancing reasoning capabilities [14][15]. - The entire process is open-source, including training scripts, model parameters, and datasets, which significantly lowers the barrier for replication and experimentation [17].
苹果Meta狂抓AI,抢人并购
Hu Xiu· 2025-06-23 23:27
Core Insights - Apple and Meta are intensifying their efforts in AI, realizing its potential to disrupt device experiences and advertising models [1][2] - Both companies face challenges in talent acquisition and strategic direction, risking marginalization in the AI landscape [3][12] Group 1: AI Competition and Acquisitions - Apple and Meta are competing against AI giants like Microsoft, Amazon, Google, and OpenAI, with significant valuations for potential acquisition targets such as Perplexity at $14 billion and Thinking Machines Lab at $10 billion [2][23] - Meta has acquired nearly half of Scale AI for $14.3 billion and is considering other acquisitions like SSI, valued at $32 billion, and several other AI companies with valuations ranging from $4.5 billion to $62 billion [2][21] Group 2: Strategic Challenges - Both companies are struggling with a lack of direction and talent, leading to confusion in strategic execution [3][12] - Apple has not delivered substantial AI innovations at its recent developer conference, raising concerns about its future in the AI ecosystem [6][13] Group 3: Market Position and Threats - Apple is losing its dominance in the smartphone market, with competitors like Huawei and Xiaomi advancing rapidly in AI capabilities [8][22] - Google is solidifying its position in AI search and video, posing a direct threat to Meta's advertising market, particularly in short videos [7][10] Group 4: Talent Acquisition Efforts - Zuckerberg is actively recruiting top talent in AI, emphasizing the importance of building a strong team to drive Meta's AI initiatives [15][18] - Apple is also seeking to enhance its AI capabilities by potentially acquiring or collaborating with companies like Mistral and Thinking Machines Lab [19][21] Group 5: Future Outlook - The competition for AI talent and technology is intensifying, with both Apple and Meta needing to adapt quickly to avoid being left behind [12][23] - The ongoing mergers and acquisitions in Silicon Valley signal a new wave of consolidation in the AI sector, with both companies needing to act decisively [23]