Workflow
端侧模型
icon
Search documents
全国首个行政复议垂直大模型亮相,面壁智能“掘金”政务数智化市场|聚焦2025服贸会
Hua Xia Shi Bao· 2025-09-12 00:45
9月10日,本报记者来到首钢园3号馆,这里也是2025服贸会"电信、计算机和信息服务专题展"所在场 馆,而北京方案展是这次"电信、计算机和信息服务专题展"的主要看点之一。 本报记者从现场工作人员处了解到,此次北京方案展主要围绕"促进城市精细化治理、服务人民美好生 活"这一主线,创新打造"1+1+9"的一体化展示架构,即1个数智化算力底座,1个北京市政务云,联动9 大应用方案,全面呈现数字技术赋能城市发展的创新实践,重点突出"示范引领""沉浸体验""交易促 进"三个特点,通过真实场景模拟,深度还原数字技术应用的标杆价值,推动优秀数字服务"北京方 案"走向全国乃至全球。 (石飞月 摄影) 本报(chinatimes.net.cn)记者石飞月 北京报道 9月10日-14日,2025年中国国际服务贸易交易会(下称"服贸会")在北京首钢园举办,北京方案展也在 同期展陈。此次方案展的核心亮点之一,是面壁智能基于北京市行政复议实际办案场景、自主研发的全 国首个深度融合业务全流程的行政复议垂直大模型。 面壁智能自成立以来便一直走差异化路线,如端侧模型、小模型,也在不同领域迎来了商业化落地,此 次展示的行政复议垂直大模型也属于 ...
苹果看上的公司,靠量子“邪修”给模型“瘦身”
虎嗅APP· 2025-09-02 14:00
Core Viewpoint - The article discusses the rise of Multiverse Computing, a Spanish AI startup that has developed a compression technology called CompactifAI, which significantly reduces the size of AI models while maintaining performance, positioning itself as a leader in the AI efficiency race amidst growing competition from tech giants and startups alike [6][10][22]. Summary by Sections Company Background - Multiverse Computing was founded in 2019, initially focusing on quantum computing software for financial applications. It quickly gained recognition and funding, being named a "Cool Vendor" by Gartner, which is a prestigious acknowledgment in the tech innovation space [9]. - The company has completed five rounds of financing, with its valuation increasing from $108 million in 2024 to $500 million in 2025, making it one of the largest AI startups in Spain [6][8]. Technology Development - The company pivoted to AI model compression in 2023, leveraging its expertise in quantum tensor networks to address the rising computational costs associated with large AI models. This led to the development of CompactifAI, which can compress model sizes by 80-95% with minimal performance loss [10][13]. - The newly launched models, "SuperFly" and "ChickBrain," are touted as the smallest and highest-performing models, with SuperFly having 94 million parameters and ChickBrain having 3.2 billion parameters [15][17]. Market Position and Competition - Multiverse's technology has attracted interest from major hardware companies like Apple, Samsung, and Sony, aiming to integrate its small models into next-generation devices. This aligns with Apple's strategy to prioritize lightweight local models over large, general-purpose models [22]. - The competitive landscape is intensifying, with tech giants like Meta, Google, and Microsoft also entering the small model space, alongside startups like Neural Magic and Deci, all targeting improved AI performance and cost efficiency [21][23]. Business Model and Applications - Multiverse offers three commercial service models: API access to compressed models, private deployment licenses, and model compression services for clients. Its primary customers include large internet and software companies utilizing AI for various applications [17][18]. - The CompactifAI technology allows for significant cost savings in AI deployment, reducing inference costs by 50-80% and enabling models to run on less powerful hardware, thus broadening accessibility [20][17].
1年涨五倍,被苹果看上的“模型瘦身”公司靠谱吗?
Hu Xiu· 2025-09-02 05:21
Core Insights - Multiverse Computing has developed a technology called CompactifAI that can compress large AI models by 80-95% while maintaining performance, allowing these models to run on devices like smartphones and cars [1][6][11] - The company has seen significant financial growth, with its valuation increasing from $108 million in 2024 to $500 million, making it one of the largest AI startups in Spain [2][4] - The rise of generative AI has led to increased demand for efficient model compression solutions, positioning Multiverse favorably in a competitive landscape [6][19] Company Overview - Founded in 2019, Multiverse initially focused on quantum computing software for financial applications before pivoting to AI model compression [5][6] - The team consists of highly qualified individuals, with 40% holding PhDs and expertise spanning finance, quantum physics, and technology entrepreneurship [5] Technology and Innovation - CompactifAI utilizes quantum tensor network techniques to efficiently compress model parameters, which is distinct from traditional methods like quantization and distillation [8][10] - The compressed models, such as "SuperFly" and "ChickBrain," have significantly reduced parameter counts while retaining performance, making them suitable for various applications [12][13][16] Market Position and Competition - Multiverse's technology has attracted interest from major hardware companies like Apple and Samsung, aiming to integrate their models into next-generation devices [19] - The competitive landscape is intensifying, with tech giants and startups alike entering the AI efficiency space, focusing on model acceleration and optimization [20][21] Business Model and Services - Multiverse offers three commercial service models: API access to compressed models, private deployment licenses, and model compression services for clients [16][17] - The cost savings from using CompactifAI are substantial, with reduced inference costs and improved processing speeds, making it appealing to enterprises using large models [16][18]
面壁智能成立汽车业务线,与吉利、长安等车企合作AI座舱
Nan Fang Du Shi Bao· 2025-08-16 13:22
Core Insights - The commercialization of large models is a key focus for 2023, with significant investments in automotive, mobile, and robotics sectors [1] - The automotive sector is emerging as a primary battleground for edge intelligence, with multi-modal large models redefining smart vehicle interactions [5] Company Developments - Mianbi Intelligent has elevated the importance of automotive applications by establishing a dedicated automotive business line to enhance the deployment of its MiniCPM edge models [1] - Mianbi Intelligent, founded in August 2022, has developed a complete series of MiniCPM edge models, including the influential V2.5 and V2.6 models, which have gained global recognition [1] - The company has recently open-sourced its fastest MiniCPM 4.0 models, with plans for additional edge models to be released in the second half of the year [1] Industry Trends - The consensus in the industry is shifting towards the advantages of edge models and "edge-cloud collaboration," prompting more large model manufacturers to focus on edge solutions [2] - The integration of edge models in vehicles allows for full functionality and rapid response even in offline environments, enhancing user privacy [5] - The automotive industry is witnessing a surge in collaborations, with Mianbi Intelligent partnering with major automakers like Geely, Volkswagen, Changan, Great Wall, and GAC to develop next-generation AI cockpit systems [5] Product Launches - The Changan Mazda strategic new energy vehicle MAZDA EZ-60, equipped with Mianbi's edge models, is set to launch by the end of the month [4][5]
面壁智能CEO发全员信:成立汽车业务线、让端侧模型更多上“车”
Core Insights - The company, Mianbi Intelligent, has undergone an organizational restructuring to enhance its automotive business line, aiming for a "pressure-type" breakthrough in applying its MiniCPM edge model to more vehicles [1][3] - The CEO, Li Dahai, emphasized that the commercialization of large models is a key focus for 2023, with significant partnerships established with major automotive brands [1][2] - Mianbi's MiniCPM edge model has seen over 13 million downloads since its launch, indicating strong market interest [2] Company Developments - Mianbi Intelligent was founded on the technological achievements of Tsinghua University's NLP lab and has strategically focused on small parameter and edge models [2] - The company plans to launch its first mass-produced vehicle, the Changan Mazda EZ-60, by the end of August [1] - The restructuring elevates the automotive business line's importance within the organization, reflecting a commitment to this sector [3] Industry Context - Major tech companies like Baidu, Tencent, and Alibaba are also entering the automotive edge model space, indicating a competitive landscape [4] - The automotive sector is seen as a critical battleground for large models, with AI models expected to enhance user interaction and potentially integrate with autonomous driving technologies [4] - The establishment of a dedicated automotive business line by Mianbi is viewed as a proactive move, with expectations that other AI model companies will follow suit [4]
面壁李大海谈端侧模型竞争:元年开启,巨头涌入印证前景无限可能
Huan Qiu Wang· 2025-08-15 07:48
Core Insights - The CEO of Mianbi Intelligent, Li Dahai, announced that 2025 will mark the "Year of Edge Intelligence," indicating a significant opportunity in the market as it is still in its formative stages [1] - The industry consensus is shifting towards the advantages of edge models and "edge-cloud collaboration," with major players increasingly focusing on edge technology [1] - Mianbi Intelligent aims to establish commercial advantages quickly while maintaining a balance between technology and user value, emphasizing the need for differentiated user experiences that cloud models cannot replicate [1] Company Strategy - Mianbi Intelligent's core competitive advantage lies in efficiency, striving for the best performance with minimal resources, which leads to faster and more cost-effective edge model solutions [1] - The company introduced the MiniCPM edge model in early 2024, which has 2.4 billion parameters, surpassing the Mistral 7B model, and has achieved over 13 million downloads [2] - The MiniCPM model has been successfully integrated with major chip manufacturers like Qualcomm, NVIDIA, MTK, Intel, Huawei, and Rockchip, and is particularly noted for its application in smart automotive human-machine interaction [2] Market Dynamics - The influx of new entrants into the market is seen as validation of Mianbi Intelligent's strategic choices and the potential for accelerated market growth [1] - The company has established a dedicated automotive business line to promote the widespread adoption of the MiniCPM model in vehicles [2]
面壁智能成立汽车业务线,首款MiniCPM车型月底上市
Mei Ri Jing Ji Xin Wen· 2025-08-15 07:45
Core Viewpoint - The company, Mianbi Intelligent, has undergone an organizational upgrade to establish a dedicated automotive business line, indicating a strategic focus on the automotive sector [1] Group 1: Organizational Changes - In late July, Mianbi Intelligent initiated a new round of organizational upgrades, creating a primary organization specifically for the automotive business line [1] - The CEO, Li Dahai, communicated these changes through a company-wide letter [1] Group 2: Partnerships and Collaborations - Mianbi Intelligent has formed partnerships with major automotive manufacturers including Geely, Volkswagen, Changan, Great Wall, and GAC [1] Group 3: Product Launch - The first mass-produced vehicle equipped with Mianbi's MiniCPM edge model, the Changan Mazda strategic new energy vehicle MAZDA EZ-60, is expected to be launched by the end of this month [1]
Qwen紧追OpenAI开源4B端侧大模型,AIME25得分超越Claude 4 Opus
量子位· 2025-08-07 00:56
Core Insights - The Qwen team has released two new models, Qwen3-4B-Instruct-2507 and Qwen3-4B-Thinking-2507, which are designed to enhance performance on various tasks, particularly in reasoning and general capabilities [2][3][5]. Model Performance - Qwen3-4B-Thinking-2507 achieved a score of 81.3 in the AIME25 assessment, outperforming competitors like Gemini 2.5 Pro and Claude 4 Opus [4][5][23]. - The new models support a context length of 256k, significantly improving context awareness and understanding [3][17]. Model Specifications - Qwen3-4B-Instruct-2507 is a non-reasoning model that enhances general capabilities and multi-language support, while Qwen3-4B-Thinking-2507 is a reasoning model tailored for expert-level tasks [7][16]. - The 4B parameter size is particularly friendly for edge devices, allowing for deployment on small hardware like Raspberry Pi [2][8][26]. Comparative Analysis - In various tests, Qwen3-4B-Instruct-2507 outperformed smaller closed-source models like GPT-4.1-nano and showed comparable performance to larger models like Qwen3-30B-A3B [13][15]. - The models exhibit significant improvements in areas such as instruction following, logical reasoning, and text generation, with enhanced alignment to user preferences [17][24]. Deployment Recommendations - The Qwen team has provided deployment suggestions for local use, including applications like Ollama and MLX-LM, and recommended using a quantized version for very small devices [27][28]. - For optimal performance, especially in reasoning tasks, it is advised to use a context length greater than 131,072 tokens [29]. Community Engagement - The Qwen team has encouraged community feedback and interaction, with links provided for accessing the new models on platforms like Hugging Face and ModelScope [26][36].
华泰证券|机器人产业跟踪
2025-06-30 01:02
Summary of Key Points from the Conference Call Industry Overview - The conference call primarily discusses the **robotics industry** and **Xpeng Motors**' advancements in this sector, particularly in AI robotics and related technologies [1][2][15]. Core Insights and Arguments - **Xpeng Motors** is rapidly advancing in the robotics field, with self-developed software and leading autonomous driving technology. The company is expected to mass-produce ToB (business-to-business) robots by 2026, utilizing innovative hardware technologies such as screw drives, high degrees of freedom in hands, and axial flux motors [1][2]. - The **2025 Shanghai Auto Show** saw a decrease in foot traffic and vehicle models compared to previous years. Domestic brands are significantly iterating in new energy and intelligence, surpassing joint venture brands. Traditional domestic automakers are outperforming new entrants in terms of new model quantity and quality, with a recovery expected in the mid-to-large SUV and MPV markets [1][5]. - There is an increasing market focus on the softer segments of the robotics industry, including operating systems, SoC chips, and large model advancements. Progress has been noted in end-side models based on the DeepSeek open-source model [1][6][7]. - **SoC companies** in the robotics sector reported impressive Q1 2025 results, with revenue and net profit significantly increasing, driven by AI-driven demand for system-level chips. Companies like Rockchip are launching new products and planning next-generation releases, indicating substantial profit elasticity [1][8]. - The **MCU analog chip market** is showing signs of recovery, with increased demand from industrial sectors and potential growth driven by robotics. The domestic market is accelerating the localization replacement cycle, which is expected to enhance traditional demand growth [1][9]. Additional Important Insights - **Tesla** has made significant moves, including the release of new products and a visit to domestic suppliers, indicating a commitment to advancing its localization replacement chain, which could positively impact related companies [1][11]. - The **T-chain industry** is witnessing notable changes, with companies like Rongtai showing advantages in lightweight structural components and micro-screw technology. This sector is becoming clearer as demand for micro-screw products increases [1][12]. - The **demand for humanoid robot screw equipment** is robust, with domestic machine tool companies receiving substantial orders, although supply is currently insufficient to meet demand [1][17]. - There are significant differences in pricing and technology between domestic and international humanoid robot machining equipment, with domestic prices generally lower, leading to a preference for local machines for rapid prototyping [1][18]. - The **production efficiency** of specialized machining methods is improving, with new techniques reducing production time significantly compared to traditional grinding methods [1][19][20]. - The future development trends for humanoid robot screw equipment indicate a strong commitment to improving machining processes, although challenges remain in fully replacing traditional methods [1][21].
长文本推理 5 倍提速!面壁MiniCPM4 端侧模型发布,0.5B模型效果秒杀同级
AI前线· 2025-06-12 06:07
Core Viewpoint - The newly released MiniCPM4.0 model series, featuring 8B and 0.5B parameter scales, significantly enhances edge-side performance and adaptability for various terminal scenarios [1][6]. Model Performance - MiniCPM4.0-8B is the first native sparse model with a 5% sparsity, achieving performance comparable to Qwen-3-8B while using only 22% of the training cost [2][4]. - In benchmark tests like MMLU, CEval, and HumanEval, MiniCPM4.0-0.5B outperforms similar models such as Qwen-3-0.6B and Llama 3.2, achieving a rapid inference speed of 600 Token/s [4][6]. Technological Innovations - The model employs a new context-sparse architecture that allows for a 5x speed increase in long text inference and up to 220x in memory-constrained scenarios [6][8]. - MiniCPM4.0 reduces long text cache requirements to just 1/4 of that needed by Qwen3-8B, achieving a 90% model size reduction while maintaining robust performance [8][10]. Model Architecture - The InfLLMv2 sparse attention architecture allows for efficient "sampling" of relevant text segments, reducing computational costs by 90% compared to traditional models [14][15]. - The model features a dual-frequency switching mechanism that optimizes attention modes for long and short texts, enhancing efficiency and accuracy [17]. Deployment and Adaptation - MiniCPM4.0 has been adapted for major chip platforms including Intel, Qualcomm, and Huawei Ascend, and supports various open-source frameworks [10][24]. - The ArkInfer cross-platform deployment framework addresses the challenges of chip fragmentation, providing a versatile solution for model deployment [25]. Data and Training Innovations - The company utilizes a high-density data selection mechanism to construct high-quality datasets, achieving a 90% reduction in validation costs [28][29]. - The training strategy incorporates advanced techniques like FP8 training and chunk-wise rollout to optimize GPU resource utilization [30].