MiniCPM - filings, earnings calls, financial reports, news

MiniCPM

Search documents

Sou Hu Cai Jing· 2025-12-23 18:43

12月17日，北京市新一批生成式人工智能大模型完成国家备案，大模型备案数量突破200款，北京市模型备案数量、产业应用规模两项指标持续保持全国首位。在中央经济工作会议明确"深化拓展'人工智能+'"的指引下，以人工智能大模型为代表的北京AI产业正以技术突破与商业化落地双轮驱动，率先完成从"模型热"到"应用热"的叙事升级，成为"十五五"规划中培育壮大新兴产业和未来产业，推动科技创新和产业创新深度融合的生动实践。纵观北京的科技企业，初创"独角兽"中，12月19日，智谱率先通过港交所聆讯并正式递交招股书，有望以"全球大模型第一股"身份上市，标志着资本市场将首次迎来以AGI基座模型为核心业务的上市公司；月之暗面的K2模型被硅谷投资人誉为"比OpenAI更便宜更强"。面壁智能的端侧模型以24亿参数实现百亿级性能，登顶国际开源榜单。行业"巨头"更是引领风潮，截至9月，百度AI搜索月活用户达3.82亿，在今年连续三个季度登上国内AI搜索行业月活榜首；截至12月，豆包大模型的日均tokens调用量已经超过50万亿，相比去年12月，实现了超过十倍的增速。这些场景，正是北京AI产业蓬勃发展的生动缩影。最新数据显 ...

Artificial Intelligence

Generative AI

Artificial Intelligence

文心大模型

豆包大模型

MiniCPM

Artificial Intelligence

Generative AI

Artificial Intelligence

文心大模型

豆包大模型

MiniCPM

从「密度法则」来看Scaling Law撞墙、模型密度的上限、豆包手机之后端侧想象力......｜DeepTalk回顾

锦秋集· 2025-12-15 04:09

Core Insights - The article discusses the transition from the "Scaling Law" to the "Densing Law," emphasizing the need for sustainable development in AI models as data growth slows and computational costs rise [2][3][15]. - The "Densing Law" indicates that model capability density increases exponentially, with capability density doubling approximately every 3.5 months, while the parameter count and inference costs decrease significantly [11][28]. Group 1: Scaling Law and Its Limitations - The "Scaling Law" has faced challenges due to bottlenecks in training data and computational resources, making it unsustainable to continue increasing model size [15][16]. - The current training data is limited to around 20 trillion tokens, which is insufficient for the expanding needs of model scaling [15]. - The computational resource requirement for larger models is becoming prohibitive, as seen with LLaMA 3, which required 16,000 H100 GPUs for a 405 billion parameter model [16]. Group 2: Introduction of Densing Law - The "Densing Law" proposes that as data, computation, and algorithms evolve together, the density of model capabilities grows exponentially, allowing for more efficient models with fewer parameters [11][28]. - For instance, GPT-3 required over 175 billion parameters, while MiniCPM achieved similar capabilities with only 2.4 billion parameters [24]. Group 3: Implications of Densing Law - The implications of the Densing Law suggest that achieving specific AI capabilities will require exponentially fewer parameters over time, with a notable case being Mistral, which achieved its intelligence level with only 35% of the parameters in four months [32][33]. - Inference costs are also expected to decrease exponentially due to advancements in hardware and algorithms, with costs for similar capabilities dropping significantly over time [36][39]. Group 4: Future Directions and Challenges - The future of AI models will focus on enhancing capability density through a "four-dimensional preparation system," which includes efficient architecture, computation, data quality, and learning processes [49][50]. - The article highlights the importance of high-quality training data and stable environments for post-training data, which are critical for the performance of models in complex tasks [68][70]. Group 5: End-User Applications and Market Trends - By 2026, significant advancements in edge intelligence are anticipated, driven by the need for local processing of private data and the development of high-capacity edge chips [11][45][76]. - The article predicts a surge in edge applications, emphasizing the importance of privacy and personalized experiences in AI deployment [76][77].