MaaS(模型即服务)
Search documents
盘点2025:模型服务,成为基础设施
第一财经· 2025-12-30 10:15
Core Insights - The article emphasizes the rapid growth of the Model as a Service (MaaS) market, with major players like OpenAI, Google Cloud, and Volcano Engine capturing significant market shares by 2025 [1][3] - Volcano Engine has achieved a remarkable daily token call volume of 63 trillion, positioning itself as a leading Chinese player in the AI cloud market [3][6] - The introduction of the Doubao model has led to exponential growth in token usage, highlighting the increasing importance of MaaS as a foundational infrastructure in AI [4][11] Market Dynamics - By October 2025, OpenAI, Google Cloud, and Volcano Engine are projected to hold 65% of the global MaaS market, with respective shares of 31%, 19%, and 15% [1] - Volcano Engine's daily token call volume of 30 trillion places it third globally, following OpenAI and Google Cloud [3] - The MaaS market is still perceived as "thin" and "narrow," indicating potential for further growth and competition [3] Company Performance - Volcano Engine has reported a 100% year-on-year revenue growth, exceeding 20 billion, and has revised its revenue target for 2030 upwards by several percentage points [6] - The company has prioritized MaaS as its strategic focus, leading to significant investments in resources and technology [6][16] - The introduction of the Doubao model API service has drastically reduced pricing, marking a shift from "per count" to "per milligram" pricing, with a reduction of up to 99.3% [6] Technological Advancements - The launch of the DeepSeek-R1 model has further enhanced Volcano Engine's capabilities, allowing it to capitalize on the growing demand for model inference services [7][10] - Continuous iterations of the Doubao model have led to increased token call volumes, with new models being released every three months [10][11] - The company is focusing on optimizing AI application accessibility and cost-effectiveness through advanced tools like Prompt Pilot and Model Router [27][28] Future Outlook - Volcano Engine aims to maintain its leadership in the MaaS market while expanding into deeper industry applications, particularly in sectors like smart manufacturing and consumer electronics [27] - The company is developing a new architecture centered around agents, which will enhance the integration of models into existing workflows [28][30] - The potential market for agents is vast, with estimates suggesting it could significantly expand beyond traditional IT budgets into areas like global customer service and programming [30]
代表字节跳动上春晚的,怎么是火山引擎?
财联社· 2025-12-30 07:24
Core Viewpoint - ByteDance's decision to have Volcano Engine as the exclusive AI cloud partner for the 2026 CCTV Spring Festival Gala marks a strategic shift towards promoting its B2B services rather than its traditional consumer-facing products [1][6][12]. Group 1: Event and Market Positioning - The announcement of Volcano Engine as the partner for the Spring Festival Gala highlights its rapid growth, with daily token usage surpassing 63 trillion, maintaining its position as the leader in AI service usage in China [3][9]. - The event attracted significant attention, with over 5,000 attendees and many more unable to enter, indicating strong interest in AI applications and services [3][4]. - The collaboration signifies a departure from the norm where consumer apps like Douyin (TikTok) were typically featured, showcasing a new focus on B2B capabilities [4][6]. Group 2: Strategic Implications - ByteDance aims to position Volcano Engine as a foundational technology brand that can influence consumer perceptions, similar to how Intel and CATL have established themselves in their respective markets [16][17]. - The partnership is intended to communicate the advanced capabilities of Volcano Engine, emphasizing its role in supporting high traffic and AI applications during major events like the Spring Festival Gala [9][17]. - This strategic move reflects a broader trend in the AI industry where the competition is shifting towards the performance of underlying models and infrastructure rather than just consumer applications [8][10]. Group 3: Future Outlook - The collaboration is expected to enhance user trust in Volcano Engine as a reliable AI service provider, potentially influencing developers' choices for model infrastructure in the future [17]. - ByteDance's internal discussions indicate a recognition of the changing landscape in AI application competition, necessitating a focus on foundational technologies [13][14]. - The growth of Volcano Engine in the MaaS (Model as a Service) sector, with nearly 50% market share in China's public cloud large model market, positions it as a key revenue contributor for ByteDance moving forward [14][16].
一边亏一边冲!智谱MiniMax抢IPO,大模型赚钱难为何还扎堆上市?
Sou Hu Cai Jing· 2025-12-24 08:21
Core Viewpoint - The competition between Zhipu and MiniMax for IPO in Hong Kong reflects a shift in the large model industry from a technical race to a capital test, with both companies aiming to become the first in the market and capitalize on the financial benefits [3][13]. Group 1: Company Performance - Zhipu's revenue is projected to grow from 57.4 million in 2022 to 312.4 million in 2024, representing a compound annual growth rate (CAGR) of 130%, with expectations to double again by 2025 [5]. - The company has a strong backing from prestigious investors, including Hillhouse, Sequoia, Tencent, Alibaba, and Meituan, enhancing its market position [5]. - Zhipu is transitioning from a "heavy asset" model to a "light asset" model, moving towards a Model as a Service (MaaS) approach, which is expected to drive exponential growth [7]. Group 2: Competitive Landscape - MiniMax, another competitor in the same space, is also preparing for its IPO, expected to be listed in January 2026, creating a competitive race for market leadership [9]. - The competition is likened to a "tortoise and hare" scenario, emphasizing the urgency and stakes involved in the IPO process [9]. Group 3: Challenges and Risks - The high cost of computing power is a significant concern, with over 70% of research and development expenses allocated to GPU services, limiting funds for technological upgrades and talent acquisition [11]. - Global supply chain issues for high-end chips and U.S. sanctions pose risks to model iteration and development, impacting the company's operational capabilities [11]. - Despite rapid revenue growth, Zhipu is facing substantial losses, projected at 2.958 billion in 2024 and 2.358 billion in the first half of 2025, with research expenses exceeding eight times the revenue during the same period [11].
全球大模型第一股之争:中国AI,到底该先造底座还是卖爆款?
3 6 Ke· 2025-12-23 10:23
Core Viewpoint - The competition between two Chinese AI unicorns, Zhipu and MiniMax, represents a broader struggle in the AI industry regarding the ultimate value attribution, as they adopt fundamentally different approaches to their business models and market strategies [1][27]. Group 1: Company Strategies - Zhipu aims to establish itself as the "power plant" of the AI era by focusing on foundational infrastructure, developing its own GLM architecture, and providing stable, continuous AI capabilities to enterprises through a model-as-a-service (MaaS) approach [2][12]. - MiniMax, in contrast, adopts a consumer-oriented strategy, prioritizing the rapid development of popular applications to capture market attention and generate revenue quickly, resembling a typical internet company [4][6]. Group 2: Financial Performance - Zhipu's revenue model is primarily B2B and B2G, with projected revenues of 263 million RMB in 2024 and over 718 million RMB in 2025, benefiting from a high customer retention rate of 79% and maintaining a gross margin above 50% [9][12]. - MiniMax's revenue heavily relies on consumer products, with 71.1% of its income coming from applications, but it faces challenges with a low gross margin, which has not exceeded 25% since 2023, and high marketing expenses that are 2.8 times its revenue [15][18]. Group 3: Market Positioning - Zhipu's approach is characterized by high stickiness and sustainability, as its enterprise clients face significant switching costs once integrated into its foundational models, positioning it as a long-term player in the AI infrastructure space [12][21]. - MiniMax's rapid consumer market entry strategy, while effective for quick user acquisition, exposes it to intense competition from larger tech giants, making its long-term viability uncertain [25][26]. Group 4: Industry Implications - The ongoing evolution of the AI industry suggests that foundational technologies will ultimately dictate the success of applications, as seen in historical tech trends where infrastructure providers maintain long-term value [24][27]. - The current landscape indicates a high cost of customer acquisition and operational expenses, with both companies needing to ensure sustainable revenue and profit margins to survive in a competitive market [23][22].
AI大模型独角兽招股书深度拆解:MiniMax to C,智谱 to B
Hua Er Jie Jian Wen· 2025-12-22 03:57
Core Insights - Two Chinese AI unicorns, Zhiyu and MiniMax, submitted their IPO applications to the Hong Kong Stock Exchange within 48 hours, showcasing different commercialization paths in the AI large model sector [1] - The IPO documents reveal distinct business models: MiniMax focuses on consumer-driven applications, while Zhiyu emphasizes enterprise-level services [8][9] Group 1: Business Models - MiniMax's business model is centered around "AI Native Apps," with significant revenue growth projected from $758,000 in 2023 to $21.8 million in 2024, and reaching $38 million in the first nine months of 2025, accounting for 71.1% of total revenue [3][4] - MiniMax's average monthly active users (MAU) are expected to surge from 3.1 million in 2023 to 27.6 million by September 2025, with paid user numbers reaching 1.77 million and average revenue per paid user (ARPPU) increasing from $6 to $15 [4][5] - Zhiyu AI focuses on enterprise services, with local deployment revenue reaching 162 million RMB by June 2025, constituting 84.8% of total revenue [10][11] Group 2: Financial Performance - MiniMax's gross margin improved from -24.7% in 2023 to 12.2% in 2024, and further to 23.3% in the first nine months of 2025, indicating effective cost control and revenue growth [18][27] - Zhiyu AI reported a gross margin of 50%, with local deployment services achieving a high margin of 59.1% in the first half of 2025, although its cloud deployment business faced declining margins [20][25] Group 3: R&D Investments - Both companies are heavily investing in R&D, with Zhiyu AI's R&D expenses reaching 1.595 billion RMB in the first half of 2025, resulting in a staggering R&D expense ratio of 835.4% [25][26] - MiniMax's R&D expense ratio decreased from over 2000% in 2023 to 337.4% in the first nine months of 2025, reflecting improved operational efficiency as revenue grows [27] Group 4: Market Focus - MiniMax is highly globalized, with only 26.9% of its revenue coming from mainland China in 2025, while Zhiyu AI primarily targets the domestic market, focusing on state-owned enterprises and large institutions [30][32] - The shareholder structures of both companies include major tech players, with MiniMax backed by Alibaba and Tencent, while Zhiyu AI has a more diversified investor base including state-owned funds [32][33] Group 5: Future Outlook - Both companies have sufficient cash reserves to continue their technological advancements, with MiniMax holding approximately $1 billion and Zhiyu AI having 2.55 billion RMB in cash equivalents [34] - The IPO submissions signal a shift in the Chinese AI industry from a focus on technology to a more commercialized approach, with both companies vying to become leaders in the large model sector [34]
全球大模型第一股智谱的“冰火两重天”
Guan Cha Zhe Wang· 2025-12-20 08:11
(文/陈济深 编辑/张广凯) 12月19日,北京智谱华章科技股份有限公司(智谱)正式向港交所递交招股书,打响了中国大模型企业上市的第一 枪。 这不仅意味着智谱有望成为"全球大模型第一股",也标志着中国大模型赛道正式迈入资本化新阶段。 作为全球AI新势力中的重要玩家,招股书也第一次揭示了这家明星独角兽商业化的"冰火两重天"。 过去三年,公司收入分别为0.57亿元、1.25亿元、3.12亿元。2025年上半年持续加速,短短6个月就入账1.9亿元,同比 增长325%。 收入增长的背后是智谱惊人的烧钱速度:过去三年,智谱经调整净亏损分别为0.97亿、6.21亿以及24.66亿元。 不过值得注意的是,智谱在B端业务占据主导的情况下,面对传统B端大厂和互联网巨头的竞争,长期维持住了超过 50%的毛利率。 随着2025年字节火山引擎、阿里云、百度等互联网巨头纷纷下场加码AI大战,智谱的这份招股书提供了一个观察中国 大模型企业在全球AI淘汰赛下真实状态的绝佳切片。 巨头夹缝中的"独立样本" 智谱的核心团队成员,基本都来自清华大学计算机系知识工程实验室,长期从事大规模预训练模型、语义大数据分 析、智能问答等领域的研究。 有这样 ...
日耗50万亿Token,火山引擎的AI消费品战事
3 6 Ke· 2025-12-19 10:55
Core Insights - The AI market in China is rapidly evolving, with Huoshan Engine emerging as a leading player, particularly in the model-as-a-service (MaaS) sector, where it holds the largest market share domestically and ranks third globally [2][3] - The daily token usage of the Doubao model has surged to over 50 trillion, marking a tenfold increase compared to the previous year [1][4] - The focus for 2025 in the AI market will be on multimodal capabilities and agents, with Huoshan Engine launching several new products centered around these themes [3][6] Market Position and Growth - Huoshan Engine has established itself as a significant force in the AI sector, with projected revenues exceeding 200 billion in 2024, reflecting a growth rate of over 60% [6][23] - The company aims to simplify model usage by integrating multiple capabilities into a single API, contrasting with competitors who offer separate models for different functions [26][27] Product Innovations - The newly launched Seedance 1.5 pro video generation model emphasizes immediate usability, capable of producing synchronized audio-visual content without extensive post-production [8][15] - The model's advancements include improved lip-sync accuracy and enhanced immersion, making it particularly suitable for diverse content creation [13][21] Competitive Landscape - The AI video model market is characterized by rapid iteration, with companies focusing on producing fully publishable works rather than just raw video segments [7][9] - Huoshan Engine's approach to model training and optimization has led to a tenfold increase in inference speed, significantly reducing costs and enhancing performance [31][30] Future Directions - The company is exploring innovative billing models, such as the "AI Savings Plan," which offers tiered discounts to help businesses reduce costs by up to 47% [32][33] - Huoshan Engine is committed to building a comprehensive AI infrastructure that enables businesses to easily adopt advanced AI capabilities, aiming to make AI assistants as ubiquitous as websites and apps [38][39]
日耗50万亿Token,火山引擎的AI消费品战事
36氪· 2025-12-19 10:31
Core Viewpoint - The AI market is rapidly evolving, with major players like Volcano Engine leading the way in model consumption and innovation, particularly in the areas of multi-modal capabilities and AI agents [3][5][51]. Group 1: Market Growth and Trends - As of December, the daily token usage of Doubao model has surpassed 50 trillion, representing a growth of over 10 times compared to the same period last year [3]. - By 2025, the token usage is projected to reach 16.4 trillion, indicating significant growth potential in the AI market [4]. - The competition among cloud vendors for "AI cloud supremacy" is intensifying, with major updates from companies like Google and OpenAI [4]. Group 2: Product Innovations - Volcano Engine has released key products focusing on multi-modal capabilities and AI agents, including the Doubao flagship model 1.8 and the video generation model Seedance 1.5 pro [5][6]. - The Seedance 1.5 pro model emphasizes the ability to produce "publishable complete works," showcasing advancements in video generation technology [10][11]. - The model's improvements in voice and image synchronization have made it a standout in the market, achieving high levels of usability with minimal input [11][18]. Group 3: Business Model and Strategy - Volcano Engine aims to simplify model usage by integrating multiple capabilities into a single API, reducing complexity for clients [38][39]. - The company is focusing on enhancing the efficiency of model training and deployment, with the Seedance 1.5 pro achieving over a 10-fold increase in inference speed [46]. - A new billing model, "AI Savings Plan," has been introduced to help enterprises save up to 47% on costs, reflecting a shift towards value-based pricing [47][48]. Group 4: System Engineering and Infrastructure - The competition in AI infrastructure has shifted from merely comparing model capabilities to a broader system engineering challenge [51]. - Volcano Engine is developing a comprehensive AI infrastructure that includes both the core model (Doubao) and operational tools (AgentKit) to facilitate easier deployment for enterprises [53]. - The goal is to enable every enterprise to have its own AI assistant, akin to having a website or app, supported by a complete ecosystem [54].
豆包升级1.8 对话谭待:模型间最重要的不是竞争而是做大市场
Bei Ke Cai Jing· 2025-12-19 09:20
Core Insights - The core message of the news is the significant growth and advancements of the Doubao model by Huoshan Engine, which has seen a tenfold increase in daily token usage, reaching over 50 trillion tokens by December 2023, alongside the launch of the Doubao 1.8 model optimized for various applications [1][2]. Group 1: Model Performance and Upgrades - Doubao model's daily token usage has surpassed 50 trillion, marking a tenfold increase compared to December of the previous year [1]. - The Doubao 1.8 model has been upgraded with enhancements in multi-modal capabilities and Agent functionalities, including improved tool usage and complex instruction adherence [2][3]. - The Doubao model demonstrated advanced capabilities such as long video understanding, which can be applied in various sectors like online education and product quality inspection [3]. Group 2: Market Position and Competition - Doubao mobile has gained significant attention, sparking discussions about AI assistant safety and its potential to disrupt the market [1][2]. - The competition in the AI sector is intensifying, with Alibaba's products like Qianwen and Lingguang rapidly gaining user traction, indicating a shift in the competitive landscape [4][5]. - Huoshan Engine claims a 46.4% market share in the Chinese public cloud model market, while Alibaba asserts a leading position that exceeds the combined share of its competitors [5][6]. Group 3: Industry Outlook and Collaboration - The industry outlook suggests a potential tenfold growth in the AI market next year, emphasizing the importance of collaboration rather than zero-sum competition [2][6]. - Huoshan Engine has significantly reduced model calling costs by up to 99%, which has been validated as a viable strategy as other cloud providers follow suit [6]. - The increasing investment from various cloud vendors in the MaaS (Model as a Service) sector is seen as a positive sign of market maturation, which will facilitate the broader application of AI across industries [6].
MaaS定义AI下半场:一场对大模型生产力的投票
华尔街见闻· 2025-11-21 11:19
Core Insights - The AI sector is experiencing a significant capital surge in 2025, with companies like Zhipu and MiniMax vying for the title of "first stock of large models," highlighting the industry's growing prominence [1] - A value gap exists where companies invest heavily in AI but many remain stuck in pilot phases without generating tangible financial impacts [1] - The market is shifting towards the "second half" of model value realization, with companies facing the dilemma of high investment costs versus the fear of missing out on technological advancements [1] Group 1: Market Dynamics - The transition from "selling model parameters" to "delivering MaaS (Model as a Service)" allows companies to focus on business value rather than the risks of model iteration [2] - The competition in the AI "second half" is characterized by a shift from demo showcases to a battle of foundational models as the basis for enterprise AI deployment [4] - A dramatic market reshuffle is occurring, with Anthropic's Claude series leading the enterprise-level LLM API market with a 32% usage share, while OpenAI's share has dropped from 50% to 25% [4][9] Group 2: Financial Growth and Strategy - Anthropic's "enterprise-first" strategy has led to a remarkable increase in annual recurring revenue (ARR), soaring from $1 billion to $5 billion within months [9] - Traditional cloud giants like Alibaba Cloud are adopting a "build kitchen" strategy, offering a full-stack solution from IaaS to MaaS, while engaging in price wars to attract customers [10][11] - Smaller firms are finding opportunities by focusing on niche markets and differentiating their offerings rather than competing directly with giants [12][14] Group 3: Performance and Efficiency - As of 2025, companies are prioritizing model performance and efficiency over mere token price reductions, indicating a shift in focus towards effective AI solutions [13] - Zhipu's new models, GLM-4.5 and GLM-4.6, have seen a rapid increase in token usage, particularly in coding tasks, attracting significant developer interest [14][27] - The demand for high-performance models in critical applications, such as coding and financial analysis, is driving companies to pay premiums for improved accuracy and reliability [18][21] Group 4: Future Trends and Implications - The emergence of MaaS is not just a commercial choice but a technological necessity, as companies must navigate the complexities of AI deployment strategies [17] - The market is witnessing a shift where foundational models are becoming the primary applications, with the potential for models to evolve into autonomous agents [22][24] - The valuation of AI companies is changing, with a growing recognition that foundational models represent a new form of labor rather than just software, leading to a potential revaluation of independent firms in the sector [26][28]