Core Insights - In February, China's AI model API call volume surged, surpassing that of the United States for the first time, with 41.2 trillion tokens compared to the U.S.'s 29.4 trillion tokens during the week of February 9-15 [1][7] - The following week, China's model call volume increased to 51.6 trillion tokens, marking a 127% growth over three weeks, while U.S. model calls dropped to 27 trillion tokens [1][7] - Four out of the top five models in global API call volume are from Chinese companies, indicating a collective rise of Chinese AI manufacturers rather than reliance on a single product [1][10] Token Call Volume Growth - The global model token call volume has experienced explosive growth, increasing from 12.4 trillion tokens in the week of March 3-9, 2025, to 139.5 trillion tokens by mid-February 2026, a tenfold increase in less than a year [6] - In early February 2026, China's model call volume reached 22.7 trillion tokens, signaling a strong competitive push against U.S. models [6][7] Leading Models and Their Performance - The top five models by call volume during the week of February 16-22, 2026, included four from Chinese manufacturers, contributing 85.7% of the total call volume [10] - MiniMax's M2.5 model, launched on February 13, 2026, quickly became the top model, contributing 14.4 trillion tokens to the total call volume of 32.1 trillion tokens during the week of February 9-15 [10] Cost Competitiveness - Chinese models are significantly cheaper than their U.S. counterparts, with MiniMax's M2.5 and Zhiyu's GLM-5 priced at $0.3 per million tokens for input, compared to $5 for Claude Opus 4.6, making Chinese models approximately 16.7 times cheaper [15][16] - For output, MiniMax's M2.5 costs $1.1 per million tokens, while Claude Opus 4.6 costs $25, representing a cost difference of about 22.7 times [16][17] Technological Innovations - The "Mixture-of-Experts" (MoE) architecture is a key factor in reducing inference costs for Chinese models, allowing for significant reductions in memory usage and increases in throughput [18] - Chinese AI companies are also pursuing vertical integration to optimize costs further, combining model algorithms, cloud infrastructure, and AI chips for better efficiency [19] Market Trends and Future Projections - The demand for tokens is expected to grow exponentially, with a projected compound annual growth rate of 330% from 2025 to 2030 in China [19] - The concept of "Token inflation" reflects a structural increase in token consumption per user, driven by deeper engagement with AI tools for complex tasks [20] - Future AI service pricing is anticipated to become highly customized and flexible, influenced by task complexity and resource consumption [22]
2月井喷!中国AI调用量首超美国 四款大模型霸榜全球前五 国产算力需求正经历指数级增长
Mei Ri Jing Ji Xin Wen·2026-02-26 11:40