模型分层
Search documents
OpenAI和国产模型悄悄打起“价格战”
第一财经· 2026-03-18 10:27
Core Viewpoint - The article discusses OpenAI's launch of two new small models, GPT-5.4 mini and nano, which are optimized for high-frequency workloads, offering lower latency and higher cost-effectiveness compared to flagship models [3][10]. Group 1: Model Performance and Features - OpenAI's GPT-5.4 mini is designed for a balance of speed and performance, operating at over twice the speed of its predecessor and achieving scores close to the flagship model in various benchmarks [7][8]. - The GPT-5.4 mini scored 54.4% on the SWE-bench Pro programming benchmark, 72.1% on the OSWorld-Verified benchmark, and 88.0% on the GPQA Diamond test, indicating strong performance in programming and multi-modal tasks [8]. - The GPT-5.4 nano is the smallest and cheapest version, suitable for lighter tasks, and performs slightly below the mini model [7][9]. Group 2: Pricing and Cost Efficiency - The pricing for GPT-5.4 mini is set at $0.75 per million tokens for input and $4.5 for output, consuming only 30% of the GPT-5.4 quota, making it a cost-effective option for simple programming tasks [9][10]. - GPT-5.4 nano is priced at $0.20 per million tokens for input and $1.25 for output, approximately one-fourth the cost of the mini model [10][16]. - Comparatively, other models like DeepSeek V3.2 and Kimi-K2.5 offer lower prices, raising questions about the competitiveness of OpenAI's new models in terms of cost [16][18]. Group 3: Industry Implications and Strategic Positioning - The release of these small models signifies a strategic shift towards model layering, where developers will utilize a combination of models based on task complexity and cost [10][12]. - OpenAI emphasizes the importance of a system where larger models handle complex planning while smaller models execute simpler tasks efficiently [12][13]. - The competitive landscape is intensifying, particularly with Chinese models dominating the cost-performance ratio, leading to debates on the effectiveness of OpenAI's new offerings [5][14].