刚刚,OpenAI开源2个推理模型:笔记本/手机就能跑,性能接近o4-mini
量子位·2025-08-05 21:09

Core Insights - OpenAI has released two open-source inference models: gpt-oss-120b and gpt-oss-20b, marking its first open-source model release since GPT-2 in 2019 [3][4][5] - These models are designed to be commercially usable under the Apache 2.0 license, allowing for free use without licensing fees [5] - The gpt-oss models demonstrate strong performance in reasoning tasks, although they still lag behind proprietary models in code generation and complex reasoning [5][25] Model Specifications - gpt-oss-120b has 117 billion parameters and can run on a single 80GB GPU, achieving performance close to the proprietary o4-mini model [6][7] - gpt-oss-20b has 21 billion parameters and can operate on consumer-grade devices with 16GB memory, performing similarly to o3-mini [6][7] - Both models utilize a mixture of experts (MoE) architecture to optimize active parameters during inference [29][30] Performance Evaluation - In various benchmarks, gpt-oss-120b outperformed OpenAI's o3-mini and matched or exceeded the performance of o4-mini in programming, general problem-solving, and tool usage [41][42] - gpt-oss-20b also achieved comparable results to o3-mini, particularly excelling in competition math and health-related questions [47] - The models support three inference strengths—low, medium, and high—allowing developers to balance latency and performance [38] Technical Features - The models employ advanced pre-training and post-training techniques, focusing on reasoning and efficiency for broad deployment [27][35] - They support a maximum context length of 128k tokens and utilize a unique attention mechanism to enhance memory efficiency [31][33] - OpenAI has made the tokenizer used for these models available as well, further supporting the open-source initiative [34] Strategic Importance - OpenAI views the release of these models as a significant step forward in open-source weight models, enhancing accessibility for developers and researchers [60] - The open-source nature of gpt-oss models lowers barriers for emerging markets and smaller organizations, promoting democratization of AI technology [61][62] - The initiative aims to foster a healthy open-source model ecosystem, contributing to the broader adoption of AI for societal benefits [62]