凌晨，Qwen又更新了，3090就能跑，3B激活媲美GPT-4o

Core Insights - The article discusses the release of the new AI model Qwen3-30B-A3B-Instruct-2507, which showcases significant improvements in performance and efficiency compared to its predecessor and other industry models [1][2]. Performance Improvements - The new model operates in a non-thinking mode, activating only 3 billion parameters while achieving performance comparable to leading closed-source models like Google's Gemini 2.5-Flash and OpenAI's GPT-4o [2][4]. - Performance metrics show substantial improvements, with AIME25 scores increasing from 21.6 to 61.3 and Arena-Hard v2 scores rising from 24.8 to 69.0 [3][4]. Benchmark Comparisons - In benchmark tests, the Qwen3-30B-A3B-Instruct-2507 model performs on par or surpasses models such as DeepSeek-V3-0324 across various categories [4][10]. - The model's average performance in knowledge benchmarks is 62.8, with specific scores like MMLU-Pro at 78.4 and GPQA at 70.4 [10]. Enhanced Capabilities - The model exhibits significant advancements in general capabilities, including instruction following, logical reasoning, text comprehension, mathematics, science, programming, and tool usage [13][27]. - It has improved multilingual knowledge coverage and can generate higher quality text aligned with user preferences [13][27]. Open Source and Accessibility - The model has been made open-source and is available on platforms like HuggingFace and QwenChat, allowing broader access and community support [16][17]. - Users have reported successful implementations of the model on consumer-grade GPUs, such as the RTX 3090, highlighting its accessibility [24][23]. Contextual Understanding - The model's long-context understanding has been enhanced, now supporting up to 256K tokens, which is a significant improvement for complex tasks [28][27]. Industry Impact - The rapid advancements in model efficiency and performance are noted as a significant trend in the AI industry, with the Qwen team setting a high bar for future developments [7][35].