Core Insights - Alibaba Cloud Tongyi Qwen has launched its largest and most powerful model to date, Qwen3-Max, following the release of the Qwen3-2507 series [1] - The preview version of Qwen3-Max-Instruct ranks third on the LMArena text leaderboard, surpassing GPT-5-Chat [1] - The official version of Qwen3-Max has enhanced capabilities in coding and agent functions, achieving industry-leading performance across various benchmarks [1] Model Specifications - Qwen3-Max has over 1 trillion parameters and was pre-trained using 36 trillion tokens [1] - The model architecture follows the design paradigm of the Qwen3 series and utilizes a global-batch load balancing loss proposed by Tongyi [1] Enhanced Version - The reasoning-enhanced version, Qwen3-Max-Thinking, has demonstrated exceptional potential, achieving 100% accuracy in high-difficulty reasoning benchmarks such as AIME 25 and HMMT [1] - This version integrates a code interpreter and employs parallel testing computational techniques [1]
阿里巴巴(09988)正式推出其迄今为止规模最大、能力最强的模型 Qwen3-Max