Core Viewpoint - The article discusses the release and features of the Qwen3-Coder model by Alibaba Cloud, highlighting its advanced capabilities in coding and agentic tasks, as well as its competitive performance against other models in the market [3][4][5]. Group 1: Model Features - Qwen3-Coder series includes various versions, with Qwen3-Coder-480B-A35B-Instruct being the most powerful, featuring 480 billion parameters and supporting 256K tokens natively, expandable to 1 million tokens [4]. - The model has achieved state-of-the-art (SOTA) results in areas such as Agentic Coding, Browser Use, and Tool Use, comparable to Claude Sonnet4 [5][6]. - The training data for Qwen3-Coder amounts to 7.5 terabytes, with 70% being code, enhancing its programming capabilities while maintaining general and mathematical skills [12]. Group 2: Technical Details - The model utilizes a unique approach to reinforcement learning (RL) by focusing on real-world software engineering tasks, allowing for extensive interaction and decision-making [16]. - A scalable environment for RL has been established, enabling the simultaneous operation of 20,000 independent environments, which enhances feedback and evaluation processes [16]. Group 3: Tools and Integration - Qwen Code, a command-line tool for agentic programming, has been developed to maximize the performance of Qwen3-Coder in coding tasks [17]. - The integration of Qwen3-Coder with Claude Code is also highlighted, allowing users to leverage both models for enhanced coding experiences [22][26]. Group 4: User Experience - Users can access Qwen3-Coder through the Qwen Chat web version for free, providing an opportunity to experience its capabilities firsthand [6][7]. - Various demos showcasing the model's capabilities, such as simulating a solar system and creating visual effects in coding environments, are available for users [8][9][10].
阿里开源最强编码模型 Qwen3-Coder:1M上下文,性能媲美 Claude Sonnet 4