Workflow
Step 4
icon
Search documents
阶跃新模型快到“没推理”!印奇上任,果然气势一新
量子位· 2026-02-03 07:45
Core Insights - The article discusses the launch of the new open-source agent model Step 3.5 Flash, which features a total of 196 billion parameters and 11 billion active parameters, supporting a context window of 256K [2][36]. Model Performance - The model achieves a peak inference rate of 350 TPS, comparable to closed-source models in agent scenarios and mathematical tasks, capable of handling complex, long-chain tasks [5][41]. - In benchmark tests, Step 3.5 Flash scored 97.3 in the AIME 2025 benchmark, 74.4% in the SWE-bench Verified coding tasks, and 88.2 in the τ²-Bench for agent tasks, indicating strong performance across various applications [7][6]. Technical Architecture - Step 3.5 Flash employs a MoE sparse mixture of experts architecture, activating approximately 11 billion parameters during inference to control computational and deployment costs effectively [36]. - The model incorporates a 3:1 sliding window attention mechanism to address long context issues, enhancing its ability to manage lengthy texts [37]. - It features a self-developed MIS-PO reinforcement learning framework to improve inference and agent execution capabilities, reducing data noise and gradient variance for stable optimization in long-sequence tasks [42]. Ecosystem Integration - The model is designed to work seamlessly with major AI acceleration chip platforms from various manufacturers, including Ascend, Mu Xi, and Alibaba's T-head, ensuring compatibility with current mainstream domestic AI hardware [4]. - Step 3.5 Flash emphasizes a cloud-edge collaboration approach, where the cloud handles complex planning and reasoning while the edge focuses on secure data retrieval and local execution [30][32]. Future Developments - The development team is already working on Step 4, indicating ongoing advancements in the model's capabilities [43].
阶跃星辰开源Agent 基模 Step 3.5 Flash 下一代模型已开启训练
Xin Lang Cai Jing· 2026-02-02 11:58
Core Insights - The company, Jieyue Xingchen, has initiated the training of its next-generation model, Step 4, and is inviting developers to participate in its co-development [1][3] - Jieyue Xingchen has released and open-sourced its latest base model, Step 3.5 Flash, which utilizes a sparse mixture of experts (MoE) architecture with a total parameter count of 196 billion, activating approximately 11 billion parameters per token [1][3] - Step 3.5 Flash has been optimized for high inference speed, achieving up to 350 tokens per second for single-request code tasks, addressing the need for low-latency responses in multi-step reasoning scenarios [1][3] Model Performance - Step 3.5 Flash has demonstrated strong performance in various tests, including τ²-Bench, BrowseComp, LiveCodeBench V6, and AIME 2025 mathematics competition, showcasing its competitiveness in agent scenarios and mathematical tasks [2][4] - The model exhibits robust capabilities in automatic programming and "edge-cloud collaboration," enabling the construction of complex visual geographic systems based on natural language instructions and efficiently breaking down complex user requirements into sub-tasks for cross-platform data analysis and decision support [2][4] Industry Trends - There is a noticeable shift among developers from simple prompt engineering to building agents and workflows, indicating a growing interest in more complex AI applications [1][3] - Despite the strength of underlying models, challenges remain in stability, response speed, and cost in scaled scenarios, which Step 3.5 Flash aims to address through its specialized optimizations [1][3]