模型与算力协同
Search documents
阶跃星辰发布开源基座模型 Step 3.5 Flash
Zheng Quan Ri Bao Wang· 2026-02-02 08:11
Group 1 - Shanghai Jiyue Xingchen Intelligent Technology Co., Ltd. launched the new generation open-source Agent base model Step3.5Flash, designed for real-time Agent workflow scenarios, balancing inference speed, intelligence level, and cost [1] - Step3.5Flash achieves a maximum inference speed of 350 tokens per second for single request code tasks, providing a "faster, stronger, and more stable" option for Agent base models [1] - The model utilizes a sparse MoE architecture, activating approximately 11 billion parameters per token out of a total of 196 billion parameters, significantly enhancing inference efficiency while maintaining model capability [1] Group 2 - Nearly 10 chip and infrastructure manufacturers, including Huawei Ascend and Alibaba Pingtouge, have completed adaptations for Step3.5Flash, enhancing model adaptability and computing efficiency through collaborative innovation [1] - The establishment of the "MoXin Ecological Innovation Alliance" in July 2025 aims to break down technical barriers between chips, models, and platforms, optimizing performance and accelerating the application of large models across various industries [2] - The industry recognizes that deep collaboration between models and computing power will be a crucial path for the large-scale application of inference models [2]