Core Insights - The release of the new open-source Agent base model Step 3.5 Flash aims to enhance real-time Agent workflow scenarios by balancing inference speed, intelligence level, and cost [1] - Step 3.5 Flash achieves a maximum inference speed of 350 tokens per second for single-request code tasks, positioning itself as a preferred choice for Agent applications [1] Group 1: Model Features - Step 3.5 Flash utilizes a sparse MoE architecture, activating approximately 11 billion parameters per token out of a total of 196 billion parameters, significantly improving inference efficiency while maintaining model capability [1] - The model is designed to provide a more efficient and affordable foundational model option for Agent applications [1] Group 2: Industry Collaboration - In July 2025, the company initiated the "MoCore Ecological Innovation Alliance" with nearly 10 chip and infrastructure manufacturers to eliminate technical barriers between chips, models, and platforms [2] - The alliance aims to enhance computing power utilization efficiency through joint optimization, accelerating the application of large models across various industry scenarios [2] - The industry recognizes that deep collaboration between models and computing power will be a crucial pathway for the large-scale application of inference models [2]
阶跃星辰发布开源基座模型 Step 3.5 Flash,多家头部芯片厂商已完成适配
Yang Zi Wan Bao Wang·2026-02-02 05:37