华为云联合工商银行落地金融业首个Serverless NPU弹性算力调度技术
Xin Lang Cai Jing·2025-12-02 12:56

Core Insights - Huawei has successfully implemented a Serverless NPU elastic computing scheduling technology in collaboration with Industrial and Commercial Bank of China, significantly enhancing computational efficiency [1][3] Group 1: Technology Implementation - The Serverless NPU elastic computing scheduling technology can reduce the startup time for inference services of a trillion MoE large model to a level of seconds, achieving over a 10-fold increase in startup efficiency compared to traditional scheduling methods [1][3] - This technology represents a shift in computing supply models from "long-term binding" to "on-demand usage," indicating a more flexible approach to resource allocation [1][3]