华为联合三大高校发布并开源AI容器技术Flex:ai，助力破解算力资源利用难题

Core Insights - Huawei officially launched the AI container technology Flex:ai at the 2025 AI Container Application Landing and Development Forum, in collaboration with Shanghai Jiao Tong University, Xi'an Jiaotong University, and Xiamen University, to address the low utilization of computing resources in the AI industry [1][2] Group 1: Technology Overview - The Flex:ai technology aims to tackle the issue of "computing resource waste" in the AI industry, where small model tasks monopolize entire cards, leading to resource idleness, while large model tasks face insufficient computing power [1] - Flex:ai is built on the Kubernetes container orchestration platform, enabling precise matching of AI workloads with computing resources through refined management and intelligent scheduling of GPU and NPU resources, significantly improving computing resource utilization [1] Group 2: Key Technological Breakthroughs - A collaboration with Shanghai Jiao Tong University led to the development of the XPU pooling framework, which allows a single GPU or NPU card to be divided into multiple virtual computing units, increasing overall computing utilization by 30% in small model training and inference scenarios [2] - A partnership with Xiamen University resulted in cross-node remote virtualization technology, which aggregates idle XPU computing resources within a cluster to form a "shared computing pool," facilitating the integration of general-purpose and intelligent computing resources [2] - The Hi Scheduler intelligent scheduler, developed in collaboration with Xi'an Jiaotong University, addresses the challenge of unified scheduling of heterogeneous computing resources, ensuring stable operation of AI workloads even under fluctuating loads [2]