Workflow
异构AI系统
icon
Search documents
异构AI系统正在成为主流,业内呼吁构建“混合算力”技术护城河
Di Yi Cai Jing· 2025-12-17 10:12
Core Insights - The hybrid computing cluster has become an essential technology option for the industry in pursuit of optimal cost-performance by 2025, shifting from a previous stance of caution regarding mixed computing resources [1][3] - The potential re-entry of NVIDIA's H200 into the Chinese market has garnered significant attention, emphasizing the necessity for domestic computing capabilities in China [1] - The establishment of a "heterogeneous computing scheduling" technology moat is currently a hot topic in the industry [1] Group 1: Technological Trends - The consensus has shifted towards hybrid computing, with Intel combining its Gaudi 3 accelerator with NVIDIA's B200 GPU to enhance the inference limits of the NVIDIA B200 cluster by up to 70% [3] - Software-hardware collaboration is identified as a major trend in addressing computing challenges, with NVIDIA's CUDA software platform being a critical technology moat [3] - The development of intelligent computing is viewed as a comprehensive competition involving technology, ecology, and applications, with the establishment of an open, unified, and cooperative ecosystem being key to overcoming challenges [3] Group 2: Market Dynamics - The ability to solve the "mixed computing" challenge will determine who holds pricing power in the market, with a clear business model emerging from standardizing computing resources and achieving economies of scale [4] - The daily average token call volume for Wunwen AI Cloud has increased fivefold over the past five months, indicating a surge in demand for computing resources [5] - The rapid iteration of models presents new challenges for computing resources, with significant increases in token calls observed during specific high-demand periods [5] Group 3: Future Directions - The infrastructure for AI creation must evolve from focusing solely on inference efficiency to supporting long-term tasks, context management, and multi-modal resource scheduling [6] - The trend towards heterogeneous computing is expected to grow, with industry experts emphasizing the need for systematic methodologies and tools to address the technical challenges of mixed computing [6][7] - The rapid expansion of computing demands will lead to increased energy costs, with projections indicating that global GPU computing cluster electricity consumption could exceed 1000 TWh by 2030, accounting for approximately 2.5% of global electricity consumption [7]