大模型适配

Search documents
寒武纪、华为昇腾适配DeepSeek最新模型
财联社· 2025-09-30 00:59
Core Viewpoint - The release of DeepSeek-V3.2-Exp model on Hugging Face platform introduces a sparse Attention architecture that reduces computational resource consumption and enhances inference efficiency [1] Group 1: Model Deployment and Adaptation - Huawei's Ascend has quickly adapted and deployed the DeepSeek-V3.2-Exp model based on vLLM/SGLang inference frameworks, providing open-source inference code and operator implementations for developers [1] - Cambricon announced the adaptation of the latest DeepSeek-V3.2-Exp model and has open-sourced the vLLM-MLU inference engine source code, leveraging the new DeepSeek Sparse Attention mechanism to significantly reduce training and inference costs in long-sequence scenarios [1] - Haiguang Information announced seamless adaptation and deep optimization of its DCU, achieving "zero-wait" deployment for large model computing power, showcasing excellent performance of DeepSeek-V3.2-Exp on Haiguang DCU [1]
填补空白!第四范式发布「信创模盒」ModelHub XC,连接国产GPU和国产大模型
Ge Long Hui· 2025-09-22 11:12
Core Viewpoint - The emergence of compatibility issues between deployed AI models and chip architectures is becoming a hidden ceiling that restricts the practical application of AI, which Fourth Paradigm aims to address with its new solutions [1][7]. Group 1: Product Launch - Fourth Paradigm officially launched the "ModelHub XC" platform, the "Xinchang Community," and the "Xinchang Model Adaptation Value-Added Service" to tackle industry pain points and bridge gaps between customers, computing power, and developers [3]. - The "ModelHub XC" features an innovative AI engine system, EngineX, specifically designed to adapt to domestic computing power, fundamentally addressing the long-standing compatibility and support issues of domestic AI models [7]. Group 2: Market Context - Many existing ModelHubs primarily optimize foreign models and software for their hardware (e.g., NVIDIA GPUs), leading to compatibility issues with domestic hardware (e.g., Cambricon), resulting in time-consuming and repetitive adaptation processes [8]. - The platform has already certified and adapted over a hundred models upon launch, with plans to increase this number to thousands within six months and to reach tens of thousands within a year [10]. Group 3: Services and Support - Fourth Paradigm introduced a value-added service for model adaptation, providing tailored adjustments for users unfamiliar with which models are compatible with domestic computing power, ensuring a "safety net" for model compatibility [12]. - The platform also offers clear labeling of compatible domestic chip brands for each model, simplifying the process for users to determine which chips to purchase based on the models they wish to download [10].