Workflow
国产算力适配
icon
Search documents
智谱发布GLM-5技术细节:工程级智能,适配国产算力
Hua Er Jie Jian Wen· 2026-02-22 11:20
Core Insights - The release of GLM-5 marks a significant advancement in AI model capabilities, shifting the focus from mere parameter size to system engineering capabilities [2][15] - GLM-5 demonstrates the ability to perform complex tasks, improve training efficiency, and fully adapt to domestic chip architectures, indicating a move towards an independent technological ecosystem in China [2][14] Group 1: Model Capabilities - GLM-5 can handle complex tasks beyond simple code generation, showcasing "engineering-level intelligence" [4][5] - The model supports a context length of 200K tokens, enabling it to manage long-term planning and multi-round interactions effectively [4][6] - The introduction of DSA (DeepSeek Sparse Attention) reduces computational complexity by 1.5-2 times without loss of performance, allowing for more efficient processing [6][7][9] Group 2: Training and Efficiency Innovations - GLM-5 features a restructured reinforcement learning (RL) architecture that decouples model generation from training, significantly enhancing throughput [13] - The model's training efficiency is optimized through asynchronous RL algorithms, allowing for stable learning in complex environments [13] - The overall design emphasizes efficiency innovations over sheer computational power, which is crucial for the Chinese AI landscape [10] Group 3: Hardware Adaptation - GLM-5 is natively compatible with various domestic GPU ecosystems, including Huawei Ascend and others, marking a shift towards system-level adaptation rather than reliance on foreign hardware [14] - The model's performance on a single domestic computing node is comparable to that of a cluster of two international GPUs, with deployment costs reduced by 50% in long-sequence processing scenarios [14] Group 4: Comprehensive AI Engineering - The development of GLM-5 represents a complete closed-loop system that integrates model architecture innovation, training efficiency optimization, and deep adaptation to domestic chips [15] - This signifies a transition for Chinese AI from application-level advantages to full-stack optimization, including architecture, algorithms, training systems, and inference frameworks [15][18] - The report emphasizes a mature approach to AI development, focusing on practical engineering metrics rather than competitive benchmarking [18]
MinerU全面深度适配主流国产算力,10余家国产AI芯片在列
Huan Qiu Wang· 2026-02-12 08:45
【环球网科技综合报道】2月12消息,上海人工智能实验室 OpenDataLab 团队、 DeepLink 团队及国产 芯片厂家携手,于日前先后完成了昇腾、平头哥、沐曦、海光、燧原、摩尔线程、天数智芯、寒武纪、 昆仑芯、太初元碁、壁仞等 10 余家主流国产算力的适配。此举旨在通过软硬件协同的全栈优化策略, 深度适配各类算力,全面提升 MinerU 项目的生态兼容性与适应力,赋能更多开发者与企业高效构建大 模型语料基石。 近期,国内不少主流AI大模型相继推出更新版本,国产AI芯片企业也紧随其后适配新版本大模型。 对此,太初元碁相关负责人表示,截至目前其已完成包括DeepSeek、千问、智谱、MinerU、文心一言 等在内的30多个AI大模型的国产算力适配工作,涵盖了Qwen3 Dense/MoE 系列模型、BAAI Embedding / Reranker系列模型、Qwen-VL、LLaVA等多模态理解系列模型;Stable-Diffusion、FLUX、Wan系列等 多模态生成类模型;GLM、Seed-OSS、文心一言等大语言模型;以及MinerU、DeepSeek-OCR 2、 Paddle-OCR等主流OC ...
新品密集发布+国产算力适配 商汤科技股价早盘拉升8.5%
Zhi Tong Cai Jing· 2025-12-22 05:45
Core Insights - SenseTime's stock price experienced a significant increase, with a peak rise of over 8.5%, reaching HKD 2.04, driven by recent technological advancements and product innovations in AI [1] - The company launched several new AI-based products during its product release week, including the first integrated creative multi-episode generation AI agent "Seko 2.0" and the first native AI office assistant "Xiao Huan Xiong 3.0" [1] - SenseTime achieved a milestone in adapting its self-developed models to domestic chips, with a strategic partnership established with companies like Zhongke Shuguang and Daxiao for deep integration of its large models [1] Product Innovations - New AI products introduced include the "Ruying" marketing AI engine, "Daxiao Robot" for embodied intelligence, "Kapi Camera" AI photo assistant, and a new generation AI financial assistant "Kapi Accounting" [1] - The SenseCore device has completed full adaptation with mainstream domestic AI chips, including Huawei Ascend, Cambricon, and others, enhancing operational efficiency [2] Strategic Collaborations - SenseTime has established a joint optimization mechanism with chip manufacturers to significantly improve operational efficiency on domestic chips [2] - The company is positioned for potential valuation recovery as product deployment progresses and industry demand increases [2]