Workflow
3550亿参数!智谱发布GLM-4.5模型,12项基准评测国产最佳
Xin Lang Ke Ji·2025-07-28 14:32

Core Insights - The article discusses the launch of GLM-4.5, a new flagship model by Zhipu, designed specifically for intelligent agent applications, which is now open-sourced on Hugging Face and ModelScope platforms under the MIT License [2] - GLM-4.5 has achieved state-of-the-art (SOTA) performance in reasoning, coding, and intelligent agent capabilities, ranking third globally among all models and first among domestic and open-source models in 12 key evaluation benchmarks [2] - The model boasts higher parameter efficiency, with a total parameter count of 355 billion, which is half of DeepSeek-R1 and one-third of Kimi-K2, while achieving the best performance-to-parameter ratio in the SWE-bench Verified leaderboard [2][3] Model Architecture - The model utilizes a mixture of experts (MoE) architecture, with GLM-4.5 having a total parameter count of 355 billion and active parameters of 32 billion, while GLM-4.5-Air has 106 billion total parameters and 12 billion active parameters [3] - It is designed for complex reasoning and tool usage, as well as for immediate response in non-thinking modes [3] Pricing and Performance - The API call pricing is set at 0.8 yuan per million tokens for input and 2 yuan per million tokens for output, with a high-speed version capable of processing up to 100 tokens per second [3]