Core Viewpoint - The article highlights significant advancements in two leading Chinese large model companies, DeepSeek and Zhiyu, with new model releases expected to enhance their capabilities and market position [2][3]. Group 1: DeepSeek Developments - DeepSeek announced the upload of its new model, DeepSeek-V3.2, to the HuggingFace community platform on September 29 [2]. - The previous version, DeepSeek-V3.1, released in August, introduced a mixed inference architecture that supports both thinking and non-thinking modes, improved thinking efficiency, and enhanced agent capabilities through post-training optimization [3]. Group 2: Zhiyu Developments - Zhiyu is set to release its new model, GLM-4.6, with some users already able to access it via API [2]. - The flagship model GLM-4.5, launched in July, integrates reasoning, coding, and agent capabilities into a single model to meet complex application needs [3]. - In August, Zhiyu also released the GLM-4.5V, a high-performance open-source visual reasoning model with a total of 106 billion parameters and 12 billion active parameters [3].
DeepSeek和智谱都将于近日发布新模型,或将迎来重大突破
IPO早知道·2025-09-29 09:45