腾讯混元小尺寸开源模型

Search documents
腾讯,最新发布!
中国基金报· 2025-08-04 11:30
Core Viewpoint - Tencent Hunyuan has launched four small-sized open-source models, with the smallest being 0.5B parameters, emphasizing the importance of open-source in the global large model landscape, particularly in China [2][9]. Group 1: Model Specifications - The four models have parameters of 0.5B, 1.8B, 4B, and 7B, and can run on consumer-grade graphics cards, making them suitable for low-power scenarios such as laptops, smartphones, smart cockpits, and smart homes [4]. - The models feature enhanced Agent and long-text capabilities, allowing for complex tasks such as deep search, Excel operations, and travel planning [6]. - The models have a native long context window of 256k, enabling them to process up to 400,000 Chinese characters or 500,000 English words in one go, equivalent to reading three full "Harry Potter" novels [6]. Group 2: Deployment and Support - The models are available on open-source platforms like GitHub and Hugging Face, with support from various consumer-grade chip platforms including Arm, Qualcomm, Intel, and MediaTek [7]. - Deployment requires only a single card, and they can be directly integrated into various devices such as PCs, smartphones, and tablets [6]. Group 3: Industry Trends - The open-source trend in large models is gaining momentum in China, with Tencent's models covering multiple modalities including text, image, video, and 3D generation [9]. - Other tech giants like Alibaba, ByteDance, and Xiaomi are also actively releasing their own open-source models, contributing to a competitive landscape aimed at accelerating AI adoption and innovation [10][11].