腾讯混元推出首款开源混合推理模型:擅长Agent工具调用和长文理解
TENCENTTENCENT(HK:00700) AI前线·2025-06-28 05:13

Core Viewpoint - Tencent Hunyuan has launched the first open-source hybrid inference MoE model, Hunyuan-A13B, featuring 80 billion parameters with only 13 billion activated parameters, demonstrating superior performance and cost-effectiveness compared to leading open-source models in the same architecture [1][2]. Model Performance - Hunyuan-A13B has shown strong general capabilities across various authoritative industry datasets, achieving high scores in multiple categories such as Mathematics, Science, Coding, Reasoning, and Instruction [3]. - In Mathematics, Hunyuan-A13B scored 87.3 on AIME2024, outperforming competitors like OpenAI's model and Qwen3-A22B [3]. - The model supports a native context window of 256K, excelling in long-text datasets [4]. Agent Capabilities - Tencent has developed a multi-Agent data synthesis framework for Hunyuan-A13B, enhancing its performance in diverse environments through reinforcement learning [3]. - The model can switch between fast and slow thinking modes, optimizing resource allocation for different tasks [5]. Deployment and Accessibility - Hunyuan-A13B is user-friendly for individual developers, requiring only a single mid-range GPU for deployment, and supports various quantization formats [6]. - The model has been integrated into mainstream open-source inference frameworks, achieving over twice the throughput of leading open-source models [6]. Training Innovations - The model was pre-trained on 20 trillion tokens, significantly improving its general capabilities [6]. - Tencent's team has developed a Scaling Law joint formula for MoE architecture, enhancing the pre-training effectiveness [6]. New Datasets - Tencent has released two new datasets, ArtifactsBench and C3-Bench, to address gaps in industry evaluation standards for large language models [7]. - ArtifactsBench includes 1,825 tasks across nine domains, while C3-Bench features 1,024 test data points focusing on complex tool relationships and dynamic decision-making [7]. Upcoming Events - The first AICon Global AI Development and Application Conference will be held on August 22-23, focusing on AI applications, including Agent and multimodal technologies [8].