Workflow
智能体Agent
icon
Search documents
阿里Qwen3发布,超越DeepSeek-R1等登顶全球最强开源模型
Investment Rating - The report rates the industry as "Outperform" [1] Core Insights - The release of Alibaba's Qwen3 confirms that leading AI companies in China are at the forefront of global technology, with open-source models expected to significantly boost the AI industry [2][9] - Qwen3 achieved a new high in the BFCL evaluation, indicating strong support for the upcoming AI Agent era [2][12] - The report maintains a positive outlook on the computer sector and suggests monitoring specific companies such as Guangzhou Sie Consulting, ArcSoft Corporation, Hygon Information Technology Co., Ltd., and others [2][9] Summary by Sections Qwen3 Model Performance - Alibaba launched Qwen3, the world's strongest open-source model, with the flagship model Qwen3-235B-A22B surpassing top competitors like DeepSeek-R1 and OpenAI's models [10][12] - Qwen3's dataset has expanded to approximately 36 trillion tokens, nearly double that of its predecessor Qwen2.5, covering 119 languages [11] - Qwen3 supports two thinking modes: a thoughtful mode for complex problems and a quick mode for simpler queries, enhancing its operational efficiency [11] Agent Capabilities - Qwen3 excels in the Agent domain, achieving a score of 70.8 in the BFCL evaluation, surpassing other leading models [12] - The introduction of Qwen-Agent simplifies the integration of tools, enhancing the model's capabilities in real-world applications [12] Investment Recommendations - The report highlights several companies to watch, including 合合信息 (Hehe Information), 赛意信息 (Saiyi Information), 鼎捷数智 (Dingjie Smart), and others, with detailed earnings forecasts provided [6][9]
刚刚,Qwen3 终于发布!混合推理模式、支持MCP,成本仅DeepSeek R1三分之一,网友喊话小扎:工程师要赶紧加班了
AI前线· 2025-04-28 23:57
Qwen3 在推理、指令遵循、工具调用、多语言能力等方面均大幅增强。在官方的测评中,Qwen3 创下所有国产模型及全球开源模型的性能新高:在奥 数水平的 AIME25 测评中,Qwen3 斩获 81.5 分,刷新开源纪录;在考察代码能力的 LiveCodeBench 评测中,Qwen3 突破 70 分大关,表现甚至超过 Grok3;在评估模型人类偏好对齐的 ArenaHard 测评中,Qwen3 以 95.6 分超越 OpenAI-o1 及 DeepSeek-R1。 | | Qwen3-235B-A22B | Qwen3-32B | OpenAl-o1 | Deepseek-R1 | Grok 3 Beta | Gemini2.5-Pro | Open Al-o 3-mini | | --- | --- | --- | --- | --- | --- | --- | --- | | | MoE | Dense | 2024-12-17 | | Think | | Medium | | ArenaHard | 95.6 | 93.8 | 92.1 | 93.2 | - | 96.4 | 89.0 | | AIM ...