开源

Search documents
“人工智能是年轻人的事业”,“模都”上海集聚全国1/3的AI人才
Di Yi Cai Jing Zi Xun· 2025-04-29 16:25
Group 1 - Shanghai has entered a new phase with an economic scale exceeding 5 trillion yuan, becoming a leading hub for artificial intelligence innovation, application, industry aggregation, and talent [1] - The city is focusing on enhancing its technological innovation capabilities and high-end industry leadership to build a globally influential technology innovation center [1] - The AI talent pool in Shanghai accounts for one-third of the national total, with 250,000 AI professionals concentrated in the city [2] Group 2 - Shanghai has established a development pattern with significant AI innovation communities, such as "Mosu Space" in Xuhui and "Moli Community" in Pudong, attracting numerous innovative enterprises and young talents [2] - The "Mosu Space" has gathered nearly 400 AI companies and facilitated the implementation of 43 registered large models, accounting for approximately 61% of the city's total [8] - The city aims to build a world-class AI industry ecosystem by 2025, covering computing power, data, models, and applications [10] Group 3 - The local government is committed to creating a supportive environment for AI development through policy guidance, comprehensive industry chain layout, and strong data and computing power support [5][6] - Shanghai's AI industry is projected to exceed 450 billion yuan in 2024, with a year-on-year growth of over 7.8%, having already met its "14th Five-Year Plan" goals ahead of schedule [7] - The city is fostering an open-source ecosystem to enhance community cohesion and trust in AI technology, which is crucial for its development [7][11]
Qwen 3 发布,开源正成为中国大模型公司破局的「最优解」
Founder Park· 2025-04-29 12:33
阿里新一代的大模型 Qwen 3 今早发布,新旗舰 Qwen3-235B-A22B 的评测成绩,和 DeepSeek R1、Grok-3、Gemini-2.5-Pro 不相上下。这一代全系列模 型都支持混合推理,对 Agent 的支持也上了新台阶。 随着 Qwen 2.5 和 3 的发布,全球的开源模型生态也呈现了一种新形态:以 DeepSeek+Qwen 的中国开源组合,取代了过去 Llama 为主,Mistral 为辅的开 源生态。Qwen 系列的衍生模型目前已经是 HuggingFace 上最受欢迎的开源模型,衍生模型的数量也超过了 Llama 系列。而 DeepSeek 对于开源模型生态 的冲击和贡献,也有目共睹。 与大模型六小龙相比,主打开源的 Qwen 和 DeepSeek 无疑在国际市场赢得了更多开发者和创业者的关注,来自开源社区的代码贡献、更多优秀微调版本 的出现,也在以另外一种方式推动模型能力的进步。 可以说, 开源,正在成为中国大模型公司进入全球市场的最佳路径。 而对阿里云来说,Qwen+阿里云的配合,「模型-云-行业应用」的打法,走出了国内 MaaS 模式的新方向,也在很大程度上降低了国 ...
Qwen3深夜炸场,阿里一口气放出8款大模型,性能超越DeepSeek R1,登顶开源王座
3 6 Ke· 2025-04-29 09:53
Core Insights - The release of Qwen3 marks a significant advancement in open-source AI models, featuring eight hybrid reasoning models that rival proprietary models from OpenAI and Google, and surpass the open-source DeepSeek R1 model [4][24]. - Qwen3-235B-A22B is the flagship model with 235 billion parameters, demonstrating superior performance in various benchmarks, particularly in software engineering and mathematics [2][4]. - The Qwen3 series introduces a unique dual reasoning mode, allowing the model to switch between deep reasoning for complex problems and quick responses for simpler queries [8][21]. Model Performance - Qwen3-235B-A22B achieved a score of 95.6 in the ArenaHard test, outperforming OpenAI's o1 (92.1) and DeepSeek's R1 (93.2) [3]. - Qwen3-30B-A3B, with 30 billion parameters, also shows strong performance, scoring 91.0 in ArenaHard, indicating that smaller models can still achieve competitive results [6][20]. - The models have been trained on approximately 36 trillion tokens, nearly double the data used for the previous Qwen2.5 model, enhancing their capabilities across various domains [17][18]. Model Architecture and Features - Qwen3 employs a mixture of experts (MoE) architecture, activating only about 10% of its parameters during inference, which significantly reduces computational costs while maintaining high performance [20][24]. - The series includes six dense models ranging from 0.6 billion to 32 billion parameters, catering to different user needs and computational resources [5][6]. - The models support 119 languages and dialects, broadening their applicability in global contexts [12][25]. User Experience and Accessibility - Qwen3 is open-sourced under the Apache 2.0 license, making it accessible for developers and researchers [7][24]. - Users can easily switch between reasoning modes via a dedicated button on the Qwen Chat website or through commands in local deployments [10][14]. - The model has received positive feedback from users for its quick response times and deep reasoning capabilities, with notable comparisons to other models like Llama [25][28]. Future Developments - The Qwen team plans to focus on training models capable of long-term reasoning and executing real-world tasks, indicating a commitment to advancing AI capabilities [32].
通义千问 Qwen3 发布,对话阿里周靖人
晚点LatePost· 2025-04-29 08:43
以下文章来源于晚点对话 ,作者程曼祺 晚点对话 . 最一手的商业访谈,最真实的企业家思考。 阿里云 CTO、通义实验室负责人 周靖人 "大模型已经从早期阶段的初期,进入早期阶段的中期,不可能只在单点能力上改进了。" Qwen3 旗舰模型,MoE(混合专家模型)模型 Qwen3-235B-A22B,以 2350 亿总参数、220 亿激活参数,在 多项主要 Benchmark(测评指标)上超越了 6710 亿总参数、370 亿激活参数的 DeepSeek-R1 满血版。更小 的 MoE 模型 Qwen3-30B-A3B,使用时的激活参数仅为 30 亿,不到之前 Qwen 系列纯推理稠密模型 QwQ- 32B 的 1/10,但效果更优。更小参数、更好性能,意味着开发者可以用更低部署和使用成本,得到更好效 果。图片来自通义千问官方博客。 (注:MoE 模型每次使用时只会激活部分参数,使用效率更高,所以有 总参数、激活参数两个参数指标。) Qwen3 发布前,我们访谈了阿里大模型研发一号位,阿里云 CTO 和通义实验室负责人,周靖人。他 也是阿里开源大模型的主要决策者。 迄今为止,Qwen 系列大模型已被累计下载 3 ...
理想又一款纯电来了:i6谍照疑似曝光,尾部更像SUV了
Xin Lang Cai Jing· 2025-04-29 07:21
编辑 | 志豪 第二款纯电SUV谍照曝光,理想加快出牌。 车东西4月28日消息,日前,网络上流传出了一组疑似理想i6的伪装图。 从新车造型风格来看,新车和理想此前官宣的理想i8采用了相似的家族化设计语言,根据定位及节奏推 测,这很可能是将在下半年上市的理想i6。 3月中旬,理想汽车CEO李想在电话会议上点明了今年的新车规划:"下半年有两款纯电SUV,包括7月 份发布的i8,还有也会在下半年发布的i6,这两款产品的发布节奏会类似于2022年发布的L9和L8。" 根据2022年理想汽车的发布节奏来看,理想i6可能在今年10月份左右推出。 理想汽车第二款纯电SUV离量产落地越来越近了,与此同时理想汽车也宣布理想星环OS操作系统代码 开放下载,由此,理想汽车也成为全球首个实现整车级操作系统全面开源的车企。 文 | 车东西 郭月 无论是发新车,还是系统开源,都说明理想正在加快出牌。 01.第二款纯电SUV来了MPV的味更淡了 这组新车谍照中,尽管伪装贴纸掩盖了部分细节,但是也透露出一些关键信息。 新车前后大灯两侧或搭配狭长的贯穿式LED灯带,灯组内部疑似融入点阵式光源,与理想L系列车型的 星环灯组形成差异化。 疑似理想 ...
没能让中国妥协,36万亿美债填不上,特朗普矛头对准大债主!
Sou Hu Cai Jing· 2025-04-29 07:10
特朗普这一次"二进宫"为什么这么疯狂?仅仅上台三个多月就甩出了这么多的骚操作? 其实唯一的一个问题就是美债问题,到今年2月美债这个雪球已经滚到了36.2万亿美元,而且有9.2万亿 的美债在今年就到期。 到期就意味着美国要付利息,要知道去年美国支付的利息就已经超过1万亿美元,今年的利息肯定要更 高。 特朗普不想付这些钱,他想到的办法就是开源节流,利用关税增加收入是开源,将矛头对准最大的债主 是节流。 36万亿美债的窟窿 曾几何时,大家都以买美债为一个保险的投资,因为美国是超级大国,经济实力强,买了美债就能获得 稳定的利息,我国还曾是第一大美债持有国呢。 然而在意识到一些问题后,我国就减少了美债的持有量,维持在一万亿美元以下,保持在第二大持有国 的位置。 第一大持有国是谁呢?日本,这个美国最忠诚的小弟。 不过近几年日本也有些心慌了,美国各种骚操作让人看不懂,疫情期间不断加息,吸引各国去购买美 债,然后拿"股民"的钱去控制疫情、给人民发放社保福利等。 这可把马斯克气坏了,直接在社交平台上大骂。之后他的政府效率部还把国际开发署给精简了,因为这 个机构里面也有很多猫腻。 美国靠着"加息"轻轻松松就获得了不菲的财富,但是 ...
【昇腾全系列支持Qwen3】4月29日讯,据华为计算公众号,Qwen3于2025年4月29日发布并开源。此前昇腾MindSpeed和MindIE一直同步支持Qwen系列模型,此次Qwen3系列一经发布开源,即在MindSpeed和MindIE中开箱即用,实现Qwen3的0Day适配。
news flash· 2025-04-29 06:27
Core Insights - Huawei's Ascend series fully supports the Qwen3 model, which was released and open-sourced on April 29, 2025 [1] - The Ascend MindSpeed and MindIE have been consistently supporting the Qwen series models, ensuring immediate compatibility with Qwen3 upon its release [1]
通义App全面上线千问3
news flash· 2025-04-29 03:13
Core Insights - The article highlights the launch of Alibaba's new generation open-source model Qwen3, available on the Tongyi App and website, enhancing user experience with advanced AI capabilities [1] Company Developments - The Tongyi App and Tongyi website (tongyi.com) have fully launched the Qwen3 model, which is described as the world's strongest open-source model [1] - Users can access the dedicated intelligent agent "Qwen Large Model" and experience its top-tier intelligent capabilities on both platforms [1]
阿里巴巴,登顶全球开源模型!
Zheng Quan Shi Bao· 2025-04-29 02:41
Core Insights - Alibaba has released the highly anticipated Qwen3 model, which has outperformed top global models in various benchmark tests, establishing itself as a leading open-source model [1][2][3] Model Performance - Qwen3 achieved a score of 81.5 in the AIME25 assessment, setting a new open-source record, and scored over 70 in the Live Code Bench test, surpassing Grok3 [1][2] - In the Arena Hard evaluation, Qwen3 scored 95.6, outperforming OpenAI-o1 and DeepSeek-R1 [1][2] Model Architecture - Qwen3 utilizes a mixed expert architecture with a total parameter count of 235 billion, activating only 22 billion parameters, significantly enhancing capabilities in reasoning, instruction following, tool usage, and multilingual abilities [2][3] Key Features - The model integrates "fast thinking" and "slow thinking," allowing seamless transitions between simple and complex tasks, thus optimizing computational efficiency [3][4] - Qwen3 offers eight different model sizes, including two mixed expert models (30B and 235B) and six dense models (ranging from 0.6B to 32B), catering to various applications and balancing performance with cost [3][4] Cost Efficiency - Deployment costs for Qwen3 are significantly lower compared to competitors, with the flagship model requiring only three H20 units (approximately 360,000 yuan) for deployment, which is 25%-35% of the cost of similar models [5][6] Open Source and Accessibility - Qwen3 is open-sourced under the Apache 2.0 license and supports over 119 languages, making it accessible for global developers and researchers [6][7] - The model is available on platforms like Magic Tower Community, Hugging Face, and GitHub, with personal users able to experience it through the Tongyi app [6][7] Industry Impact - The release of Qwen3 is expected to significantly advance research and development in large foundational models, enhancing the AI industry's focus on intelligent applications [6][7] - Alibaba has established itself as a leader in the open-source AI ecosystem, with over 200 models released and more than 300 million downloads globally, surpassing Meta's Llama [7]
吉利开放电池底部安全专利集,自主品牌打响技术开源战
Di Yi Cai Jing· 2025-04-29 02:10
Group 1 - Leading domestic brands are transitioning from standard executors to rule makers in the automotive industry [1][2] - Geely announced the opening of its battery bottom safety patent set to the industry, emphasizing safety as a core value [1][3] - The new national standard for electric vehicle battery safety was approved, with Geely being a participant in the development of the bottom impact testing standard [2] Group 2 - The automotive industry is witnessing a technology open-source battle among leading brands, with BYD and Chery also announcing their technology sharing initiatives [1][2] - Geely's full-domain safety testing center, which cost 2 billion yuan to build, will support safety validation under extreme conditions [3] - The focus on safety in vehicle manufacturing has intensified, with Geely prioritizing safety and quality in its production processes [3]