Workflow
妙刷
icon
Search documents
王兴一鸣惊人!美团首个开源大模型追平DeepSeek-V3.1
猿大侠· 2025-09-02 04:20
克雷西 明敏 发自 凹非寺 量子位 | 公众号 QbitAI 没想到啊,最新SOTA的开源大模型…… 来自一个送外卖 ( Waimai ) 的——有两个AI,确实不一样。 要知道,这可是一家"外卖公司"啊(手动狗头),做的模型都比Meta好了。 这个最新开源模型叫: Longcat-Flash-Chat ,美团第一个开源大模型,发布即开源,已经在海内外的技术圈子里火爆热议了。 一方面是因为成绩亮眼: 它在部分benchmark上,比如Agent工具调用、指令遵循的表现 超过DeepSeek-V3.1、Qwen3 MoE-2507 ,甚至比闭源的Claude4 Sonnet还要好。 编程能力也值得关注,在TerminalBench上, 和公认的"编程之王"Claude4 Sonnet不相上下 。 比如非常流行的小球氛围编程测试,LongCat编写的程序,运行起来效果是这样的: 另一方面是技术报告中透露出不少美团对于大模型的理解,包括DSMoE、MLA、动态计算、Infra等等。 我觉得这是中国大模型里最讲得详细的论文了,甚至超过Kimi、GLM,特别是在建模和infra方面。 而且不光是模型性能好,技术报告里还 ...
王兴一鸣惊人!美团首个开源大模型追平DeepSeek-V3.1
量子位· 2025-09-01 04:39
Core Viewpoint - The article discusses the launch of Meituan's open-source large model, Longcat-Flash-Chat, highlighting its impressive performance and technical innovations, which have sparked significant interest in the tech community both domestically and internationally [2][70]. Group 1: Model Performance - Longcat-Flash-Chat has outperformed several established models, including DeepSeek-V3.1 and Claude4 Sonnet, in various benchmarks, particularly in agent tool invocation and instruction adherence [3][18]. - The model's programming capabilities are noteworthy, showing comparable performance to Claude4 Sonnet in programming tasks [5]. - Longcat-Flash-Chat achieved a throughput improvement due to its unique architecture, which includes a "zero-computation expert" design, allowing it to dynamically activate parameters based on context [12][19]. Group 2: Technical Innovations - The model employs a dual design of "zero-computation experts" and Shortcut-connected MoE, which enhances training and inference throughput by allowing parallel execution of computations [12][16]. - Longcat-Flash-Chat has a total parameter count of 560 billion, which is lower than that of its competitors like DeepSeek-V3.1 and Kimi-K2, while still maintaining high performance [11][19]. - The model's training utilized over 20 trillion tokens in just 30 days, with a utilization rate of 98.48%, demonstrating its efficiency [19]. Group 3: Company Background and Strategy - Meituan's foray into large models is seen as a surprising development given its reputation as a food delivery company, but it has been building a foundation in AI through previous investments and projects [70][71]. - The establishment of the independent AI team GN06 and the launch of various AI applications indicate Meituan's commitment to integrating AI into its business model [73][74]. - Meituan's AI strategy focuses on practical applications, aiming to enhance employee efficiency and innovate existing products through AI technologies [87][85].