闭源模型
Search documents
技术先行:阿里千问APP为何跑出更快的C端加速度?
Sou Hu Cai Jing· 2025-11-24 18:24
Core Insights - The article discusses the emerging narrative of "catching up" in the AI large model sector between China and the US, highlighting the competitive dynamics between Google and Alibaba [2][6] - Both companies are pursuing a "full-stack" approach, integrating cloud computing, chips, large models, and applications to create a comprehensive ecosystem [4][6] Group 1: Company Strategies - Google was initially perceived as lagging in AI, but the release of Gemini 3 has garnered positive feedback from industry leaders [3][6] - Alibaba's Qwen series models have achieved significant success, with the Qwen app surpassing 10 million downloads in its first week, breaking previous records [4][7] - Both companies are focusing on building robust foundational technologies before launching consumer-facing applications, demonstrating strategic patience [8][10] Group 2: Market Dynamics - The AI landscape is characterized by instability, with user engagement fluctuating significantly among competing applications [10][11] - Alibaba's Qwen model has become the most widely downloaded open-source large model globally, indicating a shift in developer preferences towards open-source solutions [12][13] - The competition between open-source and closed-source models is highlighted, with Alibaba favoring an open-source approach to foster a developer ecosystem, while Google maintains a closed-source strategy to protect its core assets [11][12] Group 3: Future Outlook - The article suggests that the ultimate goal for AI applications is to create a "business closed loop" that continuously generates value for users [19][21] - Alibaba's strategy includes leveraging its AI capabilities to enhance existing business operations, creating a seamless integration of AI across its services [22][23] - The full-stack approach adopted by both companies is expected to yield higher value elasticity and resilience in the face of market fluctuations [23]
中美大模型分歧下,企业们也站在选择路口
财富FORTUNE· 2025-11-22 13:09
祥峰投资东南亚与印度区执行董事Chan Yip Pang认为,公司选择路线时要基于使用目的——是将它用 于内部生产力的提升,还是用于原生AI应用程序的构建? 如果是前者,企业要测试AI解决方案是否真的能够提高生产力,那么通常会采用闭源模型,这样可以 迅速获取投资回报率。但随着时间推移,费用会逐渐增加,在一个时间点公司会为了降低成本转向开 源。 如果是为了开发AI应用并将其作为服务销售的初创公司,选择开源模型是更好的选择,因为开源模式 能够让公司完全掌控技术栈,成本可控,且不必依赖大模型背后的巨头。相比之下,闭源模型随时可能 涨价,甚至改变模型特征,而用户公司对此毫无还手之力。 来自金融科技领域的Dyna.AI总经理兼投资者关系负责人Cynthia Siantar指出,她所在的领域受到严格监 管,监管者不会问公司的大模型是开源还是闭源,而是会问如何做出决策的?公司需要对此给出解释, 这时开源模型的优势就会凸显。 Amplify AI Group首席执行官Will Liang的客户大多来自金融服务行业,他表示,如果AI是用于关乎公司 竞争优势和机密的事项,大多情况下开源模式是更安全的选择,因为公司可以亲自部署并严 ...
谷歌前CEO公开发声,英伟达黄仁勋果然没说错,美国不愿看到的局面出现了!
Sou Hu Cai Jing· 2025-11-14 19:45
Core Viewpoint - The article discusses the growing influence of Chinese open-source AI models on the U.S. AI industry, highlighting a shift in competitive dynamics where U.S. companies are increasingly challenged by China's free and open-source offerings [1][3][19]. Group 1: U.S. AI Industry Challenges - U.S. tech giants have adopted a closed-source model, believing that maintaining control over advanced technology is essential for market position and profit [3][4]. - This closed-source strategy has led to high usage costs, limiting access for developers and hindering global adoption [5][6]. - The regulatory environment in the U.S. is becoming a burden, with numerous state-level regulations increasing operational costs and complicating compliance for AI companies [10][12]. Group 2: Chinese AI Industry Advantages - Chinese AI companies are taking a different approach by offering open-source models that are free and powerful, gaining popularity among global developers [7][9]. - The cumulative download of Alibaba's Qwen has surpassed Meta's Llama, indicating its growing acceptance in the global market [9]. - Chinese firms benefit from government support and lower operational costs, allowing them to maintain competitive pricing and foster innovation [12][18]. Group 3: Future Implications - The article suggests that the U.S. AI industry is at a crossroads, needing to reconsider its closed-source strategy to remain competitive [18][19]. - The shift towards open-source models in China is creating a robust ecosystem that could redefine industry standards and market dynamics [14][15]. - Warnings from industry leaders like Eric Schmidt and Jensen Huang highlight the urgency for U.S. companies to adapt or risk losing market share [19].
谷歌前CEO施密特:大多数国家最终可能使用中国AI模型
Feng Huang Wang· 2025-11-14 09:05
Core Insights - Eric Schmidt, former CEO of Google, expressed concerns that many countries may ultimately adopt Chinese AI models due to cost issues, leading to a geopolitical divide where the best models in the U.S. are closed-source while those in China are open-source [2] - Open-source AI models are free and publicly available for anyone to use and share, which may attract governments with less funding compared to Western nations, regardless of the quality of the models [2] - The debate between open-source and closed-source advocates centers on the rapid development and democratization of technology versus the higher security associated with closed-source models [2] Industry Context - Chinese AI models, such as DeepSeek and Alibaba's Tongyi Qwen 3, have gained significant attention this year, raising concerns about the competitive advantage of the U.S. in the AI sector [2] - Schmidt's background includes leading Google through its IPO in 2004 and currently being a founding partner at venture capital firm Innovation Endeavours, with a net worth close to $50 billion according to Bloomberg [3] - Other supporters of open-source models include Jensen Huang, CEO of Nvidia, and Arthur Mensch, CEO of French AI startup Mistral, both advocating for the development of sovereign AI, which refers to a nation's control over AI technology, data, and infrastructure [3]
全球都用上中国免费大模型后,美国AI该怎么办?
Guan Cha Zhe Wang· 2025-11-13 13:00
Core Viewpoint - Eric Schmidt, former CEO of Google, expressed concerns that due to cost issues, most countries may ultimately adopt Chinese AI models, following Nvidia CEO Jensen Huang's statement that "China will win the AI race" [1][3]. Group 1: AI Model Landscape - Schmidt highlighted a "strange paradox" in the global AI landscape, where the largest AI models in the U.S. are closed-source and paid, while China's largest models are open-source and free [3]. - Open-source AI models allow free and public use and sharing, making them attractive to governments and countries lacking substantial funding, leading them to adopt Chinese models not necessarily because they are superior, but because they are free [3][4]. Group 2: Open Source vs. Closed Source - The early development of large models favored open-source as the mainstream choice, with even OpenAI initially releasing GPT-1 and GPT-2 as open-source [4]. - Supporters of open-source argue it promotes rapid technological development and offers significant cost advantages, while proponents of closed-source models claim higher security and advanced capabilities [5]. - The rise of Chinese open-source models has diminished the perceived security advantages of closed-source models, as open-source can be deployed locally, and performance gaps are closing [5]. Group 3: Chinese AI Model Advancements - Chinese models like DeepSeek, Alibaba's Qwen, and others have embraced open-source and consistently updated their large models, gaining popularity and raising concerns about the U.S. AI competitive edge [5][6]. - MiniMax's new open-source model, MiniMax-M2, ranked in the top five globally, while Kimi's K2 Thinking model reportedly surpassed GPT-5 in performance with a development cost of only $4.6 million [6]. - Chinese models are increasingly being adopted globally, with reports of Japanese companies using Qwen as a foundational technology [6][7]. Group 4: Global Implications - The cumulative download of Alibaba's Qwen surpassed that of Meta's Llama, indicating its popularity as an open-source model [7]. - The choice of a U.S. company to use a Chinese open-source model instead of its parent company's offerings reflects a shift in preference towards quality and cost-effectiveness [7]. - Concerns have been raised about the U.S. AI industry's reliance on closed-source strategies, which may pose significant risks if they fail [7][8]. - The rapid development of Chinese open-source models is reshaping the global AI competitive landscape, prompting fears that more countries may turn to Chinese models due to their advantages in openness, security, and cost [8].
杨植麟回复:Kimi K2训练用的H800!但“只花了460万美元”嘛…
量子位· 2025-11-11 11:11
Core Insights - The Kimi K2 Thinking model reportedly cost only $4.6 million to train, which is lower than the $5.6 million for DeepSeek V3, raising questions about the valuation of closed-source giants in Silicon Valley [13][14]. - The Kimi K2 model is causing a migration trend in Silicon Valley as it offers superior performance at a lower cost compared to existing models [5][6]. - The Kimi K2 model utilizes innovative engineering techniques, including a self-developed MuonClip optimizer, which allows for stable gradient training without human intervention [18]. Training Cost and Performance - The training cost of Kimi K2 is claimed to be $4.6 million, significantly lower than other models, prompting reflection within the industry [13][14]. - Investors and companies are migrating to Kimi K2 due to its strong performance and cost-effectiveness, with reports of it being five times faster and 50% more accurate than closed-source models [8][6]. Technical Innovations - Kimi K2 has optimized its architecture by increasing the number of experts in the MoE layer from 256 to 384 while reducing the number of active parameters during inference from approximately 37 billion to 32 billion [16]. - The model employs Quantization-Aware Training (QAT) to achieve native INT4 precision inference, which enhances speed and reduces resource consumption by about 2 times [21]. Community Engagement and Future Developments - The team behind Kimi K2 engaged with the developer community through a three-hour AMA session, discussing future architectures and the potential for a next-generation K3 model [22][24]. - The team revealed that the unique writing style of Kimi K2 results from a combination of pre-training and post-training processes, and they are exploring longer context windows for future models [26][27].
Kimi K2 Thinking突袭,智能体&推理能力超GPT-5,网友:再次缩小开源闭源差距
3 6 Ke· 2025-11-07 03:07
Core Insights - Kimi K2 Thinking has been released and is now open-source, featuring a "model as agent" approach that allows for 200-300 consecutive tool calls without human intervention [1][3] - The model significantly narrows the gap between open-source and closed-source models, becoming a hot topic upon its launch [3][4] Technical Details - Kimi K2 Thinking has 1TB of parameters, with 32 billion activated parameters, and utilizes INT4 precision instead of FP8 [5][26] - It features a context window of 256K tokens, enhancing its reasoning and agent capabilities [5][8] - The model demonstrates improved performance in various benchmarks, achieving a state-of-the-art (SOTA) score of 44.9% in the Human Last Exam (HLE) [9][10] Performance Metrics - Kimi K2 Thinking outperformed closed-source models like GPT-5 and Claude Sonnet 4.5 in multiple benchmarks, including HLE and BrowseComp [10][18] - In the BrowseComp benchmark, where human average intelligence scored 29.2%, Kimi K2 Thinking achieved a score of 60.2%, showcasing its advanced search and browsing capabilities [18][20] - The model's agent programming capabilities have also improved, achieving a SOTA score of 93% in the ²-Bench Telecom benchmark [15] Enhanced Capabilities - The model exhibits enhanced creative writing abilities, producing clear and engaging narratives while maintaining stylistic coherence [25] - In academic and research contexts, Kimi K2 Thinking shows significant improvements in analytical depth and logical structure [25] - The model's responses to personal and emotional queries are more empathetic and nuanced, providing actionable insights [25] Quantization and Performance - Kimi K2 Thinking employs native INT4 quantization, which enhances compatibility with various hardware and improves inference speed by approximately 2 times [26][27] - The model's design allows for dynamic cycles of "thinking → searching → browsing → thinking → programming," enabling it to tackle complex, open-ended problems effectively [20] Practical Applications - The model has demonstrated its ability to solve complex problems, such as a doctoral-level math problem, through a series of reasoning and tool calls [13] - In programming tasks, Kimi K2 Thinking quickly engages in coding challenges, showcasing its practical utility in software development [36]
硅谷大佬带头弃用 OpenAI、“倒戈”Kimi K2,直呼“太便宜了”,白宫首位 AI 主管也劝不住
3 6 Ke· 2025-11-04 10:50
Core Insights - Silicon Valley is shifting from expensive closed-source models to cheaper open-source alternatives, driven by cost considerations and performance improvements [1][2][5] - The Kimi K2 model, developed by a Chinese startup, has gained traction due to its superior performance and lower costs compared to models from OpenAI and Anthropic [1][5] - The emergence of open-source models like DeepSeek is putting pressure on the U.S. AI industry, as these models offer significant cost savings [3][8] Cost Considerations - Chamath Palihapitiya highlighted that the decision to switch to open-source models is primarily based on cost, as existing systems like Anthropic's are too expensive [2][5] - The DeepSeek 3.2 EXP model can reduce API costs by up to 50%, charging $0.28 per million inputs and $0.42 per million outputs, compared to Anthropic's Claude model, which costs around $3.15 [3][8] Model Performance and Transition Challenges - Transitioning to new models requires significant time for fine-tuning and engineering adjustments, complicating the switch despite the lower costs of alternatives like DeepSeek [2][6] - The Kimi K2 model has been adopted by major users, indicating a trend towards prioritizing performance and cost efficiency in AI model selection [1][5] Open-Source vs. Closed-Source Dynamics - The discussion emphasizes a growing divide where high-performance closed-source models are predominantly American, while high-performance open-source models are primarily Chinese [10][12] - The U.S. is facing challenges in the open-source model space, with significant investments in closed-source models, while China is leading in open-source developments [8][10] Security and Operational Concerns - Concerns about the security of using Chinese models in the U.S. are addressed, with assurances that running these models on local infrastructure mitigates risks of data leakage [12][16] - The competitive landscape is fostering a culture of scrutiny, where companies are actively testing models for vulnerabilities, contributing to a responsible development environment [16]
硅谷大佬带头弃用OpenAI、“倒戈”Kimi K2,直呼“太便宜了”,白宫首位AI主管也劝不住
3 6 Ke· 2025-10-28 10:39
Core Insights - Silicon Valley is shifting from expensive closed-source models to cheaper open-source alternatives, driven by cost considerations and performance improvements [1][2][14] - The Kimi K2 model, developed by a Chinese startup, has gained traction due to its superior performance and significantly lower costs compared to models from OpenAI and Anthropic [1][5][14] - The introduction of the DeepSeek model, which offers a 50% reduction in API costs, is putting pressure on the U.S. AI industry to adapt [3][8] Cost Considerations - Chamath Palihapitiya highlighted that the decision to switch to open-source models is primarily based on cost, as existing systems like Anthropic's are too expensive [2][5] - The DeepSeek model charges $0.28 per million inputs and $0.42 per million outputs, while Anthropic's Claude model costs approximately $3.15 for similar services, making DeepSeek 10 to 35 times cheaper [3][8] Model Performance and Transition Challenges - Transitioning to new models like DeepSeek requires significant time for adjustments and fine-tuning, complicating the switch despite the cost benefits [2][6] - Companies are facing a dilemma on whether to switch to cheaper models or wait for existing models to catch up in performance [6][10] Open-Source vs. Closed-Source Dynamics - The current landscape shows that high-performance closed-source models are predominantly from the U.S., while high-performance open-source models are emerging from China [10][12] - The open-source movement is seen as a way to counterbalance the power of large tech companies, but the leading open-source models are currently from China [8][10] Security and Ownership Concerns - There are concerns regarding the ownership and potential security risks associated with using Chinese models, but deploying them on U.S. infrastructure mitigates some of these risks [12][16] - The competitive landscape encourages rigorous testing for vulnerabilities, which is seen as a positive development for model safety [16][17] Future Implications - The ongoing shift towards open-source models may lead to significant changes in the AI industry, particularly in terms of cost and energy consumption [5][10] - Companies are exploring solutions to manage rising energy costs associated with AI operations, indicating a need for sustainable practices in the industry [11][12]
硅谷大佬带头弃用 OpenAI、“倒戈”Kimi K2!直呼“太便宜了”,白宫首位 AI 主管也劝不住
AI前线· 2025-10-28 09:02
Core Insights - The article discusses a significant shift in Silicon Valley from expensive closed-source AI models to more affordable open-source alternatives, particularly highlighting the Kimi K2 model developed by a Chinese startup [2][3] - Chamath Palihapitiya, a prominent investor, emphasizes the cost advantages of using the Kimi K2 model over models from OpenAI and Anthropic, which he describes as significantly more expensive [3][5] - The conversation also touches on the competitive landscape of AI, where open-source models from China are putting pressure on the U.S. AI industry [5][10] Cost Considerations - Palihapitiya states that the decision to switch to open-source models is primarily driven by cost considerations, as the existing systems from Anthropic are too expensive [3][5] - The new DeepSeek 3.2 EXP model from China offers a substantial reduction in API costs, with charges of $0.28 per million inputs and $0.42 per million outputs, compared to Anthropic's Claude model, which costs approximately $3.15 per million [5][10] Model Performance and Transition Challenges - The Kimi K2 model boasts a total parameter count of 1 trillion, with 32 billion active parameters, and has been integrated by various applications, indicating its strong performance [2][5] - Transitioning to new models like DeepSeek is complex and time-consuming, often requiring weeks or months for fine-tuning and engineering adjustments [3][7] Open-Source vs. Closed-Source Dynamics - The article highlights a structural shift in the AI landscape, where open-source models from China are gaining traction, while U.S. companies are primarily focused on closed-source models [10][12] - There is a growing concern that the U.S. is lagging in the open-source AI model space, with significant investments from Chinese companies leading to advancements that challenge U.S. dominance [10][12] Security and Ownership Issues - Palihapitiya explains that Groq's approach involves obtaining the source code of models like Kimi K2, deploying them in the U.S., and ensuring that data does not return to China, addressing concerns about data security [15][18] - The discussion raises questions about the potential risks of using Chinese models, including the possibility of backdoors or vulnerabilities, but emphasizes that open-source nature allows for community scrutiny [18][19] Future Implications - The article suggests that the ongoing competition between U.S. and Chinese AI models could lead to significant changes in the industry, particularly in terms of cost and energy consumption [6][12] - There is a recognition that the future of AI will be decentralized, with numerous players in both the U.S. and China contributing to the landscape, making it essential to address national security concerns [19][20]