GLM4.6
Search documents
Kimi杨植麟称“训练成本很难量化”,仍将坚持开源策略
第一财经· 2025-11-11 12:04
Core Viewpoint - Kimi, an AI startup, is focusing on open-source model development, with the recent release of Kimi K2 Thinking, which has a training cost of $4.6 million, significantly lower than competitors like DeepSeek V3 and OpenAI's GPT-3 [3][4][6] Summary by Sections Model Development and Costs - Kimi has invested heavily in open-source model research and updates over the past six months, releasing Kimi K2 Thinking on November 6, with a reported training cost of $4.6 million, lower than DeepSeek V3's $5.6 million and OpenAI GPT-3's billions [3][4] - CEO Yang Zhilin clarified that the $4.6 million figure is not official, as most expenses are on research and experimentation, making it difficult to quantify training costs [4][6] Model Performance and Challenges - Users raised concerns about the reasoning length of Kimi K2 Thinking and discrepancies between leaderboard scores and actual performance. Yang stated that the model currently prioritizes absolute performance, with plans to improve token efficiency in the future [4][7] - The gap between leaderboard performance and real-world experience is expected to diminish as the model's general capabilities improve [7] Market Position and Strategy - Chinese open-source models are increasingly being utilized in the international market, with five Chinese models appearing in the top twenty of the OpenRouter model usage rankings [7] - Kimi currently can only be accessed via API due to interface issues with the OpenRouter platform [7] - Kimi plans to maintain its open-source strategy, focusing on the application and optimization of Kimi K2 Thinking while balancing text and multimodal model development, avoiding direct competition with leading firms like OpenAI [6][8]
Kimi杨植麟称“训练成本很难量化” 仍将坚持开源策略
Di Yi Cai Jing· 2025-11-11 10:45
Core Insights - Kimi, an AI startup, has released its latest open-source model, Kimi K2 Thinking, with a reported training cost of $4.6 million, significantly lower than competitors like DeepSeek V3 at $5.6 million and OpenAI's GPT-3, which costs billions to train [2][3] - The company emphasizes ongoing model updates and improvements, focusing on absolute performance while addressing user concerns regarding inference length and performance discrepancies [2][3] - Kimi's models are gaining traction in the international market, with five Chinese open-source models listed among the top twenty on the OpenRouter platform [3][5] Company Strategy - Kimi plans to maintain its open-source strategy and prioritize the application and optimization of the Kimi K2 Thinking model, while also developing multimodal models [5] - The company aims to differentiate itself from leading competitors like OpenAI by focusing on architectural innovation, open-source strategies, and cost control, avoiding direct competition in specific AI browser markets [5] Technical Aspects - Kimi utilizes H800 GPUs with InfiniBand technology for high-performance computing and AI training, despite having fewer and less powerful chips compared to U.S. counterparts [3] - The training cost and resource allocation for Kimi K2 Thinking are primarily directed towards research and experimentation, making precise cost quantification challenging [2]
Kimi杨植麟称“训练成本很难量化”,仍将坚持开源策略
Di Yi Cai Jing· 2025-11-11 10:35
Core Insights - Kimi, an AI startup, has released its latest open-source model, Kimi K2 Thinking, with a reported training cost of $4.6 million, significantly lower than competitors like DeepSeek V3 at $5.6 million and OpenAI's GPT-3, which costs billions to train [1][2] - The company emphasizes ongoing model updates and improvements, focusing on absolute performance while addressing user concerns regarding inference length and performance discrepancies [1] - Kimi's strategy includes maintaining an open-source approach and advancing the Kimi K2 Thinking model while avoiding direct competition with major players like OpenAI through innovative architecture and cost control [2][4] Model Performance and Market Position - In the latest OpenRouter model usage rankings, five Chinese open-source models, including Kimi's, are among the top twenty, indicating a growing presence in the international market [2] - Kimi's current model can only be accessed via API due to platform limitations, but the team is utilizing H800 GPUs with InfiniBand technology for training, despite having fewer resources compared to U.S. high-end GPUs [2] - The company plans to balance text model development with multi-modal model advancements, aiming to establish a differentiated advantage in the AI landscape [4]
氪星晚报|光线传媒积极探索微短剧市场并筹划组建相关公司 ;DeepSeek V3.2、GLM4.6等大模型即将发布;工信部等六部门印发《机械行业稳增长工作方案(2025-2026年)》
3 6 Ke· 2025-09-29 11:43
Group 1: OPPO and Under Armour - OPPO has initiated a new imaging product series, leveraging over 17 years of mobile imaging technology, with plans to launch by 2026 [1] - Under Armour has opened its first flagship outdoor store in Shanghai, expanding its presence in 22 provinces and municipalities across China [1] Group 2: Strategic Partnerships and Projects - Xiamen Tungsten New Energy signed a strategic cooperation framework agreement with Zhongwei New Materials, projecting annual supply and demand of 40,000 tons for cobalt tetroxide and 50,000 tons for ternary precursors from 2025 to 2028 [2] - Donghua Technology's lithium carbonate project in Tibet has completed a 120-hour functional assessment, marking its readiness for official production [3] Group 3: Media and Entertainment - Light Chaser Animation is exploring the micro-short drama market and is planning to establish a related company [4] Group 4: Technology and Financing - DeepSeek V3.2 and GLM-4.6 models are set to be released soon, with the former already uploaded to HuggingFace [5] - "Linghou Robotics" completed over 100 million yuan in Series A financing, aimed at R&D and capacity expansion in industrial automation [7] - "Maike Technology" also secured Series A financing in the range of hundreds of millions, focusing on TGV process development [9] Group 5: Investment Trends - Fidelity International reports a significant increase in global investor interest in Chinese assets, with hedge funds actively participating in the Chinese stock market [10] - The National Development and Reform Commission supports private enterprises' deep involvement in the "Artificial Intelligence+" initiative, highlighting the growth of AI-related private companies [11] Group 6: Industry Growth Plans - The Ministry of Industry and Information Technology and other departments issued a plan for the mechanical industry, targeting an average annual revenue growth rate of around 3.5% and aiming to exceed 10 trillion yuan in revenue by 2026 [12]
氪星晚报|光线传媒积极探索微短剧市场并筹划组建相关公司 ;DeepSeek V3.2、GLM4.6等大模型即将发布;工信部等六部门印发《机械行业稳增长工...
3 6 Ke· 2025-09-29 11:42
Group 1: Company Developments - OPPO has initiated a new imaging product series, aiming to leverage over 17 years of mobile imaging technology, with plans to launch by 2026 [1] - Under Armour has opened its first flagship outdoor store in Shanghai, expanding its presence in high-end shopping centers across 22 provinces and municipalities [1] - Xiamen Tungsten has signed a strategic cooperation framework agreement with Zhongwei New Materials, projecting annual supply and demand for various lithium products from 2025 to 2028 [2] - Donghua Technology's lithium carbonate project in Tibet has completed a 120-hour functional assessment, marking its readiness for official production [3] - Light Media is exploring the micro-short drama market and plans to establish a related company [4] Group 2: Financing and Investment - "Linghou Robotics" has successfully completed over 100 million yuan in Series A financing, with funds allocated for R&D and capacity expansion in industrial automation and robotics [6] - "Maike Technology" has secured Series A financing in the range of hundreds of millions, aimed at enhancing TGV process R&D and production [8] - Nine丰 Energy plans to invest up to 3.455 billion yuan in a coal-to-natural gas project in Xinjiang, with a construction period not exceeding 36 months [7] Group 3: Market Trends and Insights - Fidelity International reports a significant increase in global investor interest in Chinese assets, with hedge funds actively increasing their positions in the Chinese stock market [9] - The National Development and Reform Commission supports private enterprises' deep participation in the "Artificial Intelligence+" initiative, highlighting the role of private firms in AI application [10] - The Ministry of Industry and Information Technology has issued a plan for the mechanical industry to maintain steady growth, targeting an average annual revenue growth rate of around 3.5% from 2025 to 2026 [10]