开源模型
Search documents
DeepSeek终于把OpenAI逼急了
阿尔法工场研究院· 2025-08-07 00:08
Core Viewpoint - The release of OpenAI's first open-source language model, GPT-OSS, marks a significant shift in the AI landscape, challenging the previously held belief that the strongest models would remain closed-source and proprietary [5][12][13]. Group 1: OpenAI's Strategic Shift - OpenAI has transitioned from a closed-source, paid model to an open-source collaborative ecosystem, potentially signaling a competitive stance against domestic Chinese models [14][16]. - The newly released models, GPT-OSS-120B and GPT-OSS-20B, are designed to be efficient and accessible, with the former featuring 117 billion parameters and the latter 21 billion parameters, both capable of running on consumer-grade hardware [9][10][11]. Group 2: Impact of Chinese Open-Source Models - The rapid development of Chinese open-source models, such as DeepSeek and Tongyi Qwen, has prompted OpenAI to reconsider its strategy, as these models have gained significant traction and market presence [18][20]. - The Chinese open-source model ecosystem is expected to flourish by 2025, with multiple influential teams emerging in various AI domains, including programming and multi-modal applications [20][22]. Group 3: Competitive Landscape - The competitive dynamics in the AI sector are intensifying, with Meta also reconsidering its open-source strategy, indicating a broader trend among major players to reassess their approaches in light of emerging competition [22].
资金动向 | 北水买日港股超90亿港元,加仓腾讯、阿里
Ge Long Hui· 2025-08-06 19:07
Group 1 - Tencent Holdings saw a net buy of HKD 15.18 billion, marking a total of HKD 59.44 billion in net buys over the last 10 days [1] - Alibaba-W experienced a net buy of HKD 8.76 billion, with a total of HKD 21.16 billion in net buys over the last 3 days [1] - SMIC recorded a net buy of HKD 6.11 billion, totaling HKD 9.51 billion in net buys over the last 3 days [1] Group 2 - Alibaba-W launched a new membership system integrating various Alibaba resources, aimed at enhancing consumer experience and user engagement [5] - SMIC is set to release its earnings report on August 7, with expected revenue of USD 2.185 billion for Q2 2025, a year-on-year increase of 14.91% [6] - Crystal Technology announced a significant order with DoveTree, totaling approximately USD 6 billion, setting a record for AI pharmaceutical overseas orders [6] Group 3 - Ideal Auto-W, along with China Automotive Research and Dongfeng Liuzhou Motor, issued a joint statement advocating for self-discipline and ethical competition in the automotive industry [6] - Bubble Mart received a report from Morgan Stanley stating that its intrinsic value exceeds its current IP holdings, maintaining an "overweight" rating with a target price of HKD 365 [6]
时隔六年,OpenAI 为什么再次开源?
Founder Park· 2025-08-06 14:00
Core Viewpoint - OpenAI's release of the open-source model gpt-oss marks a significant strategic shift, indicating a clearer understanding of its value proposition beyond just the model itself, focusing on its user base and application ecosystem [2][4][13]. Group 1: OpenAI's Open-Source Model Release - OpenAI has launched its first open-source model, gpt-oss, since GPT-2, with performance comparable to its proprietary o4 mini model while reducing costs by at least 10 times [2][10]. - The gpt-oss-120b model achieved a score of 90.0 on the MMLU benchmark, while the gpt-oss-20b scored 85.3, indicating competitive performance in the open-source landscape [3][8]. - The models are designed to run efficiently on various hardware, from consumer-grade GPUs to cloud servers, and are licensed under Apache 2.0, allowing for commercial deployment without downstream usage restrictions [7][8]. Group 2: Strategic Implications - OpenAI's move to open-source is not merely a technical sharing but aims to build an application ecosystem, targeting enterprises looking to deploy open-source AI models [5][12]. - The release reflects OpenAI's recognition that its core competitive advantage lies in its large user base and application ecosystem rather than just the models themselves [4][13]. - OpenAI's decision to avoid releasing training data, code, or technical reports suggests a strategy to attract businesses while potentially impacting academic research and the true open-source AI community [19][22]. Group 3: Competitive Landscape - The introduction of gpt-oss is expected to challenge existing API products, with OpenAI positioning itself aggressively in the market by offering a model that significantly undercuts the cost of its proprietary offerings [10][11]. - The architecture of gpt-oss aligns with industry trends towards sparse MoE models, indicating a shift in design preferences within the AI community [14]. - The competitive landscape is evolving, with OpenAI's release potentially reversing the previous lag in open-source model applications compared to Chinese counterparts [21][22]. Group 4: Future Considerations - The open-source model's ecosystem remains chaotic, with high-scoring models not necessarily being user-friendly, which could slow adoption rates [17][18]. - OpenAI's approach to model safety and fine-tuning raises questions about the balance between usability and security, which will need community validation [15][16]. - The ongoing competition between U.S. and Chinese open-source models highlights the need for strategic actions to maintain relevance and leadership in the AI space [20][22].
DeepSeek终于把OpenAI逼急了
Feng Huang Wang· 2025-08-06 08:21
Core Insights - OpenAI has launched its first open-source language model, GPT-OSS, which is expected to initiate a new wave of open-source development in the tech industry [1][6] - The release of GPT-OSS marks a significant shift from OpenAI's previous closed-source and paid model strategy to an open and collaborative ecosystem [6][9] Model Specifications - The GPT-OSS-120B model features a MoE architecture with 117 billion parameters, requiring only a single 80GB GPU for operation, and its performance is comparable to the closed-source O4-mini [4] - The GPT-OSS-20B model also utilizes MoE architecture, has 21 billion parameters, and can run smoothly on devices with 16GB of memory, performing similarly to O3-mini [4] Market Impact - OpenAI's decision to make GPT-OSS available for free commercial use is seen as a significant advantage for AI startups in China and globally [5] - The rapid development of Chinese open-source models, such as DeepSeek and Tongyi Qwen, has prompted OpenAI to reconsider its strategy, as these models have gained significant traction and market presence [7][8] Competitive Landscape - The emergence of competitive Chinese models has raised concerns within Silicon Valley, leading to a potential strategic shift among companies like Meta, which may abandon its open-source approach in favor of closed-source models [9] - OpenAI's recent actions indicate a heightened focus on protecting its intellectual property and maintaining a competitive edge in the evolving AI landscape [9]
Claude 小升级就赢了OpenAI 9年“开源神作”?高强度推理直接歇菜、幻觉率高达50%,写作还被Kimi 2吊锤?
3 6 Ke· 2025-08-06 07:32
值得一提的是,几乎与 gpt-oss 开源同时,谷歌 Deepmind 宣布推出 Genie 3 ,Anthropic 放出了 Claude Opus 4.1。有网友感叹,"我们生活在什么样的时 代。"马斯克也转发了这条帖子,并配了意味深长的词和表情。 刚刚,OpenAI 发布了首个开源语言模型系列 gpt-oss,包括 gpt-oss-120b 和 gpt-oss-20b 两款语言模型:完全可定制,提供完整的思维链(CoT)并支持结 构化输出。 现在,gpt-oss-120b 和 gpt-oss-20b 的权重均可在 Hugging Face 上免费下载,且它们原生采用 MXFP4 量化格式。这使得 gpt-oss-120B 模型可在 80GB 内存 内运行,而 gpt-oss-20b 仅需 16GB 内存。 下载链接:https://huggingface.co/collections/openai/gpt-oss-68911959590a1634ba11c7a4 Github 地址:https://github.com/openai/gpt-oss Claude Opus4.1 的最大亮点在于编程性能提 ...
OpenAI、谷歌等深夜更新多款模型 展示开源、智能体、世界模型进展
Di Yi Cai Jing· 2025-08-06 04:59
Core Insights - Major AI companies released new products, showcasing shifts in product strategies, particularly OpenAI's transition to open-source models and Anthropic's focus on incremental updates [1][3] OpenAI - OpenAI launched two open-source models: gpt-oss-120b with 117 billion parameters and gpt-oss-20b with 21 billion parameters, both utilizing MoE architecture [2] - The gpt-oss-120b model can run on an 80GB GPU, while gpt-oss-20b can operate on consumer devices with 16GB memory, allowing local deployment on laptops and mobile phones [2] - These models achieved top-tier performance in benchmark tests, with gpt-oss-120b scoring close to or exceeding the closed-source o4-mini model [2] Anthropic - Anthropic introduced Claude Opus 4.1, marking a shift towards more frequent, incremental updates rather than focusing solely on major version releases [3] - The new model demonstrated improved capabilities in complex multi-step problem-solving and coding tasks, with a SWE-bench Verify score of 74.5%, surpassing the previous version [4] Google - Google launched Genie 3, its first world model allowing real-time interaction, building on previous models Genie 1 and Genie 2 [5] - Genie 3 can simulate diverse environments and natural phenomena, maintaining visual consistency for up to several minutes at 720p resolution [6] - Despite advancements, Genie 3 has limitations, such as restricted action space and challenges in simulating multiple agents in shared environments [9]
OpenAI、谷歌等深夜更新多款模型,展示开源、智能体、世界模型进展
Di Yi Cai Jing· 2025-08-06 04:49
Core Insights - The recent product launches by OpenAI, Anthropic, and Google indicate a shift in product strategies among major AI model developers, with a focus on open-source models and incremental updates [1][3][5] OpenAI - OpenAI has released two open-source models, gpt-oss-120b with 117 billion parameters and gpt-oss-20b with 21 billion parameters, both utilizing the MoE architecture [2] - The gpt-oss-120b model can run on a single 80GB GPU, while gpt-oss-20b can operate on consumer devices with 16GB memory, allowing for local deployment on laptops and mobile devices [2] - OpenAI's CEO, Sam Altman, emphasized the importance of releasing powerful open-source models, which are the result of billions of dollars in research [1][2] Anthropic - Anthropic has shifted its strategy to focus on more frequent incremental updates rather than solely major version releases, exemplified by the launch of Claude Opus 4.1 [3] - Claude Opus 4.1 shows improvements in coding capabilities, scoring 74.5% on the SWE-bench Verify benchmark, surpassing its predecessor [4] - The new model is designed to handle complex multi-step problems more effectively, positioning it as a more capable AI agent [3][4] Google - Google introduced Genie 3, its first world model that supports real-time interaction, building on previous models like Genie 1 and Genie 2 [5] - Genie 3 can simulate diverse interactive environments and model physical properties, allowing for realistic navigation and interaction within generated worlds [5][6] - Despite its advancements, Google acknowledges limitations in Genie 3, such as restricted action spaces and challenges in simulating multiple agents in shared environments [9]
谁在拆 OpenAI 的围墙?
3 6 Ke· 2025-08-06 01:41
Core Insights - OpenAI's recent decision to open-source two new models, gpt-oss-120b and gpt-oss-20b, marks a strategic shift from its previous closed-source approach, which had established a dominant position in the large model market [1][2][3] - The move is seen as a response to the rising competition from open-source models that offer similar performance at significantly lower costs, prompting OpenAI to reconsider its strategy [2][4] Group 1: Strategic Implications - OpenAI's choice to use the Apache 2.0 license for its open-source models allows for commercial use and modifications, directly competing with Meta's Llama [3] - The models released are of medium scale, ensuring they do not threaten OpenAI's high-end closed-source products while still attracting developers [3][4] - OpenAI aims to maintain control over its core technology by keeping critical components, such as training data and optimization strategies, proprietary [4][8] Group 2: Market Dynamics - The AI industry is entering a phase of "layered competition," with OpenAI pursuing a dual strategy of open-source models to attract developers while retaining high-profit closed-source products for enterprise clients [5][7] - In contrast, Anthropic has chosen to focus on closed-source models targeting high-paying clients in sectors that prioritize safety and reliability, indicating a market segmentation based on user needs [6][7] Group 3: Regulatory Considerations - OpenAI's introduction of open-source models may serve as a proactive measure against increasing regulatory scrutiny on closed-source models, as open-source solutions are generally more transparent and easier to audit [8] - This strategic positioning could provide OpenAI with a competitive advantage as regulatory frameworks evolve, allowing it to maintain relevance in a changing landscape [8][10] Group 4: Developer Opportunities - The open-source models support local deployment and integration with popular frameworks, significantly lowering the barrier for independent developers to create advanced AI applications [8][10] - This shift could lead to a new wave of innovation, with the potential for groundbreaking AI applications emerging from smaller, independent developers [8][10]
奥特曼深夜官宣:OpenAI重回开源,两大推理模型追平o4-mini,号称世界最强
3 6 Ke· 2025-08-06 00:31
OpenAI深夜扔出开源核弹,gpt-oss 20B和120B两款模型同时上线。它们不仅性能比肩o3-mini和o4-mini,而且还能在消费级显卡甚至手机上轻松运行。 GPT-2以来,奥特曼终于兑现了Open AI。 他来了!他来了! 就在今夜,奥特曼带着两款全新的开源模型走来了! 正如几天前泄露的,它们分别是总参数1170亿,激活参数51亿的「gpt-oss-120b」和总参数210亿,激活参数36亿的「gpt-oss-20b」。 终于,OpenAI再次回归开源。 gpt-oss-120b 在核心推理基准测试中,120B模型的表现与OpenAI o4-mini相当,并且能在单张80GB显存的GPU上高效运行(如H100)。 gpt-oss-20b适用于低延迟、本地或专业化场景 在常用基准测试中,20B模型的表现与OpenAI o3-mini类似,并且能在仅有16GB显存的边缘设备上运行。 除此之外,两款模型在工具使用、少样本函数调用、CoT推理以及HealthBench评测中也表现强劲,甚至比OpenAI o1和GPT-4o等专有模型还要更强。 其他亮点如下: 宽松的Apache 2.0许可证:可自由用于 ...
OpenAI发布2款开源模型,北大校友扛大旗
Hu Xiu· 2025-08-06 00:15
本文来自微信公众号:APPSO (ID:appsolution),作者:发现明日产品的,原文标题:《刚刚,OpenAI发布2款开源模型!手机笔记本也能跑,北大校 友扛大旗》,题图来自:AI生成 时隔五年之后,OpenAI刚刚正式发布两款开源权重语言模型——gpt-oss-120b和gpt-oss-20b,而上一次他们开源语言模型,还要追溯到2019年的GPT-2。 OpenAI是真open了。 而今天AI圈也火药味十足,OpenAI开源gpt-oss、Anthropic推出Claude Opus 4.1(下文有详细报道)、Google DeepMind发布Genie 3,三大巨头不约而同在 同一天放出王炸,上演了一出神仙打架。 OpenAI CEO Sam Altman(山姆·奥特曼)在社交媒体上的兴奋溢于言表:"gpt-oss发布了!我们做了一个开放模型,性能达到o4-mini水平,并且能在高端 笔记本上运行。为团队感到超级自豪,这是技术上的重大胜利。" 模型亮点概括如下: gpt-oss-120b:大型开放模型,适用于生产、通用、高推理需求的用例,可运行于单个H100 GPU(1170亿参数,激活参数为5 ...