Workflow
开源模型
icon
Search documents
资金动向 | 北水买日港股超90亿港元,加仓腾讯、阿里
Ge Long Hui· 2025-08-06 19:07
Group 1 - Tencent Holdings saw a net buy of HKD 15.18 billion, marking a total of HKD 59.44 billion in net buys over the last 10 days [1] - Alibaba-W experienced a net buy of HKD 8.76 billion, with a total of HKD 21.16 billion in net buys over the last 3 days [1] - SMIC recorded a net buy of HKD 6.11 billion, totaling HKD 9.51 billion in net buys over the last 3 days [1] Group 2 - Alibaba-W launched a new membership system integrating various Alibaba resources, aimed at enhancing consumer experience and user engagement [5] - SMIC is set to release its earnings report on August 7, with expected revenue of USD 2.185 billion for Q2 2025, a year-on-year increase of 14.91% [6] - Crystal Technology announced a significant order with DoveTree, totaling approximately USD 6 billion, setting a record for AI pharmaceutical overseas orders [6] Group 3 - Ideal Auto-W, along with China Automotive Research and Dongfeng Liuzhou Motor, issued a joint statement advocating for self-discipline and ethical competition in the automotive industry [6] - Bubble Mart received a report from Morgan Stanley stating that its intrinsic value exceeds its current IP holdings, maintaining an "overweight" rating with a target price of HKD 365 [6]
时隔六年,OpenAI 为什么再次开源?
Founder Park· 2025-08-06 14:00
Core Viewpoint - OpenAI's release of the open-source model gpt-oss marks a significant strategic shift, indicating a clearer understanding of its value proposition beyond just the model itself, focusing on its user base and application ecosystem [2][4][13]. Group 1: OpenAI's Open-Source Model Release - OpenAI has launched its first open-source model, gpt-oss, since GPT-2, with performance comparable to its proprietary o4 mini model while reducing costs by at least 10 times [2][10]. - The gpt-oss-120b model achieved a score of 90.0 on the MMLU benchmark, while the gpt-oss-20b scored 85.3, indicating competitive performance in the open-source landscape [3][8]. - The models are designed to run efficiently on various hardware, from consumer-grade GPUs to cloud servers, and are licensed under Apache 2.0, allowing for commercial deployment without downstream usage restrictions [7][8]. Group 2: Strategic Implications - OpenAI's move to open-source is not merely a technical sharing but aims to build an application ecosystem, targeting enterprises looking to deploy open-source AI models [5][12]. - The release reflects OpenAI's recognition that its core competitive advantage lies in its large user base and application ecosystem rather than just the models themselves [4][13]. - OpenAI's decision to avoid releasing training data, code, or technical reports suggests a strategy to attract businesses while potentially impacting academic research and the true open-source AI community [19][22]. Group 3: Competitive Landscape - The introduction of gpt-oss is expected to challenge existing API products, with OpenAI positioning itself aggressively in the market by offering a model that significantly undercuts the cost of its proprietary offerings [10][11]. - The architecture of gpt-oss aligns with industry trends towards sparse MoE models, indicating a shift in design preferences within the AI community [14]. - The competitive landscape is evolving, with OpenAI's release potentially reversing the previous lag in open-source model applications compared to Chinese counterparts [21][22]. Group 4: Future Considerations - The open-source model's ecosystem remains chaotic, with high-scoring models not necessarily being user-friendly, which could slow adoption rates [17][18]. - OpenAI's approach to model safety and fine-tuning raises questions about the balance between usability and security, which will need community validation [15][16]. - The ongoing competition between U.S. and Chinese open-source models highlights the need for strategic actions to maintain relevance and leadership in the AI space [20][22].
DeepSeek终于把OpenAI逼急了
Feng Huang Wang· 2025-08-06 08:21
摘要: 中国开源模型的爆发式发展很难不触动OpenAI的神经,以及硅谷的神经。 北京时间8月6日凌晨,OpenAI突然发布了其首个开源语言模型 GPT-OSS,在全球科技圈投下了一枚炸弹。 具体来看,gpt-oss-120b采用了MoE架构,拥有1170亿参数,其中激活参数约51亿,仅需在单张80GB的GPU上就能运行,其性能与闭源的o4-mini十分接 近。 而gpt-oss-20b同样基于MoE架构,有210亿参数,激活参数约36亿,可在配备16GB内存的设备上流畅运行,性能表现接近o3-mini。 其实,回顾过去几年,OpenAI一直在走"闭源+收费"的路线。无论是GPT-4还是GPT-4o,核心模型始终没有开放。业界也一度认为,"最强模型永远不会开 源"。 但GPT-OSS的出现,打破了这一共识。 据OpenAI官方称,GPT-OSS是一款"小型但高效"的语言模型,训练数据涵盖多语种、多领域。 更重要的是,OpenAI声称该模型"可以免费用于商业用途",这对中国乃至全球的AI初创企业来说,简直是"天降神兵"。 准备向国产模型宣战? 作为ChatGPT世代的开创者,OpenAI此举意味着一个巨大的转向: ...
Claude 小升级就赢了OpenAI 9年“开源神作”?高强度推理直接歇菜、幻觉率高达50%,写作还被Kimi 2吊锤?
3 6 Ke· 2025-08-06 07:32
值得一提的是,几乎与 gpt-oss 开源同时,谷歌 Deepmind 宣布推出 Genie 3 ,Anthropic 放出了 Claude Opus 4.1。有网友感叹,"我们生活在什么样的时 代。"马斯克也转发了这条帖子,并配了意味深长的词和表情。 刚刚,OpenAI 发布了首个开源语言模型系列 gpt-oss,包括 gpt-oss-120b 和 gpt-oss-20b 两款语言模型:完全可定制,提供完整的思维链(CoT)并支持结 构化输出。 现在,gpt-oss-120b 和 gpt-oss-20b 的权重均可在 Hugging Face 上免费下载,且它们原生采用 MXFP4 量化格式。这使得 gpt-oss-120B 模型可在 80GB 内存 内运行,而 gpt-oss-20b 仅需 16GB 内存。 下载链接:https://huggingface.co/collections/openai/gpt-oss-68911959590a1634ba11c7a4 Github 地址:https://github.com/openai/gpt-oss Claude Opus4.1 的最大亮点在于编程性能提 ...
OpenAI、谷歌等深夜更新多款模型 展示开源、智能体、世界模型进展
Di Yi Cai Jing· 2025-08-06 04:59
Core Insights - Major AI companies released new products, showcasing shifts in product strategies, particularly OpenAI's transition to open-source models and Anthropic's focus on incremental updates [1][3] OpenAI - OpenAI launched two open-source models: gpt-oss-120b with 117 billion parameters and gpt-oss-20b with 21 billion parameters, both utilizing MoE architecture [2] - The gpt-oss-120b model can run on an 80GB GPU, while gpt-oss-20b can operate on consumer devices with 16GB memory, allowing local deployment on laptops and mobile phones [2] - These models achieved top-tier performance in benchmark tests, with gpt-oss-120b scoring close to or exceeding the closed-source o4-mini model [2] Anthropic - Anthropic introduced Claude Opus 4.1, marking a shift towards more frequent, incremental updates rather than focusing solely on major version releases [3] - The new model demonstrated improved capabilities in complex multi-step problem-solving and coding tasks, with a SWE-bench Verify score of 74.5%, surpassing the previous version [4] Google - Google launched Genie 3, its first world model allowing real-time interaction, building on previous models Genie 1 and Genie 2 [5] - Genie 3 can simulate diverse environments and natural phenomena, maintaining visual consistency for up to several minutes at 720p resolution [6] - Despite advancements, Genie 3 has limitations, such as restricted action space and challenges in simulating multiple agents in shared environments [9]
OpenAI、谷歌等深夜更新多款模型,展示开源、智能体、世界模型进展
Di Yi Cai Jing· 2025-08-06 04:49
Core Insights - The recent product launches by OpenAI, Anthropic, and Google indicate a shift in product strategies among major AI model developers, with a focus on open-source models and incremental updates [1][3][5] OpenAI - OpenAI has released two open-source models, gpt-oss-120b with 117 billion parameters and gpt-oss-20b with 21 billion parameters, both utilizing the MoE architecture [2] - The gpt-oss-120b model can run on a single 80GB GPU, while gpt-oss-20b can operate on consumer devices with 16GB memory, allowing for local deployment on laptops and mobile devices [2] - OpenAI's CEO, Sam Altman, emphasized the importance of releasing powerful open-source models, which are the result of billions of dollars in research [1][2] Anthropic - Anthropic has shifted its strategy to focus on more frequent incremental updates rather than solely major version releases, exemplified by the launch of Claude Opus 4.1 [3] - Claude Opus 4.1 shows improvements in coding capabilities, scoring 74.5% on the SWE-bench Verify benchmark, surpassing its predecessor [4] - The new model is designed to handle complex multi-step problems more effectively, positioning it as a more capable AI agent [3][4] Google - Google introduced Genie 3, its first world model that supports real-time interaction, building on previous models like Genie 1 and Genie 2 [5] - Genie 3 can simulate diverse interactive environments and model physical properties, allowing for realistic navigation and interaction within generated worlds [5][6] - Despite its advancements, Google acknowledges limitations in Genie 3, such as restricted action spaces and challenges in simulating multiple agents in shared environments [9]
谁在拆 OpenAI 的围墙?
3 6 Ke· 2025-08-06 01:41
Core Insights - OpenAI's recent decision to open-source two new models, gpt-oss-120b and gpt-oss-20b, marks a strategic shift from its previous closed-source approach, which had established a dominant position in the large model market [1][2][3] - The move is seen as a response to the rising competition from open-source models that offer similar performance at significantly lower costs, prompting OpenAI to reconsider its strategy [2][4] Group 1: Strategic Implications - OpenAI's choice to use the Apache 2.0 license for its open-source models allows for commercial use and modifications, directly competing with Meta's Llama [3] - The models released are of medium scale, ensuring they do not threaten OpenAI's high-end closed-source products while still attracting developers [3][4] - OpenAI aims to maintain control over its core technology by keeping critical components, such as training data and optimization strategies, proprietary [4][8] Group 2: Market Dynamics - The AI industry is entering a phase of "layered competition," with OpenAI pursuing a dual strategy of open-source models to attract developers while retaining high-profit closed-source products for enterprise clients [5][7] - In contrast, Anthropic has chosen to focus on closed-source models targeting high-paying clients in sectors that prioritize safety and reliability, indicating a market segmentation based on user needs [6][7] Group 3: Regulatory Considerations - OpenAI's introduction of open-source models may serve as a proactive measure against increasing regulatory scrutiny on closed-source models, as open-source solutions are generally more transparent and easier to audit [8] - This strategic positioning could provide OpenAI with a competitive advantage as regulatory frameworks evolve, allowing it to maintain relevance in a changing landscape [8][10] Group 4: Developer Opportunities - The open-source models support local deployment and integration with popular frameworks, significantly lowering the barrier for independent developers to create advanced AI applications [8][10] - This shift could lead to a new wave of innovation, with the potential for groundbreaking AI applications emerging from smaller, independent developers [8][10]
奥特曼深夜官宣:OpenAI重回开源,两大推理模型追平o4-mini,号称世界最强
3 6 Ke· 2025-08-06 00:31
OpenAI深夜扔出开源核弹,gpt-oss 20B和120B两款模型同时上线。它们不仅性能比肩o3-mini和o4-mini,而且还能在消费级显卡甚至手机上轻松运行。 GPT-2以来,奥特曼终于兑现了Open AI。 他来了!他来了! 就在今夜,奥特曼带着两款全新的开源模型走来了! 正如几天前泄露的,它们分别是总参数1170亿,激活参数51亿的「gpt-oss-120b」和总参数210亿,激活参数36亿的「gpt-oss-20b」。 终于,OpenAI再次回归开源。 gpt-oss-120b 在核心推理基准测试中,120B模型的表现与OpenAI o4-mini相当,并且能在单张80GB显存的GPU上高效运行(如H100)。 gpt-oss-20b适用于低延迟、本地或专业化场景 在常用基准测试中,20B模型的表现与OpenAI o3-mini类似,并且能在仅有16GB显存的边缘设备上运行。 除此之外,两款模型在工具使用、少样本函数调用、CoT推理以及HealthBench评测中也表现强劲,甚至比OpenAI o1和GPT-4o等专有模型还要更强。 其他亮点如下: 宽松的Apache 2.0许可证:可自由用于 ...
OpenAI发布2款开源模型,北大校友扛大旗
Hu Xiu· 2025-08-06 00:15
本文来自微信公众号:APPSO (ID:appsolution),作者:发现明日产品的,原文标题:《刚刚,OpenAI发布2款开源模型!手机笔记本也能跑,北大校 友扛大旗》,题图来自:AI生成 时隔五年之后,OpenAI刚刚正式发布两款开源权重语言模型——gpt-oss-120b和gpt-oss-20b,而上一次他们开源语言模型,还要追溯到2019年的GPT-2。 OpenAI是真open了。 而今天AI圈也火药味十足,OpenAI开源gpt-oss、Anthropic推出Claude Opus 4.1(下文有详细报道)、Google DeepMind发布Genie 3,三大巨头不约而同在 同一天放出王炸,上演了一出神仙打架。 OpenAI CEO Sam Altman(山姆·奥特曼)在社交媒体上的兴奋溢于言表:"gpt-oss发布了!我们做了一个开放模型,性能达到o4-mini水平,并且能在高端 笔记本上运行。为团队感到超级自豪,这是技术上的重大胜利。" 模型亮点概括如下: gpt-oss-120b:大型开放模型,适用于生产、通用、高推理需求的用例,可运行于单个H100 GPU(1170亿参数,激活参数为5 ...
OpenAI发布ChatGPT世代首个开源模型gpt-oss,4060Ti都能跑得动。
数字生命卡兹克· 2025-08-05 22:08
Core Viewpoint - The article discusses the recent advancements in AI models, particularly focusing on OpenAI's release of the open-source model GPT-oss, which is seen as a significant move in the AI landscape, potentially reshaping the open-source community and lowering barriers for developers [9][80]. Group 1: Model Releases - Google released a new world model, Genie 3, which has generated excitement in the gaming and VR community [3]. - Anthropic announced Claude Opus 4.1, showcasing advancements in programming capabilities [5]. - OpenAI launched GPT-oss, its first open-source model since GPT-2, which includes two models: GPT-oss-120B and GPT-oss-20B [9][14]. Group 2: Model Specifications - GPT-oss-120B has 117 billion parameters with 5.1 billion active parameters per token, while GPT-oss-20B has 21 billion parameters with 3.6 billion active parameters [15][16]. - Both models support a context length of 128K and are designed to be run on consumer-grade hardware, with the 20B model requiring only 16GB of memory [17][20]. Group 3: Performance Metrics - In various benchmarks, GPT-oss-120B and GPT-oss-20B scored 90.0 and 85.3 in MMLU, respectively, indicating strong reasoning and knowledge capabilities [32]. - The models performed well in competitive programming tests, scoring 2622 and 2516 points, respectively, although they were outperformed by OpenAI's previous models [32]. Group 4: Community Impact - The release of GPT-oss is expected to lower the entry barriers for developers and enrich the AI ecosystem, allowing more users to experiment with advanced AI capabilities [80]. - OpenAI's move is seen as a response to competitive pressure from other AI companies, indicating a shift towards more open and accessible AI technologies [78][80].