Claude 4.1

Search documents
GPT-5没有追求AGI,它代表的是OpenAI的商业化野心
3 6 Ke· 2025-08-08 10:28
北京时间8月8日凌晨,OpenAI发布了它们最新一代的GPT模型——GPT-5。 | | GPT-5 | Gemini 2.5 | Grok | Claude 4.1 | | --- | --- | --- | --- | --- | | | (high) | Pro | 4 | Opus | | AIME '25 (no tools) | 94.6% | 93.8% | 90.5% | 94.1% | | FrontierMath (with python tool | 26.3% | 27.1% | 24.0% | 25.8% | | only) | | | | | | GPQA diamond (no tools) | 85.7% | 86.1% | 83.2% | 85.9% | | HLE[1] (no tools) | 24.8% | 23.5% | 21.1% | 24.2% | | HMMT 2025 (no tools) | 93.3% | 92.9% | 89.7% | 93.0% | GPT-5以个位数优势领先竞争对手 这种合成数据的新应用,让前一代先进模型生成高质量数据,让后 ...
不吹不黑,GPT-5代码能力究竟怎么样?跟 Gemini 和 Claude 的对比测试给你答案
歸藏的AI工具箱· 2025-08-08 09:44
大家好,我是歸藏(guizang),测试了一下 GPT-5 的前端能力。 昨晚大家都很期待的 GPT-5 终于发布了,我因为怕楼上的电钻所以睡得早没看直播。 相较于国内的热度我感觉,推特上的讨论不多,早上起来看了一下大家的评论,都觉得中规中矩,有些能力甚至还有退化。 现在模型测试很难做,因为很多人不理解纯粹模型能力和 Agent 能力,另外对于 EQ、写作这种都有自己的倾向和喜好,所以还是来看看进步最明显的代码吧。 这里也有视频版本: 因为之前 o3 的前端能力太差了,所以这次没敢上来就整难的,先搞个 Bento Grid 宣传长图网页试试。 基于下面产品介绍文章关键信息,帮我用类似苹果发布会PPT的Bento Grid风格的视觉设计生成一个中文动态网页展示,具体要求为: 1. 尽量在一页展示全部信息,背景为#F8F6F5、卡片背景为白色,文字颜色为#010101,高亮按钮和文字背景色为#F69AAC-DF95E3-7DBDE9 的渐变 ,卡片内的布局为 2. 将 Markdown 格式的图片链接的图标放到合适的卡片中,防止图标跟文字重叠 3. 强调超大字体或数字突出核心要点,画面中有超大视觉元素强调重点,与 ...
终于发布的GPT-5,和它改变世界的982天
3 6 Ke· 2025-08-08 04:15
Core Insights - GPT-5 was officially released on August 8, 2023, and quickly dominated the LMArena leaderboard, ranking first in all categories [3][7] - The release of GPT-5 marks a significant advancement in AI capabilities, particularly in reasoning and agentic AI, although it does not represent a leap in performance compared to its predecessor GPT-4 [8][34] - OpenAI has introduced four versions of GPT-5, catering to different user needs and scenarios, including a lightweight version and a chat-specific version [9][11] Group 1: GPT-5 Release and Features - GPT-5 integrates capabilities from both the GPT series and the o series, allowing it to automatically select the optimal model for specific tasks [11][12] - The pricing for GPT-5 is competitive, with API costs lower than those of GPT-4, making it accessible for various applications [14][17] - OpenAI aims to simplify user experience by reducing the complexity of model selection, addressing the "choice paralysis" faced by users [11][12] Group 2: Market Context and Competitive Landscape - The AI landscape is increasingly competitive, with numerous companies releasing open-source models, leading to a narrowing gap between open-source and closed-source models [54][55] - OpenAI's revenue has surged, reaching an annualized figure of $12 billion by July 2025, driven largely by consumer subscriptions [48][50] - Major tech companies like Microsoft, Google, and Meta have also seen significant growth in market value and revenue due to advancements in AI technologies [52][53] Group 3: User Engagement and Adoption - ChatGPT has achieved remarkable user engagement, with 700 million weekly active users, reflecting its deep integration into daily life [42][45] - The application has maintained a strong growth trajectory, becoming the fastest app to reach 1 billion downloads and 500 million monthly active users [47] - OpenAI's strategic focus on user-friendly applications and real-world use cases has enhanced the appeal of GPT-5 across various sectors, including education and healthcare [25][28]
终于发布的GPT-5,和它改变世界的982天
36氪· 2025-08-08 00:07
以下文章来源于智能涌现 ,作者邓咏仪 周鑫雨 智能涌现 . 直击AI新时代下涌现的产业革命。36氪旗下账号。 很大的阵仗,很朴实的更新。 文 | 邓咏仪 周鑫雨 编辑 | 苏建勋 来源| 智能涌现(ID:AIEmergence) 封面来源 | 视觉中国 刚刚过去的7月,是疯狂的开源月——阿里(Qwen)、月之暗面(Kimi)、智谱(GLM)等十多家AI公司推出了新开源模型。OpenRouter趋势榜前10名, 来自中国的开源模型就占了9席。 但无需多言,现在GPT-5来了,用实力为这场竞赛画下句点。 来源:OpenAI 北京时间8月8日凌晨1点,GPT-5正式发布。GPT-5 并没有宣布模型参数,采用多层级架构,整合了o3系列的推理能力,重点提升了智能体(Agentic AI)能 力。 GPT-5上线后,迅速屠榜大模型竞技场LMArena,在所有细分类目中都位列第一。 | Q Model ~ 224 / 224 | Overall 74 | Hard Prompts 11 | Coding 11 | Math TI | Creative Writing TI | Instruction Following | ...
全球最大AI模型聚合平台诞生!不争冠军只做擂台
量子位· 2025-08-07 09:02
Core Viewpoint - The core viewpoint of the article emphasizes that the value of AI lies not in having the most powerful model, but in selecting the most suitable model for different scenarios, as articulated by Amazon Web Services (AWS) with its "Choice Matters" strategy [1][2]. Summary by Sections AI Model Strategy - AWS introduced the "Choice Matters" strategy, advocating for a collaborative approach where multiple models work together based on their strengths rather than a single dominant model [2][13]. - The launch of the Amazon Bedrock platform allows businesses to select models based on performance, cost, and task suitability, akin to choosing tools [2][21]. Cloud Services Insight - AWS's extensive service offerings include 429 computing services, 266 storage services, 513 database services, and 421 AI and machine learning services, reflecting a deep understanding of diverse business needs [3][4]. Market Validation - The strategy has been validated by market developments, including the recent collaboration with OpenAI, which allows access to open-source models via Amazon Bedrock and Amazon SageMaker [6][24]. - New models like gpt-oss-120b and gpt-oss-20b on Amazon Bedrock demonstrate impressive cost-performance ratios, outperforming competitors [8][24]. Model Collaboration - The article outlines two typical collaboration modes: "best match" for specific scenarios and "synergistic enhancement" for complex tasks, where multiple models can achieve greater outcomes together [14][15][16]. - Examples include using DeepSeek R1 and Claude for high-level translation queries and Nova Lite for initial translations in a complex translation system [16]. Ecosystem Development - AWS has become the largest AI model aggregation platform, offering over 400 mainstream commercial and open-source models, with partnerships including Anthropic, Google, and Meta [22][23]. - The rapid development of the Amazon Bedrock ecosystem is highlighted by the addition of various models from top AI companies, enhancing the platform's capabilities [23]. Shift in AI Demand - The demand for AI models has shifted from seeking the "strongest" model to finding the "most suitable" one, driven by performance-cost balance, task complexity, and customization needs [24]. - Companies like Nomura Securities and Doordash are choosing models based on their specific requirements, illustrating this trend [24]. Future of AI - The intersection of AI and business is expected to fundamentally reshape work processes, with significant job transformations anticipated in the coming decade [26].
谁在拆 OpenAI 的围墙?
3 6 Ke· 2025-08-06 01:41
你可能也刷到了。 昨晚,OpenAI 突然搞了个大动作:宣布开源两款新模型,叫gpt-oss-120b和gpt-oss-20b。 可以说,这是GPT-2以来,OpenAI重新向开源社区开放模型权重,关于模型参数、推理性能、训练细 节,网上已经铺天盖地了,我就不啰嗦了。 但我想说:你有没有想过,这次开源到底意味着什么? 01 智远认为这是一次战略转折点。 要知道,过去几年,OpenAI 一直是"闭源派"的头号代表。它靠 GPT-3、GPT-4 的技术优势,用 API 收 费、订阅制赚钱,建起了高墙,几乎垄断了大模型时代的入口和定价权,说白了,它就是定规则的人。 后来,风向变了。 DeepSeek火了后,局面开始松动。一批开源模型不仅性能逼近 GPT-4,成本还只有人家的 1/20。更关键 的是,它们用极度宽松的开源协议,允许你随便用、随便改、还能商用,几乎零门槛。 面对这种冲击,连 Sam Altman 都在今年 2 月 1 日公开承认了一句扎心的话:我们可能站在了历史错误 的一边。 所以,半年后,OpenAI 终于行动了。但这波"开源",真不是低头认输,仔细一看,你会发现,这里面 门道不少。 智远认为,它在主 ...