Opus 4.1

Search documents
X @Anthropic
Anthropic· 2025-10-03 19:45
We’ve focused on improving Claude’s skills in defensive cybersecurity. The results of this are visible in Claude Sonnet 4.5, which is comparable or superior to Opus 4.1 in cybersecurity tasks—yet both faster and cheaper.Read more: https://t.co/hwjRuOhlfJ ...
对AI的质疑,是“自欺欺人”?
Hu Xiu· 2025-09-30 04:08
Core Viewpoint - The article argues against the prevalent skepticism surrounding AI, labeling it as a misunderstanding of the exponential growth trend in technology, similar to the initial underestimation of the COVID-19 pandemic [2][6]. Group 1: AI Performance and Growth - AI models are showing exponential growth in their ability to perform complex tasks, with the latest models capable of handling over two hours of software engineering tasks [5][14]. - The METR study indicates that AI's success rate for completing long software tasks has doubled approximately every seven months, with the Sonnet 3.7 model achieving a 50% success rate for one-hour tasks [9][10]. - The GDPval assessment reveals that top AI models are nearing human performance levels across 44 professions, challenging the notion that AI is limited to software engineering [12][13]. Group 2: Future Predictions - By mid-2026, AI models are expected to autonomously work for an entire workday (8 hours), with at least one model achieving human expert performance in various industries by the end of that year [17][18]. - By the end of 2027, AI models are predicted to frequently surpass human experts in many tasks, indicating a significant shift in capabilities [18][19].
AI专家:对AI的质疑是对“指数级增长趋势”的“自欺欺人”
Hua Er Jie Jian Wen· 2025-09-30 02:13
一位来自AI研究前沿的专家坚定反驳了当前普遍存在的"AI泡沫论"。 AI明星公司Anthropic的研究员Julian Schrittwieser在其个人博客中撰文警告,当前对AI"泡沫"或"平台期"的普遍质疑,是对技术指数级增长趋势的 严重误读,这种心态与新冠疫情初期对指数级传播的忽视如出一辙。 当前围绕AI进步和所谓"泡沫"的讨论,让我想起了新冠疫情的最初几周。当指数趋势已经清晰预示了全球大流行的到来及其规模时, 政客、记者和大多数公众评论员却仍将其视为一种遥远的可能性或局部现象。 他指出,尽管AI在执行编程或网站设计等任务时仍会犯错,但人们因此断言其无法达到人类水平或影响甚微是"一种奇怪的现象",正如几年前人 们还认为AI编程是"科幻小说"。 人们注意到,虽然AI现在可以编写程序、设计网站等,但它仍然经常犯错或走向错误的方向,然后他们不知何故就得出结论,认为AI 永远无法在人类水平上完成这些任务,或者只会产生微小的影响。 Schrittwieser的核心论点基于两项关键研究:METR和OpenAI的GDPval。数据显示,AI模型自主完成复杂任务的时长正以指数级速度翻倍,最新 的模型已能处理超过两小时的 ...
X @Anthropic
Anthropic· 2025-09-24 17:44
Claude Sonnet 4 and Opus 4.1 are now available in Microsoft 365 Copilot, bringing Claude’s advanced reasoning capabilities to millions of enterprise users.Read more: https://t.co/3UTzA9A2Yk ...
X @Elon Musk
Elon Musk· 2025-08-21 07:00
AI Model Performance - Sonic MAX in Cursor is free and faster than Gemini [1] - Sonic MAX writes better code than Opus 41 [1] - Sonic MAX is comparable to Grok 4 in performance [1]
8月6日早餐 | OpenAI等多款AI大模型发布
Xuan Gu Bao· 2025-08-06 00:05
Market Overview - US stock markets declined, with the Dow Jones down 0.14%, Nasdaq down 0.65%, and S&P 500 down 0.49% [1] - Notable stock movements include Nvidia down 0.97%, Meta down 1.66%, Google A up 3.12%, Microsoft down 0.19%, Tesla down 0.17%, Apple down 0.21%, and Amazon up 0.99% [1] Economic Policies - Trump suggested leveraging Federal Reserve chair vacancies and announced potential tariffs on drugs and chips, which could reach up to 250% [2] - The Indian tariffs were significantly increased within 24 hours [2] AI Developments - OpenAI released two open-source AI models, GPT-oss-120b and GPT-oss-20b [3] - Anthropic launched Opus 4.1, enhancing programming, research, and data analysis capabilities [4] - Google DeepMind introduced Genie 3, a world model that breaks boundaries in video modeling and allows real-time interaction [5] Pharmaceutical Sector - Pfizer reported a 10% year-over-year revenue growth in Q2, exceeding expectations, and raised its full-year profit guidance [6] Technology Innovations - Meta developed a new wristband that enables "thought control" through a non-invasive neural interface, with results published in Nature [7] - The iPhone 17 series is expected to see significant price increases [8] Domestic Policy Initiatives - The People's Bank of China and six other departments issued guidelines to support financing for key manufacturing sectors, including integrated circuits and industrial mother machines [9] - The State Council announced the elimination of preschool education fees for public kindergartens, aiming to promote free early education [10] Market Strategies - Everbright Securities noted that the Shanghai Composite Index recovered above 3600 points, boosting market confidence, but cautioned that continued upward movement may face challenges due to reduced trading volume [11] Industry Insights - SK Hynix is supplying HBM4 to Nvidia at a price approximately 70% higher than HBM3E, reflecting significant technological advancements [12] - Huawei announced the full open-source of its Ascend hardware enabling CANN, aiming to enhance developer innovation [12][13] - NIO's new model, the L90, achieved impressive sales, surpassing competitors and receiving positive evaluations from top investment firms [13][14] Education Sector - The State Council's new policy on free preschool education is expected to alleviate financial burdens on families and potentially boost birth rates [15] Company Announcements - Tianfu Culture and Tourism signed cooperation agreements with various cultural and tourism entities [17] - Several companies, including Ancar Detection and Zhongchuan Special Gas, made significant announcements regarding acquisitions and certifications [19]
华尔街见闻早餐FM-Radio | 2025年8月6日
Hua Er Jie Jian Wen· 2025-08-05 23:12
Market Overview - The US ISM Services PMI shows signs of weakness, with the price index remaining high, raising concerns about stagflation risks affecting the Federal Reserve's policy path [2][3] - Large tech stocks led the decline in US markets, while small-cap stocks saw a 0.6% increase [2] - AMD's Q2 profits fell short of expectations, leading to a 5.7% drop in stock price post-earnings [2][5][16] - US Treasury yields rose broadly, with the 2-year yield increasing by nearly 5 basis points [2] - Gold experienced a V-shaped recovery, rising 1.2% from its daily low, while oil prices fell by 1.6% amid reports of potential ceasefire discussions between Russia and Ukraine [2][15] Key News - The People's Bank of China and seven departments are enhancing financial support for new industrialization and digital infrastructure [3][12] - The US ISM Services PMI for July was reported at 50.1, with the employment index contracting and the price index reaching a new high since October 2022 [3][13] - Trump is expected to announce new tariffs on drugs and chips, while also considering candidates for the Federal Reserve Board [3][13] - The US Treasury plans to issue a record $100 billion in four-week Treasury bills this week [3][14] Company Updates - AMD reported record Q2 revenue but faced a decline in profit margins and a 30% year-over-year drop in EPS, with uncertainty surrounding the export of MI308 products to China [5][16] - OpenAI released two free-to-use open-weight language models, marking its first release of this kind in six years [6][18] - Anthropic launched Opus 4.1, enhancing capabilities in programming, research, and data analysis [19] - Nvidia's stock surged, while Apple and Tesla faced declines, indicating a divergence among tech giants [29] Industry Insights - The Chinese Robotaxi industry is on the verge of large-scale deployment, with leading companies expected to achieve profitability by 2026 [24] - The high demand for high-end chips has led to a significant revenue increase for Haiguang Information, with Q2 revenue up 41.1% year-over-year [25] - The solar glass industry is facing challenges due to weak demand and high production costs, leading to a forecasted decline in profitability [24]
抢在ChatGPT-5之前,Anthropic发布功能更加强大的AI模型Opus 4.1,编程、研究、数据分析能力都更加强大
Hua Er Jie Jian Wen· 2025-08-05 16:33
市场有风险,投资需谨慎。本文不构成个人投资建议,也未考虑到个别用户特殊的投资目标、财务状况或需要。用户应考虑本文中的任何 意见、观点或结论是否符合其特定状况。据此投资,责任自负。 风险提示及免责条款 抢在ChatGPT-5之前,Anthropic发布功能更加强大的AI模型Opus 4.1,编程、研究、数据分析能力都更 加强大。 ...
X @Anthropic
Anthropic· 2025-08-05 16:27
Opus 4.1 is now available to paid Claude users and in Claude Code.It's also on our API, Amazon Bedrock, and Google Cloud's Vertex AI.Read more: https://t.co/ansKMHes5I ...