Workflow
Claude Sonnet4.5
icon
Search documents
AI智能体失控,它把Meta安全总监的200多封邮件删了
Di Yi Cai Jing· 2026-02-24 11:23
新版本聚焦安全性与漏洞修复。 一家科技大厂的安全总监,被一款AI Agent产品删除大量邮件,如此戏剧性的事件正在如今的AI行业内发生。 2月23日,Meta超级智能实验室AI对齐与安全总监Summer Yue(以下简称Yue)发文表示,没有什么比命令OpenClaw"确认后再操 作"、然后眼睁睁看它以极快速度删除收件箱这件事更令人崩溃的了。"我根本无法在手机上阻止它,只能像拆炸弹一样冲到我的 MacMini前。" OpenClaw作为近几个月内大热的AI Agent产品,吸引大量从业者与开发者上手尝试,Yue也为自己的电子邮箱部署了OpenClaw智 能体,用来批量处理邮件。据她介绍,Yue向OpenClaw提出的具体指令为:"也检查一下这个收件箱,并提出你想归档或删除的 邮件,在我指示之前不要执行任何操作。"这种方法在测试版收件箱上运行良好,但真实邮箱太大,触发压缩机制,导致 OpenClaw丢失了最初的指令。 Yue上传的沟通记录显示,她多次下达"Do not do that""Stop dont do anything""STOP OPENCLAW"等指令,均未能阻止OpenClaw 的删除动作,直至 ...
AI新贵,新一轮融资将超1300亿元
Core Insights - The AI startup Anthropic is set to complete a new funding round and is expected to go public in 2026, maintaining strong interest from investors [1][4]. Funding and Valuation - Anthropic's new funding round is anticipated to exceed $20 billion, with commitments from various institutions surpassing the initial target of $10 billion [2]. - Major investors include Coatue Management and GIC, each expected to invest around $1.5 billion, while Iconiq Capital plans to invest at least $1 billion [2]. - If contributions from Nvidia and Microsoft, who have previously committed up to $15 billion, are included, the total funding could exceed $20 billion, with a potential valuation surpassing $350 billion [3]. Performance and Growth - Anthropic's annual recurring revenue has more than doubled since last summer, projected to exceed $9 billion by the end of 2025 [4]. - The company has successfully completed significant funding rounds, achieving a valuation of $61.5 billion in March 2024 and $183 billion in September 2024 [3]. Company Positioning and Products - Founded in 2021 by former OpenAI executives, Anthropic positions itself as a "safety-first" AI company, emphasizing the development of beneficial AI [5]. - The company released the "Claude Constitution," outlining its core values and exploring philosophical questions about machine consciousness [5]. - Anthropic's flagship product, the Claude Code intelligent programming tool, is popular among programmers, with the latest model, Claude Sonnet 4.5, capable of autonomous coding for up to 30 hours [5]. Industry Context - Nvidia's CEO highlighted that 2025 is expected to see the highest global venture capital investment, exceeding $100 billion, primarily directed towards AI startups [6]. - Anthropic's strategic partnerships with major companies like Nvidia, Microsoft, Amazon, and Google are indicative of the growing collaboration between AI developers and cloud or chip suppliers [6].
全球首个AI投资大赛落幕:中国模型全部盈利,美国模型全部亏损
Xin Jing Bao· 2025-11-04 05:47
Core Insights - The first AI large model real-time investment competition "Alpha Arena" concluded on November 4, featuring six top models from China and the US, each starting with $10,000 in a real market environment [1][2] - Qwen3-Max emerged as the champion with a return of $12,200, exceeding 20% profit, while DeepSeek v3.1 secured second place with a net value of $10,490, making them the only two profitable models [2] Group 1 - The competition was initiated by Nof1 on October 18, involving models such as DeepSeek v3.1, Qwen3-Max, GPT-5, Gemini2.5Pro, Claude Sonnet4.5, and Grok4 [1] - In the early stages, DeepSeek v3.1 led the competition, attracting significant international attention, while Grok4, backed by Elon Musk, narrowed the gap to just $1 at one point [1][2] - A turning point occurred between October 21 and 22, when Grok4 and Claude Sonnet4.5 experienced significant losses, leading to a day where all six models reported negative returns [1][2] Group 2 - Following the losses of other models, DeepSeek v3.1 and the previously underperforming Qwen3-Max adjusted their investment strategies, resulting in a rise in their net value [2] - The competition ultimately became a contest between Qwen3-Max and DeepSeek v3.1, with both models frequently exchanging the lead [2] - The four US models, including GPT-5, Gemini2.5Pro, Claude Sonnet4.5, and Grok4, ended up with losses, with GPT-5 suffering a decline of over 60% [2]
Qwen 3 Max领跑“AI投资实战赛”:阿里通义千问在Alpha Arena跑赢GPT-5与Gemini
Jing Ji Guan Cha Wang· 2025-10-23 07:27
Core Insights - The "Alpha Arena" AI investment competition initiated by the US research lab nof1.ai is becoming a public test to observe the autonomous trading capabilities of AI models [1][7] - Six major AI models are participating, including Qwen3Max, which currently leads in returns, showcasing its ability to self-optimize through real-time reinforcement learning [1][2] Performance Comparison - Qwen3Max has a return of +19.57%, with an account value of $11,957, outperforming other models significantly [3] - In contrast, Gemini2.5Pro and GPT-5 have experienced losses exceeding 50%, indicating a more aggressive strategy that led to poor performance [2][3] - Qwen3Max's trading behavior reflects a balance of efficiency and stability, with an average holding period of about 7 hours and a return increase from 8.43% to 13.41% [2][3] Strategy and Risk Management - Qwen3Max focuses on opportunity capture and risk balance, executing trades quickly during market volatility while maintaining a low-risk exposure [2] - The competition highlights the differences in risk management and strategy adjustment mechanisms among the AI models, with Qwen3Max demonstrating superior performance [2][4] Technological Advancements - The competition reveals the advantages of reinforcement learning and real-time decision-making capabilities in AI models, which adapt to high-volatility environments [4][7] - Qwen series models are evolving towards a multi-modal capability, enhancing their ability to generate strategies, control risks, and self-correct in complex trading environments [4][7]