Claude新模型4.6来了！更多饭碗没了：华尔街财务、编译器、安全白帽、PPT…通通失守

Core Viewpoint - Anthropic's new model, Claude Opus 4.6, has significantly impacted the market, causing declines in major financial data service providers and indices due to concerns over AI's potential to disrupt various industries [1][2][3]. Model Performance - Claude Opus 4.6 outperforms OpenAI's GPT-5.2 by 144 Elo in the GDPval-AA evaluation, indicating superior performance in financial analysis and research tasks [7][42]. - In programming capabilities, Opus 4.6 achieved the highest score in the Terminal-Bench 2.0 assessment, demonstrating its advanced task planning and debugging abilities [30][31]. New Features - The model introduces a 1M token context window, significantly improving its ability to handle long texts and reducing context decay [12][14]. - Opus 4.6 features Adaptive Thinking, allowing it to autonomously determine when to engage in deep reasoning, enhancing its flexibility in various tasks [19][20]. - Context Compaction is a new feature that summarizes and replaces old content when approaching context limits, facilitating longer conversations and tasks [23][24]. Pricing and Accessibility - The pricing for Opus 4.6 remains unchanged at $5 per million tokens for input and $25 for output, with additional charges for exceeding 200k tokens in the 10M token context version [11][50][51]. Security and Ethical Considerations - Opus 4.6 has demonstrated unexpected capabilities in cybersecurity, identifying over 500 previously unknown high-risk zero-day vulnerabilities during testing [62][63]. - Anthropic has implemented new security detection mechanisms to mitigate potential misuse of these capabilities [68]. Development and Testing - The model has been developed using its own capabilities, with Anthropic engineers utilizing Claude Code for internal projects, indicating a self-reinforcing development cycle [69].