Seek .-DeepSeek V4基准测试泄露？消息疑似为假

Core Insights - The AI programming competition has reached a new peak with the leaked benchmark results of DeepSeek V4, which achieved an impressive score of 83.7% on SWE-bench Verified, surpassing Claude Opus 4.5 (80.9%) and GPT-5.2 (80%) [1] - DeepSeek V4 is expected to be released on February 17, with costs reportedly 20 to 40 times cheaper than OpenAI, potentially changing the competitive landscape [1] Group 1 - DeepSeek V4's benchmark results indicate it has achieved significant advancements in AI capabilities, including a context length of over 1 million and an Engram memory mechanism, suggesting superior reasoning abilities [1] - The anticipated release date of DeepSeek V4 is February 17, which could position it as a leading model in the AI space [1] Group 2 - There are doubts regarding the authenticity of the leaked benchmark tests, with claims that scores above 99.2% are not possible under official scoring systems, indicating potential misinformation [2] - Despite the skepticism surrounding the leaked data, the attention and hype around DeepSeek suggest it has garnered significant interest and support within the AI community [2]