Core Viewpoint - The article discusses a significant bug in the DeepSeek V3.1 model, which has caused widespread concern among developers due to the unexpected appearance of the character "极" in generated code outputs, leading to potential compilation failures and issues in high-precision tasks [1][2][11]. Summary by Sections Bug Discovery and Impact - Developers have reported that during API calls for code development, the output occasionally includes the character "极", which can disrupt the coding process [2][5]. - The issue was first identified on platforms like Volcano Engine and Chutes, but it has since affected other platforms, including Tencent's CodeBuddy and DeepSeek's official channels [5]. Community Response and Solutions - The community has pointed fingers at the DeepSeek V3.1 model for the bug, and CodeBuddy has reached out to DeepSeek for a fix in an upcoming version [12]. - Users have begun sharing tips to mitigate the "极" bug, such as using specific prompt patterns to avoid triggering the issue [14][18]. Analysis of the Bug's Origin - A user on Zhihu, Huang Zhewai, suggested that this bug is not an isolated incident and may relate to a "malicious pattern" in large model programming [21]. - Huang observed that similar issues occurred in earlier models, where the output would unexpectedly include terms like "极长" after a series of repetitions, indicating a potential flaw in the model's reasoning process [21][22]. - He hypothesized that the root cause might be inadequate data cleaning during the supervised fine-tuning (SFT) phase, leading to the model learning to use "极" as a termination marker [22]. Future Outlook - The resolution of the "极" bug is contingent upon the release of a new version from DeepSeek, which is expected to address the underlying issues [24].
DeepSeek“极你太美”bug,官方回应了
量子位·2025-08-27 02:24