Core Viewpoint - The article discusses a significant bug in the DeepSeek V3.1 model, which has caused widespread concern among developers due to the unexpected appearance of the character "极" in output results during API calls [1][2][12]. Group 1: Bug Discovery and Impact - The bug was initially discovered on platforms like Volcano Engine and Chutes, but it has since affected more platforms, including Tencent's CodeBuddy and even the DeepSeek official platform [5]. - The issue has sparked discussions on platforms like Reddit, particularly focusing on the terms "extreme," "极," and "極" [7]. - The presence of the "极" character can lead to compilation failures in code, posing a serious risk for scenarios requiring high precision and structured output [11]. Group 2: Solutions and Workarounds - While a complete fix is pending from DeepSeek, users have started sharing potential workarounds, such as using specific prompt patterns to mitigate the issue [14][19]. - One suggested workaround involves prohibiting certain symbol sequences in API calls, which is particularly relevant for third-party platforms [19]. Group 3: Analysis of the Bug's Origin - A user on Zhihu, Huang Zhewai, provided insights suggesting that this bug is not an isolated incident and may relate to a "malicious pattern" in large model programming [20]. - Huang observed similar issues in earlier models, indicating that the bug might stem from inadequate data cleaning during the supervised fine-tuning (SFT) and pre-training phases [23]. - He hypothesized that the "极" character could have been learned as a termination symbol due to its presence in "dirty data" that was not properly cleaned [23]. Group 4: Future Outlook - The resolution of the "极" bug, humorously referred to as "极你太美" or "'极'速版," is contingent upon the release of a new version from DeepSeek [25].
DeepSeek “极你太美” bug,官方回应了
猿大侠·2025-08-29 04:12