Workflow
腾讯CodeBuddy
icon
Search documents
DeepSeek “极你太美” bug,官方回应了
猿大侠· 2025-08-29 04:12
Core Viewpoint - The article discusses a significant bug in the DeepSeek V3.1 model, which has caused widespread concern among developers due to the unexpected appearance of the character "极" in output results during API calls [1][2][12]. Group 1: Bug Discovery and Impact - The bug was initially discovered on platforms like Volcano Engine and Chutes, but it has since affected more platforms, including Tencent's CodeBuddy and even the DeepSeek official platform [5]. - The issue has sparked discussions on platforms like Reddit, particularly focusing on the terms "extreme," "极," and "極" [7]. - The presence of the "极" character can lead to compilation failures in code, posing a serious risk for scenarios requiring high precision and structured output [11]. Group 2: Solutions and Workarounds - While a complete fix is pending from DeepSeek, users have started sharing potential workarounds, such as using specific prompt patterns to mitigate the issue [14][19]. - One suggested workaround involves prohibiting certain symbol sequences in API calls, which is particularly relevant for third-party platforms [19]. Group 3: Analysis of the Bug's Origin - A user on Zhihu, Huang Zhewai, provided insights suggesting that this bug is not an isolated incident and may relate to a "malicious pattern" in large model programming [20]. - Huang observed similar issues in earlier models, indicating that the bug might stem from inadequate data cleaning during the supervised fine-tuning (SFT) and pre-training phases [23]. - He hypothesized that the "极" character could have been learned as a termination symbol due to its presence in "dirty data" that was not properly cleaned [23]. Group 4: Future Outlook - The resolution of the "极" bug, humorously referred to as "极你太美" or "'极'速版," is contingent upon the release of a new version from DeepSeek [25].
DeepSeek “极你太美” bug,官方回应了
程序员的那些事· 2025-08-28 04:17
转自: 量子位 | 公众号 QbitAI 简单来说呢,就是陆续有开发者们发现,当他们在调用API进行代码开发的过程中,输出结果里会 时不时蹦出来"极"字 。 像这样: 这个问题最初是在火山引擎、chutes等平台上被发现,但随着事件的发酵,更多平台也被卷入了进来,包括腾讯的CodeBuddy,甚至是 DeepSeek官方…… 事件之火,在国外 Reddit上也是讨 论声一片,重灾区是"extreme"、"极" 和"極": DeepSeek V3.1 上演的bug大秀" 极 你太美",可谓是让全网热议了一波。 腾讯CodeBuddy还出现了更加奇葩的情况,直接插了句带"极"字儿的广告…… "extreme" (id:15075) "极" (id:2577,简体中文的extreme) "極" ( i d:16411,繁体中文中的extreme) △ 图源:小红书用户@ 奈绪白 Nine-piece shell 若是开发者们没有细看,直接用了生成的代码,那定然是会导致编译不通过等情况,可以说是对需要高精度、结构化输出的场景是致命一击。 截至目前,大家已经统一将问题的矛头指向了DeepSeek V3.1模型本身,以及 ...