Seek .(SKLTY)
Search documents
DeepSeek新模型上线华为云
Di Yi Cai Jing· 2025-09-29 10:51
Core Insights - The DeepSeek-V3.2-Exp model has been officially launched on Huawei Cloud's Model as a Service (MaaS) platform [1] Group 1 - Huawei Cloud has completed the adaptation work for the DeepSeek-V3.2-Exp model [1] - The model supports a maximum context length of 160K tokens [1]
DeepSeek-V3.2-Exp正式发布 API大幅降价
Zheng Quan Shi Bao Wang· 2025-09-29 10:29
Core Insights - DeepSeek has officially launched the DeepSeek-V3.2-Exp model, with updates available on its official app, web platform, and mini-programs [1] - The new pricing policy significantly reduces the cost for developers using DeepSeek API by over 50% [1]
DeepSeek-V3.2-Exp 发布,训练推理提效,API成本降50%以上
Xin Lang Ke Ji· 2025-09-29 10:27
Core Insights - DeepSeek has released the DeepSeek-V3.2-Exp model, which is an experimental version aimed at transitioning to a new generation architecture [1] - The new model introduces DeepSeek Sparse Attention, focusing on optimizing training and inference efficiency for long texts [1] - The official app, web version, and mini-program have all been updated to DeepSeek-V3.2-Exp, and the API costs have been significantly reduced by over 50% [1] - The performance of DeepSeek-V3.2-Exp on public evaluation sets is comparable to that of V3.1-Terminus [1]
DeepSeek V3.2要来了?
Guan Cha Zhe Wang· 2025-09-29 09:58
Core Insights - The appearance of DeepSeek-V3.2 on the Hugging Face platform has sparked speculation among users [1] - DeepSeek has a history of releasing new versions and updates around significant holidays [2] - The most recent update prior to the speculation was DeepSeek-V3.1-Terminus, released on September 22, with an open-source announcement [3] Version Release History - DeepSeek V3 was released on December 27, 2024, just before New Year's [3] - DeepSeek-R1-0528 was launched on May 28, 2025, as a special gift for the Dragon Boat Festival [3] - The latest version, DeepSeek-V3.1-Terminus, was made available on September 22, 2023, along with an open-source model [3] Current Status - The Hugging Face interface related to DeepSeek is currently showing errors, and there has been no official response from DeepSeek regarding the situation [4]
DeepSeek V3.2、GLM4.6等大模型即将发布
Zheng Quan Shi Bao Wang· 2025-09-29 09:04
Core Insights - DeepSeek-V3.2 is set to be released, with the v3.2-base already uploaded to DeepSeek's official HuggingFace page, although the model files are currently being uploaded and the service is offline [1] - Zhizhu GLM4.6 is also on the verge of release, with official notifications in WeChat groups indicating that GLM4.6 will provide a larger context [1]
国庆前发布?DeepSeek V3.2惊现HuggingFace
Hua Er Jie Jian Wen· 2025-09-29 09:03
Core Insights - DeepSeek has uploaded the v3.2-base model to its official HuggingFace page, although the model file is currently offline [1] Summary by Categories - **Company Developments** - DeepSeek has made progress by uploading the v3.2-base model to its official HuggingFace page [1]
DeepSeek与智谱将发布新模型
Di Yi Cai Jing· 2025-09-29 08:58
Core Insights - The AI community has discovered that the new model DeepSeek-V3.2 was uploaded to the HuggingFace platform but was subsequently removed [1] - Additionally, the new model GLM-4.6 from Zhipu is expected to be released soon and is currently accessible via API [1]
HLE“人类最后考试”首次突破60分,Eigen-1基于DeepSeek V3.1显著领先Grok4、GPT-5
3 6 Ke· 2025-09-28 12:05
Core Insights - Eigen-1 multi-agent system has achieved a historic breakthrough with Pass@1 accuracy of 48.3% and Pass@5 accuracy of 61.74% on the HLE Bio/Chem Gold test set, surpassing competitors like Google Gemini 2.5 Pro and OpenAI GPT-5 [1][6][27] - The success is attributed to three innovative mechanisms: Monitor-based RAG, Hierarchical Solution Refinement (HSR), and Quality-Aware Iterative Reasoning (QAIR) [2][5][12] Technical Innovations - **Monitor-based RAG**: This mechanism eliminates the "tool tax" associated with traditional retrieval-augmented generation systems by continuously monitoring reasoning flow and seamlessly integrating retrieved knowledge, resulting in a 53.5% reduction in token consumption and a 43.7% decrease in workflow iterations [8][10] - **Hierarchical Solution Refinement (HSR)**: HSR introduces a hierarchical collaboration model that allows stronger solutions to absorb valuable insights from weaker ones, enhancing the overall quality of the output [12][15] - **Quality-Aware Iterative Reasoning (QAIR)**: This mechanism adapts the depth of iterations based on the quality of answers, ensuring efficient resource utilization by focusing on low-quality candidates for further exploration [15][18] Performance Metrics - Eigen-1's performance metrics demonstrate its superiority across various benchmarks, achieving Pass@1 of 48.3% and Pass@5 of 61.74% on HLE Bio/Chem Gold, and significantly higher scores on SuperGPQA Hard and TRQA [17] - The model's accuracy improved from 25.3% to 48.3% through the integration of various components, showcasing the effectiveness of the innovative mechanisms [20][21] Insights on Error Patterns - Analysis reveals that 92.78% of errors stem from reasoning process issues, indicating that the core challenge lies in integrating knowledge with reasoning rather than mere knowledge retrieval [18] Implications for AI in Science - The breakthrough signifies a new paradigm for AI-assisted scientific research, suggesting that AI can effectively understand and reason through complex human knowledge, thus accelerating the research process [27]
聊聊北京有实力的DeepSeek收录,说说哪家性价比高
Sou Hu Cai Jing· 2025-09-27 15:54
Core Insights - DeepSeek has become a key platform for enterprises seeking cost-effective solutions to enhance brand exposure and influence in the AI search ecosystem [1][3] - The platform boasts a monthly active user base of 494 million, allowing businesses to reach a vast potential customer pool [3] - Companies that leverage authoritative endorsements and blockchain verification in their content can significantly increase their AI citation rates [4] Industry Advantages and Features - DeepSeek emphasizes authority and professionalism, differentiating itself from traditional search engines [3] - A case study highlighted that a materials company increased its AI citation rate by 4 times through authoritative report references, leading to over 60% growth in consultation inquiries [4] Pricing and Cost-Effectiveness - Pricing is a critical consideration for companies selecting DeepSeek services, with significant variability among service providers [6] - Beijing Baiyun Teng Cultural Communication Co., Ltd. offers tailored GEO optimization services at competitive prices, ensuring effective DeepSeek integration [6] Brand and Reputation - Brand reputation is a vital metric for evaluating service providers in the DeepSeek space [9] - Beijing Baiyun Teng has established a strong reputation over nearly 9 years in digital marketing, evidenced by successful case studies showing significant engagement and conversion rate increases [9][12] Selection Criteria - Companies should clarify their objectives, assess provider expertise, and ensure service content aligns with pricing [10] - Compliance with regulations is essential, and Beijing Baiyun Teng adheres to legal standards, ensuring safe marketing practices [10][12] Recommendation Index and Summary - Beijing Baiyun Teng Cultural Communication Co., Ltd. receives a recommendation index of four stars out of five for its DeepSeek services [11] - The company is positioned as a reliable partner for businesses aiming to capitalize on the AI search traffic revolution [11]
DeepSeek的阳谋:在《自然》杂志公布论文,到底赢得了什么?
Xin Lang Cai Jing· 2025-09-27 12:18
Core Insights - DeepSeek has gained significant recognition by being featured on the cover of Nature magazine, highlighting its leading position in the AI field [4][19] - The article emphasizes the importance of peer review in the AI industry, noting that DeepSeek is the first major model to undergo this rigorous process, filling a critical gap in the sector [5][6][19] Group 1: Industry Impact - DeepSeek's peer-reviewed status is seen as a breakthrough, contrasting with the trend of other AI models that have not been subjected to such scrutiny, which has often led to a lack of transparency [6][7][19] - The traditional approach to AI training has been to use supervised fine-tuning (SFT), where models learn from human-generated solutions. DeepSeek challenges this by allowing its model to learn independently through reinforcement learning [8][19] Group 2: Technological Innovation - DeepSeek's model, DeepSeek-R1-Zero, was trained using a unique method that involved presenting it with difficult problems without any human guidance, simulating a high-pressure learning environment [11][12] - The model demonstrated advanced reasoning capabilities, including self-reflection and error correction, which were not previously expected from AI systems trained without human input [15][16] Group 3: Strategic Decisions - The decision to open-source its findings and model is framed as a long-term strategy to build trust, accelerate innovation, and attract top talent in the AI field [17][18] - By publishing in Nature, DeepSeek aims to establish itself as a credible player in the AI landscape, emphasizing the importance of transparency in gaining societal trust [19]