Workflow
DeepSeek
icon
Search documents
实测低调上线的DeepSeek新模型:编程比Claude 4还能打,写作...还是算了吧
3 6 Ke· 2025-08-20 12:14
Core Insights - DeepSeek has officially launched and open-sourced its new model, DeepSeek-V3.1-Base, following the release of GPT-5, despite not having released R2 yet [1] - The new model features 685 billion parameters and supports multiple tensor types, with significant optimizations in inference efficiency and an expanded context window of 128k [1] Model Performance - Initial tests show that DeepSeek V3.1 achieved a score of 71.6% on the Aider Polyglot programming benchmark, outperforming other open-source models, including Claude 4 Opus [5] - The model successfully processed a long text and provided relevant literary recommendations, demonstrating its capability in handling complex queries [4] - In programming tasks, DeepSeek V3.1 generated code that effectively handled collision detection and included realistic physical properties, showcasing its advanced programming capabilities [8] Community and Market Response - Hugging Face CEO Clément Delangue noted that DeepSeek V3.1 quickly climbed to the fourth position on the trends chart, later reaching second place, indicating strong market interest [79] - The update removed the "R1" label from the deep thinking mode and introduced native "search token" support, enhancing the search functionality [79][80] Future Developments - The company plans to discontinue the mixed thinking mode in favor of training separate Instruct and Thinking models to ensure higher quality outputs [80] - As of the latest update, the model card for DeepSeek-V3.1-Base has not yet been released, but further technical details are anticipated [81]
DeepSeek V3.1发布后,投资者该思考这四个决定未来的问题
3 6 Ke· 2025-08-20 10:51
Core Insights - DeepSeek has quietly launched its new V3.1 model, which has generated significant buzz in both the tech and investment communities due to its impressive performance metrics [1][2][5] - The V3.1 model outperformed the previously dominant Claude Opus 4 in programming capabilities, achieving a score of 71.6% in the Aider programming benchmark [2] - The cost efficiency of V3.1 is notable, with a complete programming task costing approximately $1.01, making it 68 times cheaper than Claude Opus 4 [5] Group 1: Performance and Cost Advantages - The V3.1 model's programming capabilities have surpassed those of Claude Opus 4, marking a significant achievement in the open-source model landscape [2] - The cost to complete a programming task with V3.1 is only about $1.01, which is a drastic reduction compared to competitors, indicating a strong cost advantage [5] Group 2: Industry Implications - The emergence of V3.1 raises questions about the future dynamics between open-source and closed-source models, particularly regarding the erosion and reconstruction of competitive advantages [8] - The shift towards a "hybrid model" is becoming prevalent among enterprises, combining private deployments of fine-tuned open-source models with the use of powerful closed-source models for complex tasks [8][9] Group 3: Architectural Innovations - The removal of the "R1" designation and the introduction of new tokens in V3.1 suggest a potential exploration of "hybrid reasoning" or "model routing" architectures, which could have significant commercial implications [11] - The concept of a "hybrid architecture" aims to optimize inference costs by using a lightweight scheduling model to allocate tasks to the most suitable expert models, potentially enhancing unit economics [12] Group 4: Market Dynamics and Business Models - The drastic reduction in inference costs could lead to a transformation in AI application business models, shifting from per-call or token-based billing to more stable subscription models [13] - As foundational models become commoditized due to open-source competition, the profit distribution within the value chain may shift towards application and solution layers, emphasizing the importance of high-quality private data and industry-specific expertise [14] Group 5: Future Competitive Landscape - The next competitive battleground will focus on "enterprise readiness," encompassing stability, predictability, security, and compliance, rather than solely on performance metrics [15] - Companies that can provide comprehensive solutions, including models, toolchains, and compliance frameworks, will likely dominate the trillion-dollar enterprise market [15]
芯片股午后大爆发!寒武纪股价突破千元
Market Performance - The A-share market experienced a rebound on August 20, with the Shanghai Composite Index, Shenzhen Component Index, and STAR Market Index all reaching new highs for the year [2] - Chip stocks surged in the afternoon, with Cambrian Technology's stock price surpassing 1,000 yuan, making it one of only two stocks in A-shares to reach this milestone [2] - Several stocks, including Shengke Communication, hit the 20% daily limit up, alongside others like Xingye Co., Hanzhong Precision, and Yueling Co. [2] AI and Semiconductor Industry - DeepSeek announced an upgrade to its online model version V3.1, extending context length to 128k, with a 43% improvement in multi-step reasoning performance compared to the previous version [2] - This upgrade is expected to enhance accuracy in fields such as mathematical calculations, code generation, and scientific analysis [2] - CITIC Securities believes that AI will be the primary growth driver for the semiconductor industry, with sustained demand for cloud AI and accelerated deployment of terminal AI applications [2] - Chinese semiconductor manufacturers are anticipated to significantly benefit from the ongoing development of the AI industry, with investment logic focusing on domestic production for cloud applications and downstream growth for terminal applications [2]
DeepSeek 开源新模型 V3.1:上下文长度拓展至 128K
Huan Qiu Wang Zi Xun· 2025-08-20 04:54
来源:环球网 【环球网科技综合报道】8月20日消息,DeepSeek日前在Hugging Face上开源了新模型 V3.1-Base。 此外,日前DeepSeek 还发布通知称,线上模型版本已升级至 V3.1,上下文长度拓展至 128k,可通过官 方网页、App、小程序测试,API 接口调用方式保持不变。 就在8月14日,DeepSeek App发布了1.3.0版本,此次更新在修复已知问题、优化文本操作体验的基础 上,首次引入"对话内容生成分享图"功能,为用户提供更便捷、个性化的内容传播方式。(思瀚) ...
DeepSeek V3.1 Base突袭上线,击败Claude 4编程爆表,全网在蹲R2和V4
3 6 Ke· 2025-08-20 03:52
Core Insights - The newly released DeepSeek V3.1 model features 685 billion parameters and supports various precision formats, from BF16 to FP8 [1] - The model demonstrates exceptional programming capabilities, achieving a score of 71.6% in the Aider programming benchmark, surpassing Claude Opus 4 [1][11] - V3.1 introduces native search token support, enhancing search functionalities [1] - The architecture has been innovated by removing the "R1" designation, indicating a potential shift towards a hybrid architecture in future models [1][10] - The cost for a complete programming task is only $1.01, significantly lower than proprietary systems, which are 60 times more expensive [1][13][16] Performance Metrics - DeepSeek V3.1 has 671 billion parameters activated with a context length of 128K tokens, ranking fourth on Hugging Face's trend list even before the model card was released [2] - The model's programming performance is 1% higher than Claude 4, with a cost reduction of 68 times [16] - In the SVGBench benchmark, V3.1 ranks just below GPT-4.1-mini, outperforming its predecessor, DeepSeek R1 [17] User Engagement - The DeepSeek community has grown to over 80,000 followers, indicating strong interest and anticipation for future releases [4] - Users have reported significant improvements in understanding and output speed, particularly in context length tests [21][25]
AI与机器人盘前速递丨DeepSeek线上模型版本升级;宇树预热新款人形机器人
Mei Ri Jing Ji Xin Wen· 2025-08-20 01:14
Market Overview - The AI and robotics sectors continued their upward trend, achieving a "three consecutive days" gain, with the Huaxia Sci-Tech AI ETF (589010) closing up 0.98%, reaching a peak intraday increase of 2.62 [1] - The Robotics ETF (562500) rose by 0.71%, experiencing significant intraday volatility with a maximum fluctuation of 3.67% [1] - Total trading volume reached 2.022 billion yuan, indicating robust market activity and sustained liquidity [1] - The latest scale of the Robotics ETF reached 17.35 billion yuan, setting a new record and significantly surpassing comparable funds [1] Key Developments - DeepSeek announced an upgrade to its online model version V3.1, featuring a longer context window and readiness for testing [2] - Yushu Technology teased a new humanoid robot with a height of 1.8 meters and 31 degrees of freedom, suggesting advanced agility and elegance [2] - Shanghai's new implementation plan aims to accelerate "AI + manufacturing" development, targeting 3,000 manufacturing companies for smart applications over three years [2] Institutional Insights - Guojin Securities expressed optimism regarding the domestic advantages in AI applications, particularly in the integration of software and hardware, with positive growth expected in the second half of the year [3] Popular ETFs - The Robotics ETF (562500) is noted as the only fund exceeding 10 billion yuan in scale, offering the best liquidity and comprehensive coverage of China's robotics industry [4] - The Huaxia Sci-Tech AI ETF (589010) is characterized as the "brain" of robotics, with a 20% fluctuation range and potential for capturing significant moments in the AI industry [4]
DeepSeek线上模型版本升级;宇树预热新款人形机器人
Mei Ri Jing Ji Xin Wen· 2025-08-20 01:08
Market Review - The AI and robotics sectors continued their upward trend, achieving a "three consecutive days" increase, with the Huaxia AI ETF (589010) closing up 0.98% and reaching a peak intraday increase of 2.62% [1] - Key holdings included Chipone Technology leading with a 13.39% increase, followed by CloudWalk Technology at 4.79%, and both Daotong Technology and Yuntian Lifeng rising over 3% [1] - The Robotics ETF (562500) closed up 0.71%, experiencing significant volatility with an intraday peak fluctuation of 3.67% [1] - Notable performers included Hechuan Technology leading with a 14.50% increase, followed by Fengli Intelligent at 13.30%, Huachen Equipment at 11.11%, and Xiasha Precision hitting a 10% limit-up [1] - Total trading volume reached 2.022 billion yuan, indicating robust market activity and sustained liquidity [1] - The latest scale of the Robotics ETF reached 17.35 billion yuan, setting a new record and significantly surpassing comparable funds [1] Hot News - DeepSeek announced the upgrade of its online model to version 3.1, featuring a longer context window and readiness for testing [2] - Yuzhu Technology teased a new humanoid robot with a height of 1.8 meters and 31 degrees of freedom, suggesting agility and elegance [2] - Shanghai's implementation plan for accelerating "AI + manufacturing" aims to promote intelligent applications in 3,000 manufacturing enterprises over three years, establish 10 industry benchmark models, and develop around 10 "AI + manufacturing" demonstration factories [2] Institutional Views - Guojin Securities expressed optimism regarding the domestic advantages in the integration of AI with software and hardware, particularly in consumer and overseas markets, as evidenced by the preliminary validation from Meitu and Kuaishou's ARR [3] - The firm also anticipates positive growth in the second half of the year, with expected increases in revenue contributions from AI applications in enterprise service software and manufacturing information systems [3] Popular ETFs - The Robotics ETF (562500) is noted as the only fund exceeding 10 billion yuan in scale, offering the best liquidity and comprehensive coverage of the Chinese robotics industry [4] - The Huaxia AI ETF (589010) is characterized as the "brain" of robotics, with a 20% fluctuation limit and small to mid-cap elasticity, aimed at capturing pivotal moments in the AI industry [4]
久违的美国科技股大跌,AI和数字币领跌,发生了什么?
Hua Er Jie Jian Wen· 2025-08-20 00:44
Group 1 - The core viewpoint of the articles highlights a significant sell-off in U.S. tech stocks, driven by concerns over the commercialization returns of AI and warnings of a potential bubble from industry leaders [1][3][5] - The Nasdaq Composite Index experienced its largest single-day drop since August 1, closing down 1.4%, with notable declines in major tech stocks such as Nvidia (-3.5%), Palantir (-9.4%), and Arm (-5%) [1][3] - A report from MIT indicated that up to 95% of organizations have seen no returns from generative AI investments, raising doubts about the profitability of AI projects [3][5] Group 2 - The market is increasingly concerned about high valuations in tech stocks, with the Nasdaq 100 Index's expected P/E ratio at 27, significantly above its long-term average [3][5] - Sam Altman, CEO of OpenAI, expressed concerns about over-excitement among investors regarding AI, suggesting a bubble may be forming [3][5] - The sell-off was characterized by a rotation of funds from high-risk tech stocks to defensive sectors, with consumer staples, utilities, and real estate showing gains [7][8] Group 3 - The decline in tech stocks was particularly pronounced among high-momentum stocks, which had previously seen significant gains since mid-May, with the S&P 500 Information Technology sector rising 14% during that period [6][8] - Other risk assets, including Bitcoin, also faced declines, with Bitcoin dropping 2.7% and reaching a near three-week low [10] - Investor sensitivity to AI-related risks has been highlighted, with previous events causing market fluctuations, indicating a heightened vigilance towards negative news in the AI sector [11]
刚刚,DeepSeek新模型开源,五大能力变化明显,附一手体验
3 6 Ke· 2025-08-20 00:14
Core Insights - DeepSeek has upgraded its online model to DeepSeek V3.1, expanding the context window from 64k to 128k, available across web, app, and mini-program platforms [3][21] - The new model has shown improvements in various capabilities, including programming, understanding physical laws, creative writing, and mathematical problem-solving [4][19] Model Upgrade - The model is now open-sourced on Hugging Face, with only the Base version available for download, showing no significant changes in parameters or tensor types compared to DeepSeek-V3-0324 [2] - Initial experiences with DeepSeek V3.1 indicate enhanced performance in web development, with more complex and aesthetically pleasing outputs compared to the previous version [4][6] Performance Enhancements - In web development tasks, DeepSeek V3.1 produced longer code with improved completion and aesthetics, demonstrating better layout and content planning [4][6] - The model successfully recreated a simple game similar to the Chrome dinosaur game, although some aspects of the game were not accurately rendered [8] Question Answering and Interaction - DeepSeek V3.1 provided more detailed and factually accurate responses to niche historical questions, showing a reduction in "hallucination" compared to its predecessor [10][12] - The model's tone has shifted to be more conversational and nuanced, using conditional statements and emphasizing complexity in its answers [13] Creative Outputs - The model demonstrated its creative capabilities by generating poetry and engaging in playful comparisons between notable figures in AI, showcasing a balanced approach in its responses [17][16] Mathematical Abilities - DeepSeek V3.1 displayed a mixed performance in basic arithmetic, initially providing incorrect answers before correcting itself [18] User Engagement - Users have quickly adopted the new model, with feedback highlighting improvements in physical simulations and creative outputs [19][21]
税率50%!美国将407类钢铁和铝衍生产品纳入关税清单;纳指跌超300点;个人养老金新增3种领取情形丨每经早参
Mei Ri Jing Ji Xin Wen· 2025-08-19 22:32
2025年8月20日 星期3 ll c LT V 中国8月一年、五年期贷款市场报 将公布 2 国务院新闻办公室将于8月20日上 兵准备工作有关情况举行新闻发布会 3 2025第十四届中国煤焦钢产业大 日至22日在青岛召开 4 谷歌将于8月20日举办发布会, 品牌硬件 欧洲三大股指收盘全线上涨,德国DAX指数涨0.45%报24423.07点,法国CAC40指数涨1.22%报7979.08点,英国富时100指数涨0.34%报9189.22点。 2 印度总理莫迪会见王毅 当地时间2025年8月19日,印度总理莫迪在新德里总理府会见中共中央政治局委员、中央外办主任王毅。 莫迪表示,印中都是文明古国,友好交往历史悠久。去年10月,两国领导人喀山会晤是双边关系改善发展的转折点。印中是伙伴而不是对手,都面临加快发 展的共同任务,应该加强交流,增进了解,拓展合作,让世界感受到印中合作的巨大潜力和光明前景。双方还要稳妥管控和处理边界问题,不能让分歧变成 争端。 王毅表示,中印关系经历起伏,其中的经验教训值得铭记。无论面临什么样情况,双方都应坚持彼此是伙伴而不是对手的正确定位,坚持稳妥管控分歧,不 让边界争议影响两国关系大局。( ...