Workflow
DeepSeek
icon
Search documents
外媒:中国企业还得依靠英伟达
半导体行业观察· 2025-08-21 01:12
Core Viewpoint - The article discusses the implications of the U.S. allowing NVIDIA's key AI chips to return to China, highlighting the complex dynamics between U.S.-China trade negotiations and China's AI ambitions [1][2]. Group 1: U.S.-China Relations and NVIDIA - The U.S. has permitted the sale of H20 chips to China, which is crucial for China's AI development, while China is leveraging this in trade negotiations [1]. - Despite the U.S. announcement, there are concerns in China regarding potential security risks associated with NVIDIA's chips, leading to warnings from state media [1][2]. - The U.S. Treasury Secretary indicated that China's reaction reflects concerns about NVIDIA chips becoming a standard in China, suggesting a deeper anxiety about technological dominance [1][2]. Group 2: China's AI Industry and Domestic Alternatives - Chinese companies are still eager to purchase H20 chips despite warnings about potential backdoors, indicating a strong reliance on NVIDIA's technology [2]. - Domestic alternatives to NVIDIA's products are not yet capable of matching the performance or production levels required for AI development, as evidenced by delays in projects like DeepSeek's new model [2]. - The Chinese government is aware of the need for domestic chips but faces challenges in achieving the desired technological capabilities [2]. Group 3: Financial Implications and Security Concerns - President Trump's announcement that NVIDIA would pay 15% of its AI chip sales revenue in China raises questions about the transactional nature of national security concerns [3]. - This payment structure could provoke strong reactions globally, emphasizing the intertwining of trade and security in the semiconductor industry [3].
DeepSeek又更新了,期待梁文锋「炸场」
Xin Lang Ke Ji· 2025-08-21 00:52
Core Viewpoint - The recent upgrade of DeepSeek to version 3.1 has shown significant improvements in context length and user interaction, while also merging features from previous models to reduce deployment costs [1][11][12]. Group 1: Model Improvements - DeepSeek V3.1 now supports a context length of 128K, enhancing its ability to handle longer texts [4]. - The model's parameter count increased slightly from 671 billion to 685 billion, but the user experience has improved noticeably [5]. - The model's programming capabilities have been highlighted, achieving a score of 71.6% in multi-language programming tests, outperforming Claude 4 Opus [7]. Group 2: Economic Efficiency - The merger of V3 and R1 models allows for reduced deployment costs, requiring only 60 GPUs instead of the previous 120 [12]. - Developers noted that the performance could improve by 3-4 times with the new model due to increased cache size [12]. - The open-source release of DeepSeek V3.1-Base on Huggingface indicates a move towards greater accessibility and collaboration in the AI community [13]. Group 3: Market Context - The AI industry is closely watching the developments of DeepSeek, especially in light of the absence of the anticipated R2 model [19]. - Competitors like OpenAI, Google, and Alibaba have released new models, using R1 as a benchmark for their advancements [1][15]. - The market is eager for DeepSeek's next steps, particularly regarding the potential release of a multi-modal model following the V3.1 update [23].
Time for a Sector Rotation Away from Tech? ETFs in Focus
ZACKS· 2025-08-20 18:01
Market Overview - U.S. stocks experienced a decline on August 19, 2025, primarily driven by a drop in technology shares, with the Nasdaq-100-based ETF Invesco QQQ Trust (QQQ) falling by 1.4% [1] - Notable declines were observed in Palantir (PLTR) shares, which dropped by 9.4%, and NVIDIA (NVDA), which retreated by approximately 3% [1] Company Performance - Palantir shares surged over 150% from their April low leading up to its second-quarter earnings report, where the company reported quarterly revenue exceeding $1 billion for the first time [2] - However, the stock faced its longest losing streak since March, indicating a potential shift in investor sentiment [2] Sector Rotation - There is a noticeable shift away from Big Tech, with other sectors, such as consumer staples, beginning to show renewed strength [3] - Home Depot (HD) reported a boost in U.S. sales, resulting in a 3.2% increase in its stock price on August 19, 2025, contributing to overall market optimism [3] AI Market Concerns - OpenAI CEO Sam Altman expressed concerns about a potential bubble in the artificial intelligence (AI) industry, likening the current environment to the dot-com boom of the late 1990s [4][5] - Despite significant advancements, such as OpenAI's projected annual recurring revenue exceeding $20 billion, the company remains unprofitable, raising questions about the sustainability of current AI spending levels [6] Valuation Metrics - The P/E ratio of the Invesco QQQ Trust stands at 59.27X, significantly higher than the 10-year median of 25.8X, indicating overvaluation concerns [7] - Conversely, the price-to-book (P/B) ratio of QQQ is currently at 3.6X, the lowest in the past 10 years, suggesting some valuation support [7] Investment Strategies - The consumer staples sector is highlighted as a safe investment area, typically performing well during economic slowdowns and high inflation [9] - Value stocks, represented by ETFs like S&P 500 Pure Value Invesco ETF (RPV) and Morningstar Dividend Leaders ETF (FDL), have recently reached a one-month high, indicating a potential shift in investor focus towards stability and dividends [11]
突发利好!A股深v再创新高,寒武纪股价突破1000元
Sou Hu Cai Jing· 2025-08-20 12:15
前两天提示风险后,再加上昨天美股大跌,今天A股大幅低开跳水,高位的AI方向暴跌,但低位的白酒、化工等板块站了出来, 稳住了大盘。午盘国产算力、半导体发力,浪潮信息涨停,寒武纪再度暴涨股价突破1000元,带动市场情绪回升。今天A股走出 了深v,上证指数再创年内新高,美中不足的是量能缩了近2000亿。 今天的深v并没有让我打消"A股短期有风险"的念头,牛市的惯性还在,刚开始回调肯定有资金迫不及待的低吸,所以还需要多观 察两天,看看多空的强弱。 另外,Harris Financial Group管理合伙人James Cox表示:"投资者似乎在为杰克逊霍尔提前避险,担心鲍威尔的表态会比目前市场 预期更为鹰派。" 我提示风险不仅仅是因为近期量能、融资都飙的太快了,还因为大部分板块都没那么有性价比了:景气度最高但位置也高的海外 算力,已经开始用明年的业绩来算估值了;国产算力虽然没海外算力涨的多,但估值却要高得多;低位的消费、地产链估值低, 但行业趋势还没有反转。 | < w | 中国:融资余额 2 | | | --- | --- | --- | | 一 中国:融资余额 | | | | 相关指标 中国:融券余额 中国:融资 ...
实测低调上线的DeepSeek新模型:编程比Claude 4还能打,写作...还是算了吧
3 6 Ke· 2025-08-20 12:14
Core Insights - DeepSeek has officially launched and open-sourced its new model, DeepSeek-V3.1-Base, following the release of GPT-5, despite not having released R2 yet [1] - The new model features 685 billion parameters and supports multiple tensor types, with significant optimizations in inference efficiency and an expanded context window of 128k [1] Model Performance - Initial tests show that DeepSeek V3.1 achieved a score of 71.6% on the Aider Polyglot programming benchmark, outperforming other open-source models, including Claude 4 Opus [5] - The model successfully processed a long text and provided relevant literary recommendations, demonstrating its capability in handling complex queries [4] - In programming tasks, DeepSeek V3.1 generated code that effectively handled collision detection and included realistic physical properties, showcasing its advanced programming capabilities [8] Community and Market Response - Hugging Face CEO Clément Delangue noted that DeepSeek V3.1 quickly climbed to the fourth position on the trends chart, later reaching second place, indicating strong market interest [79] - The update removed the "R1" label from the deep thinking mode and introduced native "search token" support, enhancing the search functionality [79][80] Future Developments - The company plans to discontinue the mixed thinking mode in favor of training separate Instruct and Thinking models to ensure higher quality outputs [80] - As of the latest update, the model card for DeepSeek-V3.1-Base has not yet been released, but further technical details are anticipated [81]
DeepSeek V3.1发布后,投资者该思考这四个决定未来的问题
3 6 Ke· 2025-08-20 10:51
Core Insights - DeepSeek has quietly launched its new V3.1 model, which has generated significant buzz in both the tech and investment communities due to its impressive performance metrics [1][2][5] - The V3.1 model outperformed the previously dominant Claude Opus 4 in programming capabilities, achieving a score of 71.6% in the Aider programming benchmark [2] - The cost efficiency of V3.1 is notable, with a complete programming task costing approximately $1.01, making it 68 times cheaper than Claude Opus 4 [5] Group 1: Performance and Cost Advantages - The V3.1 model's programming capabilities have surpassed those of Claude Opus 4, marking a significant achievement in the open-source model landscape [2] - The cost to complete a programming task with V3.1 is only about $1.01, which is a drastic reduction compared to competitors, indicating a strong cost advantage [5] Group 2: Industry Implications - The emergence of V3.1 raises questions about the future dynamics between open-source and closed-source models, particularly regarding the erosion and reconstruction of competitive advantages [8] - The shift towards a "hybrid model" is becoming prevalent among enterprises, combining private deployments of fine-tuned open-source models with the use of powerful closed-source models for complex tasks [8][9] Group 3: Architectural Innovations - The removal of the "R1" designation and the introduction of new tokens in V3.1 suggest a potential exploration of "hybrid reasoning" or "model routing" architectures, which could have significant commercial implications [11] - The concept of a "hybrid architecture" aims to optimize inference costs by using a lightweight scheduling model to allocate tasks to the most suitable expert models, potentially enhancing unit economics [12] Group 4: Market Dynamics and Business Models - The drastic reduction in inference costs could lead to a transformation in AI application business models, shifting from per-call or token-based billing to more stable subscription models [13] - As foundational models become commoditized due to open-source competition, the profit distribution within the value chain may shift towards application and solution layers, emphasizing the importance of high-quality private data and industry-specific expertise [14] Group 5: Future Competitive Landscape - The next competitive battleground will focus on "enterprise readiness," encompassing stability, predictability, security, and compliance, rather than solely on performance metrics [15] - Companies that can provide comprehensive solutions, including models, toolchains, and compliance frameworks, will likely dominate the trillion-dollar enterprise market [15]
芯片股午后大爆发!寒武纪股价突破千元
Market Performance - The A-share market experienced a rebound on August 20, with the Shanghai Composite Index, Shenzhen Component Index, and STAR Market Index all reaching new highs for the year [2] - Chip stocks surged in the afternoon, with Cambrian Technology's stock price surpassing 1,000 yuan, making it one of only two stocks in A-shares to reach this milestone [2] - Several stocks, including Shengke Communication, hit the 20% daily limit up, alongside others like Xingye Co., Hanzhong Precision, and Yueling Co. [2] AI and Semiconductor Industry - DeepSeek announced an upgrade to its online model version V3.1, extending context length to 128k, with a 43% improvement in multi-step reasoning performance compared to the previous version [2] - This upgrade is expected to enhance accuracy in fields such as mathematical calculations, code generation, and scientific analysis [2] - CITIC Securities believes that AI will be the primary growth driver for the semiconductor industry, with sustained demand for cloud AI and accelerated deployment of terminal AI applications [2] - Chinese semiconductor manufacturers are anticipated to significantly benefit from the ongoing development of the AI industry, with investment logic focusing on domestic production for cloud applications and downstream growth for terminal applications [2]
DeepSeek 开源新模型 V3.1:上下文长度拓展至 128K
Huan Qiu Wang Zi Xun· 2025-08-20 04:54
来源:环球网 【环球网科技综合报道】8月20日消息,DeepSeek日前在Hugging Face上开源了新模型 V3.1-Base。 此外,日前DeepSeek 还发布通知称,线上模型版本已升级至 V3.1,上下文长度拓展至 128k,可通过官 方网页、App、小程序测试,API 接口调用方式保持不变。 就在8月14日,DeepSeek App发布了1.3.0版本,此次更新在修复已知问题、优化文本操作体验的基础 上,首次引入"对话内容生成分享图"功能,为用户提供更便捷、个性化的内容传播方式。(思瀚) ...
DeepSeek V3.1 Base突袭上线,击败Claude 4编程爆表,全网在蹲R2和V4
3 6 Ke· 2025-08-20 03:52
就在昨晚,DeepSeek官方悄然上线了全新的V3.1版本,上下文长度拓展到128k。 对于这波更新,大家的热情可谓是相当高涨。 即便还未公布模型卡,DeepSeek V3.1就已经在Hugging Face的趋势榜上排到了第四。 本次开源的V3.1模型拥有685B参数,支持多种精度格式,从BF16到FP8。 综合公开信息和国内大咖karminski3的实测,V3.1此次更新亮点有: 编程能力:表现突出,根据社区使用Aider测试数据,V3.1在开源模型中霸榜。 性能突破:V3.1在Aider编程基准测试中取得71.6%高分,超越Claude Opus 4,同时推理和响应速度更快。 原生搜索:新增了原生「search token」的支持,这意味着搜索的支持更好。 架构创新:线上模型去除「R1」标识,分析称DeepSeek未来有望采用「混合架构」。 成本优势:每次完整编程任务仅需1.01美元,成本仅为专有系统的六十分之一。 值得一提的是,官方群中强调拓展至128K上下文,此前V3版本就已经支持。 | Model | #Total | #Activated | Context | Download | | --- ...
AI与机器人盘前速递丨DeepSeek线上模型版本升级;宇树预热新款人形机器人
Mei Ri Jing Ji Xin Wen· 2025-08-20 01:14
Market Overview - The AI and robotics sectors continued their upward trend, achieving a "three consecutive days" gain, with the Huaxia Sci-Tech AI ETF (589010) closing up 0.98%, reaching a peak intraday increase of 2.62 [1] - The Robotics ETF (562500) rose by 0.71%, experiencing significant intraday volatility with a maximum fluctuation of 3.67% [1] - Total trading volume reached 2.022 billion yuan, indicating robust market activity and sustained liquidity [1] - The latest scale of the Robotics ETF reached 17.35 billion yuan, setting a new record and significantly surpassing comparable funds [1] Key Developments - DeepSeek announced an upgrade to its online model version V3.1, featuring a longer context window and readiness for testing [2] - Yushu Technology teased a new humanoid robot with a height of 1.8 meters and 31 degrees of freedom, suggesting advanced agility and elegance [2] - Shanghai's new implementation plan aims to accelerate "AI + manufacturing" development, targeting 3,000 manufacturing companies for smart applications over three years [2] Institutional Insights - Guojin Securities expressed optimism regarding the domestic advantages in AI applications, particularly in the integration of software and hardware, with positive growth expected in the second half of the year [3] Popular ETFs - The Robotics ETF (562500) is noted as the only fund exceeding 10 billion yuan in scale, offering the best liquidity and comprehensive coverage of China's robotics industry [4] - The Huaxia Sci-Tech AI ETF (589010) is characterized as the "brain" of robotics, with a 20% fluctuation range and potential for capturing significant moments in the AI industry [4]