Workflow
DeepSeek
icon
Search documents
DeepSeek线上模型版本升级
第一财经· 2025-08-19 14:10
Core Insights - The DeepSeek online model has been upgraded to version 3.1, expanding the context length to 128K [1] Group 1 - The upgrade maintains the same testing methods across the official website, App, mini-program, and API interface [1]
AI进化速递 | DeepSeek线上模型版本升级
Di Yi Cai Jing· 2025-08-19 13:19
Group 1 - DeepSeek has upgraded its online model to version 3.1, expanding the context length to 128k [1] - Alibaba's Tongyi Qianwen has launched an image editing model called Qwen-Image-Edit [1] - Nvidia has released a small language model named Nemotron-Nano-9B-v2 [1] Group 2 - Xiaopeng Motors' chairman He Xiaopeng announced that humanoid robots and L4-supported vehicles are expected to be mass-produced by 2026 [3] - Nvidia is collaborating with Foxconn to develop a humanoid robot, which is expected to debut in November [1] - Figure's founder indicated that Helix is set to undergo a significant upgrade [1] - Arm has hired Amazon's AI chip head Rami Shino to support its self-developed chip initiatives [1]
X @Bloomberg
Bloomberg· 2025-08-19 13:12
DeepSeek announced what appeared to be an update to its older V3 artificial intelligence model on Tuesday, declaring an enhanced version ready for testing https://t.co/O9JXVomXhQ ...
DeepSeek新版本突袭上线,R2发布时间仍未明确
Feng Huang Wang· 2025-08-19 12:20
Group 1 - The core focus of the recent update is the expansion of context length, which has been increased to 128k, allowing for improved memory and processing capabilities [1][3] - Users have reported enhancements in front-end coding capabilities following the update [1][3] - There are rumors regarding the potential release of DeepSeek R2 in late August, but no official release date has been confirmed [3]
DeepSeek线上模型版本升级至V3.1
Mei Ri Jing Ji Xin Wen· 2025-08-19 11:43
Core Viewpoint - DeepSeek has upgraded its online model to version 3.1, expanding the context length to 128k [1] Company Summary - The upgrade to version 3.1 indicates a significant enhancement in the capabilities of DeepSeek's online model, allowing for a larger context length which can improve the model's performance and usability [1]
中国股市创10年来高点,科技和EV崛起
日经中文网· 2025-08-19 02:31
Core Viewpoint - The Chinese stock market is showing signs of recovery, with the Shanghai Composite Index reaching its highest level since August 2015, driven by the rise of new enterprises like DeepSeek and a focus on technology and electric vehicle (EV) stocks [2][4][6]. Group 1: Market Performance - The Shanghai Composite Index closed at 3728.0273 points on August 18, marking the highest level since mid-August 2015, with a nearly 20% increase from its recent low in early April [4]. - The index had previously experienced a significant decline, dropping below 2500 points in late 2018 to early 2019 due to factors such as the devaluation of the yuan and escalating trade tensions with the U.S. [4][6]. Group 2: Leading Companies - As of August 15, the largest companies by market capitalization include Tencent Holdings, which has a market cap of $694.1 billion, representing a 4.3 times increase over the past decade [6][7]. - Other notable companies include Industrial and Commercial Bank of China ($349.5 billion, 53% increase), Agricultural Bank of China ($325.7 billion, 2.2 times increase), and Alibaba Group ($288 billion, 73% increase) [7]. Group 3: Emerging Industries - The rise of electric vehicle-related stocks is significant, with CATL (Contemporary Amperex Technology Co., Limited) achieving a market cap of $180.3 billion after its secondary listing in Hong Kong [7]. - BYD, another major player in the EV sector, has seen its market cap increase nearly sevenfold over the past decade [7]. Group 4: Government Support and Strategy - Government subsidies have played a crucial role in the growth of emerging industries, with CATL receiving over 16.9 billion yuan in subsidies from 2015 to mid-2024 [9]. - The Chinese government is strategically allocating funds to boost specific industries, which can enhance competitiveness but may also distort stock market valuations [9].
核心模型被曝蒸馏DeepSeek?前女友一纸控诉,曝出欧版OpenAI塌房真相
3 6 Ke· 2025-08-18 12:12
曾被誉为「欧洲OpenAI」的Mistral AI,陷入「抄袭」丑闻!在分手小作文中,前员工爆料核心技术是蒸馏DeepSeek,却误导外界称为自主RL成果。 Mistal套壳DeepSeek,被当场抓现行了? 几天前就有人在X上爆料:Mistral的新模型是直接蒸馏自DeepSeek,而且基准测试结果还被歪曲了。 这个被视为欧洲版OpenAI「全村希望」的公司,地位就如同中国的DeepSeek一般,如今居然塌房了? 这实在是太魔幻了。 更为劲爆的是,这个重磅大瓜还是从一篇Mistral女员工的「分手小作文」里曝出来的。 原话是这样的—— 你早知道Mistral做事不讲道德:把DeepSeek蒸馏后当成自己的模型,使用OpenAI的数据,对外却误导称是RL在发挥作用,但它实际上只是DS3的产物, 还歪曲基准测试结果。 你不仅明知这些,还积极参与其中。当我指出这些问题时,你没有承担任何责任,反而选择无视我、对我冷处理。 情感纠纷小作文,曝出套壳大瓜 也就是说,这位Mistral离职的女员工,不仅在小作文中曝光了自己和前男友、Mistral同事的感情纠葛,还爆出Mistral套壳DeepSeek的丑闻。 这个消息一 ...
高性能计算群星闪耀时
雷峰网· 2025-08-18 11:37
Core Viewpoint - The article emphasizes the critical role of high-performance computing (HPC) in the development and optimization of large language models (LLMs), highlighting the synergy between hardware and software in achieving efficient model training and inference [2][4][19]. Group 1: HPC's Role in LLM Development - HPC has become essential for LLMs, with a significant increase in researchers from HPC backgrounds contributing to system software optimization [2][4]. - The evolution of HPC in China has gone through three main stages, from self-developed computers to the current era of supercomputers built with self-developed processors [4][5]. - Tsinghua University's HPC research institute has played a pioneering role in China's HPC development, focusing on software optimization for large-scale cluster systems [5][11]. Group 2: Key Figures in HPC and AI - Zheng Weimin is recognized as a pioneer in China's HPC and storage fields, contributing significantly to the development of scalable storage solutions and cloud computing platforms [5][13]. - The article discusses the transition of Tsinghua's HPC research focus from traditional computing to storage optimization, driven by the increasing importance of data handling in AI applications [12][13]. - Key researchers like Chen Wenguang and Zhai Jidong have shifted their focus to AI systems software, contributing to the development of frameworks for optimizing large models [29][31]. Group 3: Innovations in Model Training and Inference - The article details the development of the "Eight Trigrams Furnace" system for training large models, which significantly improved the efficiency of training processes [37][39]. - Innovations such as FastMoE and SmartMoE frameworks have emerged to optimize the training of mixture of experts (MoE) models, showcasing the ongoing advancements in model training techniques [41][42]. - The Mooncake and KTransformers systems have been developed to enhance inference efficiency for large models, utilizing shared storage to reduce computational costs [55][57].
港股科技板块确实可能成为「第二波」行情的主导力量
Sou Hu Cai Jing· 2025-08-18 11:34
Core Viewpoint - The Hong Kong technology sector is poised to lead the "second wave" of market momentum, supported by valuation, capital flow, and industry trends [2] Group 1: Historical Performance and Capital Trends - The Hang Seng Hong Kong Stock Connect China Technology Index has seen a year-to-date increase of 39.03% and an impressive 88.81% rise over the past year, significantly outperforming the broader market [2] - Continuous inflow of southbound capital, coupled with expectations of a 100 basis point rate cut by the Federal Reserve in 2024, alleviates liquidity pressure on Hong Kong stocks [2] Group 2: Sector Structure and Complementarity - The Hong Kong technology sector, primarily focused on internet, AI, and information technology services (e.g., Tencent, Alibaba, DeepSeek), complements the A-share market, which is more manufacturing-oriented [2] - Seven out of the top ten weighted stocks in the Hang Seng Technology Index are not listed on the A-share market, highlighting their scarcity [2] Group 3: Policy and Fundamental Support - Continued liquidity easing (e.g., LPR reduction) and supportive industrial policies (e.g., digital economy, AI development plans) provide a recovery space for technology companies [2] - In Q2 2025, leading companies like Tencent reported better-than-expected earnings, confirming the trend of fundamental improvement [2] Group 4: Institutional Perspectives and Divergence - Optimistic views from institutions like Qianhai Kaiyuan suggest that the Hong Kong technology sector has entered a "slow bull second phase," with profit growth expected to follow valuation recovery [2] - Cautious perspectives highlight short-term volatility risks, such as profit-taking pressure, sector rotation towards pharmaceuticals/consumption, and potential liquidity disturbances from fluctuating Federal Reserve policies [2] Group 5: Investment Opportunities - Recommended elastic targets include the Hang Seng Internet ETF (05188.hk) and the Hang Seng Technology Index ETF (07188.hk) [2] - Individual stock opportunities are identified in leading AI application companies and internet giants with better-than-expected performance [2]
GPT-5“让人失望”,AI“撞墙”了吗?
华尔街见闻· 2025-08-18 10:44
当OpenAI近日发布其新模型GPT-5时,本应是该公司的又一个高光时刻。Sam Altman曾预告,GPT-5是"通往AGI道路上重要的一步"。然而,模型发布后迅速 引发了失望情绪。 OpenAI备受期待的GPT-5未能带来革命性突破。虽然通往通用人工智能(AGI)的道路似乎遭遇瓶颈, 但市场焦点正转向如何利用现有技术,在产品和服务层 面创造更广泛的商业价值。 用户在社交媒体上分享了新模型犯下的低级错误,例如错误标注美国地图,而资深用户则对其性能和"个性"变化感到不满,认为其在基准测试中表现平平。 这也许不是OpenAI 的本意,但 GPT-5 的推出清楚地表明,人工智能竞赛的性质已经发生了变化。即使这不会在AGI 或所谓的超级智能方面带来非凡的进步, 也可能为使用人工智能模型创造的产品和服务带来更多创新。 这场风波让一个尖锐的问题席卷了硅谷: 在投入了数千亿美元的投资后,生成式AI的技术进展是否已接近当前阶段的极限? 这不仅挑战了OpenAI高达5000亿 美元的估值基础,也让外界开始重新审视AI技术的发展轨迹。 尽管技术前沿的讨论充满疑虑,但资本市场和产业应用的热情并未消退。 投资者似乎更看重AI在商业 ...