Workflow
Seek .(SKLTY)
icon
Search documents
外界热议DeepSeek低调“上新”
Huan Qiu Wang Zi Xun· 2025-03-25 22:39
路透社今年2月底引述3名知情人士的说法宣称,DeepSeek原计划在今年5月初发布R2,但现在希望尽早 推出,具体时间尚未透露。此外,DeepSeek希望新模型在代码生成和多语言推理方面的表现进一步提 升。不过,外媒的相关传言并没有得到DeepSeek公司的证实与回应。 沈阳表示,DeepSeek-V3-0324的推出进一步凸显中国AI企业在技术与成本上的竞争力。美国对华GPU出 口限制可能促使中国企业加速国产硬件适配,同时其开源模式或引发西方厂商的连锁动作,例如推出更 强闭源模型。2025年可能是中美AI竞争的分水岭。 沈阳认为,在OpenAI公司的GPT大模型要把通用大模型和推理大模型融合在一起的背景下,外界关注 包括DeepSeek在内的中国头部大模型是不是最终也会出现这种合并的趋势。"这种可能是存在的,因为 对于用户来说,并不关心大模型在回应自身问题时用的是什么类型的模型,更关心大模型能不能给出更 为智能、合理的参考答案。" DeepSeek移动端页面 图源:视觉中国 在回答《环球时报》记者有关DeepSeek-V3新版本有哪些能力提升时,DeepSeek表示,一是新版本代码 能力显著提升,接近Cla ...
DeepSeek官宣V3小版本升级强在哪,被赞“开源里程碑”
Di Yi Cai Jing· 2025-03-25 15:12
Core Insights - DeepSeek has officially announced the release of its V3 model, which has garnered significant attention for its enhancements in inference, front-end development, Chinese writing, and search capabilities. This model is now recognized as the highest-scoring non-inference model, surpassing competitors like xAI's Grok3 and OpenAI's GPT-4.5 [1][4] Group 1: Model Enhancements - The V3 model represents a substantial upgrade over the previous R1 model, utilizing reinforcement learning techniques to significantly improve performance in inference tasks [6] - In code-related tasks, the V3 model generates more usable and visually appealing code, exemplified by a program simulating multiple balls in motion with adjustable parameters [6] - The model has improved the quality of mid to long-form Chinese text creation and provides more detailed and well-formatted outputs for report generation in online search scenarios [6] Group 2: Performance Metrics - The V3 model has achieved a 7% increase in intelligence index, leading all other non-inference models, although it still trails behind DeepSeek's own inference model R1 and other inference models from OpenAI, Anthropic, and Alibaba [7] - Despite being a non-inference model, the V3 model's ability to provide immediate responses makes it particularly useful in scenarios sensitive to latency [7] Group 3: Developer Feedback - Developers have reported significant improvements with the V3 model, noting its ability to surpass the R1 model and even Claude-3.7 in practical coding tests, demonstrating visible advancements in physical motion simulation [7] - An overseas developer successfully created a website and wrote over 800 lines of code without any errors using the new model, highlighting the competitive pressure open-source models are placing on larger tech companies [8]
新版DeepSeek-V3登顶非推理模型榜单!每经记者实测编程能力,R2模型也要来了?
Mei Ri Jing Ji Xin Wen· 2025-03-25 13:48
新版DeepSeek-V3登顶非推理模型榜单!每经记者实测编程能力,R2模型也要 来了? 每经记者 岳楚鹏 每经编辑 兰素英 北京时间3月24日晚间,DeepSeek悄然将DeepSeek-V3模型的最新版本上传到了开源平台HuggingFace。 新模型的版本号为DeepSeek-V3-0324,参数为6850亿,较初代V3版本的6710亿有小幅增长。 尽管DeepSeek十分低调,但还是有不少人在第一时间就注意到了这一更新,并对其进行了测试。 根据社区测试反馈,DeepSeek-V3-0324最明显的变化是编程能力得到了极大的提升。众多开发者基于对新模型的综合体验判断,新模型的编程能力已经接近 目前最强编程模型Claude 3.7 Sonnet。 3月25日,专业AI模型评测机构Artificial Analysis发布的最新排名显示,新版V3在基准测试中较老版V3跃升了7位,排名所有非推理模型中的第一名。 《每日经济新闻》记者实测后发现,DeepSeek-V3-0324的编程能力确实强大,但仍会出现幻觉问题。 有外媒推测:"V3新版本的推出时机和特点强烈表明,它将成为DeepSeek-R2的基础,后者是 ...
《我的世界》成为AI新「考场」?高三生用游戏评测AI:DeepSeek-R1位列第三
3 6 Ke· 2025-03-25 12:45
Core Insights - A high school student, Adi Singh, has developed a new AI evaluation benchmark called MC-Bench, utilizing the game Minecraft to assess AI models' capabilities in a more intuitive manner [1][2][10] - Traditional standardized tests often give AI models an unfair advantage, as they are optimized for specific tasks, leading to discrepancies in real-world performance [2][8] - MC-Bench allows users to vote on AI-generated architectural designs in Minecraft, providing a crowdsourced method for evaluating AI performance [5][9] Group 1: MC-Bench Overview - MC-Bench is designed to evaluate AI models by having them create structures in Minecraft based on user prompts, such as "a crystal-clear wine glass filled with deep red wine" [2][5] - The evaluation process involves user voting to select the best creations, with results revealed only after voting concludes [5][10] - The project has garnered attention from major AI companies like OpenAI, Google, and Anthropic, which provide computational resources but are not officially collaborating [10][13] Group 2: Advantages of Game-Based Evaluation - Minecraft serves as a familiar and visually engaging platform, making it easier for the general public to understand and participate in AI assessments [7][8] - The game environment allows for a controlled testing space, enabling the evaluation of AI's reasoning and planning abilities in a safe manner [7][8] - Game-based assessments can simulate real-world complexities, test AI's decision-making skills, and provide a repeatable environment for comparison [7][8] Group 3: Current Status and Future Plans - As of now, MC-Bench primarily tests basic construction abilities of AI models, tracking their progress since the GPT-3 era [10][16] - Future plans include expanding the benchmark to more complex tasks that require long-term planning and goal-oriented actions [10][16] - The leaderboard of MC-Bench shows that Claude 3.7 Sonnet ranks first, while DeepSeek-R1 is currently in third place, indicating the platform's effectiveness in reflecting user experiences with these models [14][16]
外媒称DeepSeek爆火后,中国AI创企正彻底调整商业模式
Guan Cha Zhe Wang· 2025-03-25 12:29
Core Insights - The Chinese AI startup landscape is undergoing significant changes as companies adjust their business models in response to the success of DeepSeek, which has led to a concentration of market power among a few leading firms [1][2][3] Group 1: Business Model Adjustments - Many Chinese AI startups are shifting resources towards application development rather than foundational model development due to the competitive pressure from DeepSeek [1] - Zero One Everything, founded by former Google China head Kai-Fu Lee, is transitioning its business to align with what it calls the "DeepSeek era," ceasing pre-training of large language models by the end of 2024 [1] - The company announced it will offer enterprise-level DeepSeek deployment customization solutions, leveraging its expertise in hybrid expert models [1] Group 2: Funding and Investment - The startup Moonlight is reducing its marketing budget for its chatbot Kimi and focusing on model training to enhance performance, having raised over $1.3 billion (approximately 9.4 billion RMB) in funding in 2024 [2] - Alibaba has shown interest in acquiring Moonlight, having invested $800 million, which includes rights for future purchase, although recent shifts in focus may lower the likelihood of this acquisition [2] Group 3: Sector Focus Changes - Baichuan Intelligence is pivoting towards the healthcare sector, having dissolved its financial AI sales team to concentrate on developing AI technologies for medical diagnostics [3] - Zhipu AI, founded by renowned computer scientist Tang Jie, is exploring multiple business avenues and aims for an IPO by the end of 2025, although DeepSeek's growth may impact this plan [3] - Zhipu AI reported sales of 300 million RMB in 2024, with losses amounting to 2 billion RMB [3]
后DeepSeek时代:六小虎向左,BAT向右
3 6 Ke· 2025-03-25 11:23
后DeepSeek时代:六小虎向左,BAT向右 DeepSeek犹如一颗投入平静湖面的巨石,在AI行业掀起了滔天的波澜,甚至可以夸张点说,其直接改写了国内大模型的竞争规则。 DeepSeek给AI大模型行业,免费赠送了一波国民级别的市场教育,却也平等地在先行者们头上,悬起了一把达摩克利斯之剑。 其中,AI"六小虎"之中的智谱就是一个缩影,智谱脱胎于清华大学知识工程研究室,素来有"国家队"之称。然而就在最近开始频频出现融资动作,10天之 内补充弹药达15亿人民币;可与此同时,组织震荡颇有加剧之势,从一线团队到高管大牛皆有波及。冰火两重天的态势,可谓是目前除了DeepSeek之 外,大多数大模型从业者们,真实写照的一个缩影。 2024年底,智谱曾以200亿元的估值,完成一轮30亿元人民币的融资,在这之后,包括杭州城投、上乘资本、华发集团等国资背景的资方快马加鞭地赶到 为其注资。 不过,也有风投人士对「新熵」分析,DeepSeek的横空出世还是对智谱的估值造成了一定负面影响,快速拿钱也可能是为了抢下已经出现上涨瓶颈的相 对高价。 与大开现金粮仓之门形成反差的是,智谱在团队规模和对外投资上呈现出收缩之势。高峰期阶段的 ...
李开复:DeepSeek让中美AI差距缩小至只剩三个月
Sou Hu Cai Jing· 2025-03-25 09:30
Core Insights - The CEO of Zero One Technology, Kai-Fu Lee, stated that the gap between China and the U.S. in AI development has narrowed to just three months in certain areas due to advancements by companies like DeepSeek [3] - Lee emphasized that the rise of DeepSeek indicates China's leading position in infrastructure and software engineering [3] - He noted that U.S. semiconductor sanctions act as a "double-edged sword," presenting challenges but also driving innovation within Chinese companies [3] Company Developments - Zero One Technology is focusing on practical AI applications, specifically software solutions that help clients better deploy foundational models [4] - The company recently launched an all-in-one AI work platform called "Wanzhi," aimed at assisting enterprises in deploying AI technology [4] - Zero One Technology has begun generating revenue and anticipates significant growth in income, projecting to reach several times last year's revenue of $15 million by 2025 [4]
博鳌报告:DeepSeek凸显美国制裁下中国的发展韧性
Nan Fang Du Shi Bao· 2025-03-25 06:50
Core Insights - The report highlights the resilience of China's economy and key industries under U.S. sanctions, using DeepSeek as a case study to illustrate innovation and growth potential in the face of geopolitical challenges [3]. Economic Outlook - The "Asian Economic Outlook and Integration Process 2025 Report" predicts a moderate recovery in Asian economies, with growth expected to rise to 4.5% in 2025 from 4.4% in 2024. The GDP share of Asian economies in the world is projected to increase from 48.1% in 2024 to 48.6% in 2025 [2]. - Excluding China, the weighted actual GDP growth rate for other East Asian economies is expected to decline by 1.0 percentage points to 3.3% in 2025, while the growth rate for other Asian economies is projected to decrease by 0.3 percentage points to 4.2% [2]. Trade and Investment Challenges - Ongoing trade tensions, particularly with the U.S. imposing tariffs on goods from Mexico, Canada, and China, are expected to create significant uncertainty for global trade and investment in 2025 [2]. - The report emphasizes that these trade frictions will put overall pressure on trade and investment in Asia [2]. E-commerce and Digital Trade - E-commerce and digital trade are highlighted as key growth areas, with the Asia-Pacific region's retail e-commerce growth expected to reach 8.4% in 2024. China's cross-border e-commerce import and export total is projected at 2.63 trillion yuan (approximately 369 billion USD), reflecting a year-on-year growth of 10.8% [4]. - Southeast Asia's e-commerce sector is also experiencing significant growth, with a gross merchandise value (GMV) of 263 billion USD, up 15% year-on-year [4]. - However, compliance risks related to cross-border e-commerce are noted, particularly due to changes in U.S. customs policies and new regulations from the EU regarding low-value goods [4]. Employment Trends - The employment growth rate in Asia is expected to decline significantly from 1.94% in 2024 to 1.22% in 2025, which is lower than the global growth rate of 1.28% [6]. - Despite this decline, the overall unemployment rate in Asia is projected to slightly decrease from 4.40% in 2024 to 4.39% in 2025, remaining below the global rate of 4.96% [6]. - The impact of artificial intelligence on employment is highlighted, with varying effects across regions and genders. The highest impact is expected in Northern, Southern, and Western Europe, while East Asia shows the highest proportion of affected employment in Asia [6][8].
摩根士丹利 -中国 DeepSeek 时刻
摩根· 2025-03-25 06:35
Investment Rating - The report suggests a positive outlook for investment in China's AI sector, particularly highlighting the emergence of DeepSeek as a significant milestone in the industry [1][3]. Core Insights - DeepSeek's development represents China's ambition to lead in the tech revolution, potentially inspiring a new generation of talent and contributing to national pride [1][7]. - The cost-effective training of DeepSeek, reportedly under $6 million, challenges the narrative that China lags behind the U.S. in AI innovation, as it achieves near-parity with top models [2][3]. - The MSCI China Index surged 26% following DeepSeek's unveiling, indicating strong investor enthusiasm for AI-driven economic growth [3]. Summary by Sections DeepSeek's Impact - DeepSeek's breakthrough is seen as a symbol of China's resurgence in innovation and competitiveness, with implications for emerging market investors [1][14]. - The emergence of other AI agents, such as Butterfly Effect's Manus, further illustrates the competitive landscape in China's AI sector [4][5]. Policy and Market Dynamics - A shift in policy from regulatory crackdowns to support for private-sector innovation is noted, with high-level meetings between political leaders and tech executives [8]. - China's AI ecosystem is positioned as a unique opportunity for investors, focusing on consumer-facing applications rather than hardware [9]. Future of AI Development - The report outlines a dual-track future for AI, contrasting China's efficiency-driven approach with the capital-intensive models in the U.S. [13][14]. - Both models are expected to coexist, providing a diversified opportunity set for emerging market investors [14].
云南昆明上线“DeepSeek+变电”4个应用
Zhong Guo Dian Li Bao· 2025-03-24 06:22
云南昆明上线"DeepSeek+变电"4个应用 为进一步提升变电专业故障诊断和处理能力,3月13日,南方电网云南昆明供电局依托南网"大瓦特"模 型体系最新开放的DeepSeek大模型,在云南率先推出"操作票智能生成及检验""设备试验数据智能分 析""新改、扩建工程验收助手""变电设备维护检修验收助手"4个变电运行专业典型模型应用。通 过"DeepSeek+变电"本地化部署,实现主变压器验收流程全面智能化,单次验收时长由1.5小时压缩至15 分钟。 在变电站运维过程中,设备验收存在制度文件庞杂、人工核验效率低等痛点。以主变压器验收为例,运 维人员需要对照《电力设备交接验收规程》《绝缘油击穿电压测定法》等10份文件制度,逐项核查62项 试验数据,单次验收耗时长达90分钟,且存在因记忆偏差导致漏检的风险。 昆明供电局此次推出的"设备试验数据智能分析"应用,构建了覆盖45项国家标准、行业规程的结构化知 识库,依托DeepSeek大模型进行电力场景定向训练,实现"设备型号输入—数据智能比对—问题精准定 位"全流程自动化。当运维人员输入设备参数后,系统可以实时解析试验数据与制度条款的匹配度,快 速识别如"绕组直流电阻超标 ...