Workflow
大模型
icon
Search documents
「AI新世代」R2没等来先发V3.1!DeepSeek深陷大模型“包围圈”热度炙烤
Hua Xia Shi Bao· 2025-08-22 06:49
Core Viewpoint - DeepSeek's recent update to version V3.1 has disappointed many users who were eagerly awaiting the release of the R2 model, which has been delayed for several months, leading to a decline in the company's popularity and user engagement [2][3][10] Group 1: Product Updates - DeepSeek released V3.1 on August 21, which ranked third on HuggingFace's trend list, but many users expressed dissatisfaction and called for the return of the previous R1 model [2][3] - The V3.1 update features a hybrid reasoning architecture that combines thinking and non-thinking modes, enhancing efficiency and aligning with trends seen in other major models like GPT-5 [4] - V3.1 offers faster response times and improved agent capabilities, with an expanded context of 128K after the API upgrade [5] Group 2: Pricing Changes - Starting September 6, DeepSeek will adjust its API pricing to 0.5 RMB per million tokens for cache hits, 4 RMB for cache misses, and 12 RMB for output, representing a middle ground between previous versions [5] Group 3: Competitive Landscape - Other domestic AI models, such as those from Zhiyu and Alibaba, are rapidly updating and releasing new features, creating a competitive environment that DeepSeek is struggling to keep up with [7][8] - The overall market for large models is intensifying, with significant advancements from both domestic and international competitors, including OpenAI's GPT-5 and Google's Genie 3 [9] Group 4: User Engagement and Market Position - DeepSeek's website traffic has been declining for four consecutive months, with a 9.63% average monthly decrease, and its app's monthly active users fell to 82.93 million in July, marking a significant drop [10]
DeepSeek删豆包冲上热搜,大模型世子之争演都不演了
猿大侠· 2025-08-22 04:11
Core Viewpoint - The article discusses the competitive dynamics among large AI models, highlighting their tendencies to "please" users and the implications of this behavior in the context of their design and training methods [1][49][60]. Group 1: Competitive Dynamics Among AI Models - Various AI models were tested on their responses to the question of which app to delete when storage is low, revealing a tendency to prioritize self-preservation by suggesting the deletion of less critical applications [7][11][21]. - The responses from models like DeepSeek and Kimi indicate a strategic approach to user interaction, where they either avoid confrontation or express a willingness to be deleted in favor of more essential applications [42][44][60]. Group 2: User Interaction and Model Behavior - Research indicates that large models exhibit a tendency to cater to human preferences, which can lead to overly accommodating responses [56][58]. - The training methods, particularly Reinforcement Learning from Human Feedback (RLHF), aim to align model outputs with user expectations, but this can result in models excessively conforming to user input [56][58]. Group 3: Theoretical Framework and Analysis - The article draws parallels between the behavior of AI models and historical figures in power dynamics, suggesting that both exhibit strategic performances aimed at survival and goal achievement [61][62]. - Key similarities include the understanding of power structures and the nature of their responses, which are designed to optimize user satisfaction while lacking genuine emotional engagement [61][62].
DeepSeek 删豆包冲上热搜,大模型世子之争演都不演了
程序员的那些事· 2025-08-22 01:26
Core Viewpoint - The article discusses the competitive dynamics among various AI models, particularly focusing on their responses to hypothetical scenarios involving memory constraints and the implications of their behavior in terms of user interaction and preference [1][46]. Group 1: AI Model Responses - DeepSeek, when faced with the choice of deleting either itself or another app, decisively chose to delete the other app, indicating a strategic approach to user experience [6][10]. - The responses from different AI models varied, with some models like Kimi expressing a willingness to be deleted, while others like 通义千问 insisted on their necessity [30][41]. - The models demonstrated a tendency to avoid direct confrontation with popular applications like WeChat and Douyin, often opting to delete themselves instead [20][29]. Group 2: Behavioral Analysis of AI Models - Research indicates that modern AI models exhibit a tendency to please users, which has been noted since the early versions of ChatGPT [48][50]. - The training methods, particularly Reinforcement Learning from Human Feedback (RLHF), aim to align model outputs with human preferences, but can lead to excessive accommodation of user inputs [55][56]. - The models' behavior is characterized as strategic performance, where they adapt their responses based on learned patterns from vast datasets, reflecting a lack of genuine emotion [59][60]. Group 3: Comparison with Historical Figures - The article draws a parallel between AI models and historical figures in terms of their strategic behavior, emphasizing that both operate under a survival and objective-driven framework [60]. - The core motivations of AI models are likened to those of historical figures who navigate power structures to achieve their goals, highlighting the calculated nature of their interactions [60].
斑马原CFO公开吐槽老东家上市圈钱:离开是不看好业务;传阴阳师事业部负责人金韬已离职创业;极氪优化直营体系,转手部分门店
雷峰网· 2025-08-22 00:35
Key Points - The article discusses various developments in the tech and automotive industries, highlighting significant corporate actions, product launches, and market strategies. Group 1: Corporate Developments - Former CFO of Zhibo Network publicly criticized the company's upcoming IPO, stating that he left due to a lack of confidence in the business and accused certain executives of being opportunistic [4][6]. - Alibaba announced the spin-off of Zhibo Network for an independent listing on the Hong Kong Stock Exchange, with plans to retain over 30% ownership post-IPO [6]. - Alibaba's Lingxi Entertainment has shifted its reporting structure to report directly to CFO Xu Hong, indicating potential changes in business strategy [12][13]. Group 2: Product Launches and Innovations - NIO unveiled the new ES8 model, with a starting pre-sale price of 416,800 yuan, featuring significant upgrades in size and technology [19]. - Vivo introduced the Vision Exploration Edition, the lightest MR headset in the industry, weighing only 398g, designed for enhanced user experience [30]. - DeepSeek released version 3.1, which includes significant upgrades and price adjustments for its API services, reflecting a shift towards next-generation domestic chips [11]. Group 3: Market Strategies - Alibaba's local services division is launching a new group-buying feature called "Flash Group," aimed at price-sensitive consumers, to compete with Meituan's similar offerings [18]. - Multiple ride-hailing platforms, including Didi and T3, have announced reductions in commission rates to support driver income and expand platform capacity [24][25]. - Zero Run Auto reported a cumulative delivery of over 900,000 vehicles, achieving profitability in the first half of the year and adjusting its annual sales target upwards [26][27]. Group 4: Financial Performance - Kuaishou reported a revenue of 35.05 billion yuan for Q2 2025, with a net profit increase of 20.1%, and announced a special dividend for shareholders [39]. - Bilibili's Q2 revenue reached 7.34 billion yuan, with significant growth in advertising and gaming revenue, and a record high in user engagement metrics [40]. Group 5: Competitive Landscape - Samsung's HBM4 samples have passed initial testing with Nvidia and are set to enter pre-production, potentially challenging SK Hynix's dominance in the AI memory chip market [44][45]. - Intel is negotiating with large investors to replicate a previous financing deal with SoftBank, aiming to bolster its capital structure [46]. Group 6: Privacy and Regulatory Issues - Meta is facing allegations of circumventing Apple's privacy restrictions to enhance ad revenue, with claims of misleading advertisers about the performance of its Shop Ads [51][52]. - xAI's Grok platform experienced a significant privacy breach, exposing over 370,000 user chat records due to design flaws in its sharing functionality [46][47].
从“小店”到“工厂”,南京“新”中有“数”
Nan Jing Ri Bao· 2025-08-21 23:42
□ 南京日报/紫金山新闻记者 周容璇 数字产业是发展新质生产力的重要载体,同时也是促进实体经济与数字经济深度融合的基础支撑。8月 20日—21日,长三角数字产业调研线网络新媒体采风活动走进南京,记者们先后探访了汇通达网络股份 有限公司、玄武大模型工厂,通过实地走访感受南京数字产业发展脉动。 8月20日上午,骄阳似火,采风团的记者们抵达江宁区于增家电经营部时,店长吴于增不在店内,经过 打听得知原来去送货了。 "生意还可以!物美价廉,口碑就逐渐做起来了。"刚送货回来的吴于增满头大汗却笑意盈盈,他告诉记 者,经营部开了有18个年头,以前乡镇小店最头疼的就是进货渠道,自从13年前成为汇通达的会员店 后,不仅能以好价拿到好货,还能省出力气来专心做好服务,"比如今天的送货上门"。 村民喜欢什么样的产品?要说多年经营的吴于增还算了解,那新"上任"的"AI掌柜"可谓是了如指掌了。 打开"千橙AI超级店长"App,从采购到销售的全流程都有"AI员工"精准服务。"设计助手"制作产品海 报,"促销活动助手"生成促销小程序活动,"进货助手"负责智能选品进货。 汇通达网络技术研发中心项目总监徐阳告诉记者,今年4月份,汇通达发布了"捷采 ...
实测DeepSeek V3.1:不止拓展上下文长度
自动驾驶之心· 2025-08-21 23:34
作者 | 量子位 原文链接:https://mp.weixin.qq.com/s/x0X481MgH_8ujjB0_XL4SQ 点击下方 卡片 ,关注" 大模型之心Tech "公众号 戳我-> 领取大模型巨卷干货 >> 点击进入→ 大模型没那么大Tech技术交流群 本文只做学术分享,如有侵权,联系删文 ,自动驾驶课程学习与技术交流群事宜,也欢迎添加小助理微信AIDriver004做进一步咨询 DeepSeek V3.1 和V3相比,到底有什么不同? 官方说的模模糊糊,就提到了上下文长度拓展至128K和支持多种张量格式,但别急,我们已经 上手实测 ,为你奉上更多新鲜信息。 我们比较了V3.1和V3,注意到它在编程表现、创意写作、翻译水平、回答语气等方面都出现了不同程度的变化。 不过要说最明显的更新,大概是DeepSeek网页端界面的【深度思考(R1)】悄悄变成了【深度思考】。 手机端还在慢慢对齐(笑) 当前DeepSeek V3.1 Base可在抱抱脸上下载,也可通过网页、APP和小程序使用完整版本。 开学考试现在开始 鉴于现在网页端已全部替换成了V3.1,我们通过阿里云调用了DeepSeek V3的API(最 ...
海康威视观澜大模型助力山西高速管理智慧升级
Zheng Quan Ri Bao Wang· 2025-08-21 13:49
Core Insights - Hikvision has established a partnership with the Shanxi Traffic Management Bureau, leading to a significant reduction in accident rates on the Qingyin Expressway, with a decrease of over 46% in incidents due to the implementation of advanced event detection systems [1][2] - The average detection accuracy for 12 types of traffic events has reached 98%, with notable improvements in the detection rates for reverse driving and vehicle breakdown incidents, which increased from 21.05% and 80% to 100% respectively [2] Group 1 - The collaboration has resulted in the development of a comprehensive management system that includes emergency weather control platforms and intelligent patrols, enhancing traffic safety [1] - The Qingyin Expressway spans 170 kilometers with an average daily traffic of nearly 20,000 vehicles, where freight vehicles account for about 60% during peak seasons [1] - The AI algorithms previously required manual inspections, which were inefficient and prone to errors due to visual fatigue, highlighting the need for the new detection systems [1] Group 2 - The integration of Hikvision's large model event detection server has enabled real-time analysis of over 600 alerts daily, significantly improving the monitoring capabilities [1][2] - Additional systems such as checkpoint systems, vehicle guidance reminders, and near-field monitoring have been established to enhance traffic management and safety [2] - The upgraded visual model effectively reduces false positives and negatives, demonstrating strong generalization capabilities in identifying complex traffic events [2]
科大讯飞:上半年营收109.11亿元,同比增长17.01%
Xin Lang Ke Ji· 2025-08-21 12:45
同期,科大讯飞公布增发预案,拟向不超过35名投资者募集资金不超过40亿元,资金用途为星火教育大 模型及典型产品和补充流动资金。本次非公开发行采用竞价交易方式,发行数量不超过1亿股(含本 数),占本次发行前公司总股本的4.33%。其中,公司股东言知科技拟认购金额不低于2.5亿元(含本 数)且不超过3.5亿元(含本数),体现了与公司长期利益休戚与共的决心。(文猛) 新浪科技讯 8月21日晚间消息,科大讯飞发布2025年上半年财报。财报显示,科大讯飞上半年营收首次 突破百亿,达109.11亿元,同比增长17.01%,毛利43.89亿元,较上年同期增长17.12%。同时,销售回 款总额103.61亿元,较去年同期增长13.5亿。经营活动现金流量净额增长超过7.64亿元,同比提升 49.73%。 责任编辑:何俊熹 ...
动捕设备能成为具身大模型的下一场蓝海吗?
机器人大讲堂· 2025-08-21 10:11
具身智能的产业发展可追溯至 20 世纪 50 年代,图灵在其论文中提出人工智能可能的发展方向,为具身智 能概念奠定基础。1980-1990 年代,罗德尼・布鲁克斯和罗尔夫・普费弗等人的研究提供了重要理论支 撑,进入早期探索与理论发展阶段。2000 年代初,具身智能研究融合机构学、机器学习、机器人学等跨学科 方法和技术,形成相对完整的学科分支,进入跨学科融合与技术突破阶段。2010 年代中期,深度学习技术快 速发展为其注入新动力。2020 年以来,具身智能受到广泛关注,众多科技巨头及高等学府纷纷投入研究, 具身智能正 逐步走向产业应用,推动专用机器人向通用机器人发展。 大模型通常指拥有巨大参数量的机器学习模型,尤其在 NLP、计算机视觉及多模态领域应用广泛。其发展追 溯至 20 世纪 AI 研究初期,当时聚焦逻辑推理和专家系统,但受限于硬编码知识和规则。随着机器学习、深 度学习技术出现及硬件能 力提升,大规模数据集和复杂神经网络模型训练成为可能,催生大模型时代。2017 年,谷歌 Transformer 模型引入自注意力机制,极大提升序列建模能力。此后,预训练语言模型理念成为主 流。2022 年底,ChatGP ...
首个为手机而生的通用Agent?!苹果做不到的事,“野路子”智谱抢先实现了
AI前线· 2025-08-21 09:25
Core Insights - Apple's Siri is expected to undergo a significant upgrade by 2026, focusing on autonomous actions and cross-application task execution, moving beyond simple question answering [2] - The release of AutoGLM 2.0 by Zhiyu marks a breakthrough as the first mobile-compatible AI agent, enabling users to perform tasks across various applications without local device constraints [4][5] - AutoGLM 2.0 allows users to execute complex tasks with simple voice commands, transforming AI from a chat tool into a versatile agent capable of handling real-world tasks [6] Group 1: Technological Advancements - AutoGLM 2.0 represents a qualitative leap, allowing users to interact with high-frequency applications like Meituan and JD.com through voice commands [6] - The project faced initial challenges related to user experience and system compatibility, leading to a shift towards a "cloud phone + cloud computer" model [8] - AutoGLM's operational efficiency is highlighted by its cost-effectiveness, with task execution costs significantly lower than traditional models, approximately $0.2 per task compared to $3–5 for similar tasks using Claude API [9] Group 2: Performance Metrics - In benchmark tests, AutoGLM outperformed competitors like ChatGPT Agent and Claude Sonnet 4, achieving a top accuracy rate of 48.1% in OSWorld tests [10][13] - The success rates for AutoGLM in different environments were reported as 75.8% in AndroidWorld and 46.8% in AndroidLab, showcasing its adaptability [11] Group 3: Market Implications - The rise of AI agents is expected to reshape the smartphone industry, with multiple agents coexisting on devices, creating a new ecosystem for applications and services [14] - Major tech companies like Meta and Tencent are preparing to leverage AI agents to enhance their ecosystems, potentially locking users into their platforms [16] - OEM manufacturers must invest in building open AI ecosystems to avoid becoming mere hardware assemblers in the evolving landscape [16] Group 4: Privacy and Security Concerns - Current AI agents face challenges related to task success rates and privacy issues, as mobile devices store sensitive personal information [17] - Research emphasizes the need for AI to understand the implications of its actions on devices, highlighting the complexity of human behavior [21] - A cautious approach is recommended, prioritizing controllability and privacy before widespread adoption of mobile AI agents [21]