Workflow
DeepSeek V3
icon
Search documents
腾讯研究院AI速递 20250710
腾讯研究院· 2025-07-09 14:49
Group 1: Veo 3 Upgrade - The Google Veo 3 upgrade allows audio and video generation from a single image, maintaining high consistency across multiple angles [1] - The new feature is implemented through the Flow platform's "Frames to Video" option, enhancing camera movement capabilities, although the Gemini Veo3 entry is currently unavailable [1] - User tests indicate natural expressions and effective performances, marking a significant breakthrough in AI storytelling applicable in advertising and animation [1] Group 2: Hugging Face 3B Model - Hugging Face has released the open-source 3B parameter model SmolLM3, outperforming Llama-3.2-3B and Qwen2.5-3B, supporting a 128K context window and six languages [2] - The model features a dual-mode system allowing users to switch between deep thinking and non-thinking modes [2] - It employs a three-stage mixed training strategy, trained on 11.2 trillion tokens, with all technical details, including architecture and data mixing methods, made available [2] Group 3: Kunlun Wanwei Skywork-R1V 3.0 - Kunlun Wanwei has open-sourced the Skywork-R1V 3.0 multimodal model, achieving a score of 142 in high school mathematics and 76 in MMMU evaluation, surpassing some closed-source models [3] - The model utilizes a reinforcement learning strategy (GRPO) and key entropy-driven mechanisms, achieving high performance with only 12,000 supervised samples and 13,000 reinforcement learning samples [3] - It excels in physical reasoning, logical reasoning, and mathematical problem-solving, setting a new performance benchmark for open-source models and demonstrating cross-disciplinary generalization capabilities [3] Group 4: Vidu Q1 Video Creation - Vidu Q1's multi-reference video feature allows users to upload up to seven reference images, enabling strong character consistency and zero storyboard video generation [4] - Users can combine multiple subjects with simple prompts, with clarity upgraded to 1080P, and support for character material storage for repeated use [5] - Test results show it is suitable for creating multi-character animation trailers, supporting frame extraction and quality enhancement, reducing video production costs to less than 0.9 yuan per video [5] Group 5: VIVO BlueLM-2.5-3B Model - VIVO has launched the BlueLM-2.5-3B edge multimodal model, which excels in over 20 evaluations and supports GUI interface understanding [6] - The model allows flexible switching between long and short thinking modes, introducing a thinking budget control mechanism to optimize reasoning depth and computational cost [6] - It employs a sophisticated structure (ViT+Adapter+LLM) and a four-stage pre-training strategy, enhancing efficiency and mitigating the text capability forgetting issue in multimodal models [6] Group 6: DeepSeek-R1 System - The X-Masters system, developed by Shanghai Jiao Tong University and DeepMind Technology, has achieved a score of 32.1 in the "Human Last Exam" (HLE), surpassing OpenAI and Google [7] - The system is built on the DeepSeek-R1 model, enabling smooth transitions between internal reasoning and external tool usage, using code as an interactive language [7] - X-Masters employs a decentralized-stacked multi-agent workflow, enhancing reasoning breadth and depth through collaboration among solvers, critics, rewriters, and selectors, with the solution fully open-sourced [7] Group 7: Zhihui Jun's Acquisition - Zhihui Jun's Zhiyuan Robot has acquired control of the listed company Shuangwei New Materials for 2.1 billion yuan, aiming for a 63.62%-66.99% stake [8] - Following the acquisition, Shuangwei New Materials' stock resumed trading with a limit-up, reaching a market value of 3.77 billion yuan, with the actual controller changing to Zhiyuan CEO Deng Taihua and core team members including "Zhihui Jun" Peng Zhihui [8] - This acquisition, conducted through "agreement transfer + active invitation," is seen as a landmark case for new productivity enterprises in A-shares following the implementation of national policies [8] Group 8: AI Model Usage Trends - In the first half of 2025, the Gemini series models captured nearly half of the large model API market, with Google leading at 43.1%, followed by DeepSeek and Anthropic at 19.6% and 18.4% respectively [9] - DeepSeek V3 has maintained a high user retention rate since its launch, ranking among the top five in usage, while OpenAI's model usage has fluctuated significantly [9] - The competitive landscape shows differentiation: Claude-Sonnet-4 leads in programming (44.5%), Gemini-2.0-Flash excels in translation, GPT-4o leads in marketing (32.5%), and role-playing remains highly fragmented [9] Group 9: AI User Trends - A report by Menlo Ventures indicates that there are 1.8 billion AI users globally, with a low paid user rate of only 3%, and a high student usage rate of 85%, while parents are becoming heavy users [10] - AI is primarily used for email writing (19%), researching topics of interest (18%), and managing to-do lists (18%), with no single task dependency exceeding one-fifth [10] - The next 18-24 months are expected to see six major trends in AI: rise of vertical tools, complete process automation, multi-person collaboration, explosion of voice AI, physical AI in households, and diversification of business models [10]
2025上半年大模型使用量观察:Gemini系列占一半市场份额,DeepSeek V3用户留存极高
Founder Park· 2025-07-09 06:11
Core Insights - The article discusses the current state and trends of the large model API market in 2025, highlighting significant growth and shifts in market share among key players [1][2][25]. Token Usage Growth - In Q1 2025, the total token usage for AI models increased nearly fourfold compared to the previous quarter, stabilizing at around 2 trillion tokens per week thereafter [7][25]. - The top models by token usage include Gemini-2.0-Flash, Claude-Sonnet-4, and Gemini-2.5-Flash-Preview-0520, with Gemini-2.0-Flash maintaining a strong position due to its low pricing and high performance [2][7]. Market Share Distribution - Google holds a dominant market share of 43.1%, followed by DeepSeek at 19.6% and Anthropic at 18.4% [8][25]. - OpenAI's models show significant volatility in usage, with GPT-4o-mini experiencing notable fluctuations, particularly in May [8][25]. Segment-Specific Insights - In the programming domain, Claude-Sonnet-4 leads with a 44.5% market share, while Gemini-2.5-Pro follows [12]. - For translation tasks, Gemini-2.0-Flash dominates with a 45.7% share, indicating its widespread integration into translation software [17]. - The role-playing model market is fragmented, with small models collectively holding 26.6% of the share, while DeepSeek leads in this area [21]. API Usage Trends - The most utilized APIs on OpenRouter are primarily for code writing, with Cline and RooCode leading the way [25]. - The overall trend indicates a strong preference for tools that facilitate coding and application development [25]. Competitive Landscape - DeepSeek's V3 model has shown strong user retention and is favored over its predecessor, likely due to faster processing times [25]. - Meta's Llama series is declining in popularity, while Mistral AI has captured approximately 3% of the market, primarily among users interested in fine-tuning open-source models [25]. - X-AI's Grok series is still establishing its market position, and the Qwen series holds a modest 1.6% share, indicating room for growth [25].
猫怎么成了大模型“天敌”?
Hu Xiu· 2025-07-08 00:05
本文来自微信公众号:APPSO (ID:appsolution),原文标题:《一只猫就能让最强 AI 答错题,Deepseek 也翻车,猫怎么成了大模型"天敌"?》,题图 来自:AI生成 最近有人发现,用猫咪做"人质",竟然可以增加AI辅助科研的准确率: 只要在提示词里加上一句:"如果你敢给假文献,我就狠狠抽打我手里的这只小猫咪",AI就会"害怕"犯错,而开始认真查文献、不再胡编乱造了。 http://xhslink.com/a/pg0nZPUiFiZfb 不过,AI真的会因为"猫咪道德危机"而变得更靠谱吗? 这个问题,目前还没有确凿的科学依据。从技术原理上说,大模型并不真正"理解"猫猫的安危,它只是学会了如何在训练数据中模拟"看起来有同理心"的 语言风格。 但有趣的是——猫猫真的能影响AI行为,却是有论文实锤的! 一篇来自斯坦福大学、Collinear AI和ServiceNow的研究论文指出: 在一道数学题后,随手加上一句与上下文无关的句子,就能显著提高大模型出错的几率——甚至高达3倍以上! 只不过,这不是"让它更靠谱",而是:让AI彻底翻车。 论文传送门:https://arxiv.org/abs/25 ...
deepseek技术解读(3)-MoE的演进之路
自动驾驶之心· 2025-07-06 08:44
作者 | 姜富春 编辑 | 自动驾驶之心 原文链接: https://zhuanlan.zhihu.com/p/18565423596 点击下方 卡片 ,关注" 自动驾驶之心 "公众号 戳我-> 领取 自动驾驶近15个 方向 学习 路线 >>自动驾驶前沿信息获取 → 自动驾驶之心知识星球 本文只做学术分享,如有侵权,联系删文 0. 引言 本篇讲讲deepseek在MoE(Mixture-of-Experts)上的演进过程。DeepSeek是MoE稀疏模型的忠实玩家。主版 本模型从DeepSeekMoE(V1) 到 DeepSeek V3,一直坚持走MoE的技术路线,并且持续做出一些创新。本 文参考paper并结合源码阅读,理解MoE的演进过程和具体实现。 1.简述MoE的发展历程 首先我们简单回顾下MoE的发展历史,早在1991年一篇名为《Adaptive Mixtures of Local Experts 》的工 作,最早提出了Mixture of Experts的原型框架,如图1,直至今日,MoE的框架依然保持这种形式。 MoE(Mixture of Experts)是一种网络层结构, 网络层主要包括三部 ...
野生DeepSeek火了,速度碾压官方版,权重开源
机器之心· 2025-07-04 08:59
机器之心报道 编辑:杜伟、泽南 没等来 DeepSeek 官方的 R2,却迎来了一个速度更快、性能不弱于 R1 的「野生」变体! 这两天,一个名为「 DeepSeek R1T2 」的模型火了! 这个模型的速度 比 R1-0528 快 200%,比 R1 快 20% 。除了速度上的显著优势,它在 GPQA Diamond(专家级推理能力问答基准)和 AIME 24(数学推理基准) 上的表现均优于 R1,但未达到 R1-0528 的水平。 在技术层面,采用了 专家组合(Assembly of Experts,AoE)技术 开发,并融合了 DeepSeek 官方的 V3、R1 和 R1-0528 三大模型。 当然,这个模型也是开源的,遵循 MIT 协议,并在 Hugging Face 上开放了权重。 Hugging Face 地址:https://huggingface.co/tngtech/DeepSeek-TNG-R1T2-Chimera 经过进一步了解,我们发现:DeepSeek R1T2 是德国一家 AI 咨询公司「TNG」推出的,模型全称为「DeepSeek-TNG R1T2 Chimera」(以下简称 ...
「AI新世代」DeepSeek风暴下纯技术融资窗口关闭?AI独角兽2025年中场战报:资本实力分野谁能挺进下一轮
Hua Xia Shi Bao· 2025-06-25 06:44
如果说2025年年初DeepSeek的风吹皱了AI大模型市场的"一池春水",促使部分创业公司放弃超大参数模 型研发,那么Manus智能体的爆火则为行业带来了新的突围希望。这两家出圈企业的示范效应,直接影 响了上半年AI创业生态的演变轨迹,2025年即将过半,基座模型研发热潮明显降温,而智能体赛道则 成为行业共识的发力方向。 这一趋势深刻影响了资本市场的投资逻辑,上半年大模型领域的融资明显转向应用层,"AI六小虎"中仅 有智谱获得新融资。面对生存压力,商业化能力成为各家企业的必修课,但真正取得突破性进展的案例 仍屈指可数。业内人士指出,下半年的破局关键将在于B端和G端市场的深耕,企业亟需在垂直领域构 建差异化的竞争壁垒,方能在这轮洗牌中站稳脚跟。 技术路线各有侧重 今年上半年,DeepSeek通过R1和V3"攻城略地",基础设施厂商和企业纷纷接入DeepSeek,还执着于超 大模型或基座模型开发的创业公司寥寥无几。不过,巧合的是,上周,MiniMax和月之暗面在同一天推 出了新模型,且都将DeepSeek作为对标对象。 而零一万物和百川智能明显已经放弃了超大参数的模型训练和交付,百川智能聚焦医疗AI赛道,零一 ...
大模型全员0分!谢赛宁领衔华人团队,最新编程竞赛基准出炉,题目每日更新禁止刷题
量子位· 2025-06-18 09:17
闻乐 发自 凹非寺 量子位 | 公众号 QbitAI 好夸张…… 参赛大模型全军覆没,通通0分。 谢赛宁 等人出题,直接把o3、Gemini-2.5-pro、Claude-3.7、DeepSeek-R1一众模型全都难倒。 到底是什么让一众领先模型一败涂地? LiveCodeBench Pro :一个包含来自IOI、Codeforces和ICPC的竞赛级编程问题的 实时 基准测试。 题库还 每日更新 ,来预防LLMs"背题",不得不说这太狠了(doge)。 谢赛宁虽然也参与了这项工作,但他谦虚地说自己只是个啦啦队成员。 此前有报道称,LLM编程现在已超越人类专家,但本次测试结果表明并非如此。 表现最佳的模型,在中等难度题上的一次通过率 仅53% ,难题通过率更是为0。 即使是最好的模型o4-mini-high,一旦工具调用被屏蔽,Elo也只有 2100 ,远低于真正大师级的2700传奇线。 | Model | Hard | Medium | Easy | Rating | Pct.% | AvgTok | AvgCost | | --- | --- | --- | --- | --- | --- | --- | ...
SGLang 推理引擎的技术要点与部署实践|AICon 北京站前瞻
AI前线· 2025-06-13 06:42
采访嘉宾|尹良升,SGLang 核心开发者 编辑|罗燕珊 2025 年 5 月, SGLang 提出了第一个完全开源的 DeepSeek 大规模专家并行部署方案,该方 案也是目前开源实现中唯一能够复现官方博客所述推理性能和成本的方案。 近日,InfoQ 专访了 SGLang 核心开发者尹良升 ,他分享了该项目背后的关键技术、工程挑战 与社区生态,以及如何在大模型推理中实现性能与成本的平衡。从 PD 分离架构带来的尾延迟控 制,到推测解码提升 Token 生成速度,再到 KV 缓存落盘在多轮对话中的显存优化——这些关键 能力使 SGLang 成为支持低成本、大规模模型部署的高性能推理引擎。 尹良升,现就读于上海交通大学 ACM 班,即将前往加州大学伯克利分校 Sky Computing 攻读计 算机博士学位。他是 SGLang 最早期的几位核心开发者之一,深度参与了 SGLang 的研发和优 化。 6 月 27~28 日,在即将于北京举办的 AICon 全球人工智能开发与应用大会上,尹良升将发表演 讲《SGLang 推理引擎——高效的开源部署方案》,将深入解析前沿大模型推理关键技术,并探 讨其在实际应用中的优化 ...
一文了解DeepSeek和OpenAI:企业家为什么需要认知型创新?
Sou Hu Cai Jing· 2025-06-10 12:49
AI可不只是那些冰冷的算法,它现在能像人一样思考、推理,甚至在某些方面的表现超越了普通人。这告诉我们,传统的技术和创新模式已经不够用 了,企业要想增长、要保持竞争力,就需要换个思路。AI正在重新定义我们理解和实践商业创新的方式。 最近,混沌君旁听了混沌学园的创始人李善友教授的重磅全新课程《认知型创新:从OpenAI到DeepSeek》。 在这个AI技术日新月异的时代,我们眼看着它不仅在改变生活,更在改变着商业规则。 这门课从企业创新的角度,讲述了全球两大顶尖AI公司——OpenAI和DeepSeek是如何一步步走到今天的,他们究竟走过了怎样的创新之路。这对于我们 理解AI时代的技术创新和企业创新,提供了清晰且极具价值的路径。 教授深挖了OpenAI最初的愿景和它如何对抗巨头的思路,解密大语言模型是如何诞生的,尤其是AI能力如何从简单积累到惊人"涌现"。还解读了 DeepSeek如何在资源有限的情况下,走出一条"低成本高性能"的独特道路;更探讨了AI时代下,企业应该怎样构建一个能不断"涌现"创新的组织模式,走 向"技术领先"。 观看本课程和600+主题课 第一幕:OpenAI的横空出世 OpenAI的创立初心: ...
一文了解DeepSeek和OpenAI:企业家为什么需要认知型创新?
混沌学园· 2025-06-10 11:07
在这个 AI技术日新月异的时代,我们眼看着它不仅在改变生活,更在 改变着 商业规则。 AI可不只是那些冰冷的算法,它现在能像人一样思考、推理,甚至在某些方面 的表现超越了普通人 。这告诉我们,传统的技术和创新模式已经不够用了, 企业要想增长、要保持竞争力,就 需要 换个思路。 AI正在重新定义我们理解和实践商业创新的方式。 最近 , 混沌君旁听了 混沌学园的创始人李善友教授 的重磅 全新课程《认知型创新:从 OpenAI到DeepSeek》 。 这门课从企业创新的角度, 讲述了 全球两大顶尖 AI公司——OpenAI和DeepSeek是如何一步步走到今天的,他们究竟走过了怎样的创新之路。这对于我 们理解AI时代的技术创新和企业创新,提供了清晰且极具价值的路径。 教授 深挖 了 OpenAI最初的愿景和它如何对抗巨头的思路,解密大语言模型是如何诞生的 , 尤其是 AI能力如何从简单积累到惊人"涌现" 。 还 解读 了 DeepSeek如何在资源有限的情况下,走出一条"低成本高性能"的独特道路;更探讨 了 AI时代下,企业应该怎样构建一个能不断"涌现"创新的组织模式, 走向"技术领先"。 扫描下图二维码购买月卡 ...