Gemini3 Pro
Search documents
DeepSeek双模型发布:一位是“话少助手” 一位是“偏科天才”
Ke Ji Ri Bao· 2025-12-08 10:03
Core Insights - DeepSeek has released two new models, DeepSeek-V3.2 and DeepSeek-V3.2-Speciale, which have garnered attention for their performance in comparison to leading models like OpenAI's GPT-5 and Google's Gemini3 Pro [1][2] Model Features - DeepSeek-V3.2 is designed as a high-efficiency assistant with strong reasoning and agent capabilities, aimed at automating complex tasks such as report generation and coding [2] - DeepSeek-V3.2-Speciale focuses on solving high-difficulty mathematical problems and academic research, pushing the limits of open-source model reasoning [2] Technological Innovations - The new models incorporate two significant breakthroughs: Domain-Specific Architecture (DSA) and Thinking Tool Invocation technology [2] - DSA enhances efficiency by allowing the model to retrieve only the most relevant information, reducing resource consumption [2] - Thinking Tool Invocation enables multi-round reasoning and tool usage, allowing the model to think, execute, and iterate on tasks autonomously [2] Market Positioning - The release of these models aims to bridge the performance gap between open-source and closed-source models, providing a competitive edge for open-source development [3][4] - DeepSeek's focus on practicality and generalization is intended to create pressure on closed-source vendors, transforming aspirations into competitive realities [4] Community Engagement - DeepSeek has updated its official web platform, app, and API to the new version, while the Speciale version is currently available only as a temporary API for community evaluation [4]
DeepSeek又上新!模型硬刚谷歌 承认开源与闭源差距拉大
Di Yi Cai Jing· 2025-12-01 23:13
Core Insights - DeepSeek has launched two new models, DeepSeek-V3.2 and DeepSeek-V3.2-Speciale, which are positioned to compete with leading proprietary models like GPT-5 and Gemini 3.0, showcasing significant advancements in reasoning capabilities [1][4]. Model Overview - DeepSeek-V3.2 aims to balance reasoning ability and output length, making it suitable for everyday applications such as Q&A and general intelligence tasks. It has achieved performance levels comparable to GPT-5 and is slightly below Google's Gemini 3 Pro in public reasoning tests [4]. - DeepSeek-V3.2-Speciale is designed to push the limits of reasoning capabilities, integrating enhanced long-thinking features and theorem-proving abilities from DeepSeek-Math-V2. It has surpassed Gemini 3 Pro in several reasoning benchmarks, including prestigious math competitions [4][5]. Benchmark Performance - In various benchmarks, DeepSeek models have shown competitive results: - AIME 2025: DeepSeek-V3.2 scored 93.1, while GPT-5 and Gemini-3.0 scored 94.6 and 95.0 respectively [5]. - Harvard MIT Math Competition: DeepSeek-V3.2-Speciale scored 92.5, outperforming Gemini 3 Pro's 97.5 [5]. - International Math Olympiad: DeepSeek-V3.2-Speciale scored 78.3, close to Gemini 3 Pro's 83.3 [5]. Limitations and Future Plans - Despite these achievements, DeepSeek acknowledges limitations compared to proprietary models, including narrower world knowledge and lower token efficiency. The team plans to enhance pre-training and optimize reasoning chains to improve model performance [6][7]. - DeepSeek has identified three key areas where open-source models lag behind proprietary ones: reliance on standard attention mechanisms, insufficient computational resources during post-training, and gaps in generalization and instruction-following capabilities [7]. Technological Innovations - DeepSeek has introduced a sparse attention mechanism (DSA) to reduce computational complexity without sacrificing long-context performance. This innovation has been integrated into the new models, contributing to significant performance improvements [7]. Availability - The official website, app, and API for DeepSeek-V3.2 have been updated, while the enhanced Speciale version is currently available only through a temporary API for community evaluation [8]. Community Reception - The release has been positively received in social media, with users noting that DeepSeek's models have effectively matched the capabilities of GPT-5 and Gemini 3 Pro, highlighting the importance of rigorous engineering design over sheer parameter size [9].
DeepSeek又上新!模型硬刚谷歌
第一财经· 2025-12-01 14:05
2025.12. 01 两款模型有着不同的定位。DeepSeek-V3.2的目标是平衡推理能力与输出长度,适合日常使用,例 如问答场景和通用智能体任务场景。9月底DeepSeek发布了实验版V3.2-Exp,此次是正式版更 新。在公开推理测试中,V3.2达到了GPT-5的水平,仅略低于谷歌的Gemini3 Pro。 本文字数:1580,阅读时长大约3分钟 作者 | 第一财经 刘晓洁 12月1日晚,DeepSeek又上新了两款新模型,DeepSeek-V3.2和DeepSeek-V3.2-Speciale, 在推理能力上全球领先。 据DeepSeek公布的数据,Speciale在多个推理基准测试中超越谷歌最先进的Gemini3 Pro。具体 来看,在美国数学邀请赛、哈佛MIT数学竞赛、国际奥林匹克数学竞赛等测试中,V3.2-Speciale都 超过了Gemini3 Pro,但在编程、理工科博士生测试中略逊于谷歌。 DeepSeek-V3.2-Speciale则是此次的重头戏,其目标是"将开源模型的推理能力推向极致,探索 模型能力的边界"。据介绍,Speciale是V3.2的长思考增强版,同时结合了DeepSee ...
DeepSeek又上新!模型硬刚谷歌,承认开源与闭源差距拉大
Di Yi Cai Jing· 2025-12-01 13:31
Core Insights - DeepSeek has launched two new models, DeepSeek-V3.2 and DeepSeek-V3.2-Speciale, which are leading in reasoning capabilities globally [1][3]. Model Overview - DeepSeek-V3.2 aims to balance reasoning ability and output length, suitable for everyday use such as Q&A and general intelligence tasks. It has reached the level of GPT-5 in public reasoning tests, slightly below Google's Gemini3 Pro [3]. - DeepSeek-V3.2-Speciale is designed to push the reasoning capabilities of open-source models to the extreme, combining features from DeepSeek-Math-V2 for theorem proving, and excels in instruction following and logical verification [3][4]. Performance Metrics - Speciale has surpassed Google's Gemini3 Pro in several reasoning benchmark tests, including the American Mathematics Invitational, Harvard MIT Mathematics Competition, and International Mathematical Olympiad [4]. - In various benchmarks, DeepSeek's performance is competitive, with specific scores noted in a comparative table against GPT-5 and Gemini-3.0 [5]. Technical Limitations - Despite achievements, DeepSeek acknowledges limitations compared to proprietary models like Gemini3 Pro, particularly in knowledge breadth and token efficiency [6]. - The company plans to enhance pre-training computation and optimize reasoning chains to improve model efficiency and capabilities [6][7]. Mechanism Innovations - DeepSeek introduced a Sparse Attention Mechanism (DSA) to reduce computational complexity, which has proven effective in enhancing performance without sacrificing long-context capabilities [7][8]. - Both new models incorporate this mechanism, making DeepSeek-V3.2 a cost-effective alternative that narrows the performance gap with proprietary models [8]. Community Reception - The release has been positively received in the community, with users noting that DeepSeek's models are now comparable to GPT-5 and Gemini3 Pro, marking a significant achievement in open-source model development [8].
Gemini3 Pro实测:文科生确实能自己做网页了
虎嗅APP· 2025-11-27 23:58
以下文章来源于刺猬公社 ,作者刺猬公社编辑部 刺猬公社 . 本文来自微信公众号: 刺猬公社 ,作者:刺猬公社编辑部,题图来自:AI生成 Gemini3Pro发布后,市场充斥着欢呼声。最引人瞩目的,是其"Vibe Coding"能力,即用户通过自然 语言描述,就能指挥AI牛马,写出自己的代码。 但最近两年,AI圈的"炸裂"时刻实在太多了,论数量可能仅少于电影圈,狼炸多了总显得面目可 疑。比起Crusor、Trae等市面上已有的Vibe Coding产品,Gemini3Pro到底有何新变化? 刺猬公社决定派出一位文科生作为代表,实测毫无编码基础的普通用户,能用Gemini3Pro做些什 么。 实测1:生成一个牛马时钟 互联网内容行业观察与研究 牛马时钟是本次实测中需求最简单的一个案例。交互层面,用户只需要选择计划退休时间、上班时间 和下班时间,剩下的都是计算和前端显示。没有经过任何调试,最终效果几乎挑不出毛病来。 唯一值得讨论的问题,是页面右下角电量消耗进度条使用了绿色而不是灰色。但由于电量消耗到 100%,用户就能迎来下班,状态确实比上班更快乐,所以你也不好说这到底是一个Bug还是一个 Feature。 在做这 ...
AI模型升级催热端侧硬件预期,消费电子ETF(159732.SZ)上涨2.02%,环旭电子上涨5.58%
Mei Ri Jing Ji Xin Wen· 2025-11-27 03:09
Group 1 - The A-share market saw all three major indices rise, with the Shanghai Composite Index increasing by 0.61%, led by gains in the electronics, communications, and power equipment sectors, while the comprehensive and transportation sectors experienced declines [1] - The Consumer Electronics ETF (159732.SZ) rose by 2.02%, with significant increases in component stocks such as Lianxu Electronics (up 5.58%), Zhaoyi Innovation (up 5.37%), and XWANDA (up 5.07%) [1] Group 2 - Google launched its latest AI model, Gemini3 Pro, which features native multimodal capabilities and strong reasoning abilities, achieving an 81% score on the MMMU-Pro benchmark and 87.6% on the Video-MMMU benchmark [3] - Gemini3 achieved a groundbreaking score of 1501 on the LMArena leaderboard and a high score of 91.9% on GPQADiamond, indicating its advanced reasoning capabilities [3] - According to Guosen Securities, the continuous upgrade of AI models and the maturation of AI agents will enhance user experience through improved connectivity of edge hardware with cloud models, suggesting a potential new surge in the AI sector [3]
刷屏社交圈!海外科技媒体点赞!灵光6天下载破200万,解锁 AI 大众化新姿势
Bei Jing Shang Bao· 2025-11-26 14:18
Core Insights - Ant Group's Lingguang AI application achieved over 1 million downloads in just 4 days, surpassing the record set by Sora2 and demonstrating the rapid growth of Chinese AI applications on a global scale [1][9] - Lingguang's unique features, such as "Lingguang Flash Applications," allow users to create applications with natural language prompts, significantly lowering the barrier for non-programmers to engage with AI technology [5][8] Group 1: Performance and Popularity - Lingguang reached 1 million downloads in 4 days and 2 million downloads in 6 days, outperforming major global AI applications like ChatGPT and DeepSeek [1][9] - The application topped the App Store's free tools category in China and ranked sixth overall, marking a significant shift in the competitive landscape previously dominated by ByteDance [5][9] Group 2: Features and User Engagement - Lingguang allows users to create applications through simple text prompts, enabling functionalities like homework assistance and meal planning without any coding knowledge [6][8] - The application has sparked a wave of user-generated content, with individuals sharing their custom applications on social media, indicating a growing trend of "DIY applications" among the general public [7][8] Group 3: Market Impact and Future Directions - The rapid success of Lingguang reflects a broader shift in the AI application market, with Ant Group positioning itself as a strong competitor alongside Alibaba's other AI initiatives [9] - Ant Group's CTO emphasized the commitment to enhancing user experience and meeting personalized needs, indicating ongoing development and innovation in the AI space [9]
海外科技媒体:AI助手“灵光”让难题处理“如清风拂面般轻松”
Huan Qiu Wang· 2025-11-26 10:11
Core Insights - Ant Group launched its multimodal AI assistant "Lingguang" on November 18, which quickly gained attention both domestically and internationally, with Tech Times describing it as making problem-solving feel effortless [1][3] - Within just six days of its release, Lingguang attracted over 2 million downloads, indicating strong market interest and adoption [3] - Lingguang is compared to Google's Gemini 3 Pro, with both products offering similar functionalities, although improvements are expected in future iterations of Lingguang [3] Product Features - Lingguang integrates language, images, sound, and data to produce engaging outputs such as 3D models, animations, charts, interactive maps, and flash programs [3] - The core functionality of Lingguang lies in its ability to break down tasks into manageable subtasks and switch flexibly between different modes, presenting clear and logical results [3] - Ant Group's CTO described Lingguang as akin to having a personal AI developer in one's pocket, simplifying coding and visualization processes [3] Industry Context - Prior to Lingguang's launch, other Chinese AI models like Qianwen and DeepSeek had already outperformed Western counterparts in an AI investment competition, showcasing China's rapid advancements in AI innovation [4] - The emergence of Lingguang has garnered attention from mainstream media, with Business Insider highlighting its "ambient programming" capabilities and IT Boltwise noting its potential to fundamentally change application development [4] - Professional users on overseas social media praised Lingguang's multimodal delivery capabilities, indicating a shift from experimental to practical applications of multimodal technology [6]
海外科技媒体Tech Times:AI助手“灵光”让难题处理“如清风拂面般轻松”
Qi Lu Wan Bao· 2025-11-26 08:20
Core Insights - Ant Group's multimodal AI assistant "Lingguang" launched on November 18, quickly gaining attention both domestically and internationally, with Tech Times highlighting its ease of problem-solving [1][3] - Within just six days of its release, Lingguang achieved over 2 million downloads, indicating strong market interest and adoption [3] - Lingguang is compared to Google's Gemini 3 Pro, showcasing similar functionalities, including the ability to integrate language, images, sound, and data into engaging outputs [3] Group 1 - Lingguang's core functionality revolves around breaking down tasks into manageable components and switching between different modes to present clear and logical results [3] - Ant Group's CTO described Lingguang as akin to having a personal AI developer in one's pocket, facilitating coding, visualization, and simplification of complex issues [3] - The rapid iteration speed of technology companies like Ant Group suggests that improvements to Lingguang will be forthcoming [3] Group 2 - Prior to Lingguang's launch, other Chinese AI models had already outperformed Western counterparts in an AI investment competition, indicating China's competitive edge in AI innovation [4] - Lingguang's emergence has attracted attention from major international media, with Business Insider noting its capabilities in "ambient programming" and application development [4] - IT Boltwise emphasized that Lingguang's application development features represent a significant advancement in AI, potentially transforming how applications are created [4] Group 3 - On overseas social media, Lingguang's multimodal delivery capabilities received positive feedback from professional users, with comments highlighting its impressive integration of search, reasoning, and creativity [6] - Users noted that the code-driven outputs meet developers' needs, suggesting that multimodal AI is becoming increasingly engaging and practical [6]