DeepSeek
Search documents
The best open weight AI models are coming from China, says Corridor's Alex Stamos
CNBC Television· 2025-12-12 19:51
Alex Stamos is chief product officer at Corridor and former chief security officer at Facebook. Alex, it's great to see you. I don't know if you'd have any comment on whether Meta is training off of this Chinese model. >> Yeah, I don't have any specific insight.I'm not at Meta anymore. Uh what that's called is distillation. So when you're building a model, one of the ways you can train it is you could ask another model millions or billions of questions, take that output uh and then build your model to give ...
中国AI再现全球级爆款,算力、应用呈两端协同跃升态势
Xin Lang Cai Jing· 2025-12-12 14:13
12月10日,阿里巴巴宣布,千问APP公测23天,月活跃用户数已突破3000万。据悉,千问APP于11月17 日上限公测版,仅两天就成功冲入APPStore免费总榜前三,首周下载量破1000万,超越ChatGPT、 Sora、DeepSeek,刷新了"增长最快AI应用"的记录。 这与年初另一家中国AI大模型企业?DeepSeek的高光时刻,形成强烈呼应。1月27日,DeepSeek发布仅 一周的开源推理模型DeepSeek-R1,登顶苹果中国区及美国区应用商店免费榜,同样引发全球关注。 一是新加坡国家人工智能计划在其新的东南亚语言大模型项目中,放弃了Meta模型,转向Qwen的开源 架构;二是美国科技巨头Meta在研发代号为"牛油果"的全新AI模型时,采用了阿里巴巴开源的Qwen模 型进行蒸馏优化。 其他中国大模型方面,诸如硅谷大佬ChamathPalihapitiya宣布用Kimi取代OpenAI作为生产力工具,美国 Vercel、Windsurf等编程平台接入智谱模型,德国NovoAI、英国NetMind.AI与丹麦EmpatikAI弃用 ChatGPT而转投DeepSeek之类案例,同样层出不穷。 单个 ...
“千问速度”引爆科技圈,中国AI开启全球领先叙事
Quan Jing Wang· 2025-12-12 08:39
12月10日,阿里巴巴宣布,千问APP公测23天,月活跃用户数已突破3000万。据悉,千问APP于11月17 日上限公测版,仅两天就成功冲入APP Store免费总榜前三,首周下载量破1000万,超越ChatGPT、 Sora、DeepSeek,刷新了"增长最快AI应用"的记录。 这与年初另一家中国AI大模型企业DeepSeek的高光时刻,形成强烈呼应。1月27日,DeepSeek发布仅一 周的开源推理模型DeepSeek-R1,登顶苹果中国区及美国区应用商店免费榜,同样引发全球关注。 两者一在年初,一在年尾,既像巧合,又像隐喻,完美诠释了中国AI大模型在2025年整个年度的全新 叙事:不再是全球竞争中的模仿与跟随者,而是成功蜕变为行业领先者甚至是行业规则的制定者,与其 他国家和地区的领先大模型并肩而行,成为AI产业生态发展中不容忽视的重要力量。 知名AI投资人Nathan Benaich和伦敦AI专项基金Air Street Capital联合发布的2025年版《State of AI Report》中,"中国AI体系"首次被从"外围追赶者"提升为"平行竞争者",并获得了"它正在开源AI和商业 化部署方面设 ...
对谈刘知远、肖朝军:密度法则、RL 的 Scaling Law 与智能的分布式未来丨晚点播客
晚点LatePost· 2025-12-12 03:09
Core Insights - The article discusses the emergence of the "Density Law" in large models, which states that the capability density of models doubles every 3.5 months, emphasizing efficiency in achieving intelligence with fewer computational resources [4][11][19]. Group 1: Evolution of Large Models - The evolution of large models has been driven by the "Scaling Law," leading to significant leaps in capabilities, surpassing human levels in various tasks [8][12]. - The introduction of ChatGPT marked a steep increase in capability density, indicating a shift in the model performance landscape [7][10]. - The industry is witnessing a trend towards distributed intelligence, where individuals will have personal models that learn from their data, contrasting with the notion that only a few large models will dominate [10][36]. Group 2: Density Law and Efficiency - The Density Law aims to maximize intelligence per unit of computation, advocating for a focus on efficiency rather than merely scaling model size [19][35]. - Key methods to enhance model capability density include optimizing model architecture, improving data quality, and refining learning algorithms [19][23]. - The industry is exploring various architectural improvements, such as sparse attention mechanisms and mixed expert systems, to enhance efficiency [20][24]. Group 3: Future of AI and AGI - The future of AI is expected to involve self-learning models that can adapt and grow based on user interactions, leading to the development of personal AI assistants [10][35]. - The concept of "AI creating AI" is highlighted as a potential future direction, where models will be capable of self-improvement and collaboration [35][36]. - The timeline for achieving significant advancements in personal AI capabilities is projected around 2027, with expectations for models to operate efficiently on mobile devices [33][32].
AI 价值链-Google Gemini 3 Pro、Claude Opus 4.5、Grok 4.1 与 DeepSeek 3.2…… 谁才是真正的领导者?这意味着什么
2025-12-12 02:19
Summary of Key Points from the Conference Call Industry Overview - The conference call discusses the U.S. semiconductor and internet industries, focusing on the AI value chain and the competition among leading AI models: Google Gemini 3 Pro, Claude Opus 4.5, Grok 4.1, and DeepSeek 3.2 [1][2][3]. Core Insights and Arguments - **Model Performance Comparison**: - Gemini 3 Pro and Claude Opus 4.5 are viewed as closely matched, while skepticism surrounds DeepSeek's claim to leadership. All three models have published benchmarks that favor their performance, but third-party benchmarking is still ongoing [3][4][14]. - Early results indicate that Gemini and Claude are neck and neck, with Grok 4.1 outperforming GPT-5 [3][14]. - **Scaling Laws**: - The scaling laws for AI models remain intact, suggesting renewed confidence among AI labs and their investors to expand AI infrastructure. Continued access to superior compute resources and unique data is essential for scaling [4][15]. - **OpenAI's Challenges**: - OpenAI is reportedly lagging behind its competitors, facing issues such as disappointing GPT-5 performance, failed pre-training runs, and significant talent departures. This situation raises concerns about its future leadership in the AI space [6][18][19]. - **Compute Infrastructure**: - The competition between GPUs and TPUs is highlighted, with concerns about Nvidia's market position. The defining theme is compute scarcity, which benefits both GPU and ASIC technologies [7][20][22]. - **Market Dynamics**: - There is a potential shift from model benchmarking to product adoption and monetization, as evidenced by Gemini's inability to displace ChatGPT despite superior performance [8][21]. Important but Overlooked Content - **DeepSeek's Position**: - DeepSeek's ability to quickly follow leading models raises concerns about the sustainability of frontier model economics if model improvement slows down. However, current model improvements are still strong [5][17]. - **Investment Implications**: - Nvidia (NVDA) is rated as outperforming with a target price of $275, citing a significant datacenter opportunity. Broadcom (AVGO) is also rated outperforming with a target price of $400, driven by a strong AI trajectory. AMD (AMD) is rated market perform with a target price of $200, contingent on OpenAI's success [10][11][12]. - **Consumer Behavior**: - OpenAI's large user base, with 800 million monthly active users, may provide a competitive moat despite its current challenges. The sticky nature of consumer behavior in technology could offer OpenAI some breathing room [18][19]. - **Future Monitoring**: - Investors are advised to closely monitor developments in the AI space, particularly regarding OpenAI's performance and the broader implications for the semiconductor and AI infrastructure markets [19][21]. This summary encapsulates the key points discussed in the conference call, providing insights into the competitive landscape of AI models, the challenges faced by leading companies, and the implications for investors in the semiconductor and AI sectors.
“连姥姥都问我,你知道DeepSeek吗?”
第一财经· 2025-12-12 01:11
Core Viewpoint - The emergence of DeepSeek has significantly impacted MiniMax and other large model companies, prompting introspection on their performance and strategic choices [5][6]. Group 1: Challenges and Reflections - MiniMax's founder, Yan Junjie, faced numerous challenges during the startup phase, including the bankruptcy of Silicon Valley Bank, which affected payroll [3]. - The team recognized that their performance was hindered by a lack of deep thinking and lowered expectations, contrasting with DeepSeek's unique insights and technical accumulation [6][8]. Group 2: Team Morale and Incentives - To boost team morale during tough times, Yan emphasized the importance of encouragement and financial incentives, stating that monetary rewards are effective [7]. - In September, MiniMax initiated a million-dollar stock option incentive program, offering varying amounts based on employee contributions, covering various roles within the company [7]. Group 3: Strategic Direction - MiniMax's approach involves a unique strategy of ToC (Technology of Communication) and international expansion, with their Talkie application gaining significant user traction overseas [8]. - The company experienced a period of indecision regarding whether to prioritize technology or product development, ultimately deciding on a technology-driven approach despite the associated risks [8][9]. Group 4: Market Position and Talent - The gap between domestic large model companies and top international models is narrowing, with Chinese companies achieving this with significantly lower investment [12]. - Yan highlighted the importance of local AI talent, noting that many key contributors to success in companies like DeepSeek and MiniMax are homegrown, often in their first jobs [12]. Group 5: Future Outlook - Yan remains optimistic about the future of AGI, noting that the number of companies in the large model space is decreasing, leading to a more concentrated market [13]. - The AI industry is not merely an extension of the internet; the core product in the large model era is the model itself, with blurred boundaries between roles in product management, development, and algorithms [14].
Interview: brace for volatility as AI reshapes markets in 2026, says Erlen Capital's Schneller
Invezz· 2025-12-11 13:30
2025 delivered no shortage of drama across the global economic, business, and financial landscape. From DeepSeek's shockwaves to tariff battles and the relentless march of AI reshaping boardrooms and stock charts, the year had it all. ...
X @Polyhedra
Polyhedra· 2025-12-11 13:00
Why zkML? Because DeepSeek just released two V3.2 models that deliver frontier-class performance in reasoning, mathematics, and coding — while dramatically lowering the compute required to achieve it.Frontier intelligence is no longer confined to hyperscaler-grade infrastructure. ...
连姥姥都在问DeepSeek!一位AI六小龙掌门的反思与进击
Di Yi Cai Jing· 2025-12-11 12:18
明年大模型公司或许会更少,中国做AI最大的优势是人才。 AI六小龙之一的MiniMax创始人闫俊杰在创业初期遇到过很多挑战,其中包括硅谷银行破产,"所有的 钱都在那个银行里,那个时候已经没法发工资了。" 但或许更大的挑战来自DeepSeek,连他的姥姥都在问"你知道 DeepSeek吗?" 他在近日做客罗永浩的播客节目中提到,团队反思过"为什么没有做得那么好"。很多问题是自己作为掌 舵者的认知不够,行业里有很多像梁文锋这样厉害的人,他也提到和梁文锋在DeepSeek 成立前就认 识,每年会交流几次。 "连姥姥都问我,你知道DeepSeek吗?" DeepSeek的出现给MiniMax在内的大模型公司带来了很多变化。 闫俊杰提到,今年春节回老家时,发现所有人都在关注DeepSeek,"就连我姥姥都问我,你知道 DeepSeek吗?" 对于是否坚持技术驱动,模型还是产品优先,闫俊杰表示,MiniMax经历过大约半年时间的迷茫和纠 结。摇摆的点在于,他们认为,MiniMax不应该靠纯烧钱来增长,即便是为了AGI,也必须得用可商业 化的方式来实现。因为AGI需要足够长的时间,成本也会比传统互联网公司要高得多。 在访谈 ...
AI巨头制定AI“宪法”:捐赠核心技术,推动“智能体联合国”标准化
3 6 Ke· 2025-12-11 10:05
Group 1 - The core idea of the news is the establishment of the AI Agent Foundation (AAIF) by OpenAI, Anthropic, and Block to promote interoperability and open standards in the AI agent ecosystem [2][3] - The foundation aims to provide neutral management and infrastructure for AI agents, facilitating their transition from experimental stages to real-world applications [3][4] - The collaboration reflects a strategic shift among Silicon Valley giants, recognizing that open standards are more beneficial for long-term interests than closed competition in the commercialization of AI agents [3][5] Group 2 - The establishment of AAIF addresses two major industry pain points: interoperability issues and the risk of monopolistic practices in the AI agent ecosystem [4][5] - The three founding companies have donated their core technologies to ensure the foundation's neutrality, including Anthropic's MCP protocol, OpenAI's AGENTS.md, and Block's Goose framework [6][7] - These contributions aim to reduce redundant labor in building connectors, enhance consistency in agent behavior across systems, and facilitate easier deployment of agent systems in a secure environment [7] Group 3 - OpenAI and Anthropic, despite being fierce competitors in the large language model space, are collaborating to ensure an open and expansive market for AI agents [8] - The strategic interest in preventing market fragmentation or monopolization is crucial for accelerating the commercialization of AI technologies [8] - The trend towards open-source solutions is being recognized as a significant advantage, with companies like OpenAI increasing their open-source efforts to attract global developers and expand their ecosystems [8][9] Group 4 - The grand vision of AAIF is to create a modular, composable, and auditable AI agent ecosystem, akin to the internet, rather than isolated applications [9] - By leveraging the donated technologies, AAIF aims to accelerate innovation and keep the doors of the AI agent ecosystem open [9]