腾讯研究院
Search documents
2025年AI治理报告:回归现实主义
腾讯研究院· 2026-01-22 08:44
宏观格局: 发展优先,安全"软着陆" 2025年2月的巴黎"人工智能行动峰会"是一个标志性时刻,与两年前布莱切利峰会笼罩的"安全焦虑"不 同,巴黎峰会的关键词悄然变更为"创新"与"行动",这一变化折射出全球治理的底层逻辑重构。在这种 背景下,全球监管竞速出现了"逆转",过去被视为"监管高地"的区域开始主动寻求松绑。 欧盟的自我修正 。随着《AI法案》进入实施期,复杂的合规成本开始显现,为了挽救产业竞争力,欧 盟在2025年不得不推出"数字综合提案 (Digit al O mnibus) ",推迟高风险义务生效时间并试图简化规 则,这表明即便是最坚定的监管者也必须在发展现实面前低头。 美国的"去监管化" 。特朗普政府展现了鲜明的"美国优先"色彩,撤销了前任政府侧重安全的行政令, 转而通过《确保国家人工智能政策框架》限制各州分散立法,试图以统一的联邦规则为产业扫清障碍。 如果说前两年全球对AI的态度还夹杂着"末日恐惧",那么2025年,风向已彻底改变。全球AI治理正在经 历一场深刻的"去理想化"进程。面对技术与产业的双重压力,各主要经济体不约而同地调整了身位:治 理的重心从"防范假设性的末日风险",迅速转移到了" ...
腾讯研究院AI速递 20260122
腾讯研究院· 2026-01-21 16:01
Group 1 - DeepSeek's Model 1 has been discovered in the FlashMLA codebase, potentially indicating an upcoming release, featuring a 512-dimensional architecture and support for NVIDIA's Blackwell architecture [1] - Liquid AI has launched the open-source inference model LFM2.5-1.2B-Thinking, which operates on a liquid neural network architecture and requires only 900MB of memory on mobile devices, achieving a score of 88 on MATH-500 [2] - The xAI engineer revealed that AI is being tested as a "colleague" in the MacroHard project, achieving human speeds eight times faster, and the company is considering utilizing idle computing power from approximately 4 million Tesla vehicles in North America [3] Group 2 - Research indicates that models like DeepSeek-R1 can spontaneously form multi-role debate mechanisms, significantly improving accuracy through internal social dialogue [4][5] - Medical SAM3, a new model developed by the University of Central Florida, allows for expert-level segmentation in medical imaging using only text prompts, achieving an average accuracy increase from 11.9% to 73.9% across 33 datasets [6] - Anthropic's CEO predicts that AI will fully take over software engineering roles within 6-12 months, with a significant portion of entry-level jobs expected to disappear in the next 1-5 years [7] Group 3 - The Sequoia xbench team reported that top agents can handle over 60% of 104 daily tasks, indicating that foundational agent capabilities have become commoditized [8] - OpenAI's CFO discussed the maturation of multi-agent systems by 2026, emphasizing that AI bubbles should be measured by API call volumes rather than stock prices, with productivity increases of 27-33% for cutting-edge companies [9]
AI健康助手,正风起云涌
腾讯研究院· 2026-01-21 08:44
Group 1 - The article discusses the global trend of conversational AI health assistants, highlighting their emergence as a new focus in the AI + healthcare sector, with significant investments from major internet companies in China and abroad [5][10][11] - OpenAI and Anthropic have launched health-related products and features, with OpenAI's ChatGPT being utilized by over 400 million users for medical inquiries, indicating a strong demand for AI in healthcare [4][13][14] - The article emphasizes the rapid development expected in the conversational AI health assistant market by 2026, driven by changing user habits, technological advancements, and regulatory support [5][21][25] Group 2 - The article identifies key drivers behind the rise of AI health assistants, including changes in user interaction habits, increased AI capabilities, and the competitive landscape in healthcare [22][23][26] - It notes that while the AI health assistant market is growing, challenges such as technological limitations, unclear business models, and regulatory risks remain [29][30][32] - The article suggests that the healthcare sector is a promising area for AI commercialization due to its essential nature and the potential for long-term service revenue [27][28][34] Group 3 - The article outlines strategies for the healthy development of AI health assistants, including increasing data openness and innovation support, as well as promoting industry self-regulation [37][40] - It emphasizes the importance of high-quality healthcare data for training AI models and the need for collaboration between healthcare providers and AI developers [38][39] - The article calls for a clear regulatory framework to ensure the safe and effective deployment of AI health assistants, highlighting the need for industry standards and guidelines [43][44][46] Group 4 - The article concludes with a vision for the future, suggesting that health applications have the potential to become super apps, meeting essential user needs and integrating with various aspects of life [49][50] - It posits that the AI health assistant could become a necessary tool in the AI era, reflecting a shift in user expectations towards technology that actively supports health management [51]
腾讯研究院AI速递 20260121
腾讯研究院· 2026-01-20 16:03
Group 1 - Musk has fulfilled his promise by open-sourcing the new recommendation algorithm for the X platform, which is 100% AI-driven and removes manual features and rules [1] - The algorithm utilizes Thunder and Phoenix engines to construct information streams, predicting 15 types of user behaviors with weighted scoring, where the weight of replying to authors' comments is 75 times that of likes [1] - Negative feedback such as blocking and reporting significantly reduces visibility, while time spent and genuine interactions are core metrics, allowing even small accounts to gain exposure and diminishing the advantage of having a large follower base [1] Group 2 - Zhipu AI has open-sourced the lightweight model GLM-4.7-Flash, which has 30 billion total parameters but only 3 billion activated, aimed at "local programming and intelligent assistants," with free API access [2] - This model is the first to adopt the MLA architecture from DeepSeek, supporting a context window of 200K and scoring 59.2 in the SWE-bench code repair test [2] - Local deployment tests show that it can run at 43 tokens per second on Apple's M5 chip and is compatible with HuggingFace, vLLM, and Huawei's Ascend NPU [2] Group 3 - MiniMax has unveiled Agent 2.0, defined as an "AI-native workspace," which offers a desktop application for seamless local and cloud connectivity, allowing operations on local files and initiating web automation tasks [3] - The Expert Agents feature encapsulates private knowledge and industry SOPs to create vertical domain expert avatars, enhancing general expertise scores from 70 to as high as 100 [3] - Users can customize Expert Agents, achieving a closed-loop capability from research to delivery, with desktop versions available for both Windows and Mac [3] Group 4 - Jieyue Xingchen has open-sourced the multimodal small model Step3-VL-10B, which, with only 10 billion parameters, competes with and even surpasses models like GLM-4.6V (106 billion) and Qwen3-VL (235 billion) in various evaluations [4] - The model possesses exceptional visual perception, deep logical reasoning, and interactive capabilities with edge agents, achieving top-tier performance in the AIME math competition [4] - It employs 1.2 trillion data for full parameter joint pre-training, over 1400 reinforcement learning iterations, and an innovative PaCoRe parallel coordination reasoning mechanism, with both Base and Thinking versions open-sourced [4] Group 5 - "Moon's Dark Side" is undergoing a new round of financing, with a valuation of $4.8 billion, an increase of $500 million from the previously announced $4.3 billion valuation just 20 days ago, with financing expected to complete soon [5] - The company currently holds over 10 billion yuan in cash and is not in a hurry to go public, planning to time its IPO as a means to accelerate AGI development [5] Group 6 - Superparameter Technology has launched the game agent COTA, which is entirely driven by a large model, achieving professional-level performance in FPS games with a visible reasoning chain [6] - It uses a "dual-system hierarchical architecture" to simulate human fast and slow thinking, with the Commander responsible for strategic decisions and the Operator executing operations in milliseconds, reducing response time to 100 ms [6] - This product validates the feasibility of large models in high-frequency competitive gaming scenarios, providing reference ideas for embodied intelligence and other real-world issues [6] Group 7 - Microsoft CEO Satya Nadella stated at the Davos Forum that mastering model orchestration capabilities is essential for establishing a competitive edge in the AI era [7] - The proliferation of AI requires enhancing "token efficiency per dollar per watt" from the supply side, while the demand side necessitates companies to drive transformation across "concepts, capabilities, and data" [7] - True "enterprise sovereignty" involves converting unique experiences and knowledge into proprietary AI models to prevent core value from flowing to model providers [7] Group 8 - a16z's analysis indicates that while ChatGPT maintains a dominant position with 800-900 million weekly active users, Gemini is growing at 155%, indicating a "winner-takes-most" market in AI assistants [8] - OpenAI's new experiences pushed through the ChatGPT interface for shopping, tasks, and learning have not truly broken through, limited by the existing chatbox interface's inability to provide a top-tier product experience [8] - Successful AI products like Replit, Suno, and Character AI share a common trait of having a distinct and focused interface, suggesting that startup opportunities lie in deep optimization for specific workflows [8] Group 9 - Anthropic's research team has discovered that model personalities can be quantified, with a dominant dimension called the "assistance axis" measuring the extent to which models operate in "intelligent assistant" mode [9] - Interventions along the assistance axis can control role-playing willingness, significantly reducing harmful response rates and defending against personality jailbreak attacks [9] - The proposed "activation ceiling" technique can lower the success rate of personality jailbreaks by nearly 60% without significantly impairing model performance, opening new pathways for human control over AI [9]
超越“第四次工业革命”:关于人工智能与人类主体性的再思考
腾讯研究院· 2026-01-20 09:53
王鹏 腾讯研究院资深专家 在当下的科技舆论场中,当我们在谈论人工智能时,最不假思索的叙事框架无疑是"第四次工业革命"。 这确实是一个充满诱惑力的线性类比:蒸汽机是对肌肉的解放,电力是对能源的解放,而 AI 则是对智能的 解放。在这种叙事里,历史是一条不断上升的直线,而我们正站在生产力曲线最陡峭的拐点上。 然而,随着大模型能力的涌现与社会震荡的加剧,我们发现,仅用工业革命的逻辑来解释当下,虽然在生 产力维度是正确的,但在 认识论维度 上却是匮乏的。 工业革命的底色是工具理性。无论是瓦特的蒸汽机还是现代的流水线,它们追求的是效率、规模、标准化 以及对物理世界的征服。它们主要解决的是"怎么做" (H o w) 的问题。 但生成式 AI 不同。当机器开始以一种令人不安的逼真度进行对话、推理、创作时,它冲击的不再单纯是生 产力的边界,而是认知、创造与存在的本质。它触碰的不是人类的手脚,而是大脑皮层中最敏感的区域。 如果我们愿意拉长历史的焦距,透过五百年的迷雾回望,你会发现:此刻硅谷发生的一切,不仅呼应了 18 世纪的工业变革,更与 14 至 16 世纪那场发生在佛罗伦萨的思想巨变——文艺复兴,存在着惊人的、深层 的 拓 ...
【全球招募】用AI唤醒千年文明!探元计划NextGen数智活化赛道:五大文化场景等您“揭榜挂帅”
腾讯研究院· 2026-01-20 09:53
Core Viewpoint - The article emphasizes the integration of advanced technologies like AI to revitalize cultural heritage and enhance public engagement with historical narratives and experiences [2][56]. Group 1: Cultural Revitalization through Technology - The initiative aims to create immersive experiences that allow users to interact with cultural heritage, such as AI-generated historical narratives and personalized experiences [2][5]. - The "NextGen" plan by Tencent focuses on leveraging cutting-edge technologies to address the challenges of cultural heritage revitalization, aiming to create new forms of expression and engagement [5][56]. Group 2: Specific Topics and Challenges - The program identifies three main topics for innovation: 1. Development of multi-modal intelligent agents for cultural content generation [5]. 2. Creation of immersive interactive experiences that combine sensory data and emotional computing [6]. 3. Human-machine collaboration for the transmission and development of traditional crafts through digital means [7]. Group 3: Specific Cultural Scenarios - Five specific cultural scenarios have been outlined for technological application: 1. "Cloud Residence Intelligent Companion" for enhancing public understanding of historical texts [8][9]. 2. "Hangzhou West Lake Experience" focusing on personalized immersive tourism experiences [15][16]. 3. "Dawenkou Culture Interactive Experience" to facilitate understanding of ancient pottery techniques [19]. 4. "Bridge Wisdom Transmission" for teaching traditional wooden bridge construction techniques [29]. 5. "Cantonese Lion Dance Digital Activation" to enhance interaction and experience in traditional performances [36]. Group 4: Collaboration and Support - The initiative invites global technology teams to collaborate with cultural institutions to propose innovative solutions, with funding and resources available for selected projects [43][52]. - The project will undergo a structured process from proposal submission to implementation, ensuring thorough evaluation and support [48][50].
腾讯研究院AI速递 20260120
腾讯研究院· 2026-01-19 16:03
Group 1 - Tesla has announced that the design of its AI5 chip is nearing completion, with the AI6 chip in early stages, aiming to shorten the chip design cycle to 9 months and predicting it will become the highest production AI chip globally [1] - The AI5 chip will utilize Samsung's 2nm and TSMC's 3nm processes, boasting overall performance 50 times that of AI4 and memory capacity 9 times greater, with mass production expected in 2027 [1] - Tesla signed a $16.5 billion agreement with Samsung for the production of the AI6 chip in the U.S., anticipated to launch in 2028 [1] Group 2 - Anthropic has upgraded Claude Cowork with a "permanent memory" feature, allowing the AI to categorize and store information, enhancing user understanding over time [2] - The upgrade includes an MCP connector system to improve automation, voice mode development, and a new UI area for continuous management of results [2] - Continuous learning is viewed as a key breakthrough for AGI, with OpenAI and Google also investing in memory functionalities [2] Group 3 - Kunlun Wanwei has launched Skywork Design Agent, focusing on poster design, social media materials, logo branding, and general creative image generation [3] - The product features a self-developed canvas engine that supports manual editing, AI photo retouching, and layer separation, streamlining the entire process from material import to export [3] - It offers multiple export formats (PNG, JPG, PDF) and includes a unique "add to knowledge base" feature to address material management issues, now fully launched overseas [3] Group 4 - Douzi 2.0 has introduced the Coze Skill feature, allowing users to encapsulate personal methodologies and industry experiences into reusable "skill packages" [4] - A new "long-term plan" feature enables goal-oriented AI collaboration, breaking down vague objectives into clear steps for automatic execution [4] - The launch of a skill marketplace facilitates the exchange of industry skill packages, allowing professionals to monetize their expertise, alongside the beta release of video Agent Skill [4] Group 5 - Giant Network's "Supernatural Action Group" has introduced an "AI large model challenge" mode, integrating large model technology into game combat, marking a significant application in a high DAU game [5] - AI characters are driven by large models in real-time, capable of voice interaction and mimicking human behavior, with over 25 million AI matches recorded in the first week [5] Group 6 - Anker and Feishu have collaborated to create a 10-gram AI recording device, addressing the portability issues of traditional AI recording cards [7] - The device features real-time summarization capabilities, generating structured logical maps during meetings and supporting real-time translation in 24 languages [7] - Recordings are directly streamed to the Feishu knowledge base, integrating with the entire Feishu ecosystem to reduce the burden of knowledge base construction [7] Group 7 - Roboto has open-sourced its bipedal humanoid robot prototype, achieving a running speed of 3 m/s, making it one of the most advanced open-source humanoid robots globally [8] - The open-source content includes hardware schematics, EBOM material lists, supplier information, and control algorithm code, enabling reproducibility and verification [8] - The team, originating from Harbin Institute of Technology, has secured millions in seed funding and aims to reduce the cost of embodied intelligence development by 80% [8] Group 8 - Galaxy General has launched the Galbot S1, a heavy-duty robot capable of carrying loads up to 50 kg, currently operating in key production processes at CATL [9] - It features an industry-first fully autonomous, zero-remote operation "embodied handling model," utilizing pure visual perception without QR code markers [9] - Galaxy General has recently completed a 2.1 billion yuan financing round, with a valuation exceeding 20 billion yuan, and has established partnerships with leading manufacturers [9] Group 9 - OpenAI's product manager reported that since the release of ChatGPT-5, the Codex platform has seen a 20-fold growth, processing trillions of characters weekly [10] - The Sora Android app achieved a rapid development cycle, going from zero to launch in 28 days and topping the App Store, significantly improving team efficiency [10] - The manager noted that human typing speed and multitasking capabilities are often the limiting factors for AGI, rather than the models themselves [10]
我们正在亲手撰写历史
腾讯研究院· 2026-01-19 13:24
研究院诸友: 见字如面,展信开颜:) 刚刚过去的 2025 年,AI 倾泻如潮,世界在数字中深潜。 中国的大模型像一道弧光,向世界证明,智慧不仅可以轻盈,还可以开放,我们又在全球AI叙事中刻下来自东方的笔迹; 机器人在无数次跌倒后,又笨拙着爬起; "人工智能+"推动技术从实验室走向工厂,走向医院与家庭; 面对这一切,我们终于感觉到: 未来不再是预言,而是正在发生的此刻。 当 日程、工作、学习、健康、乃至情感的星图,皆可交由AI打理 , 我们却在某个月夜惊觉: 那些被 以效率之名 优化 掉 的,是否正是人之为人的根本? 迷途时的 意外风景、 阅读中艰涩的顿悟 、 人际间笨拙而真诚的摩擦——这些构成生命质感的 瑕疵 与低效 ,是否正被我们亲手裁剪? 上半场,我们教会机器学习。 下半场,真正的挑战或许在于,我们如何 对"何以为人"这一古老命题,持续探索与捍卫。 2026年的晨光,照亮的将不再是模型的参数之战,而是一条更为艰深的道路: 27岁的姚顺雨作为首席AI科学家加入腾讯,他平静 断言: AI的上半场已经结束。 我们又恍然:狂飙的技术史诗,第一章已然合上 。而 下半场的序曲,是关于我们自身的诘问。 当"人"与"机 ...
腾讯研究院AI速递 20260119
腾讯研究院· 2026-01-18 16:01
Group 1 - xAI's Colossus 2 is the world's first supercomputer cluster to reach 1GW power, with plans to upgrade to 1.5GW in April and a final capacity of 2GW [1] - The cluster will house 555,000 GPUs, surpassing Meta and Microsoft, dedicated to training Grok 5 with 60 trillion parameters [1] - The surge in power demand from data centers may lead to rolling blackouts for 67 million residents in the US PJM grid area, prompting xAI to deploy 168 Tesla Megapack energy storage systems [1] Group 2 - OpenAI has launched an $8/month ChatGPT Go subscription service, offering the GPT-5.2 Instant version with message and image creation limits ten times that of the free version [2] - The company plans to test advertisements in the US on both free and Go versions, with ads clearly marked and not affecting response content [2] - OpenAI assures that user data will not be sold to advertisers, and users can opt out of personalized ads and delete related data [2] Group 3 - OpenAI has quietly launched the ChatGPT Translate tool, supporting over 50 languages and allowing users to adjust the tone of translations [3] - Google has responded with the open-source TranslateGemma model, supporting 55 languages and featuring 12 billion parameters, surpassing the previous 27 billion baseline [3] - TranslateGemma retains multimodal capabilities to translate text in images, with a 4 billion version that can run on mobile devices [3] Group 4 - Black Forest Labs has open-sourced the FLUX.2 Klein model, achieving end-to-end inference in under 0.5 seconds on modern hardware, unifying text-to-image generation and editing [4] - The 4 billion parameter model requires only 13GB of VRAM to run on consumer-grade GPUs, while the 9 billion version matches the performance of models with five times the parameters [4] - The model offers FP8 and NVFP4 quantized versions, achieving inference speedups of up to 1.6x and 2.7x on RTX GPUs, with VRAM usage reduced by 40% to 55% [4] Group 5 - Meituan has released the LongCat-Flash-Thinking-2601 model with 560 billion parameters, introducing a rethinking mode that allows for simultaneous parallel thinking [7] - The model shows significant improvements in tool usage and search benchmarks, with a new evaluation method for generalization capabilities in automated environment scaling [7] - The model employs environment scaling and multi-environment reinforcement learning, enhancing adaptability in out-of-distribution scenarios [7] Group 6 - The court has unsealed over 100 documents in the lawsuit between Musk and OpenAI, revealing that Altman indirectly holds shares in OpenAI through the YC fund [8] - A diary entry from Brockman in 2017 admits to wanting to turn OpenAI into a for-profit company and remove Musk, stating it was the only chance to get rid of him [8] - OpenAI refutes claims that Musk sought a 50%-60% equity stake and CEO position, with the judge deeming the evidence too contentious for a jury trial set for April 27 [8] Group 7 - Neuralink's first subject revealed that brain chips can be upgraded without surgery through three methods: Telepathy app updates, OTA firmware updates, and hardware iterations [9] - After 85% of electrodes detached, the team used software algorithms to enhance the performance of the remaining 15%, achieving better results than intact electrodes [9] - Future plans include a "dual-chip configuration" to create a "digital bridge" between the brain and spinal cord, potentially allowing paralyzed individuals to walk again [9] Group 8 - Sequoia Capital partners have published a blog asserting that AGI has arrived, defining it as the ability to clarify tasks [10] - The article cites an example of an intelligent agent completing a recruitment task autonomously in 31 minutes, demonstrating its capability to form hypotheses and validate them [10] - The capabilities of long-cycle intelligent agents are expected to double every seven months, with predictions that by 2028 they could complete a human expert's daily work [10] Group 9 - OpenAI's post-training lead stated that the intelligence of a model is determined by how well it understands user queries [11] - GPT-5.1 has transformed all chat models into reasoning models, allowing them to autonomously decide on thinking duration based on question difficulty [11] - Improvements have been made in context memory, automatic model switching, and user-defined expression styles, with future models expected to be more customizable [11] Group 10 - Anthropic's new Economic Index report indicates that AI accelerates significantly with task complexity, achieving speedups of 9 times for high school tasks and 12 times for college tasks [12] - Human-AI collaboration has extended the time limit for AI tasks from 2 hours to 19 hours, nearly a tenfold increase, emphasizing the importance of human feedback [12] - The report warns of the "de-skilling" risk, as AI systematically removes high-intelligence components from work, with tasks now requiring an average of 14.4 years of education [12]
腾讯研究院AI每周关键词Top50
腾讯研究院· 2026-01-17 02:33
Group 1: Core Insights - The article highlights the top 50 keywords related to AI developments for the week of January 12-16, emphasizing the dynamic nature of the AI landscape [2][3]. Group 2: Applications - AI applications are diverse, including AI drug screening by Tsinghua University, AI programming impacts by Tailwind CSS, and AI audio devices by OpenAI [3]. - Notable advancements include the Gemini collaboration by Apple and the acquisition of Torch Medical by OpenAI, indicating a trend towards integrating AI in healthcare [3]. - The introduction of various AI models and tools, such as the Niji 7 anime model by Midjourney and the Video v1.0 by Kunlun Wanwei, showcases the expanding capabilities of AI in creative fields [3]. Group 3: Technology - Significant technological advancements are noted, including the Spirit v1.5 by Qianxun Intelligent and the COSA system by Zhujidi Power, reflecting ongoing innovation in AI technology [4]. - The emergence of AI-driven personal intelligence applications by Google and autonomous driving tests by NVIDIA indicates a focus on practical AI applications in everyday life [4]. Group 4: Perspectives - Various viewpoints are presented, such as OpenAI's discussion on capability surplus and predictions about AGI by Elon Musk, highlighting the ongoing debates in the AI community [4]. - The article also mentions the 2026 top ten breakthrough technologies identified by MIT, suggesting a forward-looking perspective on AI advancements [4].