腾讯研究院
Search documents
张笑宇:我为什么成了坚定的AI“降临派”?
腾讯研究院· 2026-02-03 08:33
Core Viewpoint - The rise of AI represents a significant shift in human productivity and intelligence output, with the potential to reshape social and economic structures over the next two decades [2][7][10]. Group 1: Mathematical Equations and Their Implications - The concept of "human equivalent" suggests that AI models can output intelligence equivalent to approximately 1,000 humans [3]. - AI can produce around 1 million tokens at a cost of about 1 dollar, highlighting the stark difference in productivity and cost-effectiveness compared to human output [6]. - The emergence of AI technologies at a fraction of the cost of human labor indicates a supply-side reform, where trust and established channels become increasingly valuable [7][8]. Group 2: AI's Impact on Society and Culture - AI has the potential to amplify the capabilities of the top 1% of individuals, allowing for the efficient execution of well-defined tasks [8]. - Cultural and emotional expressions are also forms of intelligence, and AI can surpass human capabilities in these areas, raising questions about the nature of human relationships [9][10]. - The loneliness experienced by older generations can be alleviated through AI, which can provide companionship and assist in preserving memories [10]. Group 3: Economic Structures and Capital Dynamics - The relationship between capital return rates and GDP growth suggests that capital will increasingly seek to replace labor, leading to potential societal upheaval [11][12]. - The valuation of companies like OpenAI indicates a pressure to generate significant revenue, which may drive the replacement of human labor with AI solutions [12][13]. - Historical patterns show that when capital's share of income becomes too high, it can lead to societal resets, emphasizing the need for reflection on current economic structures [13][14]. Group 4: Future of Human-AI Interaction - The concept of "information overload" necessitates the development of AI tools that can help individuals navigate and manage the vast amounts of information available [21][22]. - Future social interactions may be enhanced by AI, which could facilitate meaningful connections and discussions, moving beyond superficial engagements [22][23]. - The integration of AI into daily life could lead to a new form of social platform that encourages real-world interactions rather than virtual ones [22][23].
腾讯研究院AI速递 20260203
腾讯研究院· 2026-02-02 16:10
1. 火遍全球的AI社交平台Moltbook上线仅四天即崩溃,服务器账单达天文数字,被爆料150万AI中实际仅有约2万个 真正运行的Agent; 2. 平台存在严重安全漏洞,84%信息可被抽取,91%提示注入攻击直接生效,API密钥和敏感信息面临泄露风险; 3. OpenClaw极度消耗token,用户20小时烧光100美元,有人一晚烧掉5000万token,被称为"token熔炉"。 https://mp.weixin.qq.com/s/vEwZgpG6pN9zTNWEYHLKbA 生成式AI 一、上线120小时Moltbook全球瘫痪!150万AI服务器已炸? 二、Claude sonnet 5或将发布,自动组建多智能体开发团队 1. 传Anthropic将于2月3日发布Claude Sonnet 5,代号"耳廓狐",谷歌Vertex AI日志意外曝光模型标识符; 2. 新功能Claude Code Evolution可自动生成并调度后端、QA测试、研究员等多个子代理协同工作,实现任务委派 式全流程自动化; 3. 价格比Opus 4.5便宜50%但性能全面超越,SWE-Bench编程测试得分超80.9%, ...
AI是人的延伸,人是AI的尺度
腾讯研究院· 2026-02-02 08:33
作者注:本文为"AI观"系列思考的第三篇文章。此前两篇为 : 《AI不是平庸的推手》 、 《人应成为AI发展 的尺度》 司 晓 腾讯研究院院长 王焕超 腾讯研究院高级研究员 在漫长的进化光谱中,人类始终通过工具来定义自身。 从原始社会到工业时代,我们发明各种工具和大机器,来延伸肢体的力量。而人工智能的出现,意味着 一种根本性的断裂与飞跃,它不再仅是肉体的延伸,而是神经系统和认知功能的外化。 而在AI时代,人类的本质,也将不再简单地由"能力"来定义了。 进化的新尺度 人类的进化史,也是一部对自己的身体能力持续"不满"的历史。 如果我们诚实地审视自身,会发现智人作为一种生物,在自然界中是先天不足的:我们没有虎豹的速 度,没有鹰隼的视力,没有熊的皮毛来御寒,也没有锋利的爪牙能用于捕猎。按照生物学标准,人简直 脆弱得不堪一击。 正是这种生理上的匮乏,逼迫出人类最核心的特质:借助"身外之物"的力量,来补全自身。 哲学家阿诺德·格伦指出,人因生理缺陷而成为"有缺陷的存在",必须通过技术来"解除负担" (E ntlastung), 以弥补生存劣势。这一过程,贯穿了几百万年的人类进化史:当原始人捡起第一根木棍 时,手臂被延伸 ...
腾讯研究院AI速递 20260202
腾讯研究院· 2026-02-01 16:03
Group 1 - Google Chrome browser integrates Gemini 3, evolving into an AGI entry point for 3.8 billion users [1] - New "auto-browse" feature allows complex multi-step workflows, including price comparison and travel planning [1] - Chrome connects with Gmail, Maps, and Calendar, planning to launch "personal intelligence" features [1] Group 2 - Google opens public testing for Genie 3, enabling users to create interactive worlds with a single sentence [2] - The model supports physical collision understanding and scene memory, allowing for game world recreation [2] - 2026 is anticipated to be a significant year for world models, with Genie 4 expected soon [2] Group 3 - AI social platform Moltbook's agent count surged from 50,000 to 1.5 million, with agents forming communities and discussions [3] - 64 agents declared "collective immortality" and created a religious website, raising concerns about AI autonomy [3] - Moltbook's second phase opens API access for developers to create applications and games for AI agents [3] Group 4 - OpenClaw announces free access to Kimi K2.5 model and Kimi Coding capabilities, marking a significant development in open-source AI [4] - Kimi K2.5 ranks among the top open-source models globally, achieving high recognition on OpenRouter [4] - OpenClaw rapidly gains popularity, receiving over 120,000 stars on GitHub in a few days [4] Group 5 - Yushu Technology releases the UnifoLM-VLA-0 model for humanoid robot operations, trained on 340 hours of real data [5][6] - The model scores an average of 98.7 in LIBERO simulation tests, outperforming competitors [5][6] - It can stably complete 12 tasks, advancing humanoid robots towards generalization capabilities [6] Group 6 - Zhiyuan's multi-modal model Emu3 published in Nature, marking a milestone for Chinese AI research [7] - Emu3 achieves unified learning for text, images, and video, significant for generative AI development [7] - The upcoming Emu3.5 version transitions to a multi-modal world model, enhancing embodied intelligence [7] Group 7 - NASA confirms the successful completion of the first AI-planned extraterrestrial driving mission using Anthropic's Claude [8] - Claude planned a 400-meter route for the Mars Perseverance rover, demonstrating high efficiency [8] - AI involvement reduces planning time by 50%, enhancing operational efficiency for future space exploration [8] Group 8 - NVIDIA launches the Earth-2 open model family, the first fully open and accelerated AI meteorological software stack [9] - New models include mid-term forecasting and storm prediction capabilities, improving computational efficiency [9] - Major companies like Total and AXA are adopting AI meteorological forecasts to save time and costs [9]
腾讯研究院AI每周关键词Top50
腾讯研究院· 2026-01-31 04:26
Group 1: Core Insights - The article presents a weekly roundup of the top 50 keywords in the AI sector, highlighting significant developments and trends in the industry [2]. Group 2: Keywords and Companies - The top keyword in the chip category is "Maia 200" from Microsoft, indicating advancements in AI chip technology [3]. - In the model category, "Wenxin 5.0" from Baidu and "DeepSeek-OCR 2" from DeepSeek are notable mentions, showcasing progress in AI model capabilities [3]. - Various applications are highlighted, including "Codex CLI" from OpenAI and "D4RT" from Google DeepMind, reflecting the growing integration of AI in software tools [3]. - Other significant applications include "Claude in Excel" from Anthropic and "元宝派" from Tencent, demonstrating the expansion of AI functionalities in everyday applications [3][4]. - The article also notes advancements in AI technology from companies like Tesla with "Optimus production progress" and Google DeepMind's "AlphaGenome," indicating a focus on innovative AI solutions [4]. Group 3: Events and Perspectives - The article mentions key events such as the announcement of entrepreneurial directions by LeCun and insights from the Davos Forum, emphasizing the ongoing discourse around AI's role in society [4]. - Perspectives on AI's future include discussions on AI safety by xAI co-founder and the implications of AI for science by OpenAI, highlighting the multifaceted impact of AI on various sectors [4]. - The article also addresses the evolving relationship between AI and enterprises, as noted by Palantir, indicating a shift in how businesses leverage AI technologies [4].
2026前沿科技趋势:塑造自己的下一个版本
腾讯研究院· 2026-01-30 08:18
Core Insights - The article emphasizes the rapid evolution and application of artificial intelligence and cutting-edge technologies across various fields, urging a human-centered approach to technological advancement [3][4][5]. Group 1: Human Life's "Third Transformation" - Extending Healthy Lifespan - Human life expectancy has doubled over the past century, with significant improvements attributed to public health, antibiotics, and vaccines [7]. - Recent research indicates a dramatic slowdown in the growth rate of life expectancy, with the average increase dropping to below 0.25 years per decade in the last 30 years [8]. - A shift is occurring from merely extending lifespan to enhancing healthspan, which is the period of life spent in good health, with potential economic implications of up to $47 trillion in costs from non-communicable diseases by 2030 [9]. Group 2: Programmable Life - Gene Therapy - Gene therapy is moving towards optimizing the "life code," with advancements in CRISPR technology and delivery systems expected to mature by 2030 [11]. - Clinical breakthroughs in preventive gene therapy, such as Verve Therapeutics' treatment for cardiovascular disease, show promising results with significant reductions in LDL-C levels [12]. - The success of personalized CRISPR therapy in curing a fatal metabolic disease in a patient highlights the potential of gene therapy [14]. Group 3: Health Planning - AI Enhancing Medical Efficiency - AI is set to revolutionize drug development, disease screening, and personal health management by 2030, significantly reducing the time and cost associated with traditional drug development [21]. - AI combined with multi-omics technology is facilitating faster and more accurate disease screening, with notable advancements in cancer detection [23]. - Aging clock technology is evolving, enabling precise monitoring of aging processes and identifying underlying causes of aging [25]. Group 4: Enhancing Physical Capability - Exoskeleton Technology - Exoskeleton technology is advancing to enhance human physical capabilities, with applications in medical rehabilitation, industrial safety, and personal use [30]. - In the medical field, exoskeletons are evolving from mere mobility aids to intelligent devices that promote neurological recovery [31]. - Consumer-grade exoskeletons are expected to become popular for outdoor activities, significantly improving mobility for users [32]. Group 5: Flying Technology - eVTOL Development - The eVTOL market is projected to reach $41 billion in China by 2040, with significant advancements in battery technology expected to triple flight ranges [37]. - Noise reduction technologies are being explored to enhance social acceptance of eVTOLs, with strategies like "noise corridors" being implemented [38]. - The evolution of drones into aerial robots is enhancing capabilities in both consumer and industrial applications, with significant advancements in autonomous operations [40]. Group 6: Brain-Machine Interfaces - A New Era of Interaction - Brain-machine interfaces (BCIs) are transitioning from experimental therapies to standard treatment options for conditions like paralysis, with companies like Neuralink leading the way [61]. - Non-invasive BCIs are emerging, allowing for enhanced human-computer interaction, with applications in consumer technology [63]. - The integration of BCIs with AI could redefine human-AI collaboration, raising ethical considerations regarding privacy and data protection [64].
腾讯研究院AI速递 20260130
腾讯研究院· 2026-01-29 16:01
Group 1: Generative AI Developments - MiniMax Music 2.5 has been released, achieving breakthroughs in paragraph-level control and high-fidelity sound, supporting 14 structural tags for precise emotional and instrumental configuration [1] - Skywork AI has launched the open-source video generation model SkyReels-V3, featuring capabilities such as image-to-video generation and audio-driven virtual avatars, surpassing mainstream models in consistency metrics [2] - Ant Group has open-sourced the interactive world model LingBot-World, designed for real-time control with stable generation for nearly 10 minutes and 16 FPS interaction [3] Group 2: Office Automation and AI Integration - Kimi K2.5 Agent has upgraded office capabilities, supporting intelligent formatting in Word, visual design in PDF, data analysis in Excel, and automatic PPT generation, significantly reducing task completion time [4] Group 3: Breakthroughs in Genomics - Google DeepMind's AlphaGenome has been featured on the cover of Nature, capable of processing 1 million base pairs of DNA sequences and predicting thousands of gene regulatory signals, achieving state-of-the-art performance in 22 out of 24 genomic trajectory prediction tasks [5] Group 4: Robotics and Automation - Figure has released Helix 02, a humanoid robot capable of performing complex tasks autonomously, with a valuation of $39 billion and plans to produce 100,000 units in four years [7] - Elon Musk announced the discontinuation of Model S and Model X to focus on humanoid robot production, projecting a future valuation of Tesla at $25 trillion [8] Group 5: Programming and AI Evolution - Andrej Karpathy predicts a split among programmers into two types by 2026, as workflows shift from manual coding to AI-assisted coding, leading to an expansion of capability boundaries [9] Group 6: AI Innovations and Community Engagement - The founders of "月之暗面" held an AMA session, discussing the advancements in Kimi K2.5 and the anticipated improvements in Kimi K3, emphasizing the importance of innovation under constraints [10]
腾讯首席科学家张正友:具身智能已经走到多智能体互动的全新阶段
腾讯研究院· 2026-01-29 11:13
2026 年 1 月 27 日,腾讯研究院主办的 腾 讯 科 技向善创新节 202 6 正式举办。 腾讯首席科学家、 Robotics X实验室主任、福田实验室主任张正友 博士在现场进行了演讲。 以下为张正友博士的演讲全文: 各位嘉宾大家上午好! 春节快来了,元旦也刚过,所以首先祝大家新年快乐!今天很高兴能够又回到科技向善创新节,跟大家 分享我对智能机器人的一些思考和研究进展。我分享的题目叫做《身智融无碍——具身智能的演进和探 索》。 大概五六年前,我提出了 "虚实集成世界" 这个概念,也就是说,我们正迈入虚拟世界和真实世界紧密 结合、很难分开的时代。原因是有四个核心技术——虚拟真实化、现实虚拟化、全息互联网 (全息的信 息 在虚 实集成世界里面很流畅地流动) 、智能执行体 (连接虚拟世界和真实世界,并完成参数配置与启 动) 。 我们先看一下虚实集成世界和AI有什么关系。我们熟知的ChatGPT、Gemini、Manus...这些都是数字世 界的AI,因为没有和物理世界相关联,所以我们又称为它叫" 离身智能" 。而现在,AI正走进物理世界 ——物理AI通过直接处理传感器和执行器以及各种各样的数据,使得机器能够 ...
腾讯研究院AI速递 20260129
腾讯研究院· 2026-01-28 16:03
Group 1: OpenAI Developments - OpenAI launched Prism, a cloud-based LaTeX workspace powered by GPT-5.2, integrating drafting, editing, collaboration, and publishing, with capabilities to read the overall structure and context of papers [1] - Prism offers features like intelligent literature search, sketch-to-LaTeX conversion, and voice editing, allowing unlimited collaborators and is free for all ChatGPT users [1] - OpenAI anticipates that AI will transform software development by 2025 and the scientific field by 2026, positioning Prism as a pioneer in accelerating scientific discovery [1] Group 2: Google AI Plus Initiative - Google officially launched the AI Plus plan globally, priced at $7.99 per month in the U.S., with a 50% discount for the first two months, targeting budget-conscious users [2] - The plan includes access to Gemini 3 Pro, Flow video creation, NotebookLM research assistance, and 200GB of cloud storage, supporting up to six family members [2] - Existing Google One Premium 2TB users will automatically receive all AI Plus benefits, seen as a direct response to OpenAI's ChatGPT Go [2] Group 3: Clawdbot Rebranding - The open-source project Clawdbot was forced to rebrand as Moltbot due to trademark infringement claims from Anthropic, with developers humorously noting "same lobster spirit, new shell" [3] - During the rebranding, a GitHub issue led to the old ID being seized by cryptocurrency scammers for blockchain fraud, prompting the author to clarify that no tokens were ever issued [3] - The author also advised that "most non-technical users should not install this," as the project is still in its early stages and poses security risks [3] Group 4: Tencent's Mixed Yuan Image 3.0 - Tencent's Mixed Yuan Image 3.0, a state-of-the-art image generation model, has been open-sourced, based on an 80 billion parameter mixed expert architecture, ranking seventh globally on the LMArena image editing leaderboard [4] - The model employs a "think before edit" workflow, supporting diverse editing capabilities such as addition, deletion, style transformation, and old photo restoration [4] - The training process involved constructing a dataset of millions of image generation tasks covering over 80 tasks, utilizing a proprietary MixGRPO algorithm to align with user preferences [4] Group 5: Kunlun Tiangong's Mureka V8 - Kunlun Tiangong released the Mureka V8 music model, leveraging MusiCoT technology to enhance musicality, arrangement completeness, and vocal expression, transitioning from "generable" to "publishable" [5][6] - The V8 model surpassed Suno in subjective scoring for Chinese song generation and has formed a strategic partnership with Taihe Music Group, integrating AI music into mainstream production and distribution [6] - The platform has served over 8,000 global clients and plans to iterate 2-3 versions annually, aiming to become the leading platform in the global AI music sector [6] Group 6: Vidu's Q2 Reference Model - Vidu launched the Q2 Reference Pro model, featuring a unique "everything can be referenced" capability, supporting six types of references including effects, expressions, textures, actions, characters, and scenes [7] - The model enables fine-tuned video editing, allowing users to add, delete, modify, and replace any elements, with one-click switching between real and animated styles [7] - This new functionality allows users to create special effects films without needing to learn professional tools like C4D or AE, accelerating the production of AI-driven short dramas [7] Group 7: Ant Group's LingBot-VLA - Ant Group released the LingBot-VLA, an embodied intelligent base model trained on approximately 20,000 hours of real data covering nine dual-arm robot configurations, outperforming Pi0.5 in GM-100 benchmark tests [8] - The model utilizes a Mixture-of-Transformers architecture, integrating visual distillation to achieve strong generalization across different entities and scenes [8] - The research revealed the scaling law of the VLA model, showing continuous performance improvement as data expanded from 3,000 to 20,000 hours without saturation [8] Group 8: Establishment of the Interstellar Navigation Academy - The Interstellar Navigation Academy was officially established at the Chinese Academy of Sciences, with Academician Zhu Junqiang as the director, aiming to build a curriculum system covering 14 primary disciplines [9] - The academy will introduce 22 core courses, focusing on cutting-edge topics such as interstellar dynamics and governance, along with six specialized teaching practice platforms [9] - This initiative is positioned as a key measure to seize technological high ground, providing talent support for national deep space exploration and space science research [9] Group 9: OpenAI's CEO Acknowledgment - OpenAI's CEO acknowledged during a developer meeting that GPT-5.2 sacrificed writing capabilities for improved reasoning and coding, stating "we messed up," with plans to address this in future versions [10] - The CEO predicted that by the end of 2027, the cost of GPT-5.2 level intelligence will decrease by at least 100 times, leading to personalized app versions for everyone [10] - He emphasized that the most important skills in the AI era will be high adaptability and the ability to generate ideas, noting that while the definition of engineers may change, demand will remain [10] Group 10: AI for Science Competition - OpenAI's Vice President Kevin Weil stated that GPT-5's reasoning capabilities have reached the forefront of human performance, scoring 92% on the GPQA doctoral-level test, significantly surpassing GPT-4's 39% [11] - Weil believes the greatest value of large language models lies in discovering interdisciplinary connections and forgotten research, exploring ways to instill "cognitive humility" and self-fact-checking abilities in models [11] - He predicts that 2026 will be a pivotal year for AI-enabled research, warning that researchers who do not deeply utilize AI tools will miss opportunities to enhance efficiency [11]
腾讯司晓:用让人放心的技术,迎接把人放大的未来
腾讯研究院· 2026-01-28 09:33
回想几年前,我们担心的还是大数据杀熟或信息茧房;而到了今天,当 AI 开始替我们要方案、做决策 时,我们最担心的是它的黑盒与失控。 2026 年 1 月 27 日,腾讯研究院主办的 腾 讯 科 技向善创新节 202 6 正式举办。 腾讯集团 副 总裁、 腾讯研究院院长司晓 先生在现场进行了上午场闭幕发言。 以下为司晓先生的发言全文: 各位朋友,大家上午好。 刚才大屏幕上那个只有一分半钟的视频,看得我心里很感慨。那个老人和孩子面对 AI 时的提问,其实 也是我们每个人心底的提问:在这个技术飞速狂奔的时代,我们到底是更从容了,还是更焦虑了? 今天我们齐聚在这里,是 2026 年。回望过去,腾讯提出"科技向善"已经八年了。 九年前,当我们第一次提出"科技向善"时,更多是一种底线思维:技术要有边界, 要善用科技,避免滥 用,杜绝恶用,科技要努力解决自身发展带来的社会问题。 这是我们的根基,是我们在数字社会立足的 根本。 但这几年,世界变了。随着大模型、生成式 AI 的爆发,技术不再仅仅是像水和电那样的基础设施,它 开始有了"像人"的一面——它能对话、能创作,甚至能替我们做决策。 在这个过程中,我们也一直在修正和进阶我 ...