腾讯研究院

Search documents
意识的七大理论,走到哪一步了?
腾讯研究院· 2025-09-05 08:01
Core Viewpoint - The article explores the complex phenomenon of consciousness from various interdisciplinary perspectives, aiming to connect different theories and establish a computational framework for understanding consciousness and its implications for artificial intelligence [2][9]. Group 1: Introduction and Definition of Consciousness - Consciousness is defined as a multifaceted concept involving awareness, wakefulness, and subjective experience, with distinctions made between these related but different concepts [7][16]. - The article emphasizes the importance and difficulty of understanding human consciousness, aiming to engage various research communities in this exploration [7][8]. Group 2: Theoretical Frameworks - The article outlines several influential theories of consciousness, including Information Integration Theory (IIT), Orchestrated Objective Reduction Theory (Orch OR), Global Workspace Theory (GWT), High-Order Theories (HOT), Attention Schema Theory (AST), and Conscious Turing Machine (CTM) [8][38]. - IIT posits that consciousness corresponds to the ability of a system to integrate information, with a focus on the causal power of the system [42][46]. Group 3: Measurement of Consciousness - Recent research has developed effective methods for measuring human consciousness, including indices based on electrical signals and behavioral assessments [18][19]. - The Perturbational Complexity Index (PCI) is highlighted as a significant measure for distinguishing between conscious and unconscious states [19][20]. Group 4: Consciousness and Intelligence - The article discusses the distinction between consciousness and intelligence, noting that consciousness is often considered more mysterious and difficult to measure than intelligence [22][23]. - The relationship between consciousness and free will is explored, with ongoing debates about the existence of true free will and its connection to consciousness [28][29]. Group 5: Sleep and Consciousness - The article examines consciousness during sleep, noting that different sleep stages (REM and NREM) exhibit varying levels of awareness and perception [35][36]. - Information Integration Theory suggests that consciousness diminishes during deep sleep due to reduced integration of brain activity [36][37]. Group 6: Biological Evidence and Theories - The article discusses biological evidence supporting the theories of consciousness, particularly the role of the brain's cortical areas in information integration [49]. - The Orch OR theory is presented as a hypothesis linking consciousness to quantum processes, suggesting that true randomness may be necessary for free will [65].
腾讯研究院AI速递 20250905
腾讯研究院· 2025-09-04 22:42
Group 1 - OpenAI has acquired the Alex team, known for its powerful coding assistant plugin for Xcode, indicating its intention to expand influence in the Apple app development ecosystem [1] - Apple plans to launch an AI search engine called "World Knowledge Answers" in Spring 2026, competing directly with ChatGPT and Perplexity [2] - Apple is facing a talent drain in its AI division, having lost 10 AI researchers, including key personnel, to Meta in recent weeks [2] Group 2 - The new Kimi K2-0905 model from Moonlight has enhanced programming capabilities and supports 256K context length, doubling the previous version's capacity [3] - Kimi K2-0905 maintains state-of-the-art performance in creative writing and improves compatibility with Claude Code [3] Group 3 - Kuaishou has open-sourced its 8 billion parameter multimodal model Keye-VL-1.5, achieving state-of-the-art results in video understanding benchmarks [4] - Keye-VL-1.5 can process several minutes of video content in under 10 seconds and introduces innovative training strategies for video recommendation and content review [4] Group 4 - OpenAI has introduced the Projects feature to the free version of ChatGPT, allowing users to manage up to 5 files and customize project settings [5][6] - This feature enhances the efficiency of ChatGPT by enabling centralized management of related content [5] Group 5 - Salesforce has announced the layoff of approximately 4,000 customer support positions, attributing this to the efficiency gains from its AI system, Agentforce [7] - The CEO stated that AI now handles up to 50% of internal workloads, while the company plans to hire 1,000 to 2,000 sales personnel to promote AI's value [7] Group 6 - A comprehensive review of scientific large language models (Sci-LLMs) has been published, detailing over 600 datasets and models, and highlighting four paradigm shifts from 2018 to 2025 [9] - The review emphasizes the importance of data quality and proposes a dynamic assessment model for scientific knowledge [9] Group 7 - OpenAI released a white paper outlining leadership strategies for maintaining a competitive edge in the AI era, noting that early adopters of AI see revenue growth 1.5 times faster than their peers [10] - The report suggests five core principles for organizations to follow in their AI strategy and implementation [10]
泡泡共同体
腾讯研究院· 2025-09-04 08:33
Core Viewpoint - The article discusses the concept of "bubble community" as a complex and dynamic structure in the digital age, emphasizing the coexistence of isolation and connection among individual information bubbles, challenging the traditional narrative of "information cocoons" [31][22]. Group 1: Conceptual Framework - The terms "information cocoon," "filter bubble," and "echo chamber" attempt to describe a shared experiential space, which can be categorized into three forms: bubbles, spheres, and foams, as proposed by philosopher Peter Sloterdijk [3][5]. - Bubbles represent the smallest unit of coexistence, symbolizing intimate relationships, while spheres denote larger, closed communities that provide safety from external threats [3][5][6]. - Foams consist of interconnected bubbles, representing an open yet fragile social structure, where individual bubbles maintain their independence while influencing one another [6][8]. Group 2: Algorithmic Influence - Algorithms create a "pampered space" for users by filtering out uncomfortable information and reinforcing cognitive biases, leading to a "cognitive immune space" [10][12]. - The filtering mechanism passively constructs boundaries, while the "adhesion mechanism" actively strengthens these boundaries through user interactions, such as likes and shares [11][12]. - This results in a parasitic structure where users become laborers in data capitalism, trading their data for a comfortable cognitive environment [12][13]. Group 3: Challenges of Digital Interaction - The article highlights the shift from emotional resonance to adversarial stimuli as the glue that binds groups together, leading to polarization and a lack of diverse viewpoints [13][15]. - Individual fluidity is constrained within algorithmic pampered spaces, where users appear to have freedom but are actually confined to predetermined cognitive frameworks [14][15]. - The self-consuming cycle of information within these bubbles leads to a gradual loss of vitality and diversity, as members become increasingly homogeneous in their views [15][16]. Group 4: Cognitive Navigation - The concept of "cognitive cocoon" is introduced, emphasizing that the real challenge lies in cognitive rigidity rather than mere information isolation [19][20]. - Individuals often reject opposing viewpoints due to confirmation bias and the fear of cognitive dissonance, reinforcing their existing beliefs [19][20]. - The article argues that breaking free from cognitive cocoons requires more than just exposure to diverse information; it necessitates a shift in mindset and the ability to engage with differing perspectives [16][19]. Group 5: Optimizing Bubble Communities - The article proposes three strategies for optimizing bubble communities: algorithmic intervention, sphere re-gasification, and social engineering [24][25][27]. - Algorithmic intervention involves injecting "heterogeneous bacteria" into recommendation systems to enhance cognitive diversity [24]. - Sphere re-gasification aims to make rigid boundaries more permeable, facilitating interaction between different bubbles [25]. - Social engineering emphasizes the need for a collective social contract among users, platforms, and governments to foster a healthier information ecosystem [27][28].
腾讯研究院AI速递 20250904
腾讯研究院· 2025-09-03 16:01
生成式AI 一、 OpenAI斥资11亿美金,收购软件开发平台公司Statsig 1. OpenAI以11亿美元收购软件实验和分析平台Statsig,并任命创始人Vijaye Raji为OpenAI应用部门新CTO; 2. Statsig帮助软件团队判断代码是否该上线及效果评估,这次收购显示OpenAI战略重心从纯底层模型研发转向打造 数据驱动的终端用户产品; 3. Vijaye Raji曾在微软工作十年,后在Meta担任高管十年,2021年创立Statsig,期间表现出卓越的编程能力和领 导才能,将直接向OpenAI应用CEO Fidji Simo汇报。 https://mp.weixin.qq.com/s/_eGHvABP5GK2FfCEl1jlfw 二、 又一万亿AI独角兽诞生,Anthropic完成130亿美元融资 1. Anthropic宣布完成130亿美元F轮融资,投后估值达1830亿美元,成为仅次于OpenAI、字节跳动的全球第三大AI 独角兽; 1. 谷歌发布nano banana(Gemini 2.5 Flash Image)官方Prompt模板,覆盖写实摄影、贴纸、文本渲染、商业摄 影、留 ...
腾讯研究院AI速递 20250903
腾讯研究院· 2025-09-02 16:01
生成式AI 一、 这次真不一样,谷歌Gemini解锁「详解网页」新技能 1. 谷歌Gemini API全面上线"URL Context"功能,使模型能深度访问并处理来自URL的内容,包括网页、PDF和图 像等; 2. 该功能采用两步检索流程,可深度解析PDF中的表格、文本结构、脚注等,处理容量上限达34MB,单次请求最多 处理20个URL; 3. URL Context被评价为"RAG的又一颗棺材钉",无需提取、分块、矢量化和存储等繁琐流程,直接解析特斯拉50 页PDF并精准提取数据。 https://mp.weixin.qq.com/s/alV-czwScS_CSsdP3nWZHQ 二、 混元世界模型上新,综合能力问鼎WorldScore排行榜 1. 腾讯发布混元3D世界模型系列最新成员HunyuanWorld-Voyager,业界首个支持原生3D重建的超长漫游世界模 型; 2. 混元Voyager突破传统视频生成限制,能生成长距离、世界一致的漫游场景,支持将视频直接导出为3D格式,且 与混元世界模型1.0高度适配; 3. 该模型在斯坦福大学李飞飞团队发布的WorldScore基准测试中位居综合能力首位,支 ...
所有人都在谈“人工智能+”,到底怎么落地?
腾讯研究院· 2025-09-02 08:23
Core Viewpoint - The article discusses the transition from "Internet+" to "Artificial Intelligence+" as a new phase in technological integration, emphasizing the transformative potential of AI in reshaping industries and societal operations [5]. Group 1: Differences Between "Artificial Intelligence+" and "Internet+" - The technological stage differs, with "Internet+" being based on mature digital technologies while "Artificial Intelligence+" is characterized by rapid iteration and uncertainty in technology and applications [7]. - The value creation mechanism varies; "Internet+" enhances connectivity, while "Artificial Intelligence+" focuses on computational enhancement, improving productivity at each node and expanding the network's value [10]. - The diffusion paths are distinct; "Internet+" follows a consumer-to-producer model, while "Artificial Intelligence+" is more producer-focused, requiring deep integration into business processes before reaching consumers [12]. Group 2: Economic Impact of AI - AI's productivity effects are expected to grow exponentially, with predictions that AI could contribute to a 15% increase in global economic growth over the next decade [11]. - The rapid evolution of AI capabilities, with task completion abilities doubling approximately every seven months, indicates a significant potential for economic value creation [11]. Group 3: Practical Exploration of "Artificial Intelligence+" - Companies should prioritize high-value AI use cases that are data-rich and core to their business, as demonstrated by Pfizer's use of AI to enhance drug development efficiency [17]. - The engineering of AI systems is crucial, with companies needing to adapt general models to specific business needs through techniques like prompt engineering and retrieval-augmented generation [18]. - Building AI datasets should focus on business needs rather than data collection for its own sake, ensuring that data strategies are integrated throughout the AI application lifecycle [19]. Group 4: Recommendations for Promoting "Artificial Intelligence+" - A top-level design is necessary to create an innovative environment for "Artificial Intelligence+", similar to the strategic guidance that supported "Internet+" [22]. - Encouraging a diverse range of developers and startups in AI applications can foster innovation and investment in the sector [23]. - Establishing a comprehensive data element market and promoting open industry application scenarios can enhance the sustainable development of AI applications [25].
腾讯研究院AI速递 20250902
腾讯研究院· 2025-09-01 16:01
Group 1 - Meta and Scale AI partnership has deteriorated, with Ruben Mayer, a high-ranking executive who joined Meta from Scale AI, leaving the company just two months after the collaboration began [1] - Meta's internal researchers have complained about the low data quality from Scale AI, prompting Meta to shift its focus to competitors Mercor and Surge [1] - Following the loss of Meta's support, Scale AI has also lost major clients like OpenAI and Google, leading to significant layoffs [1] Group 2 - Users reported a significant performance decline in Claude Opus 4.1 during the daytime, particularly between 10-11 AM, with frequent errors in document processing [2] - Analysis suggests that the performance drop may be due to Anthropic's use of 1.58-bit quantization during the day, which resulted in the loss of critical information [2] - Anthropic has acknowledged the issue as a problem with the inference stack and has rolled back to previous versions 4.1 and 4.0 to restore quality [2] Group 3 - Tencent has open-sourced the 7B parameter translation model Hunyuan-MT-7B, which supports 33 languages and has achieved first place in 30 out of 31 languages in the WMT2025 competition [3] - The company also released the first translation integration model, Hunyuan-MT-Chimera-7B, which generates superior translations based on original text and multiple model outputs [3] - The model utilizes AngelSlim compression for FP8 quantization, improving inference performance by 30% and is integrated into various Tencent services [3] Group 4 - Jieyue Star has launched the end-to-end speech model Step-Audio 2 mini, which integrates speech understanding, audio reasoning, and generation, along with native Tool Calling capabilities [4] - The model has excelled in multiple benchmark tests, achieving an MMAU score of 73.2, ranking first among open-source end-to-end speech models [4] - It employs a true end-to-end multimodal architecture, incorporating chain reasoning and reinforcement learning for enhanced understanding of emotions, tones, and non-verbal signals [4] Group 5 - Shanghai AI Laboratory has released the Shusheng·Wanxiang InternVL3.5 series models, featuring nine sizes with parameters ranging from 1 billion to 241 billion, enhancing general capabilities, reasoning abilities, and deployment efficiency [5] - The flagship model InternVL3.5-241B-A28B surpasses GPT-5 in several benchmarks, achieving a score of 77.7 in MMMU, the highest for open-source models [5] - Innovations include dynamic visual resolution routing and a decoupled deployment framework, reducing inference latency from 369ms to 91ms, enhancing core capabilities [6] Group 6 - The South Korean government has distributed AI dolls developed by startup Hyodol to tens of thousands of elderly individuals living alone, providing companionship and health monitoring [7] - The dolls feature a ChatGPT-based dialogue system and sensors to detect movements, with the ability to alert caregivers in emergencies [7] - Over 12,000 Hyodol dolls are currently in use, priced at approximately 8,160 RMB each, significantly lower than the cost of caregiving staff, addressing the shortage of nursing personnel in South Korea [7] Group 7 - As of September 1, the "Identification Method for AI-Generated Synthetic Content" has been implemented, requiring AI-generated content to include identity tags [8] - Providers of synthetic content must add explicit and implicit identifiers, while platforms must verify metadata and provide clear indications [8] - Major platforms like Tencent, Douyin, Kuaishou, Bilibili, and DeepSeek have announced detailed rules and functionalities for adding identifiers to AI content, prohibiting users from deleting or altering these tags [8] Group 8 - Tsinghua University and partners have released RLinf, the first large-scale reinforcement learning framework for embodied intelligence, featuring a new hybrid execution model [9] - The framework achieves over 120% system acceleration in training scenarios for embodied intelligence [9] - It integrates Megatron+SGLang/vLLM and FSDP+HuggingFace backends, designed for different training needs, and includes adaptive communication libraries and automatic scheduling modules [9] Group 9 - DeepSeek has published an official announcement in response to the new regulations, committing to label AI-generated content and warning users against modifications [10] - The company has disclosed training details for its models, including a scale of 685 billion parameters and the pre-training and optimization processes [10] - DeepSeek has outlined its data governance system, employing filters to eliminate harmful content while ensuring user rights to information, choice, and control, acknowledging the ongoing challenge of "hallucinations" in models [10]
段永朝:在AI缔造的新知识时代,刷题和应试将不再有意义
腾讯研究院· 2025-09-01 09:04
段永朝 苇草智酷创始合伙人 【 精彩观点整理 】 本文根据腾讯研究院对 段永朝老师 的访谈整理 访谈时间:2025年8月1日 对未来的想象不能再沿用基于三百年前笛卡尔主客二分法,或将世界进行分层 (如物理世界、观念世界 、人 造 世界) 的旧理论框架,因为这些理论一定有其目光不及之处。 我们需要看到一个新的世界的出现, 即"机器世界"。这个世界的崛起,意味着未来生命的概念可能会被重新定义,从而诞生出"人造生 命"或"机器生命"。讨论未来人机关系的第一步,首先要讨论这个"新世界"。 人类通过神话、传说等叙 事传统,一直在想象和创造"新物种"。如今,基因编辑、脑机接口、人工合成生命等生物科技的发展, 使得改变人体乃至创造新物种成为技术上势不可挡的趋势,科幻小说中的"超能人"将来可能会出现。未 来,在"纯种人"与"纯机器人"之间,会出现由不同比例合成的"赛博格"构成的模糊地带。即便在当下, 佩戴眼镜、摄入化学合成药物等,也已使我们在一定程度上成为了"赛博格"。 目前的大模型就像一个"话痨",有问必答,从不承认"不知道"。这种无论对错都要给出答案的特 性,恰恰是其技术尚处初级阶段的体现。 AI时代,个体的独立性在下降 ...
腾讯研究院AI速递 20250901
腾讯研究院· 2025-08-31 16:02
Group 1: Generative AI Developments - xAI launched Grok Code Fast 1, which is five times faster than GPT-5 and ranks among the top five coding models globally, focusing on real programming tasks and supporting multiple languages [1] - Meta is seeking partnerships with OpenAI or Google to enhance its AI capabilities, as its internal flagship model Llama 5 is progressing slowly, reflecting a sense of urgency in the AI race [2] - OpenAI introduced GPT-realtime, featuring advanced voice generation and improved accuracy, with a new API that lowers costs and enhances application flexibility [3] Group 2: Data Privacy and User Engagement - Claude updated its privacy policy to allow user data collection for model training, which has drawn criticism for contradicting its earlier stance on data security [4] Group 3: Model Performance and Innovations - Meituan open-sourced the LongCat-Flash model with 560 billion parameters, achieving high efficiency and speed, and performing well in various benchmarks [5] - GPT-5 demonstrated superior social reasoning and manipulation skills in a series of games, achieving a 96.7% win rate, highlighting its dominance in social intelligence [6][7] Group 4: Talent Movement and Legal Issues - xAI's founding engineer was accused of stealing core code and moving to OpenAI after cashing out approximately $7 million in stock, leading to a lawsuit over trade secrets [8] Group 5: Robotics and AI Interaction - Tsinghua University's team developed a framework allowing a robot to play table tennis with high accuracy, showcasing advancements in dynamic interaction capabilities [9] Group 6: AI Hardware Insights - a16z's Bryan Kim emphasized the need for hardware to facilitate more natural interactions with AI, identifying key factors for success in AI hardware applications [10]
腾讯研究院AI每周关键词Top50
腾讯研究院· 2025-08-30 02:33
Core Viewpoint - The article provides a weekly summary of the top 50 keywords related to AI developments, highlighting significant advancements, applications, and events in the industry [2]. Group 1: Chips - Jetson Thor and NVFP4 are key chip developments from NVIDIA, indicating a focus on enhancing computational power [3]. - UE8M0 FP8 is a notable chip from DeepSeek, showcasing innovation in AI hardware [3]. Group 2: Models - The release of Grok-2 as an open-source model by xAI reflects the trend towards collaborative AI development [3]. - Meta and others are advancing with the DeepConf method, indicating a push for improved model training techniques [3]. - NVIDIA's Jet-Nemotron and MiniCPM-V 4.5 from 面壁 are significant model advancements, showcasing the competitive landscape in AI modeling [3]. - The introduction of M2N2 evolution by Sakana AI and the V3.1 Bug by DeepSeek highlight ongoing improvements and challenges in model performance [3]. - OpenAI and Anthropic are collaborating on peer evaluation models, emphasizing the importance of model validation [3]. Group 3: Applications - Coinbase's mandatory use of AI tools signifies a shift towards integrating AI in operational processes [3]. - OpenAI's GPT-4b micro and Tencent's AI meeting summary feature demonstrate the growing application of AI in various sectors [3]. - Other notable applications include SpatialGen by 群核科技, Video Ocean's video intelligence, and DingTalk A1 by 钉钉, indicating diverse use cases for AI technology [3][4]. Group 4: Events - OpenAI's leadership transition and Midjourney's collaboration with Meta are significant events impacting the AI landscape [4]. - The monopoly lawsuit involving X company and Musk's Macrohard initiative reflect ongoing regulatory and competitive challenges in the industry [4]. Group 5: Perspectives - Insights from Claude Code on product iteration mechanisms and a16z on the generative platform landscape highlight strategic considerations in AI development [4]. - Google's AI energy consumption report and Stanford University's study on AI's impact on employment provide critical perspectives on the societal implications of AI [4]. - The discussion on digital immortality by Delphi and Geoffrey Hinton's baby hypothesis indicate philosophical considerations surrounding AI advancements [4].