AGI
Search documents
倒计时3周离职,LeCun最后警告:硅谷已陷入集体幻觉
3 6 Ke· 2025-12-16 07:11
Core Viewpoint - LeCun criticizes the obsession with large language models (LLMs) in Silicon Valley, asserting that this approach is a dead end and will not lead to artificial general intelligence (AGI) [1][3][26] Group 1: Critique of Current AI Approaches - LeCun argues that the current trend of stacking LLMs and relying on extensive synthetic data is misguided and ineffective for achieving true intelligence [1][3][26] - He emphasizes that the real challenge in AI is not achieving human-like intelligence but rather understanding basic intelligence, as demonstrated by simple creatures like cats and children [3][12] - The focus on LLMs is seen as a dangerous "herd mentality" in the industry, with major companies like OpenAI, Google, and Meta all pursuing similar strategies [26][30] Group 2: Introduction of World Models - LeCun is advocating for a different approach called "world models," which involves making predictions in an abstract representation space rather than relying solely on pixel-level outputs [3][14] - He believes that world models can effectively handle high-dimensional, continuous, and noisy data, which LLMs struggle with [14][12] - The concept of world models is tied to the idea of planning, where the system predicts the outcomes of actions to optimize task completion [14][12] Group 3: Future Directions and Company Formation - LeCun plans to establish a new company, Advanced Machine Intelligence (AMI), focusing on world models and maintaining an open research tradition [4][5][30] - AMI aims to not only conduct research but also develop practical products related to world models and planning [9][30] - The company will be global, with headquarters in Paris and offices in other locations, including New York [30] Group 4: Perspectives on AGI and AI Development Timeline - LeCun dismisses the concept of AGI as meaningless, arguing that human intelligence is highly specialized and cannot be replicated in a single model [31][36] - He predicts that significant advancements in AI could occur within 5-10 years, potentially achieving intelligence levels comparable to dogs, but acknowledges that unforeseen obstacles may extend this timeline [31][33] Group 5: Advice for Future AI Professionals - LeCun advises against pursuing computer science as a primary focus, suggesting instead to study subjects with long-lasting relevance, such as mathematics, engineering, and physics [45][46] - He emphasizes the importance of learning how to learn and adapting to rapid technological changes in the AI field [45][46]
8点1氪:麦当劳多款餐品涨价;深圳一地厕所安装“吸烟会变透明”玻璃;纳斯达克称申请将工作日交易时长延长至23小时
36氪· 2025-12-16 00:12
Group 1 - McDonald's has increased the prices of several menu items by 0.5 to 1 yuan, including various burgers, snacks, and meals, with the Big Mac and Double Filet-O-Fish both rising by 1 yuan [3][4] - The "1+1 Flexible Combo" meal, humorously referred to as the "poor man's meal," remains unchanged at a starting price of 13.9 yuan, although some combinations within it have seen a 1 yuan increase [4] Group 2 - In Shenzhen, a restroom has been equipped with glass that turns transparent when smoke is detected, aimed at discouraging smoking [5] Group 3 - The breakfast combo at Mixue Ice City priced at 7.9 yuan has faced criticism for being expensive, as consumers compare it to cheaper local options [7][8] - The breakfast items sold at Mixue are pre-packaged rather than freshly made, leading to perceptions of lower value compared to local street food [7][8] Group 4 - The U.S. has seen a rise in tuition fees at several universities, with total costs for a bachelor's degree nearing 2.8 million yuan, particularly at prestigious institutions [9] Group 5 - Nasdaq has filed to extend trading hours to nearly 23 hours on weekdays, aiming to enhance trading flexibility [7] Group 6 - The first L3 level autonomous driving vehicles in China have received approval for commercial testing, marking a significant step towards the commercialization of autonomous driving technology [10]
估值1.05万亿!DeepSeek双登《自然》封神,中国AI如何做到颠覆?
Sou Hu Cai Jing· 2025-12-15 22:07
2025年末,一位中国创业者再度引爆科技圈。 国际顶级期刊《自然》新鲜出炉的年度十大科学人物榜单上,DeepSeek创始人梁文锋赫然在列。 要知道,该榜单每年仅甄选十位真正推动科学进步的领军者。梁文锋的入选,源自其带领团队研发的 DeepSeek大模型对全球AI格局的颠覆性重塑。 而这并非他与《自然》的首次邂逅——今年9月,他作为DeepSeek-R1论文核心作者已登上期刊封面, 短短三月内再次上榜,实力毋庸置疑。 正如《自然》赋予他的"Tech disruptor"评语,这位40岁的创业者已是公认的AI领域革命者。 接连的高光时刻,让梁文锋的崛起之路格外耀眼。他与估值1.05万亿的DeepSeek所缔造的传奇,究竟是 时运眷顾还是实力使然? 一、破局者之路,从10万到万亿的逆袭 长期以来,海外科技巨头始终认定中国AI难触核心技术,只能在产业链下游挣扎。然而,一位年轻企 业家的实践路径,正在系统性地扭转这一认知。 2013年,职业生涯起步阶段的梁文锋带着有限资本,进入变幻莫测的金融市场。当时他对人工智能的理 解尚处于探索阶段,却已展现出敢于挑战常规的勇气与远见。 两年后,他创立幻方科技,专注于量化投资这一专业 ...
DeepMind科学家惊人预测:AGI在2028年实现,大规模失业要来了
3 6 Ke· 2025-12-15 02:50
Core Insights - DeepMind's Chief Scientist Shane Legg predicts a 50% chance of achieving Minimal AGI by 2028, indicating a significant shift in human labor dynamics and potential for large-scale unemployment [1][25][27] - The development of AGI is seen as a critical turning point, with the potential to fundamentally reshape society and the economy [6][19][22] AGI Development Stages - Minimal AGI: Capable of performing typical cognitive tasks that humans can do, expected to be achieved by 2028 with a 50% probability [3][9] - Full AGI: Expected to follow Minimal AGI within 3-6 years, capable of performing tasks of the most outstanding humans, such as creating new theories and art [11] - Superintelligence (ASI): Will surpass human cognitive abilities across all domains, leading to unprecedented changes in society [13][19] Implications of AGI - The arrival of AGI could lead to structural unemployment, particularly affecting high-level cognitive jobs, while lower-skilled jobs may remain safer for the time being [22][24] - A rethinking of resource distribution and societal values will be necessary as human labor becomes less central to value creation [24][31] Future Vision - Shane Legg emphasizes the need for public policy and social structures to evolve alongside AGI to ensure equitable benefits and prevent potential risks [31][32] - The ultimate significance of AGI may lie in redefining what constitutes a meaningful human life, moving away from work-centric values [30][34] Call to Action - A collective effort from various societal sectors, including philosophers, educators, and policymakers, is essential to navigate the challenges and opportunities presented by AGI [35][39]
腾讯研究院AI速递 20251215
腾讯研究院· 2025-12-14 16:01
Group 1 - OpenAI's GPT-5.2 received negative feedback from users on platforms like X and Reddit, citing issues such as blandness, excessive safety checks, and poor emotional intelligence [1] - SimpleBench testing revealed GPT-5.2 scored lower than Claude Sonnet 3.7 from a year ago, with errors in simple questions, while LiveBench scores were below Opus 4.5 and Gemini 3.0 [1] - The strict safety refusal mechanism was criticized for reducing the model's empathy and contextual awareness, leading to mechanical and unrealistic suggestions in emotional support scenarios [1] Group 2 - Google launched the new Gemini Deep Research Agent just before GPT-5.2, enhancing accuracy and reducing hallucinations through multi-step reinforcement learning [2] - The new version achieved leading scores of 46.4% in the Humanity's Last Exam test set, 66.1% in DeepSearchQA, and 59.2% in BrowseComp [2] - Google also introduced an open-source benchmark for network research agents and a new interactive API for server-side state management and long inference loops [2] Group 3 - Runway released significant updates, including the Gen-4.5 flagship video model and the first general world model, GWM-1, which supports native audio generation and multi-camera editing [3] - GWM-1 is an autoregressive model that allows frame-by-frame prediction and real-time intervention, featuring variants for exploring environments, dialogue characters, and robotic operations [3] - NVIDIA's CEO congratulated Runway, indicating a shift from simple video generation to true world simulation, with AI beginning to understand the underlying logic of the physical world [3] Group 4 - Google integrated Gemini model capabilities into its translation service, launching a real-time voice translation beta that supports over 70 languages while preserving speaker tone and rhythm [4] - The text translation engine has been restructured to intelligently parse idioms and context rather than relying on literal translations, supporting translations between English and nearly 20 other languages [4] - The Chrome team introduced an experimental browser called Disco, featuring GenTabs that convert web content into interactive mini-apps [4] Group 5 - TuoZhu Technology upgraded its 3D model platform MakerWorld by integrating Tencent's Hunyuan 3D 3.0, launching a new figurine generator that allows users to create printable 3D models from a single image [6] - Hunyuan 3D 3.0 introduced a pioneering 3D-DiT sculpting technology, enhancing modeling precision threefold with a geometric resolution of 1536³ and supporting ultra-high-definition modeling with 3.6 billion voxels [6] - MakerWorld has attracted over 2 million users with 20 unique modeling tools, significantly shortening design cycles by leveraging advanced generative AI technology [6] Group 6 - Disney invested $1 billion in OpenAI, acquiring warrants for additional equity, marking a significant content licensing partnership for the Sora platform [7] - The three-year licensing agreement grants exclusivity in the first year, allowing Sora and ChatGPT Images to use over 200 Disney characters, including those from Marvel and Pixar, excluding live-action likenesses [7] - Disney plans to utilize OpenAI's API to develop new products for its Disney+ streaming platform and deploy ChatGPT for internal workflows, with selected fan-created videos to be featured on Disney+ [7] Group 7 - The Erdős 1026 problem, proposed in 1975, was solved with AI assistance in just 48 hours, showcasing AI's potential to provide new mathematical insights rather than merely searching existing literature [8] - The AI system Aristotle automatically proved a formula in Lean proof assistant language, while AlphaEvolve helped refine a clean formula from numerical results [8] - This achievement demonstrates AI's capability to generate new mathematical insights, significantly reducing the time required for traditional problem-solving methods [8] Group 8 - Yuzhu Technology launched the first humanoid robot application store, aimed at standardizing and modularizing humanoid robot functionalities to lower the development barrier for complex movements [9] - The application store includes core modules such as user forums, action libraries, datasets, and developer centers, allowing users to deploy cloud-based motion control algorithms without coding skills [9] - Initial applications include preset martial arts and dance routines for the G1 series robots, utilizing proprietary dynamics algorithms and high-precision motion capture data [9] Group 9 - Google DeepMind's chief AGI scientist predicts a 50% chance of achieving minimal AGI by 2028, with complete AGI expected within 3-6 years after that, leading to a phase of superintelligent AI [10] - AGI is viewed as a continuous spectrum rather than a critical point, with three stages: minimal AGI for typical cognitive tasks, complete AGI for exceptional human tasks, and ASI surpassing all human cognitive domains [10] - The emergence of AGI is anticipated to cause structural unemployment, primarily affecting high-level cognitive jobs, while lower-level physical jobs may remain temporarily safe [10] Group 10 - A report by Similarweb indicates that global GenAI platform monthly visits exceeded 7 billion, a 76% year-on-year increase, with mobile app downloads reaching 1.9 billion, more than tripling in a year [12] - The proportion of users aged 18-34 decreased by approximately 15%, indicating a rapid influx of older users, while ChatGPT has become one of the top five websites globally, with 95% of users still using Google [12] - AI Mode has become the first generative AI search feature to surpass 100 million visits, marking a shift in the internet from being search-driven to being AI-driven [12]
2026 将近,世界模型到底更「世界」了吗?
机器之心· 2025-12-13 02:30
Core Viewpoint - The recent launch of GWM Worlds and GWM Robotics by Runway pushes video generation towards an interactive "world simulation" paradigm, reigniting discussions on the definition and scope of "world models" as interfaces for creation and interaction, simulators for training and evaluation, or cognitive frameworks for reasoning and decision-making [1]. Group 1: Evolution of World Models - Over the past two years, world models have evolved to be considered on par with LLMs in the AGI landscape, transitioning from a narrow definition focused on reinforcement learning to a broader understanding that includes generative modeling [4]. - Initially, world models were seen as internal environment models for agents, predicting future states based on current conditions and actions, allowing for internal simulation and decision-making [5]. - The engineering perspective defined world models as a combination of three capabilities: compressing high-dimensional perception into usable representations, predicting future states over time, and utilizing predictions for planning and decision-making [6]. - By 2024, the understanding of world models expanded to encompass general world evolution modeling, with a trend from language generation to image generation, and ultimately to 3D and world generation [6]. - The boundaries of the world model concept have become more ambiguous, with ongoing debates about the nature of representations, the incorporation of physical laws, and the organization of input relationships [6]. Group 2: Industry Layout and Trends - Major companies are investing in world models, questioning whether they are enhancing their "data engines" or building new frameworks for "spatiotemporal cognition" [3]. - In February 2024, OpenAI referred to the video generation model Sora as "world simulators," emphasizing their ability to learn the three-dimensional structure and physical laws of the real world [6]. - Concurrently, LeCun introduced V-JEPA, which focuses on predicting masked video segments in abstract representation space, allowing for higher training efficiency by discarding unpredictable information [6]. - The current discourse has shifted from whether to develop world models to how to model them, with debates on whether to abstract from pixel levels or to directly operate in abstract spaces [7]. - There is a recognition that existing approaches may only capture partial physical laws, indicating a need for representations of isolated objects and a priori laws of change across space and time to achieve a coherent world model [7]. Group 3: Definition and Ambiguity of World Models - By 2025, world models are positioned alongside LLMs, with companies like Google DeepMind, Meta, and Nvidia shifting focus from pure LLMs to world models, aiming for "Physical AI + superintelligence" due to stagnation in LLM advancements [8]. - The distinction between world models and existing generative AI lies in the former's goal to construct internal representations of environments that include physical, temporal, and spatial dimensions for planning and decision-making [9]. - The term "world model" has become ambiguous, referring to latent states within systems, game-like simulators for training agents, or any content pipeline capable of generating navigable 3D scenes [9]. - An analysis from Entropy Town in November 2025 categorized world models into three technical routes: interface, simulator, and cognitive framework, highlighting the ongoing ambiguity in the field [9].
安永企业家奖2025获奖企业家介绍专辑(四)
Sou Hu Cai Jing· 2025-12-12 07:49
"安永企业家奖"2025获奖名单正式公布,十二位来自中国内地和中国香港/澳门的杰出企业家获得了"安永企业家奖"2025殊荣。 让我们来认识一下获奖企业家。 安永企业家奖2025获奖者 科技业 黄伟博士是云知声创始人兼CEO,他毕业于中国科学技术大学,获信号与信息处理博士学位。作为国内最早一批从事人工智能语音语义相关研究的科研人 员,曾主导开发全球首款手机声纹认证系统,连续三年获美国国家标准技术署说话人识别评测的世界第一。2012年,黄伟博士洞察到人工智能语音语义技 术的商业化前景,创立云知声,并带领企业于2025年6月成功登陆香港交易所主板市场,成为"AGI第一股",也是全球首批实现大模型商业化的人工智能 企业,公司市值后续一度突破600亿港元。 黄伟博士深耕AI领域多年,是国内AI产业重要推动者。在他的带领下,云知声主要以包括大模型技术、智算平台、多模态交互技术、AI芯片、领域知识 图谱等的全栈式AI硬核技术为核心,并以成熟且领先的工程化能力实现了在医疗、家居、楼宇、教育、交通、汽车、政务、金融等十余个实体经济场景 下的AI应用落地,取得了骄人的发展成绩。 黄伟博士连续五批参与国家"科技创新2030"新一代 ...
别让米其林主厨削土豆,英伟达用“小脑指挥大脑”,重构AGI生产力
3 6 Ke· 2025-12-12 01:35
觉得大模型消耗的算力过大,英伟达推出的8B模型Orchestrator化身「拼好模」,通过组合工具降本增效,使用30%的预算,在HLE上拿下37.1%的成 绩。 最近,NVIDIA Research发现,只要经过适当微调,小模型已足以「指挥」大模型 英伟达研究团队的新模型Orchestrator仅有 80 亿参数(8B)的模型,不仅比以往的工具使用类AI智能体准确率更高、成本更低,还能在工具选择上精准对 齐用户的偏好。 在HLE基准测试中,Orchestrator斩获了37.1%的高分,一举超越了GPT-5(35.1%),同时在效率上提升了2.5倍。 在tau2-Bench和FRAMES测试中,Orchestrator同样以大幅优势领先 GPT-5,而其成本仅为后者的30%左右。 在多项指标上,Orchestrator均实现了性能与成本的最佳平衡,并能出色地泛化至未曾见过的工具中。 预印本链接:https://arxiv.org/abs/2511.21689 为什么「强模型+工具」还是不够好? 面对Humanity's Last Exam(HLE)这类超难综合推理考试,现在的大模型虽然「什么都懂一点」,但一到 ...
“连姥姥都问我,你知道DeepSeek吗?”
第一财经· 2025-12-12 01:11
Core Viewpoint - The emergence of DeepSeek has significantly impacted MiniMax and other large model companies, prompting introspection on their performance and strategic choices [5][6]. Group 1: Challenges and Reflections - MiniMax's founder, Yan Junjie, faced numerous challenges during the startup phase, including the bankruptcy of Silicon Valley Bank, which affected payroll [3]. - The team recognized that their performance was hindered by a lack of deep thinking and lowered expectations, contrasting with DeepSeek's unique insights and technical accumulation [6][8]. Group 2: Team Morale and Incentives - To boost team morale during tough times, Yan emphasized the importance of encouragement and financial incentives, stating that monetary rewards are effective [7]. - In September, MiniMax initiated a million-dollar stock option incentive program, offering varying amounts based on employee contributions, covering various roles within the company [7]. Group 3: Strategic Direction - MiniMax's approach involves a unique strategy of ToC (Technology of Communication) and international expansion, with their Talkie application gaining significant user traction overseas [8]. - The company experienced a period of indecision regarding whether to prioritize technology or product development, ultimately deciding on a technology-driven approach despite the associated risks [8][9]. Group 4: Market Position and Talent - The gap between domestic large model companies and top international models is narrowing, with Chinese companies achieving this with significantly lower investment [12]. - Yan highlighted the importance of local AI talent, noting that many key contributors to success in companies like DeepSeek and MiniMax are homegrown, often in their first jobs [12]. Group 5: Future Outlook - Yan remains optimistic about the future of AGI, noting that the number of companies in the large model space is decreasing, leading to a more concentrated market [13]. - The AI industry is not merely an extension of the internet; the core product in the large model era is the model itself, with blurred boundaries between roles in product management, development, and algorithms [14].
腾讯研究院AI速递 20251212
腾讯研究院· 2025-12-11 16:25
Group 1 - Meta is betting on the mysterious project Avocado, with the release originally planned for the end of 2025 now postponed to Q1 2026, utilizing distillation learning from Google Gemma, OpenAI gpt-oss, and Qwen models, potentially adopting a closed-source approach [1] - After the release of Llama 4 failed to attract enough developers and faced benchmark testing issues, Zuckerberg is rethinking the open-source strategy, establishing the MSL Super Intelligence Lab and bringing in AI executive Alexandr Wang with an investment of $14.3 billion [1] - MSL is laying off 600 employees, excluding the core TBD Lab team, while simultaneously announcing a $27 billion investment in the Hyperion data center [1] Group 2 - Adobe has announced the integration of Photoshop, Express, and Acrobat into ChatGPT, allowing users to enhance photos, design letters, and edit PDFs directly within the chat interface [2] - These tools are available for free within ChatGPT, although advanced features like Generative Fill are not included, aiming to showcase products to over 800 million weekly active users [2] - This move is part of OpenAI's initiative to incorporate more third-party applications into ChatGPT, with Spotify, Zillow, and Figma being among the first to join in October [2] Group 3 - Zhiyu has officially released the industrial-grade speech synthesis system GLM-TTS, achieving "3 seconds" voice replication and strong text comprehension capabilities with only 100,000 hours of training data [3] - The model employs a two-stage generation paradigm and integrates a four-dimensional regularization reward mechanism based on GRPO algorithm [3] - The model weights are open-sourced on Hugging Face and ModelScope, allowing users to experience and call APIs on platforms like Z.ai and Zhiyu Qingyan [3] Group 4 - SenseTime has launched the Seko 2.0 multi-episode creation feature, enabling a single person to complete an episode of a short drama in just 30 minutes, automating the entire process from script to final production [4] - The core advantage lies in maintaining consistency in the subject and scenes across episodes, with data collection costs reduced to only 10% of traditional remote operation solutions [4] - The platform integrates mainstream video models and is currently offering a limited-time promotion for its self-developed image generation model [4] Group 5 - Tencent's Yuanbao AI assistant has introduced a feature for summarizing unread messages in QQ groups, utilizing AI technology to distill chat records into clear and structured summary reports [5] - The functionality includes categorizing hot discussion topics, tracking specific mentions, and integrating group files with direct links to original messages [6] - Yuanbao can now be added as a QQ friend for one-on-one conversations, with support available on desktop, browser plugins, and mobile apps [6] Group 6 - Starcloud has launched the Starcloud-1 satellite equipped with the H100 chip, which boasts 100 times the computing power of previous space GPUs, successfully running Google Gemma and training the first space-based LLM [6] - The model was trained using Shakespearean texts and can respond in Renaissance language styles while performing real-time intelligence analysis [6] - Starcloud plans to build a 5GW orbital data center with solar panels, significantly reducing costs compared to ground data centers, with major players like SpaceX and Google already investing in space computing [6] Group 7 - Lingchu Intelligent has released the world's first embodied native human data collection solution, Psi-SynEngine, which includes a portable exoskeleton tactile glove data collection kit and a large-scale data pipeline [7] - The data acquisition cost is only 10% of traditional remote operation solutions, with positioning accuracy reaching sub-millimeter levels [7] - The company has also launched the Psi-SynNet-v0 large-scale real-world multimodal dataset, covering visual, linguistic, tactile, and motion data, with plans to expand from thousands to millions of hours of data [7] Group 8 - a16z predicts that by 2026, AI will not only be a tool for efficiency but will fundamentally reshape various industries, with agent-native infrastructure becoming essential [8] - The focus of consumer AI products is shifting from "helping me" to "connecting with me," with products that understand users' inner feelings showing better retention [8] - Most market opportunities in AI are expected to arise in traditional vertical industries rather than Silicon Valley, with video becoming an accessible simulation environment and CRM evolving into a foundational infrastructure [8] Group 9 - MiniMax's founder emphasizes that multimodal development is essential for AGI, with the company leading globally in language models, audio, and video sectors [9] - MiniMax-M2 ranks fifth globally among large language models and first in open-source, achieving low computing costs with a MoE architecture [9] - The core competitive advantage in the AI era is imagination rather than skills, with a call for local innovation and the cultivation of homegrown talent [10]