Workflow
世界模型
icon
Search documents
赵何娟独家对话李飞飞:“我信仰的是人类,不是AI”
Xin Lang Cai Jing· 2025-12-22 05:27
Core Insights - The article discusses the advancements in AI, particularly in the realm of "world models" and spatial intelligence, led by Professor Fei-Fei Li and her company World Labs, which is expected to see significant application-level breakthroughs within two years [2][5]. Group 1: AI Developments - Professor Fei-Fei Li's World Labs has launched the first commercial "world model" called Marble, which allows for the generation of sustainable, navigable, and geometrically consistent 3D worlds from images or text prompts [4][5]. - The concept of "world models" is becoming a competitive frontier in the industry, with companies like Google DeepMind releasing models that emphasize interactive environments and spatial understanding [5][6]. - The transition from "language generation" to "world generation" is anticipated to accelerate, with spatial intelligence expected to experience an application-level explosion in the next two years [5][6]. Group 2: Historical Context and Impact - The article reflects on the historical significance of the ImageNet project, which was pivotal in demonstrating the importance of large datasets in AI development, and how it laid the groundwork for advancements in generative AI [2][3][29]. - Li's leadership in the ImageNet initiative has been recognized as a milestone in the evolution of AI, showcasing the critical role of data alongside algorithms in enhancing AI capabilities [3][29]. Group 3: Challenges and Future Directions - The development of spatial intelligence faces a "data bottleneck," which poses challenges for the advancement of world models, as the collection of spatial data is inherently more complex than that of visual or textual data [32][37]. - Li emphasizes the need for patience in the AI field, acknowledging that while expectations for rapid advancements are high, meaningful progress often takes time [6][20]. - The article suggests that the journey towards achieving Artificial General Intelligence (AGI) is incremental, with spatial intelligence being a crucial component in this ongoing quest [25][26].
旧金山“黑夜迷途”:一次停电暴露自动驾驶单车智能的当下困境
Tai Mei Ti A P P· 2025-12-22 04:38
文 | 山自 2025年12月20日,旧金山遭遇一场罕见的大规模停电。太平洋煤气与电力公司(PG&E)一座变电站突 发火灾,导致全市约12.5万用户断电,覆盖西区、里士满、海特-阿什伯里、中国城等近三分之一区 域。交通信号灯集体熄灭,Muni公交停运,市长丹尼尔·卢瑞紧急呼吁市民非必要不出行。 2025年12月20日下午4:50的PG&E停电图 然而,在这场城市应急事件中,最引人注目的并非人类司机的混乱应对,而是——Waymo自动驾驶车 队的集体"宕机"。 有分析指出,伴随停电的蜂窝网络波动或实时交通数据中断,可能是另一重打击。Waymo等公司依赖 远程协助(Tele-assist)处理边缘场景。一旦通信中断,车辆不仅"看不见",还成了"信息孤岛",无法获 取后台指令或路径重规划。 社交媒体上,视频疯传:多辆白色Waymo车辆在漆黑潮湿的十字路口一动不动,红色尾灯在夜色中闪 烁,后方排起长龙,人类司机或鸣笛催促,或无奈绕行。有用户调侃:"停电摧毁了Waymos RIP。"更 有观察者尖锐指出:"看起来它们根本没接受过停电训练。" 停电期间,一辆Waymo被困在路上 次日,Waymo官方承认已暂停旧金山服务,并 ...
智能体如何学会「想象」?深度解析世界模型嵌入具身系统的三大技术范式
机器之心· 2025-12-22 04:23
Core Insights - The article discusses the integration of world models into embodied intelligent systems, emphasizing the shift from reactive loops to predictive capabilities [2][10] - It highlights the importance of world models in enhancing sample efficiency, long-term reasoning, safety, and proactive planning in embodied agents [11][12] Summary by Sections Introduction to World Models - Embodied intelligent systems traditionally relied on a "perception-action" loop, lacking the ability to predict future states [2] - The introduction of world models allows agents to "imagine" future scenarios, enhancing their operational capabilities [10] Research Overview - A comprehensive survey from a research team involving multiple universities presents a framework for integrating world models into embodied systems [5][7] - The paper categorizes existing research into three paradigms based on architectural integration [5][14] Paradigm Classification - The relationship between world models (WM) and policy models (PM) is described as a "coupling strength spectrum," ranging from weak to strong dependencies [15] - Three categories are identified: Modular, Sequential, and Unified architectures, each with distinct characteristics [15][16] Modular Architecture - In this architecture, WM and PM operate as independent modules with weak coupling, focusing on causal relationships between actions and states [20] - The world model acts as an internal simulator, allowing agents to predict outcomes based on potential actions [20] Sequential Architecture - This architecture involves a two-stage process where WM predicts future states, and PM executes actions based on those predictions [21] - The world model generates a valuable goal, simplifying complex long-term tasks into manageable sub-problems [22][23] Unified Architecture - The unified architecture integrates WM and PM into a single end-to-end network, allowing for joint training and optimization [24][25] - This configuration enables the agent to anticipate future states and produce appropriate actions without explicitly separating simulation and decision-making [25] Future Directions - The article outlines potential research directions, including the representation space of world models, structured intent generation, and the balance between interpretability and optimality [27][28][29] - It emphasizes the need for effective alignment mechanisms to ensure performance while exploring unified world-policy model paradigms [29]
速递|Yann LeCun(杨立坤)新公司AMI Labs聚焦“世界模型”,寻求超50亿美元估值融资
Z Potentials· 2025-12-22 03:40
Core Insights - Renowned AI scientist LeCun has confirmed the establishment of a new startup named "Advanced Machine Intelligence" (AMI), with LeCun serving as Executive Chairman and Alex LeBrun as CEO [1][2] - AMI is reportedly planning to raise €500 million (approximately $586 million) at a valuation of €3 billion (around $3.5 billion) even before its official launch [2] - AMI is developing a world model AI, which aims to address the structural hallucination issues of large language models (LLMs) by understanding environments and simulating causal relationships [3] Company Overview - LeCun, a professor at NYU and former VP and Chief AI Scientist at Meta, is recognized for his contributions to reinforcement learning and is a recipient of the prestigious A.M. Turing Award [3] - The world model AI being developed by AMI is seen as a potential solution to the inherent uncertainties of LLMs, which can lead to misinformation [3] Funding and Market Position - The valuation and funding goals of AMI are considered ambitious, especially in comparison to other AI startups, such as "Mind Machine Lab," which was valued at $12 billion during its seed round [2][3] - Nabla, the company from which LeBrun is transitioning, has raised a total of $120 million, including a $70 million Series C round completed in June [6][7] Leadership Transition - LeBrun is transitioning from CEO of Nabla to CEO of AMI, while Nabla's COO, Delphine Grol, will temporarily manage Nabla during this transition [4][6] - LeBrun has indicated that Nabla is experiencing significant growth, with annual recurring revenue expected to exceed $1 billion [7]
研究生实验到什么程度可以写小论文?
自动驾驶之心· 2025-12-22 03:23
Core Viewpoint - The article emphasizes the importance of timely submission of academic papers, particularly for graduate students, highlighting that a complete story in research is more valuable than novelty [1]. Group 1: Academic Guidance Services - The company offers a paper guidance service aimed at efficiently producing research results within a limited timeframe, helping students avoid common pitfalls in self-writing [2]. - The guidance covers various advanced topics such as reinforcement learning, 3D object detection, and multi-sensor fusion, among others, providing tailored advice based on individual research directions [3]. - The service is designed to assist students who face challenges such as unclear direction, difficulty in code reproduction, and lack of systematic research training [5]. Group 2: Instructor Qualifications - All instructors associated with the service are from globally recognized universities ranked in the top 100 by QS, with multiple publications in A-level conferences and extensive project experience [6]. Group 3: Comprehensive Academic Support - The company provides a wide range of academic support services, including assistance with journal papers, conference papers, and thesis projects, ensuring a comprehensive approach to academic success [8]. - The service is results-oriented, offering continuous support until the paper is submitted, with a focus on enhancing coding skills alongside research guidance [8]. Group 4: FAQs and Additional Information - The company assures that even students with no prior experience can publish papers by following structured courses, with the potential to produce a small paper within six months [11]. - Outstanding students may receive recommendation letters from prestigious institutions and opportunities for internships in leading companies, indicating that publishing papers is just the beginning of their academic journey [11]. - Pricing for the services varies based on the publication target, with detailed consultations provided to tailor support to individual needs [11].
回望2025·实物见变迁丨车轮上的新体验——2025年汽车“智变”里的科技跃迁
Xin Hua She· 2025-12-22 01:37
Core Insights - The article discusses the rapid adoption of intelligent driving technologies in the Chinese automotive industry, highlighting the shift from traditional driving to smart driving experiences by 2025 [1][2]. Group 1: Market Trends - By the third quarter of 2025, new passenger cars equipped with Level 2 (L2) driving assistance features saw a year-on-year sales increase of 21.2%, with a penetration rate of 64%, indicating that over 6 out of every 10 new cars sold have basic smart driving capabilities [1]. - The focus of consumers is shifting from single highway scenarios to complex urban environments, with a growing preference for driving assistance systems that can handle city traffic and intersections [2]. Group 2: Technological Advancements - Continuous technological breakthroughs and rapidly decreasing costs are driving the smart driving revolution, with hardware costs halving every two years and user experience expected to improve tenfold in the same period [3]. - The Chinese smart driving market is at a critical turning point in 2025, transitioning from "technology validation" to "scene implementation," with L2 features becoming standard across all vehicle models [3]. Group 3: Industry Dynamics - The market is experiencing intense competition, leading to a significant industry reshuffle where only companies with technical strength and mass production experience will survive [4]. - The focus of market competition is shifting towards user experience, cost control, and product ecosystem, with a predicted market structure that will be characterized by significant stratification and specialization [5].
李艳:透过“AI泡沫”之争,看何为历史必然
Huan Qiu Wang· 2025-12-21 23:02
Group 1: Core Perspectives on AI Bubble Debate - The debate around the "AI bubble" is characterized by contrasting views, with some investors concerned about inflated valuations and others optimistic about long-term technological advancements [1][2][3] - The "bubble theorists" argue that there is a significant disconnect between capital market valuations and actual performance, citing a capital expenditure to revenue ratio of 6:1 in the US AI industry, which is higher than historical bubbles [1] - Conversely, "anti-bubble theorists" maintain that the strategic investments by major nations and the expanding application of AI from consumer to industrial sectors indicate a stable and long-term growth trajectory for the industry [2][3] Group 2: Structural Imbalances in AI Application - There is a structural imbalance in the AI industry, with upstream hardware remaining robust while mid-tier model developers struggle, and many downstream applications failing to achieve profitability [4] - Approximately 80% of companies deploying AI have not seen net profit increases, leading to divergent views on the industry's future profitability [4] - Optimists believe that current investments are essential for nurturing the industry, while pessimists fear that this imbalance could lead to financial crises reminiscent of past bubbles [4] Group 3: Policy and Regulatory Landscape - AI governance is still in an exploratory phase, with significant differences in regulatory approaches across countries, affecting the stability of expectations for multinational companies [4] - The US has pursued a deregulation strategy, while the EU emphasizes risk-based regulation, and China focuses on balancing development with security [4] Group 4: Understanding the AI Bubble - Traditional financial theories define bubbles as deviations of asset prices from intrinsic values, but the intrinsic value of AI is challenging to measure using conventional methods [5][6] - The emergence of an AI bubble may not necessarily be negative, as it represents capital's vote on future potential, providing necessary funding for technological development [5][6] - The core of the AI bubble debate involves a complex interplay between technological change, capital logic, short-term gains, and long-term value [6]
LeCun离职前的吐槽太猛了
量子位· 2025-12-21 05:45
Core Viewpoint - LeCun expresses skepticism about the potential of large language models (LLMs) to achieve artificial general intelligence (AGI), arguing that the path to superintelligence through LLMs is fundamentally flawed [2][78]. Group 1: Departure from Meta - LeCun is leaving Meta after nearly 12 years, criticizing the company's increasingly closed approach to research and its focus on short-term projects [3][11][26]. - He plans to establish a new company named Advanced Machine Intelligence (AMI), which will prioritize open research and focus on world models [10][19]. Group 2: World Models vs. LLMs - LeCun believes that world models, which handle high-dimensional and continuous data, are fundamentally different from LLMs, which excel at discrete text data [28][29]. - He argues that relying solely on text data will never allow AI to reach human intelligence levels, as the complexity of real-world data is far greater than that of text [31][32]. Group 3: Research Philosophy - LeCun emphasizes the importance of open research and publication, stating that without sharing results, research lacks validity [15][17]. - He critiques Meta's shift towards short-term projects, suggesting that true breakthroughs require long-term, open-ended research [18][26]. Group 4: Future of AI - LeCun envisions that the development of world models and planning capabilities could lead to significant advancements in AI, but achieving human-level intelligence will require substantial foundational work and theoretical innovation [84][85]. - He asserts that the most challenging aspect of AI development is not reaching human intelligence but rather achieving the intelligence level of dogs, as this requires a deep understanding of foundational theories [88][89]. Group 5: Personal Mission - At 65, LeCun remains committed to enhancing human intelligence, viewing it as the most scarce resource and a key driver for societal progress [92][94]. - He reflects on his career, expressing a desire to continue contributing to the field and emphasizing the importance of open collaboration in scientific advancement [103].
王晓刚和他的“世界模型”:一人管十狗,先让四足机器人上街干活|智能涌现专访
3 6 Ke· 2025-12-21 04:38
Core Insights - The article discusses the advancements in robotics technology, particularly focusing on the launch of the "A1 embodied super brain module" and the "KAIWU" world model 3.0 by the company, which enhances the capabilities of robotic dogs to perform various tasks autonomously [2][4][6]. Group 1: Technological Advancements - The A1 module allows robotic dogs to gain "spatial intelligence" and "autonomous decision-making" capabilities, transforming them from simple machines into intelligent entities capable of complex tasks [2][4]. - The KAIWU world model 3.0 establishes the physical laws of the world within AI models, enabling robots to learn tasks more efficiently and adapt to new environments [3][4][18]. - The world model addresses the limitations of previous VLA models, which struggled with understanding physical laws and required vast amounts of data for training [3][17]. Group 2: Commercialization Strategy - The company plans to initially deploy robotic dogs for urban management tasks, such as traffic monitoring and law enforcement, in collaboration with local authorities [1][8]. - The commercialization roadmap includes expanding from four-legged robots to wheeled dual-arm robots for logistics, and eventually to bipedal humanoid robots for more complex household tasks [8][40]. - The company aims to leverage its existing resources and partnerships to accelerate market entry and reduce costs in various applications, including security and inspection [9][38]. Group 3: Data Collection and Model Validation - The world model's effectiveness relies on a closed-loop validation process, where real-world scenarios are used to test and refine the model's capabilities [7][20]. - Data collection focuses on human interactions with the physical world, allowing for scalable data acquisition that can be applied across different robotic platforms [27][29]. - The company emphasizes the importance of integrating the world model with real-world applications to build trust and demonstrate its utility [7][25].
Alex Wang“没资格接替我”!Yann LeCun揭露Meta AI“内斗”真相,直言AGI是“彻头彻尾的胡扯”
AI前线· 2025-12-20 05:32
Core Viewpoint - Yann LeCun criticizes the current AI development path focused on scaling large language models, arguing it leads to a dead end and emphasizes the need for a different approach centered on understanding and predicting the world through "world models" [2][3]. Group 1: AI Development Path - LeCun believes the key limitation in AI progress is not reaching "human-level intelligence" but rather achieving "dog-level intelligence," which challenges the current evaluation systems focused on language capabilities [3]. - He is establishing a new company, AMI, to pursue a technology route that builds models capable of understanding and predicting the world, moving away from the mainstream focus on generating outputs at the pixel or text level [3][9]. - The current industry trend prioritizes computational power, data, and parameter scale, while LeCun aims to redefine the technical path to general AI by focusing on cognitive and perceptual fundamentals [3][9]. Group 2: Research and Open Science - LeCun emphasizes the importance of open research, stating that true research requires public dissemination of results to ensure rigorous methodologies and reliable outcomes [7][8]. - He argues that without allowing researchers to publish their work, the quality of research diminishes, leading to a focus on short-term impacts rather than meaningful advancements [7][8]. Group 3: World Models and Planning - AMI aims to develop products based on world models and planning technologies, asserting that current large language model architectures are inadequate for creating reliable intelligent systems [9][10]. - LeCun highlights that world models differ from large language models, as they are designed to handle high-dimensional, continuous, and noisy data, which LLMs struggle with [10][11]. - The core idea of world models is to learn an abstract representation space that filters out unpredictable details, allowing for more accurate predictions [11][12]. Group 4: Data and Learning - LeCun discusses the vast amount of data required to train effective large language models, noting that a typical model pre-training scale is around 30 trillion tokens, equating to approximately 100 trillion bytes of data [20]. - In contrast, video data, which is richer and more structured than text, offers greater learning value, as it allows for self-supervised learning due to its inherent redundancy [21][28]. Group 5: Future of AI and General Intelligence - LeCun expresses skepticism about the concept of "general intelligence," arguing it is a flawed notion based on human intelligence, which is highly specialized [33][34]. - He predicts that significant advancements in world models and planning capabilities could occur within the next 5 to 10 years, potentially leading to systems that approach "dog-level intelligence" [35][36]. - The most challenging aspect of AI development is achieving "dog-level intelligence," after which many core elements will be in place for further advancements [37]. Group 6: Safety and Ethical Considerations - LeCun acknowledges the concerns surrounding AI safety, advocating for a design approach that incorporates safety constraints from the outset rather than relying on post-hoc adjustments [43]. - He argues that AI systems should be built with inherent safety features, ensuring they cannot cause harm while optimizing for their objectives [43][44].