Workflow
世界模型
icon
Search documents
Alex Wang“没资格接替我”!Yann LeCun揭露Meta AI“内斗”真相,直言AGI是“彻头彻尾的胡扯”
AI前线· 2025-12-20 05:32
Core Viewpoint - Yann LeCun criticizes the current AI development path focused on scaling large language models, arguing it leads to a dead end and emphasizes the need for a different approach centered on understanding and predicting the world through "world models" [2][3]. Group 1: AI Development Path - LeCun believes the key limitation in AI progress is not reaching "human-level intelligence" but rather achieving "dog-level intelligence," which challenges the current evaluation systems focused on language capabilities [3]. - He is establishing a new company, AMI, to pursue a technology route that builds models capable of understanding and predicting the world, moving away from the mainstream focus on generating outputs at the pixel or text level [3][9]. - The current industry trend prioritizes computational power, data, and parameter scale, while LeCun aims to redefine the technical path to general AI by focusing on cognitive and perceptual fundamentals [3][9]. Group 2: Research and Open Science - LeCun emphasizes the importance of open research, stating that true research requires public dissemination of results to ensure rigorous methodologies and reliable outcomes [7][8]. - He argues that without allowing researchers to publish their work, the quality of research diminishes, leading to a focus on short-term impacts rather than meaningful advancements [7][8]. Group 3: World Models and Planning - AMI aims to develop products based on world models and planning technologies, asserting that current large language model architectures are inadequate for creating reliable intelligent systems [9][10]. - LeCun highlights that world models differ from large language models, as they are designed to handle high-dimensional, continuous, and noisy data, which LLMs struggle with [10][11]. - The core idea of world models is to learn an abstract representation space that filters out unpredictable details, allowing for more accurate predictions [11][12]. Group 4: Data and Learning - LeCun discusses the vast amount of data required to train effective large language models, noting that a typical model pre-training scale is around 30 trillion tokens, equating to approximately 100 trillion bytes of data [20]. - In contrast, video data, which is richer and more structured than text, offers greater learning value, as it allows for self-supervised learning due to its inherent redundancy [21][28]. Group 5: Future of AI and General Intelligence - LeCun expresses skepticism about the concept of "general intelligence," arguing it is a flawed notion based on human intelligence, which is highly specialized [33][34]. - He predicts that significant advancements in world models and planning capabilities could occur within the next 5 to 10 years, potentially leading to systems that approach "dog-level intelligence" [35][36]. - The most challenging aspect of AI development is achieving "dog-level intelligence," after which many core elements will be in place for further advancements [37]. Group 6: Safety and Ethical Considerations - LeCun acknowledges the concerns surrounding AI safety, advocating for a design approach that incorporates safety constraints from the outset rather than relying on post-hoc adjustments [43]. - He argues that AI systems should be built with inherent safety features, ensuring they cannot cause harm while optimizing for their objectives [43][44].
对话小马智行王皓俊:Robotaxi正进入1到1000的阶段
Hua Er Jie Jian Wen· 2025-12-20 05:31
Core Insights - The global autonomous driving industry is undergoing a paradigm shift, transitioning from experimental phases to tangible financial performance, with companies like Baidu and Pony.ai achieving operational profitability [2][4][11] - The competitive landscape for Robotaxi has evolved, focusing on profitability and operational efficiency as hardware costs decrease and AI reshapes operational rules [3][11] Commercialization Progress - Pony.ai's Robotaxi achieved unit economic model (UE) profitability in Guangzhou, indicating a successful transition from R&D to commercial viability [4][5] - The average daily revenue for Pony.ai's seventh-generation Robotaxi is approximately 299 RMB, with a target of 24 rides per day to ensure positive cash flow [4][5] - The company aims to scale its fleet to 1,000 vehicles by 2025, 3,000 by 2026, and 100,000 by 2030, integrating Robotaxi into daily life [2][11] Cost Management and Operational Efficiency - Significant cost reductions have been achieved, with the BOM cost of the seventh-generation vehicle dropping by 70% compared to the sixth generation [5][6] - The use of mass-produced components and optimized algorithms has enhanced operational efficiency, allowing for better performance with lower costs [5][6] - Insurance costs for Robotaxi are 50% lower than traditional taxis, reflecting the safety record of AI drivers [6] Industry Competition - The Robotaxi market is becoming increasingly competitive, with major players like Waymo and Tesla entering the fray, each adopting different strategies [8][10] - Waymo's recent funding round has pushed its valuation to nearly $100 billion, while Tesla is focusing on a low-cost, vision-based approach [8][10] - New entrants like XPeng and Hello are also planning to launch their own Robotaxi services, intensifying competition [9][10] Market Potential and Future Outlook - The Robotaxi market could reach $80 billion in major Chinese cities by 2030, with potential global market size reaching $3.94 trillion when including overseas markets [12] - As hardware costs decline, operational expenses will become a larger portion of the cost structure, emphasizing the importance of operational efficiency [12] - The industry is shifting from a focus on technology to one centered on operational capabilities and market presence [11][12] Strategic Shifts - Pony.ai is transitioning to a "light asset" model, partnering with vehicle manufacturers and service platforms to reduce capital expenditure [7][14] - The company is focusing on creating a value chain where it provides AI technology while others handle vehicle production and service distribution [7][14] - The emphasis is on building partnerships and leveraging local resources in international markets, particularly in the Middle East [6][18]
「一脑多形」圆桌:世界模型、空间智能在具身智能出现了哪些具体进展?丨GAIR 2025
雷峰网· 2025-12-20 04:07
Core Viewpoint - The article discusses the current state and future potential of embodied intelligence, focusing on the challenges and opportunities presented by world models and spatial intelligence in the field of robotics and AI [2][4][10]. Group 1: Development of Embodied Intelligence - The technology route for embodied intelligence is still in an exploratory phase, with no convergence yet, which is seen as a positive sign for innovation [4][3]. - There is a consensus among experts that the core issues of embodied intelligence, such as interaction and human-machine collaboration, should be addressed by academic institutions, while industries focus on practical applications [4][5]. - The integration of AI with physical entities is expected to lead to significant advancements in intelligence, but the field must avoid reverting to industrial automation without achieving generalized intelligence [4][5][30]. Group 2: World Models in Autonomous Driving - World models are currently being utilized by leading companies like Tesla to enhance data generation and improve decision-making processes through closed-loop testing [11][12]. - The concept of world models has gained traction in autonomous driving due to the simplicity of generating scenarios compared to robotics, with advancements in generative AI enabling the creation of realistic training samples [12][13]. - There is ongoing debate regarding the definition and application of world models in both autonomous driving and robotics, with differing opinions on the necessity of pixel-level reconstruction versus latent state representation [12][13][14]. Group 3: Spatial Intelligence in Robotics - Spatial intelligence is a critical aspect of robotics, with a focus on perception and understanding spatial relationships, which has evolved from traditional SLAM techniques to more learning-based approaches [20][21]. - The current challenges in spatial intelligence include the need for better data representation and understanding of complex spatial relationships, which are still underdeveloped in robotic systems [22][23]. - The integration of visual and semantic information is essential for enhancing robots' spatial capabilities, but the field is still in its early stages [22][23][24]. Group 4: Commercialization and Future Applications - The future of drone applications is expected to expand significantly, with potential uses in various sectors, but the timeline for widespread adoption remains uncertain [26][27]. - The gap between technological capabilities and market needs poses challenges for entrepreneurs, as there is often a mismatch between innovative ideas and practical industrial requirements [30][31]. - The shift towards learning-based control paradigms is anticipated to increase the applicability of drones and robots in real-world scenarios, moving beyond traditional automation [28][29].
世界模型工作正在呈现爆发式增长
自动驾驶之心· 2025-12-20 02:16
Core Viewpoint - The article discusses the distinction between world models and end-to-end models in autonomous driving, emphasizing that world models are a means to achieve end-to-end autonomous driving rather than a specific technology [2]. Group 1: World Model Overview - The article highlights the recent surge in publications related to world models, particularly in the context of closed-loop simulation, which is becoming a trend in the industry due to the high costs associated with corner cases [2]. - It introduces a new course focused on world models, covering various algorithms such as general world models, video generation, and OCC generation, with applications in Tesla's world model and the Marble project by Fei-Fei Li's team [2][5]. Group 2: Course Structure - The course consists of six chapters, starting with an introduction to world models and their relationship with end-to-end autonomous driving, followed by a discussion on the historical development and current applications of world models [5][6]. - The second chapter covers foundational knowledge related to world models, including scene representation and technologies like Transformer and BEV perception, which are crucial for understanding subsequent chapters [5][6]. Group 3: Advanced Topics - The third chapter focuses on general world models, discussing notable models such as Marble, Genie 3 from DeepMind, and the latest developments from Meta, including the VLA+ world model algorithm [6][7]. - The fourth chapter delves into video generation-based world models, presenting classic works and recent advancements in the field, including projects like GAIA-1 & GAIA-2 and OpenDWM [7][8]. - The fifth chapter addresses OCC generation methods, explaining their potential for trajectory planning and end-to-end implementation [8]. Group 4: Industry Application and Career Preparation - The sixth chapter provides insights into the practical applications of world models in the industry, discussing pain points and how to prepare for job interviews in this field [9]. - The course aims to equip participants with the skills to understand and implement world model technologies, preparing them for roles as world model algorithm engineers [10][13].
让人工智能“睁眼看世界” 走在国际科技变革最前沿 上海量子城市建设画卷正从复兴岛展开
Jie Fang Ri Bao· 2025-12-20 00:59
Core Insights - The launch of the Global Maker Island and the 2025 Shanghai Quantum City Annual Conference marks a significant step in building smart infrastructure on Fuxing Island, aiming for a standard of 100,000 intelligent sensing facilities per square kilometer [1] - The rapid evolution of next-generation artificial intelligence technologies is set to transform urban landscapes, with Fuxing Island positioned as a key player in this transformation [2] Group 1: Artificial Intelligence and Urban Development - The Shanghai Quantum City Time-Space Innovation Base will open in December 2024, focusing on building a "world model" for artificial intelligence, which is essential for capturing technological changes [3] - The city aims to enhance AI's understanding of the physical world by creating training environments, such as the first heterogeneous humanoid robot training ground and the issuance of operational licenses for smart connected vehicles [3][4] - The complexity of urban environments and AI's ability to interpret them are central to the mission of the Shanghai Quantum City [4] Group 2: Scientific and Technological Advancements - Leading scientists are increasingly focusing on "spatial intelligence" as the next frontier for AI, which will define the development direction for the next decade [5] - The Shanghai Quantum City has already achieved significant milestones, including the release of a time-space data sharing platform and a specialized corpus for planning and natural resources [6] Group 3: Talent and Innovation Ecosystem - The construction of the Shanghai Quantum City emphasizes the importance of talent investment to gain strategic advantages in the new technological revolution [8] - Fuxing Island is actively inviting global creators to participate in its innovation ecosystem, offering a low-cost entrepreneurial environment and support for startups [9] - Currently, 12 well-known incubators and 14 innovative startups have officially settled in Fuxing Island, indicating a growing entrepreneurial landscape [10]
前Meta首席AI科学家再创业,AI新公司估值直指30亿欧元
Hua Er Jie Jian Wen· 2025-12-19 14:27
Group 1 - Meta's Chief AI Scientist Yann LeCun is seeking €500 million in funding for his newly established AI company, which will value the company at approximately €3 billion before its official launch [1] - The new company, named Advanced Machine Intelligence Labs (AMI Labs), will focus on developing next-generation superintelligent AI systems, particularly "world models" that can simulate and understand the physical world [2] - AMI Labs' technology foundation is based on research led by LeCun during his time at Meta, aiming to create a new AI architecture capable of learning from text, video, and spatial data, with abilities for continuous memory, complex reasoning, and planning [2] Group 2 - Alexandre LeBrun, co-founder of French health tech startup Nabla, has been appointed as the CEO of AMI Labs, while Nabla will maintain a strategic research partnership with AMI Labs [3] - Meta is undergoing a significant strategic shift in its AI approach, with CEO Mark Zuckerberg aiming to compete directly with OpenAI and Google by moving away from long-term exploratory work initiated by LeCun [4] - Meta has recently laid off approximately 600 employees from its AI research team to reduce costs and accelerate the productization process, reflecting ongoing leadership changes within the company [4]
王晓刚和他的「世界模型」:一人管十狗,先让四足机器人上街干活丨36氪专访
36氪· 2025-12-19 10:31
Core Viewpoint - The article discusses the emergence of world models in AI, highlighting their significance in overcoming the limitations of previous VLA models and their potential applications in robotics and autonomous systems [4][9][22]. Group 1: World Model Development - The world model is a concept that addresses the inherent limitations of VLA models, which struggle to understand physical laws and require vast amounts of data for training [9][28]. - The introduction of the "Awakening" world model 3.0 allows robots to learn physical interactions and adapt to new environments, significantly reducing the dependency on specific scene data [8][10]. - The world model enables robots to transition from rote learning to understanding general principles, enhancing their ability to perform tasks across various scenarios [10][28]. Group 2: Practical Applications - The "Daxiao Robot" is utilizing the world model to deploy robotic dogs for urban management tasks, such as monitoring illegal parking and drone activity [6][7][12]. - The company plans to validate the world model's capabilities through real-world applications, starting with robotic dogs and expanding to more complex robotic forms in the future [16][56]. - The integration of the world model into robotic systems aims to create a closed-loop feedback mechanism, allowing for continuous improvement based on real-world performance [14][15][16]. Group 3: Commercial Strategy - The company intends to focus on B2B applications initially, targeting sectors like smart cities and urban management where autonomous capabilities are in high demand [58]. - Future plans include expanding into logistics and home environments, leveraging existing resources and partnerships to reduce entry costs [17][56]. - The strategy emphasizes collaboration with existing platform providers while also developing proprietary solutions to enhance product reliability and performance [50][52].
最近收到了很多同学关于自驾方向选择的咨询......
自动驾驶之心· 2025-12-19 09:25
Core Insights - The article discusses various advanced directions in autonomous driving research, emphasizing the importance of deep learning and traditional methods for different academic backgrounds [2][3]. Group 1: Research Directions - Key areas of focus include VLA, end-to-end learning, reinforcement learning, 3DGS, and world models, which are recommended for students in computer science and automation [2]. - For mechanical and vehicle engineering students, traditional methods like PnC and 3DGS are suggested due to their lower computational requirements and ease of entry [2]. Group 2: Paper Guidance Services - The article announces the launch of a paper guidance service that covers various topics such as end-to-end learning, multi-sensor fusion, and trajectory prediction [3][6]. - The service includes support for topic selection, full process guidance, and experimental assistance [6]. Group 3: Publication Success - The guidance service has a high acceptance rate for papers submitted to top conferences and journals, including CVPR, AAAI, and ICLR [7]. - The article highlights the range of publication venues, including CCF-A, CCF-B, and various SCI categories [10].
7.1万亿美元期权合约到期,美股“疯狂一日”即将爆发
3 6 Ke· 2025-12-19 02:45
Economic Indicators - The latest November core CPI increased by 2.6% year-on-year, marking a new low since 2021 and falling below expectations, indicating a trend of slowing inflation [1] - Kevin Hassett, a leading candidate for the Federal Reserve chair, praised the CPI report as "shockingly good," noting that wage growth is outpacing price increases and predicting substantial tax refunds for American taxpayers next year [3][5] Federal Reserve Outlook - Hassett suggested that the Federal Reserve has significant room to lower interest rates due to improving supply-side conditions that are expected to suppress inflation in the long term [5] - Following the CPI data release, market expectations for aggressive easing by the Fed in 2026 increased, particularly regarding the likelihood of rate cuts in March next year [5] Market Dynamics - The S&P 500 faced technical resistance while attempting to breach the 6800-point level, leading to significant volatility and a rapid sell-off triggered by quantitative funds and high-leverage long positions [7][9] - The current market is in a "negative Gamma" zone, where small retreats can lead to systemic sell-offs, emphasizing the need for the index to maintain above 6800 points for stability [9][16] Options Market - A record $7.1 trillion in nominal value of options contracts is set to expire on the upcoming "quadruple witching" day, which typically amplifies market volume and volatility [14] - The dominance of zero-day-to-expiration (0DTE) options is expected to disrupt true asset pricing, shifting the focus from inflation data to liquidity pressures [16] Meta's AI Strategy - Meta is developing two ambitious AI models, "Mango" for video and "Avocado" for language, with plans for a 2026 release, reflecting a strategic pivot amid a $70 billion capital expenditure [17][19] - The company has shifted its approach from open-source to potentially proprietary models, aiming to establish a competitive edge in AI technology [20][22] - Meta's dual development of Mango and Avocado is seen as a defensive strategy to enhance user engagement and maintain its competitive position against rivals like Google and OpenAI [21][22]
LeCun创业首轮估值247亿!Alexandre当CEO
量子位· 2025-12-19 01:01
这家名为 Advanced Machine Intelligence Labs (AMI Labs)的新公司,计划于明年一月正式亮相,目标估值 30亿欧元 (约247亿人 民币)。 克雷西 发自 凹非寺 量子位 | 公众号 QbitAI LeCun在Meta的Last Day还没来,新公司又被曝出更多细节。 前脚LeCun本人在播客当中宣布了新公司名称,现在融资和估值目标就被《金融时报》曝光了。 AMI Labs的研究方向,就是LeCun一直主推的"世界模型",而且将走开源路线,老东家Meta也将与其保持合作。 另外,曝料也透露了AMI Labs的 CEO人选并非LeCun本人 ,而是他的一位老部下。 LeCun不当CEO 新公司AMI Labs定于2026年1月在巴黎正式启动,在Meta逐渐转向封闭生态的背景下,LeCun选择了他在学术界一贯坚持的开源路线。 而且在技术层面,AMI Labs选择了比主流的LLM更具挑战性的道路—— 死磕"世界模型" 。 因为在LeCun看来,基于自回归机制的LLM存在根本性的逻辑缺陷,它们只是在统计概率上预测下一个字符,并不真正理解物理世界的运行规 律。 为此,新公司将通过 ...