Workflow
世界模型
icon
Search documents
端到端VLA剩下的论文窗口期没多久了......
自动驾驶之心· 2026-01-12 09:20
Core Viewpoint - The article emphasizes the importance of deep learning and emerging technologies in the fields of automation and computer science, suggesting that students should focus on these areas to remain competitive in the job market [2]. Group 1: Recommended Learning Paths - For students in automation and computer science, deep learning, VLA, end-to-end systems, and world models are highlighted as promising areas with significant potential for research and career development [2]. - Mechanical and vehicle engineering students are advised to start with traditional PnC and 3DGS, which are easier to grasp and require lower computational power [2]. Group 2: Research Guidance Services - The article announces the launch of a paper guidance service that covers various advanced topics such as end-to-end systems, VLA, world models, reinforcement learning, and more [3]. - The service includes support for paper topic selection, full process guidance, experimental guidance, and doctoral application assistance [6][9]. Group 3: High Acceptance Rates - The guidance service boasts a high acceptance rate for papers, with several already published in top conferences and journals such as CVPR, AAAI, and ICLR [7]. - Different pricing structures are available based on the level of the paper, indicating a tailored approach to support [7].
高德扫街榜重磅升级:全球首推飞行实景探店
Xin Lang Cai Jing· 2026-01-11 17:16
Core Insights - Gaode's Street Ranking has announced three major upgrades after 100 days of launch, including the world's first "Flying Street View," a dynamic service ranking based on seasons and locations, and the introduction of friend relationships and personal ranking features [2] User Growth and Engagement - The user base of Gaode's Street Ranking has surpassed 660 million, with the Gaode app adding 46 million monthly active users in a single month, bringing the total to 996 million [2] - Since the launch of the Street Ranking, 860,000 new merchants have joined Gaode, with merchant order volume increasing by over 330% and merchant revenue rising by over 270% [2] Technological Innovation - The "Flying Street View" utilizes a world model for the first time in the lifestyle service industry, setting a global precedent [2] - Gaode's self-developed world model ranks first in the international benchmark WorldScore, receiving recognition from top AI academic circles, with related papers accepted at major AI conferences [2]
颠覆测绘界!游戏极客改写地图史,谷歌阿里把地图变数字孪生
Sou Hu Cai Jing· 2026-01-11 14:55
Core Insights - The evolution of maps from static paper versions to dynamic digital representations has transformed how people interact with their environment, making maps a vital part of daily life [1][3] Group 1: Historical Development - Maps were once expensive and slow to update, creating a disconnect with everyday users, until the advent of digital maps [3] - The founding of Keyhole in 2001 by a team skilled in 3D graphics and gaming marked a significant turning point, as they aimed to create an interactive digital globe [5] - NVIDIA's founder, Jensen Huang, recognized the potential of Keyhole's technology, leading to an investment that shifted the focus from geographic accuracy to computational power [5][8] Group 2: Google’s Acquisition and Innovation - Google acquired Keyhole, rebranding it as Google Earth, and recognized maps as the ultimate gateway to the physical world [8] - Google pushed the boundaries of web technology to create seamless map experiences, leading to the development of Google Street View [8][10] - This transformation elevated maps from mere navigation tools to comprehensive digital archives of human civilization [10] Group 3: Adaptation in China - In China, the rapid urban development and demand for real-time services forced map applications to evolve into essential urban infrastructure [10][12] - After Alibaba's acquisition, Amap (Gaode) transitioned from a navigation tool to a core component of ride-hailing and food delivery services [10][12] - Amap's introduction of the "world model" concept signifies a shift towards understanding urban dynamics, moving from static representation to predictive capabilities [12][13] Group 4: Future Implications - The ability to accurately reconstruct dynamic urban environments positions companies to influence the operational rules of the real world [13] - The ongoing evolution of mapping technology suggests that the disruption initiated by tech innovators is far from over [13]
锦秋被投企业Manifold AI流形空间完成超亿元天使+轮融资,国产世界模型让机器人大脑超进化|Jinqiu Spotlight
锦秋集· 2026-01-10 06:13
Core Insights - Manifold AI has completed over 100 million yuan in angel+ round financing, with Jinqiu Capital continuing to invest. The funding will be used for the iteration of its world model and the application of embodied intelligence [4] - The company has developed a universal spatial world model called WorldScape, which matches the quality and real-time capabilities of leading global models [6] - Manifold AI is the first team globally to deploy a comprehensive outdoor, indoor, and aerial embodied world model, significantly enhancing data efficiency and model performance [9] Financing and Investment - The latest funding round was led by Junlian Capital, with participation from Meihua Venture Capital, Huawei Hubble, and existing investors including Inno Fund and Jinqiu Capital [4] - Manifold AI has raised several hundred million yuan in total funding over the past six months [4] Technological Advancements - WorldScape enables single-image generation of interactive spaces, providing a foundation for physical AI applications [8] - The company utilizes a vast amount of physical video data for pre-training, enhancing WorldScape's operational interaction capabilities [8] - Manifold AI's approach replaces traditional VLM models with its world model, resulting in superior performance in real-world applications [10] Future Prospects - The integration of NVIDIA Jetson Thor for deploying embodied world models is a significant step towards scaling operations [14] - The involvement of Huawei Hubble is expected to facilitate the integration of domestic chips and robotic brains, laying the groundwork for large-scale implementation [14]
“机器人一次性卖完太亏!”真机智能刘智勇:今年中国本体厂商将大淘汰,拼的是世界模型?
AI前线· 2026-01-10 05:57
Core Insights - The article discusses advancements in embodied intelligence, particularly focusing on Visual Language Navigation (VLN) technology and its implications for the robotics industry by 2025 [2][4][16]. Group 1: Technological Advancements - VLN technology has emerged as a significant breakthrough, allowing robots to navigate without pre-built maps, thus enabling zero-shot generalization in new environments [4][5]. - The shift from SLAM (Simultaneous Localization and Mapping) to VLN represents a paradigm change, enhancing semantic understanding and adaptability in dynamic environments [8][12]. - World models are recognized as crucial for improving long-term planning and dynamic adaptation, although they currently face challenges related to their black-box nature [7][12]. Group 2: Industry Trends - By 2026, it is anticipated that the number of core robotics companies in China will shrink to 5 to 8, driven by a focus on profitability in specific scenarios rather than relying on extensive after-sales support [16][17]. - The competition landscape will evolve, with a shift from single-point technological advancements to overall system efficiency becoming more critical [17]. - The article highlights the potential for new business models, such as combining hardware sales with annual service fees, to create sustainable revenue streams [15]. Group 3: Challenges and Opportunities - The primary bottlenecks for large-scale deployment of embodied intelligence include high costs of data collection and insufficient scene coverage in existing datasets [9][10]. - Hardware limitations, particularly in tactile feedback and durability, pose significant challenges for the practical application of robotics in complex environments [11][12]. - The focus for companies like Zhenji Intelligent is on achieving door-to-door delivery capabilities without pre-deployment, which could significantly reduce deployment costs [13][14].
何小鹏为高德“飞行街景”上线点赞:将持续探索物理世界与 AI 深度耦合的更多可能
Xin Lang Cai Jing· 2026-01-10 04:59
Core Insights - The chairman and CEO of XPeng Motors, He Xiaopeng, praised the launch of Gaode's "Flying Street View" on Weibo, highlighting its potential to allow small businesses to create "flying" storefronts using mobile phones [1] - This development is seen as not just an upgrade in experience but also a significant breakthrough in AI's ability to understand and replicate the physical world [1] - The company believes that world models will become the core technological foundation driving the next generation of vehicles, robots, and flying cars [1] - XPeng Motors is committed to deepening its exploration of physical AI and the integration of the physical world with AI [1] - The company looks forward to collaborating with Gaode Map and other industry players to further advance this initiative [1]
聊一聊AI硬件和软件
傅里叶的猫· 2026-01-09 15:58
Group 1: AI Hardware Market - The recent performance of AI hardware is not strong, but the US stock market's hardware sector showed some resilience [1] - The memory shortage is exaggerated; a report from Macquarie suggests that the new DRAM capacity in the next two years can only support about 15GW of AI data center construction, which may delay global AI expansion plans [3] - A different perspective from a memory industry expert indicates that the capacity could support 20GW and 33GW this year and next year, respectively [5] - The global data center installation capacity is projected to reach 17.4GW by 2025, with an expected increase to 30.2GW this year [5] - Due to memory constraints, the growth of AI data centers (AIDC) will not be as rapid as anticipated, contributing to the recent decline in hardware market sentiment [7] Group 2: AI Software and Applications - The AI software and application market is exceeding many expectations, with a positive outlook for AI applications this year [8] - The government is intensifying support for AI policies, with initiatives in various sectors like healthcare, education, and manufacturing, aiming for quantifiable goals by 2026 [9] - Major tech companies are competing for AI traffic entry points and ecosystem development, with strategies focusing on both consumer (C-end) and business (B-end) markets [10][11] - For the C-end, companies are enhancing user engagement and monetization capabilities, while for the B-end, they are driving cloud revenue through developer ecosystems [12] - The competition has extended to physical scenarios, with companies like Waymo and Tesla accelerating their efforts in ROBOTAXI [13] - Key technological advancements in AI models are expected to focus on world models, native multimodality, and self-evolving agents, with significant breakthroughs anticipated by 2026 [14][15] - The core competitiveness of AI application companies lies in their ability to integrate technology quickly and effectively into specific scenarios, achieving commercial viability [15]
CES观察|能跑跳、能干活、能签单,中国人形机器人站上C位
Bei Ke Cai Jing· 2026-01-09 09:35
Core Insights - The CES 2026 showcased a significant dominance of Chinese companies in the robotics sector, with 149 out of 598 exhibitors being Chinese, accounting for nearly one-quarter of the total [1] - In the humanoid robot segment, 21 out of 38 exhibitors were from China, representing over half of the participants [1] - The event highlighted advancements in various applications of humanoid robots, including industrial, commercial, and home companionship [2][8] Group 1: Company Highlights - Companies like Yushu Technology and Zhiyuan Robotics demonstrated their capabilities in motion control and application scenarios, showcasing robots capable of dance and combat [4] - The Beijing Humanoid Robot Innovation Center made its debut, emphasizing the importance of showcasing the practical capabilities of humanoid robots to enhance international influence [8] - The company Lingqiao Intelligent presented its high-performance dexterous hand, significantly reducing costs to make it more accessible for research institutions and startups [10] Group 2: Technological Advancements - The event featured robots that integrated advanced AI capabilities, such as language interaction and autonomous sorting, demonstrating their practical applications in real-world scenarios [6][7] - Breakthroughs in tactile technology were showcased, with companies like Pasini Sensory Technology presenting advanced multi-dimensional tactile sensors [11] - The development of world models as data engines in embodied intelligence was highlighted, indicating a shift towards more sophisticated evaluation and reinforcement learning environments [11] Group 3: Market Expansion and Sales - The CES 2026 served as a critical platform for Chinese robotics companies to expand their global market presence, with several companies reporting immediate sales during the event [8] - Companies like Songyan Power are shifting their focus from product display to commercial implementation, targeting key regions for market expansion [8] - The event underscored the importance of a well-defined and efficient industrial ecosystem, indicating a maturation of the robotics industry [9][12]
马斯克diss英伟达自动驾驶:再等五六年
Sou Hu Cai Jing· 2026-01-09 08:00
Core Viewpoint - The competition between Tesla and Nvidia is intensifying, with both companies aiming to dominate the autonomous driving market, leveraging their unique strengths and strategies [1][5][22]. Group 1: Company Strategies - Nvidia's Alpamayo platform aims to reshape the autonomous driving development ecosystem by providing a framework for AI reasoning, integrating visual, language, and action models [3][7][11]. - Tesla's approach relies on extensive real-world driving data, claiming that achieving safe, unsupervised autonomous driving requires approximately 100 billion miles of training data, which Tesla is already accumulating at a rapid pace [16][18]. - Nvidia's business model focuses on empowering automotive companies by offering a "teacher model" rather than directly selling autonomous driving solutions, allowing companies to create tailored models using their own data [11][26]. Group 2: Competitive Landscape - Tesla asserts that traditional automakers will take years to integrate AI and camera systems into their designs, suggesting that Nvidia's collaboration with these companies will not pose a significant threat to Tesla in the near term [14][15]. - The competition is not just about technology but also about data ownership and ecosystem control, with Tesla's data monopoly being a significant advantage over Nvidia's more open platform [24][26]. - The battle is evolving from a focus on individual vehicle intelligence to a broader competition involving data ecosystems, development paradigms, and industry alliances [26][27]. Group 3: Market Dynamics - The automotive industry's shift towards intelligent systems is characterized by a multi-dimensional competition, where both Tesla and Nvidia are vying for leadership in different aspects of autonomous driving technology [27]. - The emergence of strong competitors from China, with robust engineering backgrounds and market scales, adds another layer of complexity to the competition between Tesla and Nvidia [26].
最前线|吉利发布全域AI2.0架构和世界行为模型,“1-2周可迭代一次”
3 6 Ke· 2026-01-09 07:34
Core Viewpoint - The automotive industry is reaching a consensus on the use of large models for intelligent assisted driving, with Geely launching its World Action Model (WAM) as part of its AI technology upgrade to version 2.0, enhancing its assisted driving system named G-ASD [1][3][6] Group 1: Technology Development - Geely's WAM is an enhanced "world model" that incorporates self-reflection and evolution capabilities, allowing for a closed-loop process from understanding to planning, simulation, judgment, and correction [3][5] - The WAM model integrates safety data from Volvo and various parameters from the vehicle, creating a unified "vehicle universal brain" that facilitates cross-domain integration [3][5] - The G-ASD system has improved capabilities in driving, safety, and parking, with features such as one-click NZP for roadside parking and automatic emergency steering [4][6] Group 2: Product Implementation - The G-ASD system covers L2, L3 conditional automated driving, and L4 automated driving, with the latest version G-ASD3 being rolled out via OTA updates to various models from Zeekr and Lynk & Co [4][6][7] - Geely's strategy includes a unified approach to its assisted driving systems, enhancing AI training and model capabilities across its brands [5][6] Group 3: Future Goals and Market Position - Geely aims to lead the industry in assisted driving and cabin technology by 2026, focusing on integrating knowledge from various domains into its models [13][14] - The company emphasizes the importance of safety in its development, with WAM being a core competitive advantage that allows for high-frequency data analysis and integration [14][15]