Workflow
端到端学习
icon
Search documents
黄仁勋点赞三款中国大模型,英伟达押宝物理AI
Guan Cha Zhe Wang· 2026-01-06 11:22
被称为科技春晚的国际消费电子产品展览会(CES)于1月6日正式开幕。 作为英伟达新年战略和新品发布的重要窗口,本次英伟达CEO依然身着去年同款亮面鳄鱼皮衣登场,发表了他在2026年的首场演讲。 在CES开始前,英伟达在社交媒体发布称"(CES2026)不会发布新款GPU。"这也是英伟达五年来首次不在CES发布新款GPU产品。 相比去年CES官宣了RTX50系产品,本次演讲中,黄仁勋把重点放在了新一代计算平台和英伟达在物理AI领域的进展,包括自动驾驶和机器人,相关开源 模型和工具等。 演讲开始。黄仁勋表示过去十年投入的约10万亿美元计算资源,正在被彻底现代化。 但这不仅仅是硬件的升级,更多的是软件范式的转移。 他特别提到了具备自主行为能力(Agentic)的智能体模型,并点名了Cursor,彻底改变了英伟达内部的编程方式。 随后,黄仁勋对2025年开源社区给予了高度评价。他表示,去年DeepSeek的突破让全世界感到意外,它作为第一个开源推理系统,直接激发了整个行业 的发展浪潮。 而在介绍开源生态时,黄仁勋的PPT后出现了三家中国模型的名字,分别是月之暗面的Kimi K2,深度求索的DeepSeek V3.2和 ...
2025新汽车年度盛典:中国汽车如何破局存量市场
21世纪经济报道· 2025-11-21 04:10
Core Viewpoint - The Chinese automotive industry is at a critical juncture, facing both opportunities and challenges, necessitating a transformation from traditional practices to a new paradigm centered on user value and technological integration [3][5]. Group 1: Industry Overview - In 2025, Chinese brands have captured over 60% market share, transitioning from followers to leaders, yet the industry faces declining profits and intense price wars [3]. - The theme of the "2025 New Automotive Annual Ceremony" is "Breaking the Old and Establishing the New," focusing on user value, technology, ecology, and strategy to navigate the competitive landscape [3][5]. Group 2: User Value and Design - Automotive design is evolving to become a key differentiator, merging emotional and rational elements to enhance user experience [8]. - The concept of "new luxury" in automotive experiences is highlighted, with companies like AITO achieving significant sales milestones, indicating a shift in consumer expectations [15]. Group 3: Technological Innovation - Companies are adopting a "soft and hard dual repair" approach in the intelligent driving sector, emphasizing the integration of AI and traditional automotive technologies [10]. - Safety remains a top priority in technological advancements, with companies like Li Auto reporting significant accident avoidance statistics through their AEB systems [12]. Group 4: Market Expansion and Challenges - The overseas market is becoming essential for long-term growth, with challenges such as regulatory barriers and cultural differences being addressed through initiatives like NESTA-Global [34]. - The automotive industry is transitioning from a growth phase to a competitive phase, requiring companies to listen to user feedback and adapt accordingly [18][20]. Group 5: Future Directions - The integration of AI and smart technologies is reshaping user interactions and driving the evolution of automotive experiences [24][26]. - The industry is moving towards a model where electric vehicles and charging infrastructure evolve together, with a focus on smart home integration and high-power charging solutions [27]. Group 6: Recognition and Awards - The ceremony recognized various companies for their contributions, including Xiaopeng for design, Horizon Robotics for AI, and AION for manufacturing excellence, showcasing the industry's diverse strengths [46].
著名机器人专家:人型机器人的未来是不像人
3 6 Ke· 2025-09-30 08:43
Group 1 - The article discusses the challenges faced by humanoid robots in achieving dexterity despite significant investments from venture capital firms and large tech companies [2][3][5] - Humanoid robots are designed to mimic human body structures and perform tasks in human environments, with the goal of creating versatile robots capable of handling various jobs [5][6] - Companies like Tesla and Figure are optimistic about the economic potential of humanoid robots, with predictions of generating trillions in revenue, but the timeline for achieving human-level dexterity remains uncertain [6][7] Group 2 - The history of humanoid robot development spans over six decades, with significant contributions from various researchers and institutions, including early models from Waseda University and Honda [8][9] - Despite advancements, no humanoid robot has demonstrated significant dexterity comparable to human capabilities, and existing designs have not been successfully applied in practical industrial settings [20][21] - The article emphasizes the importance of tactile feedback and dexterity in humanoid robots, arguing that current training methods relying on visual data are insufficient for achieving the desired level of skill [23][24][44] Group 3 - The article critiques the reliance on "learning from demonstration" methods, highlighting the limitations of current approaches that do not incorporate tactile or force feedback [23][24][25] - Companies like Figure and Tesla are shifting towards training humanoid robots using first-person videos of humans performing tasks, betting on the effectiveness of visual learning [26][27] - The article concludes that achieving true dexterity in humanoid robots will require a deeper understanding of tactile perception and the integration of such feedback into training methodologies [44][45]
著名机器人专家:人型机器人的未来是不像人
Core Viewpoint - Despite significant investments from venture capital firms and large tech companies, humanoid robots still struggle to achieve dexterity, which is essential for performing tasks in human environments [2][3][4]. Group 1: Historical Context of Humanoid Robots - The concept of humanoid robots has been explored for over 65 years, with early developments including a computer-controlled robotic arm capable of stacking blocks in 1961 [3]. - The evolution of humanoid robots has seen contributions from various institutions, including WABOT-1 from Waseda University in the 1970s and Honda's ASIMO in 2000 [11][12]. Group 2: Current State and Future Predictions - Humanoid robots are currently in the early stages of development, with Gartner indicating they have not yet reached their peak hype [4]. - Companies like Tesla and Figure are optimistic about the economic potential of humanoid robots, with predictions of creating trillions in revenue [9][10]. Group 3: Challenges in Dexterity - Achieving human-level dexterity in humanoid robots remains a significant challenge, as current robotic hands lack the necessary finesse and adaptability for a wide range of tasks [23][24]. - Existing methods for training robots often rely on visual demonstrations, which do not adequately capture the tactile feedback necessary for dexterous manipulation [27][28]. Group 4: Learning Approaches - The industry has seen a shift towards end-to-end learning methods, where robots learn from observing human actions, but this approach has limitations due to the lack of tactile feedback and precision [30][31]. - Successful applications of end-to-end learning in other fields, such as speech recognition and image labeling, highlight the importance of pre-processing and human-like structures in achieving effective learning outcomes [49][50]. Group 5: Importance of Tactile Feedback - Human dexterity is heavily reliant on rich tactile feedback, which current humanoid robots do not possess, leading to challenges in replicating human-like manipulation [51][52]. - The complexity of human touch perception and the integration of multiple body parts in dexterous tasks further complicate the development of humanoid robots capable of similar actions [52].
叫板FSD?日产新智驾上街了
汽车商业评论· 2025-09-23 17:37
Core Viewpoint - Nissan aims to compete with Tesla by launching its next-generation ProPILOT driver assistance system, which utilizes technology from UK-based Wayve, with plans for implementation in mass production vehicles by the fiscal year 2027 [4][10]. Group 1: Technology Development - The new ProPILOT system, currently in L2 level, will require drivers to monitor the vehicle and road conditions at all times, similar to Tesla's Full Self-Driving (FSD) system [4][16]. - The ProPILOT system has evolved from its initial launch in 2016, with the latest version capable of handling complex urban environments using fewer sensors [7][9]. - Wayve's AI Driver software, which focuses on real-world data absorption and transfer capabilities, will be the core component of Nissan's next-generation ProPILOT [7][13]. Group 2: Strategic Partnerships - Nissan's collaboration with Wayve marks a significant step from theoretical partnerships to practical road testing, with a dedicated development center established in Yokohama to adapt to Japan's unique driving conditions [8][10]. - The partnership aims to deliver safer and smarter mobility technologies, leveraging Wayve's expertise in AI and real-world driving data [8][10]. Group 3: Market Positioning - Nissan's strategy emphasizes a "Japan first, then overseas" approach, showcasing its prototype's capabilities in Tokyo's urban settings [9][10]. - The company plans to initiate small-scale L4 autonomous services in Japan by the fiscal year 2027, starting with vehicles equipped with safety drivers [10][16]. - Nissan's focus on L2 capabilities in urban environments reflects a broader trend among Japanese automakers to explore diverse partnerships and regulatory pathways for autonomous driving [16].
电脑、笔记本、手机生产5年内或实现全智能化
Ke Ji Ri Bao· 2025-09-22 09:26
Core Viewpoint - The production of computers, laptops, and mobile phones is expected to become fully intelligent within approximately five years, significantly impacting China's manufacturing industry [1] Group 1 - Current manufacturing in China is primarily semi-manual and semi-automated, requiring production line changes for new mobile phone models [1] - The concept of a "universal production line" driven by knowledge and end-to-end learning could revolutionize the manufacturing process in China [1]
港科&理想最新!OmniReason: 时序引导的VLA决策新框架
自动驾驶之心· 2025-09-10 23:33
Core Insights - The article discusses the development of the OmniReason framework, a novel Vision-Language-Action (VLA) model designed to enhance spatiotemporal reasoning in autonomous driving by integrating dynamic 3D environment modeling and decision-making processes [2][6][8]. Data and Framework - OmniReason-Data consists of two large-scale VLA datasets: OmniReason-nuScenes and OmniReason-Bench2Drive, which provide dense spatiotemporal annotations and natural language explanations, ensuring physical realism and temporal coherence [2][6][8]. - The OmniReason-Agent architecture incorporates a sparse temporal memory module for persistent scene context modeling and an explanation generator for human-interpretable decision-making, effectively capturing spatiotemporal causal reasoning patterns [2][7][8]. Performance and Evaluation - Extensive experiments on open-loop planning tasks and visual question answering (VQA) benchmarks demonstrate that the proposed methods achieve state-of-the-art performance, establishing new capabilities for interpretable and time-aware autonomous vehicles operating in complex dynamic environments [3][8][25][26]. - The OmniReason-Agent shows competitive results in open-loop planning with an average L2 error of 0.34 meters, matching the top method ORION, while achieving a new record for violation rate at 3.18% [25][26]. Contributions - The introduction of comprehensive VLA datasets emphasizes causal reasoning based on spatial and temporal contexts, setting a new benchmark for interpretability and authenticity in autonomous driving research [8]. - The design of a template-based annotation framework ensures high-quality, interpretable language-action pairs suitable for diverse driving scenarios, reducing hallucination phenomena and providing rich multimodal reasoning information [8][14][15]. Related Work - The article reviews the evolution of datasets for autonomous driving, highlighting the shift from single-task annotations to comprehensive scene understanding, and discusses the limitations of existing visual language models (VLMs) in dynamic environments [10][11].
Figure自曝完整技术:60分钟不间断打工,我们的机器人如何做到?
量子位· 2025-06-13 05:07
Core Viewpoint - The article highlights the advancements in robotics, particularly focusing on the capabilities of the Helix system developed by Figure, showcasing its ability to handle a wider variety of packages with improved efficiency and accuracy [1][7][19]. Technical Improvements - The Helix system has undergone significant enhancements due to the expansion of high-quality demonstration datasets and architectural improvements in its visuo-motor policy, leading to increased stability under high-speed workloads [7][20]. - The introduction of state awareness and force sensing has enhanced the robustness and adaptability of the robots without sacrificing efficiency [8]. Data Expansion - The range of packages that the Helix system can handle has expanded to include not only standard cardboard boxes but also polyethylene bags, envelopes, and other flexible or crumpled items [10]. - The system has developed adaptive strategies for different package shapes, such as flipping cardboard boxes with both hands or gently pinching the edges of envelopes [13][15]. Performance Metrics - The average processing speed for packages is approximately 4.05 seconds, with throughput increasing by 58% and barcode success rates rising from 88.2% to 94.4% [17][30]. - The improvements indicate a more agile and reliable system capable of operating at speeds and accuracy levels closer to human performance [19]. Architectural Enhancements - The Helix system's architecture has been improved with new memory and sensing modules, enhancing its ability to perceive environmental changes [20]. - Key components include: - **Visual Memory**: Allows the robot to recall previous frames to locate barcodes effectively [22][25]. - **State History**: Enables the robot to maintain context during actions, improving its ability to correct movements quickly [26][27]. - **Force Feedback**: Provides tactile feedback to adjust movements dynamically, enhancing control and adaptability [28]. Human Interaction - The Helix system can autonomously sort packages and establish human-robot interaction without separate programming, recognizing cues from humans to hand over items [31][33]. Community Response - The release of the unedited 60-minute video has generated significant interest and discussion among viewers, with varied opinions on the implications of robotics in logistics and the future of human jobs [34][37][38].
Figure自曝完整技术:60分钟不间断打工,我们的机器人如何做到?
量子位· 2025-06-13 05:07
Core Insights - The article highlights the advancements in robotics, particularly focusing on the capabilities of the Helix system developed by Figure, which showcases improved performance in handling various types of packages in logistics [1][7][19]. Technical Improvements - The Helix system has undergone significant enhancements due to the expansion of high-quality demonstration datasets and architectural improvements in its visuo-motor policy, leading to increased stability under high-speed workloads [7][19]. - The system can now handle a wider variety of package shapes and materials, including polyethylene bags and envelopes, demonstrating its adaptability [10][17]. - The introduction of real-time data observation allows the robot to learn and adjust its actions dynamically, improving its efficiency and accuracy [2][8]. Performance Metrics - The average processing speed for packages is approximately 4.05 seconds, with throughput increasing by 58% and barcode scanning success rates rising from 88.2% to 94.4% [17][30]. - The Helix system's new strategies have led to a success rate of 94% for barcode orientation and maintained an accuracy of over 92% [30]. System Architecture - The Helix system incorporates three main components: visual memory, state history, and force feedback, enhancing its ability to perceive and interact with its environment [20][22]. - Visual memory allows the robot to recall previous frames to locate barcodes effectively, while state history helps maintain context during operations [23][27]. - Force feedback enables the robot to adjust its movements based on tactile information, improving control and adaptability to different package weights and shapes [28]. Human Interaction - The Helix system can seamlessly engage in human-robot interaction without the need for separate programming, recognizing cues from humans to hand over packages [31][33]. Community Reactions - The release of the unedited 60-minute video showcasing the robot's capabilities has sparked discussions among viewers, with some praising the transparency and others questioning the implications for human labor in logistics [34][37][38].