Workflow
FutureX
icon
Search documents
小鹏最新一篇基于潜在思维链世界模型的FutureX,车端可以借鉴...
自动驾驶之心· 2025-12-15 06:00
点击下方 卡片 ,关注" 自动驾驶之心 "公众号 戳我-> 领取 自动驾驶近30个 方向 学习 路线 >>自动驾驶前沿信息获取 → 自动驾驶之心知识星球 论文作者 | Hongbin Lin等 编辑 | 自动驾驶之心 港中文联合小鹏最新的一篇工作,很有意思。基于潜在思维链世界模型增强端到端的能力, 有一些值得业内尝试的改进点: 一、背景回顾 端到端(E2E)自动驾驶指的是通过完全可微分的映射,直接将多模态原始传感器数据流转换为运动规划或底层驱动指令的技术流水线。该领域在算法方案和基准测 试两方面均取得了快速发展。尽管面临固有挑战,现有方法仍实现了显著进步。 在这些成功背后,现有端到端自动驾驶系统通过单一神经网络直接将传感器输入映射为控制输出,执行高效的一次性前向预测,而无需进一步"思考"。这导致它们在 复杂环境中缺乏适应性和可解释性(图1第二行)。在人类认知中,驾驶员在执行任何操作前,都会在脑海中模拟可能的未来场景:预测周围车辆的运动趋势、场景的 演变方向,以及每种可能行为的潜在结果(图1第一行)。这种内在推理能力使人类能够做出安全且贴合场景的决策。因此,对于端到端系统而言,在高度动态的交通 环境中推断未来场 ...
陈天桥旗下AI公司MiroMind打造全球顶尖预测型大模型,性能登顶行业基准
机器之心· 2025-09-20 04:37
Core Viewpoint - The article discusses the launch of FutureX, the world's first dynamic real-time LLM intelligence future prediction benchmark, which aims to enhance AI's predictive capabilities in uncertain environments, as emphasized by Elon Musk [2][5][4]. Group 1: FutureX Benchmark - FutureX was developed by ByteDance's SEED team in collaboration with Stanford University, Fudan University, and Princeton University, focusing on predicting future events such as stock price movements, sports outcomes, and political election results [5][6]. - The benchmark evaluates AI models based on their ability to analyze current information and make predictions using logical reasoning, trend analysis, and probability calculations, thus enhancing their practical capabilities in complex real-world scenarios [5][6]. Group 2: MiroMind's Performance - MiroMind's model, MiroFlow, achieved first place in the FutureX rankings for two consecutive weeks in September, showcasing its advanced predictive capabilities compared to other international models [8][12]. - MiroMind successfully predicted complex outcomes, such as ATP men's singles rankings and cryptocurrency price movements, demonstrating its robust modeling and risk management abilities [10][11]. Group 3: MiroMind's Predictive Strategy - MiroMind employs a systematic five-step strategy for predictions, which includes detailed planning, data acquisition, understanding rules, dynamic information updates, and probability analysis [13][11]. - The model's core capabilities include information insight, logical reasoning, uncertainty management, and cross-domain integration, allowing it to make informed predictions in various fields [11][13]. Group 4: MiroThinker Model - MiroThinker, MiroMind's flagship foundational model, is designed for reasoning, decision-making, and multi-modal understanding, and is set to be fully open-sourced for global developers and researchers [15][17]. - The model aims to bridge the gap between open-source and closed-source commercial models, enhancing collaboration and innovation in AI development [15][17].