Prophet Arena
Search documents
陈天桥旗下AI公司MiroMind打造全球顶尖预测型大模型,性能登顶行业基准
机器之心· 2025-09-20 04:37
Core Viewpoint - The article discusses the launch of FutureX, the world's first dynamic real-time LLM intelligence future prediction benchmark, which aims to enhance AI's predictive capabilities in uncertain environments, as emphasized by Elon Musk [2][5][4]. Group 1: FutureX Benchmark - FutureX was developed by ByteDance's SEED team in collaboration with Stanford University, Fudan University, and Princeton University, focusing on predicting future events such as stock price movements, sports outcomes, and political election results [5][6]. - The benchmark evaluates AI models based on their ability to analyze current information and make predictions using logical reasoning, trend analysis, and probability calculations, thus enhancing their practical capabilities in complex real-world scenarios [5][6]. Group 2: MiroMind's Performance - MiroMind's model, MiroFlow, achieved first place in the FutureX rankings for two consecutive weeks in September, showcasing its advanced predictive capabilities compared to other international models [8][12]. - MiroMind successfully predicted complex outcomes, such as ATP men's singles rankings and cryptocurrency price movements, demonstrating its robust modeling and risk management abilities [10][11]. Group 3: MiroMind's Predictive Strategy - MiroMind employs a systematic five-step strategy for predictions, which includes detailed planning, data acquisition, understanding rules, dynamic information updates, and probability analysis [13][11]. - The model's core capabilities include information insight, logical reasoning, uncertainty management, and cross-domain integration, allowing it to make informed predictions in various fields [11][13]. Group 4: MiroThinker Model - MiroThinker, MiroMind's flagship foundational model, is designed for reasoning, decision-making, and multi-modal understanding, and is set to be fully open-sourced for global developers and researchers [15][17]. - The model aims to bridge the gap between open-source and closed-source commercial models, enhancing collaboration and innovation in AI development [15][17].
AI版华尔街之狼,o3-mini靠「神之押注」狂赚9倍,DeepSeek R1最特立独行
3 6 Ke· 2025-08-18 06:58
AI能像科幻电影中的先知一样预测未来吗?一个名为「Prophet Arena」的全新基准测试,正通过预测真实世界事件来评估AI的「预言」能力。 AI能预测未来吗? 在《黑客帝国》里,先知能对Neo的未来做出预测。 以ChatGPT为代表的AI,则可以根据过去的语料来「预测下一个Token」。 那问题来了,AI能不能像先知一样,从全世界的杂乱信息里找出蛛丝马迹,准确地预测未来呢? 比如: AI监管今年能否成为联邦法律? 美国职业足球大联盟比赛中,谁会获胜? NBA今年的冠军会是谁? | 2025年降息次数? | | | | 今年经济衰退? | | 本月鸡蛋价格会上涨 | | --- | --- | --- | --- | --- | --- | --- | | | | | | | 吗? | | | 最佳预测: | | | 最佳预测: | | 最佳预测: | | | 精确地2次切割 | | | 开始 | | 高于 0% | | | GPT-5 | | 43% | o3 Mini | 27% | o3 Mini | 90% | | Grok 3 Mini | | 40% | GPT-5 | 19% | GPT-5 ...