Workflow
中金:多模态推理助力智能驾驶能力升阶,相关主线值得关注
news flash·2025-06-03 00:32

Core Insights - Google Gemini 2.5 is set to be released in March, enabling multimodal fusion reasoning [1] - Companies such as Starry Sky, SenseTime, and MiniMax have recently launched multimodal reasoning achievements between April and May, indicating significant technological progress [1] - The integration of multimodal thinking chains is leading to a unified architecture for multimodal and reasoning models, enhancing multimodal understanding capabilities [1] Industry Developments - The recent advancements in multimodal reasoning are expected to extend application scenarios, particularly in the automotive sector with companies like Li Auto and NIO implementing multimodal reasoning in user interactions [1] - The ongoing innovation in technological architecture is likely to continue driving the expansion of application scenarios in the industry [1] - The focus on multimodal reasoning as a primary development line is becoming increasingly important [1]