美团新独立APP,点不了菜只能点AI
猿大侠·2025-11-03 04:11

Core Viewpoint - Meituan has launched the LongCat-Flash-Omni model, which supports multi-modal capabilities and has achieved state-of-the-art (SOTA) performance in open-source benchmarks, surpassing competitors like Qwen3-Omni and Gemini-2.5-Flash [2][4][8]. Group 1: Model Performance - LongCat-Flash-Omni is capable of handling text, images, audio, and video inputs effectively, maintaining high performance across all modalities [3][27]. - The model features a total of 560 billion parameters, with only 27 billion activated, allowing for high inference efficiency while retaining a large knowledge base [4][40]. - It is the first open-source model to achieve real-time interaction across all modalities under current flagship model performance standards [8][42]. Group 2: User Experience - Users can experience the LongCat model through the LongCat APP and Web, which support various input methods including text, voice, and image uploads [9][10]. - The model demonstrates quick response times and smooth interactions, even in complex scenarios, enhancing user experience [27][28][30]. Group 3: Development Strategy - Meituan's iterative model development strategy focuses on speed, specialization, and comprehensive capabilities, aiming to create a robust "world model" that integrates digital and physical worlds [31][45]. - The company has invested in both software and hardware to achieve deep connections between the digital and physical realms, emphasizing the importance of hardware in extending software's impact [46][47]. Group 4: Future Outlook - Meituan's long-term vision includes advancing embodied intelligence and creating a comprehensive robotics framework that connects various service scenarios [57][62]. - The company aims to leverage AI and robotics to transform the retail industry, enhancing efficiency and user experience across its services [60][63].