Workflow
WAIC|商汤首席科学家林达华:多模态是通向AGI的必经之路

Core Insights - The essence of artificial intelligence (AI) is to create a form of genuine intelligence that can autonomously interact with the real world, which is the ultimate goal of intelligence [1] - The rapid evolution of large models, particularly language models, is seen as a stepping stone towards achieving AGI (Artificial General Intelligence), with a necessary focus on multimodal capabilities for real-world applications [1][2] Company Developments - SenseTime has officially launched the "Riri Xin" V6.5 "Awakening" world model and the "Wuneng" embodied intelligence platform during the WAIC [1] - The company has been a pioneer in multimodal integration, demonstrating that multimodal models outperform pure language models in language tasks after effective training [2] - The latest version, "Riri Xin" 6.5, has achieved advanced performance in both pure language and text tasks, showcasing the maturity of SenseTime's technology in this area [2] Industry Trends - The rise of ChatGPT has highlighted a new era in AI technology, presenting opportunities for companies like SenseTime to leverage this wave of transformation to create significant impact [3] - The shift from AI 1.0, which focused on specialized tasks, to general AI models that are more autonomous and versatile is a key development in the industry [3] - The future of software development is expected to become more accessible, allowing non-experts to create software simply by expressing their needs, which could reshape industry dynamics [3][4] Technological Advancements - The development of multimodal models is progressing through three critical stages, with the final goal being the connection between digital and physical spaces to achieve AGI [5] - SenseTime's experience in computer vision and collaboration with hardware companies has positioned it well to enhance its embodied intelligence platform [6] - The integration of world models with multimodal training data has proven effective in training autonomous driving modules, significantly improving efficiency compared to relying solely on real-world data [6] Strategic Focus - SenseTime emphasizes aligning research and development with its commercial vision, ensuring that scientific advancements translate into business value [6] - The company prioritizes projects that can achieve commercial viability, avoiding areas that do not align with its business goals [6] - Investments in embodied intelligence and foundational models are interconnected, allowing for a more efficient allocation of resources [6]