Workflow
MoE
icon
Search documents
2024 到 2025,《晚点》与闫俊杰的两次访谈,记录一条纯草根 AI 创业之路
晚点LatePost· 2026-01-09 02:38
Core Insights - MiniMax aims to contribute significantly to the improvement of AI in the industry, focusing on grassroots AI entrepreneurship despite challenges ahead [3][4] - The company has set ambitious goals for 2024 and 2025, including achieving technical capabilities comparable to GPT-4 and increasing user scale tenfold [4][36] - MiniMax emphasizes the importance of creating AI products that serve ordinary people, rather than focusing solely on large clients [5][9] Group 1: Company Vision and Strategy - MiniMax's vision is to create AI that is accessible to everyone, encapsulated in the phrase "Intelligence with everyone" [5][51] - The company believes that AGI should be a product used daily by ordinary people, rather than a powerful tool for a select few [9][51] - MiniMax's approach involves a dual focus on both technology and product development from the outset, contrary to the belief that startups should prioritize one over the other [14][15] Group 2: Technical Development and Challenges - The company has adopted a mixed expert (MoE) model for its large-scale AI, which is seen as a gamble compared to the more stable dense models used by competitors [10][20] - MiniMax faced significant challenges during the development of its MoE model, including multiple failures and the need for iterative learning [11][19] - The company recognizes that improving model performance is crucial and that many advancements come from the model itself rather than product features [19][34] Group 3: Market Position and Competition - MiniMax believes that the AI industry will see multiple companies capable of producing models similar to GPT-4, indicating a competitive landscape [41][37] - The company asserts that relying solely on funding for growth is not sustainable and emphasizes the importance of serving users and generating revenue [37][38] - MiniMax aims to differentiate itself by focusing on technical innovation and product development rather than merely increasing user numbers [57] Group 4: Future Outlook and Industry Trends - The company anticipates that the AI landscape will evolve rapidly, with significant advancements in model capabilities and user engagement [41][56] - MiniMax acknowledges the importance of open-sourcing technology to accelerate innovation and improve its technical brand [54][56] - The company is committed to continuous improvement in both technology and user experience, aiming to adapt to changing market demands [28][36]
即将开课!自动驾驶VLA全栈学习路线图分享~
自动驾驶之心· 2025-10-15 23:33
Core Insights - The focus of academia and industry has shifted towards VLA (Vision-Language Action) in autonomous driving, which provides human-like reasoning capabilities for vehicle decision-making [1][4] - Traditional methods in perception and lane detection have matured, leading to decreased attention in these areas, while VLA is now a critical area for development among major autonomous driving companies [4][6] Summary by Sections Introduction to VLA - VLA is categorized into modular VLA, integrated VLA, and reasoning-enhanced VLA, which are essential for improving the reliability and safety of autonomous driving [1][4] Course Overview - A comprehensive course on autonomous driving VLA has been designed, covering foundational principles to practical applications, including cutting-edge algorithms like CoT, MoE, RAG, and reinforcement learning [6][12] Course Structure - The course consists of six chapters, starting with an introduction to VLA algorithms, followed by foundational algorithms, VLM as an interpreter, modular and integrated VLA, reasoning-enhanced VLA, and a final project [12][20] Chapter Highlights - Chapter 1 provides an overview of VLA algorithms and their development history, along with benchmarks and evaluation metrics [13] - Chapter 2 focuses on the foundational knowledge of Vision, Language, and Action modules, including the deployment of large models [14] - Chapter 3 discusses VLM's role as an interpreter in autonomous driving, covering classic and recent algorithms [15] - Chapter 4 delves into modular and integrated VLA, emphasizing the evolution of language models in planning and control [16] - Chapter 5 explores reasoning-enhanced VLA, introducing new modules for decision-making and action generation [17][19] Learning Outcomes - The course aims to deepen understanding of VLA's current advancements, core algorithms, and applications in projects, benefiting participants in internships and job placements [24]