大模型部署 - filings, earnings calls, financial reports, news

大模型部署

Search documents

Jin Rong Jie· 2025-11-08 06:26

Core Points - Chongqing Changan Automobile Co., Ltd. and Fudan University have applied for a patent titled "Method, Device, Equipment, Storage Medium, and Program Product for Large Model Deployment" with publication number CN120909607A, filed on August 2025 [1] Summary by Categories Patent Application - The patent application involves a method for deploying large models, which includes receiving current operational scenario information and resource usage information from terminals by a cloud server [1] - The method determines whether the current operational scenario and resource usage information are compatible with the specifications of the terminal's current large model [1] - If not compatible, the method utilizes multi-objective constraints and optimization to select a target large model from a model library based on the current operational scenario and resource usage information [1] - The target model's information is then sent to the terminal for deployment, allowing inference through the target large model [1]

CHANGAN AUTOMOBILE-B(SZ:000625)

即将开课！自动驾驶VLA全栈学习路线图分享~

自动驾驶之心· 2025-10-15 23:33

Core Insights - The focus of academia and industry has shifted towards VLA (Vision-Language Action) in autonomous driving, which provides human-like reasoning capabilities for vehicle decision-making [1][4] - Traditional methods in perception and lane detection have matured, leading to decreased attention in these areas, while VLA is now a critical area for development among major autonomous driving companies [4][6] Summary by Sections Introduction to VLA - VLA is categorized into modular VLA, integrated VLA, and reasoning-enhanced VLA, which are essential for improving the reliability and safety of autonomous driving [1][4] Course Overview - A comprehensive course on autonomous driving VLA has been designed, covering foundational principles to practical applications, including cutting-edge algorithms like CoT, MoE, RAG, and reinforcement learning [6][12] Course Structure - The course consists of six chapters, starting with an introduction to VLA algorithms, followed by foundational algorithms, VLM as an interpreter, modular and integrated VLA, reasoning-enhanced VLA, and a final project [12][20] Chapter Highlights - Chapter 1 provides an overview of VLA algorithms and their development history, along with benchmarks and evaluation metrics [13] - Chapter 2 focuses on the foundational knowledge of Vision, Language, and Action modules, including the deployment of large models [14] - Chapter 3 discusses VLM's role as an interpreter in autonomous driving, covering classic and recent algorithms [15] - Chapter 4 delves into modular and integrated VLA, emphasizing the evolution of language models in planning and control [16] - Chapter 5 explores reasoning-enhanced VLA, introducing new modules for decision-making and action generation [17][19] Learning Outcomes - The course aims to deepen understanding of VLA's current advancements, core algorithms, and applications in projects, benefiting participants in internships and job placements [24]

一万多块，128G内存+96G显存，竟然只要一台小主机就搞定？

Hu Xiu· 2025-09-19 05:00

Core Insights - AMD's Ryzen AI Max+ 395 is highlighted as a powerful tool for deploying large models and generating images, showcasing its capabilities in AI applications [1] Group 1: Product Features - The Ryzen AI Max+ 395 is equipped with 128GB of memory, enabling efficient handling of complex AI tasks [1] - The product is designed to meet the demands of large model deployment, indicating its suitability for advanced AI workloads [1] Group 2: Market Implications - The introduction of the Ryzen AI Max+ 395 positions AMD competitively in the AI hardware market, potentially attracting businesses focused on AI development [1] - The capabilities of the Ryzen AI Max+ 395 may influence industry standards for AI processing power and efficiency [1]

开放几个大模型技术交流群（RAG/Agent/通用大模型等）

自动驾驶之心· 2025-09-04 03:35

Group 1 - The establishment of a Tech communication group focused on large models, inviting participants to discuss topics such as RAG, AI Agents, multimodal large models, and deployment of large models [1] - Interested individuals can join the group by adding a designated WeChat assistant and providing their nickname along with a request to join the large model discussion group [2]