盘古NLP大模型

Search documents
拿了火星图片的华为云盘古大模型,这样在地球落地
量子位· 2025-06-20 10:31
Core Viewpoint - The article discusses the advancements of Huawei Cloud's Pangu multimodal large model, highlighting its capabilities in generating 4D space images and videos from Mars images, and its unique ability to support both point cloud and video modalities simultaneously [1][7]. Group 1: Model Upgrades - Huawei Cloud has upgraded five foundational models, including Pangu NLP, multimodal, prediction, scientific computing, and CV models [8]. - The Pangu NLP model features two significant technologies: Pangu DeepDiver and a low hallucination new scheme, which enhance its capabilities [12][18]. Group 2: Pangu DeepDiver Technology - Pangu DeepDiver utilizes Search Intensity Scaling (SIS) to improve interaction between large language models (LLMs) and search engines, allowing dynamic adjustment of search frequency and depth based on problem complexity [13][14]. - The model has demonstrated performance comparable to a 671 billion parameter model in various benchmarks, indicating a qualitative leap in open-domain information retrieval capabilities [16][17]. Group 3: Low Hallucination New Scheme - The low hallucination scheme includes a multi-layered hallucination defense system and a closed-loop quality assurance system, focusing on data quality and diversity to reduce hallucination triggers [18][21]. - The model employs reinforcement learning to suppress hallucinations and enhance factual accuracy, logical consistency, and reliability [22][23]. Group 4: Industry Applications - The Pangu models have been applied in various industries, such as agriculture, where a model developed with the Chinese Academy of Agricultural Sciences can recommend gene editing targets, significantly reducing design time [28][34]. - The Pangu prediction model has been implemented in industries like cement and steel, providing process optimization solutions that enhance production efficiency [35][36]. Group 5: Model Development and Training - Huawei Cloud offers a comprehensive AI toolchain through its ModelArts Studio, facilitating the development of industry-specific models without the need for companies to start from scratch [42]. - The industry model training workflow reduces training time and costs by 60%, enabling clients to build high-quality proprietary models efficiently [45][46]. Group 6: Evaluation and Standards - Huawei Cloud has established an industry model evaluation center that provides a three-tier evaluation system across various sectors, helping clients optimize their models based on clear standards [47][48].