百度文心大模型5.0正式发布千帆平台累计开发Agent超130万个

Core Insights - Baidu officially launched the Wenxin large model version 5.0, featuring 2.4 trillion parameters and utilizing a native multimodal unified modeling technology that supports various forms of information input and output, including text, images, audio, and video [2] Group 1: Model Features - Wenxin 5.0 employs a unified autoregressive architecture for native multimodal modeling, allowing for joint training of text, images, video, and audio within the same model framework, enhancing the integration and optimization of multimodal features [2] - Despite having a high parameter count, Baidu implemented a sparse activation strategy, activating less than 3% of parameters to address inference cost issues associated with large models [2] - Official tests indicate that Wenxin 5.0's language and multimodal understanding capabilities are on par with leading international models such as Gemini-2.5-Pro and GPT-5-High [2] Group 2: Application and Development - Baidu has developed a matrix model and specialized models based on the Wenxin foundational large model, with the matrix model aimed at rapid deployment in product-level applications and general scenarios, including Wenxin Lite, video, and voice models [3] - Specialized models target industry applications and vertical scenarios, including search lightning models, e-commerce steam engine models, Wenxin digital human models, and industry-specific large models [3] - The Baidu Qianfan platform has developed over 1.3 million agents, indicating a significant push towards the industrial application of large models [4] Group 3: Industry Context - The global AI industry is entering a new fast track after several years of rapid development, with chatbots primarily based on dialogue or text input remaining the mainstream form in AI applications [4] - Baidu aims to support various intelligent applications with its models and continues to explore AI solutions that empower industries [4]