Workflow
千亿级推理大模型
icon
Search documents
单卡部署千亿大模型!江苏银行人工智能产能跃升
Zhong Jin Zai Xian· 2025-08-28 01:20
Core Insights - Jiangsu Bank has successfully implemented large-scale deployment of a trillion-level inference model using a hybrid computing architecture based on domestic chips [1][2] - The new technology framework, built through fully autonomous compilation and adaptation, has achieved a threefold increase in computing performance while reducing hardware resource usage by 75% compared to traditional solutions [1] - The intelligent agent, designed with a "human expertise first, AI as a supplement" philosophy, has been applied in business material entry and review scenarios, significantly improving efficiency and accuracy in document verification processes [1][2] Technology and Performance - The bank's model has demonstrated significant enhancements in inference capabilities, enabling efficient processing of business operations [2] - The deployment of the trillion-level model on a single GPU card has validated the feasibility of domestic computing power supporting core financial intelligent scenarios [2] - The intelligent agent autonomously matches identification rules and dynamically selects toolchains, improving the precision of image detail localization and metadata comparison [1] Future Directions - Jiangsu Bank plans to deepen research and application of artificial intelligence technologies, aiming to build a fully autonomous technology system [2] - The bank will continue to explore the application paths of intelligent agents across all business areas, promoting the integration of technology and business [2] - The focus will be on expanding the capabilities of large models in digital operations and risk control, creating a new ecosystem driven by artificial intelligence based on domestic computing power [2]