Workflow
破解效率与成本难题:华为UCM技术推动AI推理体验升级
Yang Guang Wang·2025-08-13 06:13

Group 1 - The forum on the application and development of financial AI reasoning took place in Shanghai, featuring key figures from China UnionPay and Huawei [1] - Huawei introduced the UCM reasoning memory data manager, aimed at enhancing AI reasoning experiences and cost-effectiveness, while accelerating the positive cycle of AI in business [1][3] - AI reasoning is entering a critical growth phase, with reasoning experience and cost becoming key metrics for evaluating model value [3] Group 2 - The UCM reasoning memory data manager includes three main components: reasoning engine plugins, a function library for multi-level KV Cache management, and high-performance KV Cache access adapters [3][4] - UCM technology can reduce the latency of the first token by up to 90% and expand the reasoning context window by ten times, addressing long text processing needs [3][4] - The UCM's intelligent caching capabilities significantly enhance processing speed, achieving a 125-fold increase in reasoning speed for China UnionPay's "Voice of the Customer" scenario [4] Group 3 - Huawei announced an open-source plan for UCM, which will be available in September, allowing adaptation to various reasoning engine frameworks and storage systems [4] - The collaboration between Huawei and China UnionPay aims to build "AI + Finance" demonstration applications, transitioning technology from laboratory validation to large-scale application [4]