Workflow
京东750B深度思考大模型
icon
Search documents
深度思考大模型、高商业可用数字人、具身智能,京东AI正在产业端疯长
Zhong Jin Zai Xian· 2025-05-22 08:27
近日,京东全新推出750B深度思考大模型、行业首批高商业可用数字人,并发布了角色智能体与具身 智能融合的最新进展,展示出AI认知能力与商业应用的最近进展。 "大模型的能力光谱在变、参数尺寸在变,不变的是让产业用好大模型。只有深耕产业,让大模型在产 业里跑起来,才是最有价值的事情。"京东集团探索研究院副院长、京东科技人工智能业务部总裁何晓 冬表示。 技术拓宽大模型能力边界,支持企业构建专有模型 在AI进化的"马拉松"中,京东大模型展现出了独特的产业基因。 目前,京东已经具备全尺寸大模型,满足多样化的产业需求:3B和10B模型可提供极致响应效率;81B 主力模型可兼顾效果和性能。最新推出的750B超大规模模型,则兼具"深度思考"和"非深度思考"双通 道能力,能满足各行业对"即时响应"和"深度推理"的双重需求。 具体来说,750B超大规模模型在训练过程中,使用了动态分层蒸馏、跨领域数据治理等京东创新技 术,降低大模型的训练和部署成本他,同时兼顾大模型效果,保证大模型能力"大而精"。 数字人实现高商业可用,618面向商家全免费 在大模型"硬实力"的支撑下,数字人等"软实力"应用,也迅速实现了商业化可用。 基于通用数字 ...
京东大模型:加速深度思考,铺开产业级应用
Zhong Jin Zai Xian· 2025-05-22 01:32
Core Insights - JD.com has launched a new 750B deep thinking model and the first commercially viable digital humans in the industry, showcasing advancements in AI cognitive capabilities and commercial applications [1] - The company emphasizes the importance of integrating large models into industries to maximize their value, as stated by He Xiaodong, Vice President of JD Group Exploration Research Institute [1] Group 1: Large Model Capabilities - JD.com has developed a full-size large model that meets diverse industry needs, with models of 3B, 10B, and 81B providing varying levels of response efficiency and performance [2] - The newly introduced 750B model features both "deep thinking" and "non-deep thinking" capabilities, catering to industries requiring instant responses and complex decision-making [2] - Innovative techniques such as dynamic hierarchical distillation and cross-domain data governance have been employed to reduce training and deployment costs while maintaining model effectiveness [2] Group 2: Technological Innovations - Recent research published in a Nature journal addresses the efficiency of large model development in open environments, introducing four core innovative methods: model distillation, data governance, training optimization, and cloud-edge collaboration [3] - These innovations have improved inference efficiency by an average of 30% and reduced training costs by 70%, creating a reusable industrial-grade technology paradigm [3] - The JD JoyBuild platform supports over 100 algorithms and toolchains, enabling businesses to quickly adapt general models into specialized ones [3] Group 3: Commercialization of Digital Humans - JD.com has achieved high commercial viability for digital humans, launching a general digital human model 2.0 that supports fine-tuning and natural expressions [4] - During the 618 shopping festival, JD.com offered six industry-specific digital humans for free to merchants, enhancing their sales capabilities [4] - A collaboration with a well-known snack brand resulted in a customized AI live streaming experience, generating over 10 million yuan in sales [4] Group 4: Embodied Intelligence - JD.com is exploring the integration of its large models into the physical world through the Joy Inside initiative, aiming to develop "personified" robots [5] - The Joy Inside platform utilizes over 10 million daily intelligent conversations to embed conversational AI into hardware like robots and AI toys, creating emotional connections with users [6] - This initiative is expanding the presence of intelligent interactions in everyday life, enhancing user experiences [6]