英伟达H800芯片

Search documents
【WAIC2025】阶跃星辰发布基座大模型Step 3 与多家国产芯片厂商实现联合开发
Jing Ji Guan Cha Wang· 2025-07-26 05:14
Core Viewpoint - Shanghai Jiyue Xingchen Intelligent Technology Co., Ltd. has launched its first full-size, native multimodal reasoning model, Step3, which will be open-sourced for global enterprises and developers on July 31, 2025 [1] Group 1: Model Development - Step3 features an innovative model architecture and algorithm engineering collaboration, utilizing a MoE architecture with a total parameter count of 321 billion and an active parameter count of 38 billion, achieving significant cost optimization while maintaining high decoding efficiency [2] - Step3 possesses visual perception and complex reasoning capabilities, enabling it to accurately perform cross-domain complex knowledge understanding and analyze the intersection of mathematical and visual information [2] Group 2: Performance and Efficiency - Step3 demonstrated over 70% throughput improvement compared to the industry-leading model DeepSeek-R1 when tested on NVIDIA H800 chips, and can achieve up to 300% efficiency on domestic chips [2] - A core innovation alliance has been established with nearly 10 chip and infrastructure manufacturers to enhance model adaptability and computing efficiency [2][3] Group 3: Strategic Partnerships and Revenue Goals - The alliance includes major domestic chip manufacturers such as Huawei Ascend and others, which have successfully implemented and run Step3 [3] - Jiyue Xingchen has formed a strategic partnership with Shanghai State-owned Capital Investment Co., Ltd., aiming for a revenue target of 1 billion RMB for the year, based on confirmed contract revenues and a strong gross profit margin achieved in the first half of 2025 [3][4] - The company has collaborated with over half of domestic smartphone manufacturers to develop intelligent agents and is exploring applications in various verticals, including finance and retail [4]
成本降低20%!蚂蚁集团用国产芯片训练AI
国芯网· 2025-03-25 04:46
Core Viewpoint - Ant Group has successfully utilized domestic chips, including those from Alibaba and Huawei, in conjunction with the mixed expert (MoE) machine learning method to train AI models, achieving a cost reduction of approximately 20% [1] Group 1 - The performance of the new technology is comparable to NVIDIA's H800 chip [1] - Ant Group continues to use NVIDIA chips for AI development but has shifted its latest models to primarily rely on alternatives from AMD and domestic Chinese chips [1] - Ant Group is continuously optimizing for different chips to reduce AI application costs and has made significant progress, with plans to gradually share its findings through open-source initiatives [1] Group 2 - The move is significant in the context of U.S. export restrictions on high-end chips to China, indicating that China has largely overcome U.S. semiconductor sanctions [1] - Ant Group's open-source Ling series model framework and training strategies could promote the accessibility of domestic AI technology, lowering the entry barriers for small and medium-sized enterprises and research institutions [1]