Workflow
英伟达H800芯片
icon
Search documents
阿里自研AI芯片现身,部分性能参数比肩英伟达H20
Nan Fang Du Shi Bao· 2025-09-17 03:48
Core Insights - Alibaba's self-developed AI chip, PPU, has been showcased on CCTV, indicating its competitive performance against NVIDIA's H20 and H800 chips [1][3] - The PPU chip integrates HBM2e memory and has a memory capacity of 96G, with a bandwidth of 700GB/s, positioning it between NVIDIA's A800 and H20 chips [1] - Alibaba is investing at least $53 billion over the next three years to build its cloud and AI hardware infrastructure, emphasizing AI as a core growth driver alongside e-commerce [3] Performance Comparison - The PPU chip outperforms Huawei's Ascend 910B in all key performance metrics, although the latest model from Huawei is the 910C [3] - In terms of power consumption, the PPU matches the A800 at 400W, while the H20 consumes 550W [1] Strategic Focus - Alibaba's internal strategy highlights "AI + Cloud" as a primary growth engine, with significant investments planned in AI infrastructure, foundational models, and the transformation of existing business operations [3] - The company aims to leverage its self-developed chips for training smaller AI models, while still utilizing NVIDIA chips for certain applications [3]
【WAIC2025】阶跃星辰发布基座大模型Step 3 与多家国产芯片厂商实现联合开发
Jing Ji Guan Cha Wang· 2025-07-26 05:14
Core Viewpoint - Shanghai Jiyue Xingchen Intelligent Technology Co., Ltd. has launched its first full-size, native multimodal reasoning model, Step3, which will be open-sourced for global enterprises and developers on July 31, 2025 [1] Group 1: Model Development - Step3 features an innovative model architecture and algorithm engineering collaboration, utilizing a MoE architecture with a total parameter count of 321 billion and an active parameter count of 38 billion, achieving significant cost optimization while maintaining high decoding efficiency [2] - Step3 possesses visual perception and complex reasoning capabilities, enabling it to accurately perform cross-domain complex knowledge understanding and analyze the intersection of mathematical and visual information [2] Group 2: Performance and Efficiency - Step3 demonstrated over 70% throughput improvement compared to the industry-leading model DeepSeek-R1 when tested on NVIDIA H800 chips, and can achieve up to 300% efficiency on domestic chips [2] - A core innovation alliance has been established with nearly 10 chip and infrastructure manufacturers to enhance model adaptability and computing efficiency [2][3] Group 3: Strategic Partnerships and Revenue Goals - The alliance includes major domestic chip manufacturers such as Huawei Ascend and others, which have successfully implemented and run Step3 [3] - Jiyue Xingchen has formed a strategic partnership with Shanghai State-owned Capital Investment Co., Ltd., aiming for a revenue target of 1 billion RMB for the year, based on confirmed contract revenues and a strong gross profit margin achieved in the first half of 2025 [3][4] - The company has collaborated with over half of domestic smartphone manufacturers to develop intelligent agents and is exploring applications in various verticals, including finance and retail [4]
成本降低20%!蚂蚁集团用国产芯片训练AI
国芯网· 2025-03-25 04:46
Core Viewpoint - Ant Group has successfully utilized domestic chips, including those from Alibaba and Huawei, in conjunction with the mixed expert (MoE) machine learning method to train AI models, achieving a cost reduction of approximately 20% [1] Group 1 - The performance of the new technology is comparable to NVIDIA's H800 chip [1] - Ant Group continues to use NVIDIA chips for AI development but has shifted its latest models to primarily rely on alternatives from AMD and domestic Chinese chips [1] - Ant Group is continuously optimizing for different chips to reduce AI application costs and has made significant progress, with plans to gradually share its findings through open-source initiatives [1] Group 2 - The move is significant in the context of U.S. export restrictions on high-end chips to China, indicating that China has largely overcome U.S. semiconductor sanctions [1] - Ant Group's open-source Ling series model framework and training strategies could promote the accessibility of domestic AI technology, lowering the entry barriers for small and medium-sized enterprises and research institutions [1]