Workflow
DeepSeek一体机“褐蚁”
icon
Search documents
对话季宇:大模型非必须在GPU跑,CPU内存带宽已足够
虎嗅APP· 2025-05-18 13:51
Core Viewpoint - The article discusses the innovative approach of a company, 行云集成电路, led by its founder, 季宇, in developing a cost-effective AI computing solution through the integration of CPU and memory technologies, challenging the traditional reliance on GPUs for large model deployments [5][10][19]. Group 1: Company Overview - 行云集成电路 was founded by 季宇, a former Huawei expert, focusing on self-developed GPU technology [5]. - The company aims to create a DeepSeek integrated machine, which is a high-performance computing device designed for local deployment of AI models [8][19]. Group 2: Technology and Innovation - The DeepSeek integrated machine, referred to as "组装机," combines various hardware components, including Intel or domestic CPUs and NVIDIA GPUs, but aims to reduce costs significantly [9][19]. - 季宇 argues that modern large models can run efficiently on CPUs, leveraging their high memory bandwidth, which can exceed that of high-end GPUs like the RTX 4090 [10][13]. - The company plans to design a custom chip that optimizes CPU performance for AI applications, moving away from traditional GPU reliance [13][24]. Group 3: Market Strategy - The goal is to make AI technology accessible at consumer electronics price points, transforming the market from supercomputing to widespread use [18][25]. - By lowering the cost of AI computing solutions to around 100,000 yuan, the company aims to enable more startups to enter the AI space [19][25]. - The strategy includes using common components to promote widespread adoption and avoid creating high barriers to entry for other players in the industry [22][23]. Group 4: Competitive Landscape - 季宇 believes that simply following NVIDIA's path will not lead to success, emphasizing the need for innovative approaches to challenge established players [17]. - The company seeks to demonstrate the feasibility of its approach through proof-of-concept products, aiming to gain acceptance from industry players [14][18].
对话季宇:大模型非必须在GPU跑,CPU内存带宽已足够
Hu Xiu· 2025-05-18 06:54
Core Viewpoint - The conversation highlights the innovative approach of the company in utilizing CPU memory bandwidth for large model deployment, challenging the traditional reliance on GPUs for such tasks [4][8][12]. Group 1: Company Overview - The company, founded by a former Huawei expert, focuses on developing self-researched GPUs and integrated computing devices known as DeepSeek [1][4]. - The DeepSeek integrated machine, referred to as "褐蚁" (Brown Ant), is designed to be a cost-effective solution for deploying large models, with a target price significantly lower than traditional setups [5][18]. Group 2: Technology Insights - The company argues that modern server-grade CPUs can achieve memory bandwidth exceeding that of high-end GPUs, making them suitable for running large models [10][18]. - The proposed architecture aims to leverage CPU memory capabilities, which are cheaper and more efficient than traditional GPU setups, potentially reducing costs from millions to hundreds of thousands [6][18]. Group 3: Market Positioning - The company seeks to democratize access to high-performance computing by lowering the cost barrier, allowing smaller teams to engage in AI development [23]. - The strategy involves creating a product that can compete with supercomputers at a consumer electronics price point, thus fostering broader industry adoption [17][23]. Group 4: Competitive Landscape - The founder emphasizes that simply replicating NVIDIA's approach will not lead to success; instead, innovation in design and application is crucial [15][21]. - The company aims to differentiate itself by focusing on optimizing software to fully utilize CPU memory bandwidth, challenging the industry's conventional wisdom [19][22].