Workflow
后摩漫界M50
icon
Search documents
对话「后摩智能」吴强:从科学家到创业者的惊险一跃
3 6 Ke· 2025-08-06 00:02
Core Insights - The article highlights the significant advancements in China's computing power sector, particularly focusing on "super nodes" and edge AI chips as key trends in the AI landscape [1][2] - The emergence of edge computing is seen as a potential larger market than cloud computing, with companies like Houmo Intelligence positioned to capitalize on this opportunity [2][3] - Houmo Intelligence's M50 chip, based on in-memory computing technology, represents a breakthrough in efficiency and performance for edge AI applications [3][6] Group 1: Industry Trends - The development of large AI models has created a strong demand for cloud computing, while edge computing is gaining traction due to its ability to reduce computational needs for generative AI applications [1][2] - The CEO of Houmo Intelligence predicts that 90% of data processing for generative AI will occur at the edge, with only 10% requiring cloud resources [1][2] - The market for edge computing is expected to accommodate more players, potentially leading to the emergence of the "next Nvidia" [2] Group 2: Company Overview - Houmo Intelligence, founded by CEO Wu Qiang, focuses on in-memory computing technology to enhance AI chip efficiency, having transitioned from an initial focus on smart driving chips to general-purpose edge AI applications [2][8] - The M50 chip features significant performance metrics, including 160 TOPS@INT8 and 100 TFLOPS@bFP16, with a typical power consumption of only 10W, making it suitable for various smart devices [6][7] - The company has established partnerships with notable clients, including Lenovo and iFlytek, to expand its market presence in edge AI applications [7][10] Group 3: Technological Innovations - The M50 chip utilizes a new architecture called "Tianxuan" IPU, which allows floating-point models to run directly on the in-memory computing architecture, enhancing application efficiency [6][7] - The in-memory computing approach addresses the "memory wall" and "power wall" issues associated with traditional computing architectures, making it a promising solution for future AI applications [2][3] - The company has developed a new compiler toolchain, "Houmo Dadao," to facilitate easy adaptation of its chips to mainstream deep learning frameworks [6][15] Group 4: Market Dynamics - The edge AI chip market is characterized by cost sensitivity, power efficiency, and compact design requirements, which are critical for successful product deployment [11][12] - The transition from cloud to edge computing is driven by the need for high efficiency and low power consumption in AI applications, particularly in consumer electronics and smart devices [10][11] - The competitive landscape is evolving, with various companies exploring in-memory computing, leading to a diverse range of approaches and technologies in the market [12][13]
对话「后摩智能」吴强:从科学家到创业者的惊险一跃
36氪· 2025-08-05 13:49
Core Viewpoint - The article emphasizes the significance of "storage-compute integration" as a key technology for edge AI chips, which is expected to revolutionize the last mile of large model computing, enabling efficient local processing and reducing reliance on cloud computing [2][4][6]. Group 1: Industry Trends - The AI model development has led to a two-tiered growth in computing power, with cloud computing expanding for model training and edge AI chips gaining traction for inference applications [4][5]. - The emergence of "super nodes" and edge AI chips was highlighted at WAIC 2025, showcasing the growing importance of localized computing solutions [3][4]. - The market for edge computing is anticipated to be larger than cloud computing, presenting opportunities for new players to emerge, potentially creating the "next Nvidia" [4][5]. Group 2: Company Insights - The company, Houmo Intelligent, founded by CEO Wu Qiang, focuses on developing AI chips based on storage-compute integration technology, aiming to address the challenges of traditional computing architectures [5][6]. - The newly launched M50 chip utilizes innovative architecture and compiler tools to enhance efficiency and ease of use, supporting mainstream deep learning frameworks [8][10]. - The M50 chip boasts impressive specifications, achieving 160 TOPS@INT8 and 100 TFLOPS@bFP16 with a power consumption of only 10W, making it suitable for various smart devices without cloud dependency [8][10]. Group 3: Market Strategy - The company is targeting multiple application areas, including consumer electronics, smart voice systems, and edge computing for telecom operators, with notable interest from clients like Lenovo and China Mobile [14][15]. - The transition from a focus on smart driving chips to general-purpose edge AI chips reflects a strategic pivot in response to market demands and opportunities in large model applications [11][13]. - The company aims to leverage its expertise in storage-compute integration to meet the growing needs for efficient AI processing in diverse sectors [17][18].
AI算力集群迈进“万卡”时代,超节点为什么火了?
Di Yi Cai Jing· 2025-07-30 07:59
Core Insights - The recent WAIC highlighted the growing interest in supernodes, with companies like Huawei, ZTE, and H3C showcasing their advancements in this technology [3][4][5] - Supernodes are essential for managing large-scale AI models, enabling efficient resource utilization and high-performance computing [3][4][5] - The shift from traditional AI servers to supernode architectures is driven by the increasing complexity and size of AI models, which now reach trillions of parameters [4][5][9] Group 1: Supernode Technology - Supernodes integrate computing resources to create low-latency, high-bandwidth computing entities, enhancing the efficiency of AI model training and inference [3][4] - The technology allows for performance improvements even when individual chip manufacturing processes are limited, making it a crucial development in the industry [4][9] - Companies are exploring both horizontal (scale out) and vertical (scale up) expansion strategies to optimize supernode performance [5][9] Group 2: Market Dynamics - Domestic AI chip manufacturers are increasing their market share in AI servers, with the proportion of externally sourced chips expected to drop from 63% to 49% this year [10] - Companies like墨芯人工智能 are adopting strategies that focus on specific AI applications, such as inference optimization, to compete with established players like NVIDIA [10][11] - The competitive landscape is shifting, with firms like云天励飞 and后摩智能 targeting niche markets in edge computing and AI inference, avoiding direct competition with larger chip manufacturers [11][12][13] Group 3: Technological Innovations - The introduction of optical interconnects in supernode technology is a significant advancement, providing high bandwidth and low latency for AI workloads [6][9] - Companies are developing solutions that leverage optical communication to enhance the performance of AI chip clusters, addressing the limitations of traditional electrical interconnects [6][9] - The focus on sparse computing techniques allows for lower manufacturing process requirements, enabling more efficient AI model computations [11][12]
南京经开区元素闪耀世界人工智能大会
Jiang Nan Shi Bao· 2025-07-28 13:55
Group 1: Event Overview - The 2025 World Artificial Intelligence Conference and High-Level Meeting on Global AI Governance opened in Shanghai on July 26, focusing on the theme "Intelligent Era, Global Cooperation" [1] - The exhibition area exceeded 70,000 square meters for the first time, attracting over 800 companies and showcasing more than 3,000 cutting-edge exhibits, including over 100 "global debuts" and "China premieres," marking the largest scale in history [1] Group 2: Company Highlights - Out of the Door - Out of the Door showcased its latest Agentic AI hardware product matrix, including the TicNote AI recording pen, and launched the "Voices of the Hutong" AI art exhibition, highlighting the integration of AI and humanities [2] - The exhibition utilized multi-dimensional presentations, allowing visitors to "hear" and "see" the stories behind community life, emphasizing the importance of human memory in the AI era [3] - TicNote is designed to serve as a "thinking partner" for users, applicable in various scenarios such as meetings, interviews, and academic lectures, showcasing the company's full-stack soft and hard integration capabilities [3][4] Group 3: Company Highlights - Aftermo Intelligent - Aftermo Intelligent unveiled its self-developed edge AI chip "Aftermo M50," which enables local large model inference capabilities without relying on cloud services, addressing the growing demand for intelligent interaction in offline scenarios [5][6] - The M50 chip achieves 160 TOPS@INT8 and 100 TFLOPS@bFP16 physical computing power, with a typical power consumption of only 10W, significantly improving energy efficiency by 5 to 10 times compared to traditional architectures [6][7] - The company aims to create a new ecosystem for edge intelligence that is low-power, secure, and user-friendly, enabling local processing of data to mitigate risks associated with cloud transmission [7]