Workflow
KernelCAT
icon
Search documents
【新晋会员风采】智子芯元加入深圳市人工智能行业协会
Sou Hu Cai Jing· 2026-02-06 08:27
Company Overview - Zhizi Xinyuan (Shenzhen) Technology Co., Ltd. was incubated by the Shenzhen Big Data Research Institute and established only five months ago. The company focuses on optimizing AI computing through "mathematics + AI," generating high-performance operators through automation to continuously drive innovation at the computing power level [2] - The team at Zhizi Xinyuan reflects the unique advantages of the integration of industry, academia, and research in Shenzhen. The founder, Ding Tian, was a former technical leader in Huawei's 2012 Lab focusing on "black box optimization." The core members include the former CTO of a leading large model company and experts in operators from top chip manufacturers, with nearly half of the R&D team holding gold medals from international mathematics and computer competitions [2] Industry Positioning - Zhizi Xinyuan positions itself as a "bridge" in the domestic computing power ecosystem. One end of the bridge connects to domestic computing power chips such as Huawei Ascend, Zhongke Haiguang, and Cambricon, while the other end connects to the "rapidly iterating and extremely hungry" AI application scenarios, ranging from general large models like DeepSeek and Qwen to vertical applications in autonomous driving, embodied intelligence, and biomedicine [3] - The goal is to allow software developers to focus solely on algorithm innovation, with underlying adaptations handled by automated tools. When this bridge is wide and stable enough, it will evolve from a tool into a standard [3] Product Introduction - KernelCAT is an expert-level agent operator that acts as a "translator" between AI algorithms and computing chips. It converts algorithms into hardware-executable instructions, determining the inference speed, energy consumption, and compatibility of AI models. The development of operators is currently seen as a "handcrafted workshop" process, heavily reliant on the experience of top engineers and trial-and-error, often taking months [6] - KernelCAT aims to address the challenges of operator development by utilizing AI to automate the process. Unlike traditional large models or knowledge-enhanced agents that struggle with complex computational tasks, KernelCAT is designed to understand the essence of the problem and achieve "intelligent-level" optimization [6] - The terminal version of KernelCAT is a locally running AI agent capable of both deep operator development and general full-stack development tasks. It offers both a command-line interface and a simplified desktop version for developers, providing robust programming capabilities beyond just specific tasks [6]
天下苦CUDA久矣,又一国产方案上桌了
量子位· 2026-01-30 13:34
Core Viewpoint - The article emphasizes that while domestic computing infrastructure has improved, the real challenge for developers lies in the usability of these systems, particularly in the context of AI development, where the existing software ecosystem remains heavily reliant on established foreign tools and frameworks [1][2]. Group 1: Current State of AI Development - The AI landscape is vibrant with numerous models being released, yet the underlying software ecosystem's maturity is a significant bottleneck for deployment efficiency [11][12]. - The development of high-performance operators (算子) is crucial as they serve as the "translators" between AI algorithms and hardware, impacting inference speed, energy consumption, and compatibility [13][14]. Group 2: KernelCAT Introduction - KernelCAT is introduced as a local AI agent designed to accelerate computing and facilitate model migration, capable of handling both specialized tasks and general software engineering duties [17]. - Unlike traditional tools, KernelCAT combines intelligent code understanding and optimization with operational research algorithms to automate parameter tuning, significantly reducing the time and effort required for optimization [21][22]. Group 3: Performance and Competitive Edge - In tests, KernelCAT demonstrated superior performance compared to both open-source and commercial operators, achieving execution times as low as 0.0077 ms for 1M scale tasks, which translates to acceleration ratios exceeding 200% [26]. - KernelCAT's unique approach allows it to optimize operators effectively, showcasing its potential to compete with established solutions in the market [25][27]. Group 4: Ecosystem Challenges - The article highlights that over 90% of significant AI training tasks currently run on NVIDIA GPUs, with a developer ecosystem that includes over 5.9 million users and more than 400 operators, indicating a substantial barrier for domestic alternatives [28][30]. - The success of NVIDIA is attributed to its comprehensive control over software and algorithms, underscoring the importance of a mature ecosystem for hardware performance to be fully realized [32]. Group 5: Future Directions - KernelCAT represents a shift towards building self-evolving computational foundations, moving away from reliance on existing ecosystems to developing capabilities that can adapt and grow independently [39]. - The article concludes with an invitation for users to experience KernelCAT, indicating its ongoing development and potential for broader adoption in the industry [40].