Workflow
Llama 3.1 8B模型
icon
Search documents
24人团队硬刚英伟达,AMD前高管梦之队出手,新芯片每秒17000个token
3 6 Ke· 2026-02-21 05:47
Core Insights - Taalas, a startup founded two years ago with a team of 24, has launched a new chip, HC1, which achieves a peak inference speed of 17,000 tokens per second, significantly outperforming competitors like Cerebras at 2,000 tokens per second [1][3][5] - The HC1 chip reduces costs by 20 times and power consumption by 10 times compared to existing solutions, enabling real-time response speeds for large language models (LLMs) [1][3] - Taalas's innovative approach involves embedding the model directly onto the silicon chip, which allows for a drastic increase in performance and efficiency [3][6] Company Overview - Taalas was founded by a team of former AMD executives, including Ljubiša Bajić, who has a strong background in high-performance GPU design [11][13] - The company focuses on developing a new architecture specifically for AI inference and training, emphasizing layered design and lattice networks [11][13] Technology and Performance - The HC1 chip utilizes TSMC's N6 process technology, with a compact size of 815mm² and a typical power consumption of 250W per chip [5][6] - By adopting a structured ASIC design philosophy, HC1 can quickly produce specialized AI inference chips at a lower cost, reducing the production cycle from six months to two months [6][8] - The chip's architecture allows for the storage of models and weights directly on the chip, enhancing speed and efficiency while maintaining some flexibility for model updates [8][10] Market Position and Future Plans - Taalas has raised $200 million in funding and plans to release a second-generation variant of HC1 in the spring, which will integrate a medium-sized inference model [13] - The company aims to deploy HC2 in the winter, which will feature higher density and faster operation [13] - Despite the impressive speed of HC1, there are concerns regarding its depth of inference and potential obsolescence due to rapid model iteration cycles [15][17]