Workflow
Model Architecture
icon
Search documents
X @Polyhedra
Polyhedra· 2025-09-25 12:00
6/Currently working on Gemma3 quantization, focusing on:- Learning the new model architecture- Adding KV cache support (which accelerates inference)- Implementing quantization support for some new operators-- Full operator support will require 1+ additional day, plus more time for accuracy testingStay tuned for more updates 🔥 ...
X @xAI
xAI· 2025-08-28 18:12
We built Grok Code Fast 1 from scratch, starting with a brand-new lightweight model architecture.Combined with novel improvements to accelerate serving efficiency, Grok Code Fast 1 sets a new standard for both speed and affordability. https://t.co/p04xX7uf8w ...