编码模型
Search documents
马斯克首个编码模型上线,编程飙进Top5!这9位华人天团爆肝打造
Sou Hu Cai Jing· 2025-08-29 10:21
Core Insights - xAI has launched its first coding model, Grok Code Fast 1, which has achieved impressive performance in coding benchmarks, ranking among the top five models in SWE-bench [2][3][13] - The model is designed for speed and cost-effectiveness, with a unique architecture and a focus on programming tasks [9][11][12] - Grok Code is currently available for free for a limited time on major coding platforms [8] Performance Metrics - Grok Code scored 70.8% in the SWE-bench Verified benchmark, placing it just behind OpenAI's Codex-1 and Claude 4 Opus [3][13] - In the LiveCode Bench, it achieved a score of 62%, and a score of 4.3% in the mathematical IOI [3] - The model is reported to be five times faster than GPT-5 in coding tasks [9] Cost Structure - Grok Code is the most cost-effective coding model, with input pricing at $0.20 per million tokens, output at $1.5 per million tokens, and cached input at $0.02 per million tokens [6] Development and Team Composition - The development of Grok Code involved a diverse team, with a significant representation of Chinese scholars [16][21][40] - The project has evolved from a two-person team to a larger group of skilled researchers over a few months [20] Technical Innovations - The model utilizes a new architecture and a carefully curated dataset focused on real-world coding tasks, enhancing its performance [11][14] - xAI has implemented caching optimizations for prompt words, achieving a cache hit rate of over 90% during collaborative programming [12] User Experience and Applications - Grok Code demonstrates strong full-stack development capabilities, excelling in languages such as TypeScript, Python, Java, Rust, C++, and Go [15] - Users have reported rapid development times, with one developer creating a game prototype in just one day [6][15] Future Developments - Following the launch of Grok Code, xAI plans to release a multimodal agent in September and a video generation model in October [51][52]