Workflow
Super Grok Heavy订阅服务
icon
Search documents
马斯克xAI发布Grok 4:训练算力提升100倍,多项测试中领先第二名一倍
Feng Huang Wang· 2025-07-10 06:20
Core Insights - xAI has launched its latest large language model, Grok 4, which shows significant performance improvements over its predecessor, Grok 3, with a 100-fold increase in training computational power [1] - Grok 4 achieved a 25% problem-solving rate in the "Humanities Last Exam" benchmark, while the multi-agent version, Grok 4 Heavy, exceeded 50% [1] - The company is focusing on enhancing multi-modal understanding capabilities and has released an API for Grok 4, supporting a context length of 256K [2] Model Performance - Grok 4 demonstrates superior reasoning capabilities in standardized tests, including GPQA and AIME, and achieved a perfect score in the Live Coding Bench test [2] - The model integrates tool usage directly into its training process, improving reliability in complex task handling [2] Commercialization Efforts - xAI has introduced a subscription service, Super Grok Heavy, allowing users to access both Grok 4 and Grok 4 Heavy [3] - The company plans to develop a dedicated programming model and initiate video generation model training using over 100,000 H200 GPUs in the coming weeks [3] - The release of Grok 4 marks a significant breakthrough in the competitive landscape of large language models, particularly in reasoning and multi-agent collaboration [3]