Grok 4 Heavy

Search documents
X @Elon Musk
Elon Musk· 2025-09-29 05:25
Model Performance - Grok 4 Heavy utilizes multiple agents working simultaneously on the same task to improve performance [1] - Agents compare findings and agree on a final answer, enhancing accuracy [1] - Grok 4 Heavy is designed for tackling the toughest problems [1] - The model is continuously improving on a weekly basis [1] Product Information - Grok 4 Heavy is a version of Grok 4 [1]
X @Elon Musk
Elon Musk· 2025-08-25 06:39
Technology & Innovation - Grok 4 Heavy 用于构建示波器 [1] Multimedia Resources - 使用 Jerobeam Fenderson 的音乐 [1]
X @Elon Musk
Elon Musk· 2025-08-23 22:45
Technology Comparison - Grok 4 Heavy 的性能与 Starship Super Heavy 火箭助推器在太空中的动力相当[1] Product Naming - SuperGrok Heavy 的命名灵感来源于其为大型和大胆任务而设计的特性[1]
OpenAI最强模型GPT-5来了!免费可用,Altman高呼迈向AGI一大步,微软抢先接入
硬AI· 2025-08-08 01:36
Core Viewpoint - OpenAI has launched GPT-5, which integrates reasoning capabilities from the o-series models with the rapid response abilities of the GPT series, marking a significant upgrade and a step towards achieving Artificial General Intelligence (AGI) [2][3][24]. Group 1: Model Performance - GPT-5 achieved a 74.9% accuracy rate in the SWE-bench Verified coding test, surpassing the previous models and slightly exceeding Anthropic's Claude Opus 4.1 [3][7]. - The hallucination issue has significantly improved, with an error rate of only 1.6% in health-related information, compared to 15.8% for GPT-4o [14][19]. - In general, GPT-5's responses contain factual errors 45% less frequently than GPT-4o and 80% less than o3, with an overall error rate of 4.8% [17][19]. Group 2: Key Advantages - GPT-5 excels in three main areas: programming, creative writing, and health consultation [5]. - In programming, GPT-5 can generate complex applications and websites from simple prompts, showing improvements in design choices [6][12]. - For creative writing, GPT-5 demonstrates better taste and can handle complex writing tasks more naturally [12]. - In health consultation, GPT-5 actively identifies potential health issues and interprets medical results, although it is not a substitute for professional medical advice [14]. Group 3: User Access and Integration - Starting from August 7, GPT-5 is available to all ChatGPT users, with different usage limits for free and paid subscribers [4]. - Microsoft has integrated GPT-5 into its product suite, including Microsoft 365 Copilot and Azure AI Foundry, allowing users to leverage its advanced reasoning capabilities [26][27]. Group 4: Safety and Customization - OpenAI has introduced a new safety training mode called "safe completions," which helps the model provide useful answers while minimizing risks [19][20]. - GPT-5 offers four customizable personality presets for ChatGPT interactions, enhancing user experience by matching communication styles [21][22].
X @外汇交易员
外汇交易员· 2025-08-08 00:58
Model Performance - OpenAI releases GPT-5 series with four versions: GPT-5, GPT-5 mini, GPT-5 nano, and GPT-5 Pro (enterprise and $200/month premium) [1] - GPT-5 achieves a 749% first-try accuracy on SWE-bench Verified benchmark, surpassing o3's 691% and GPT-4o's 308% [1] - GPT-5 Pro scores 42% on Humanity's Last Exam with tools, slightly below Grok 4 Heavy's 444% [2] - OpenAI claims GPT-5 is more reliable, practical, and reduces hallucinations [2] Programming Capabilities - GPT-5 excels in complex front-end generation and large codebase debugging, creating responsive websites, apps, and games from a single prompt [1] Safety and Features - OpenAI introduces "safe completions," a new safety training method for GPT-5 [2] - OpenAI will launch four new research preview personalities for all ChatGPT users: Cynic, Robot, Listener, and Nerd [2]
全网开测GPT-oss!技术架构也扒明白了
量子位· 2025-08-07 00:56
Core Insights - The article highlights the impressive performance of GPT-oss, which surpasses many existing open-source models and is poised to lead in the SaaS fast-fashion era [1][3][4]. Performance Testing - GPT-oss has successfully passed multiple performance tests, achieving top rankings in various benchmarks, including GPQA Diamond, AIME 2024, AIME 2025, and Codeforces, outperforming models like DeepSeek R1, Qwen3, and Llama 4 [5][6]. - In the MMLU benchmark, GPT-oss achieved scores of 85.9 for the low 120B model and 88 for the medium model, while Qwen3-235B performed slightly better in MMLU [6][7]. Model Architecture - The architecture of GPT-oss is noted for its wider structure, more attention heads, and higher hidden dimensions compared to similar models, incorporating advanced techniques such as attention bias units [22][24][26]. - The model retains the core MoE Transformer architecture while optimizing performance and reducing complexity, making it suitable for open-source applications [26][28]. Cost and Training - The estimated cost for training the GPT-oss-120B model is between $4.2 million and $23.1 million, while the 20B model costs between $420,000 and $2.3 million [30]. - There are indications that the model may have limitations in non-English text performance, with a significant portion of responses containing grammatical or spelling errors [30]. User Applications - Users have begun exploring various applications for GPT-oss, including its integration into platforms for academic paper understanding and data transformation [17][19][20]. - The model can be easily accessed and utilized through platforms like LM Studio and AWS, facilitating rapid development of AI applications [33][34]. Community Engagement - The article encourages users to test GPT-oss and share their experiences, indicating a growing community interest in the model's capabilities [39].
X @Elon Musk
Elon Musk· 2025-08-02 09:09
Product Offering - Grok 4 Heavy provides instant access to various features, with more to come [1]
X @Elon Musk
Elon Musk· 2025-07-30 17:49
Product Features - Grok Heavy subscribers receive priority access to all Grok beta releases for other products [1] - Grok 4 Heavy is considered superior to other $200/monthly plans like Anthropic, OpenAI, or Gemini due to its unique features [1] - Grok Heavy utilizes multiple specialized agents [1]
Grok 4遥遥领先,但马斯克想要得更多
首席商业评论· 2025-07-21 03:34
Core Viewpoint - The article discusses the ambitious and high-risk strategy of Elon Musk's xAI, focusing on the launch of the Grok 4 AI model, its capabilities, and the financial challenges the company faces in the competitive AI landscape [5][10][25]. Group 1: Grok 4 AI Model - Grok 4 is touted as the "world's strongest AI model," achieving graduate-level performance across various disciplines and outperforming competitors in standardized tests [5][8]. - Grok 4's training volume is 100 times that of Grok 2, with a computational power investment in reinforcement learning exceeding that of any other model on the market [8]. - The subscription fee for Grok 4 is set at $30 per month, while the more advanced Grok 4 Heavy version costs $300 per month [8]. Group 2: Financial Challenges - xAI's monthly expenditure is reported to be around $1 billion, with total expenses projected to reach $13 billion in 2024 against an expected revenue of only $500 million [11][22]. - The company is pursuing an aggressive hardware strategy, planning to build a supercomputer with 1 million NVIDIA GPUs, which could cost between $5 billion and $62.5 billion [13]. - xAI's financial needs are compounded by debts incurred from Musk's acquisition of Twitter, which adds to the company's overall financial burden [15]. Group 3: Competitive Landscape - The AI industry is shifting from a "scale race" to a focus on efficiency and application depth, with competitors like OpenAI and Anthropic making significant advancements [16][20]. - xAI's current revenue heavily relies on the X Premium subscription service, with only 20 million active users compared to OpenAI's 100 million [23]. - Analysts predict that xAI could achieve profitability by 2027, potentially outpacing OpenAI, but this is contingent on several optimistic assumptions [22]. Group 4: Strategic Positioning - Musk's strategy involves leveraging the data and computational resources from his other ventures, such as Tesla and Twitter, to enhance Grok's capabilities and reduce costs [20]. - The article highlights the ongoing debate about the definition of AGI, with Musk positioning Grok as a potential leader in this space, despite skepticism about its current capabilities [19].
Grok 4遥遥领先,但马斯克想要得更多
Sou Hu Cai Jing· 2025-07-19 13:37
Core Insights - Elon Musk has introduced "the world's strongest AI model," Grok 4, which is claimed to achieve graduate-level proficiency across all subjects and outperform many PhD candidates [3][6] - Grok 4 has demonstrated near-perfect scores on standardized tests like SAT and GRE, and has outperformed competitors in the "Human Ultimate Exam" [3] - The financial demands of xAI are significant, with monthly expenditures reaching $1 billion, and projected total expenses of $13 billion in 2024 against revenues of only $500 million [7][10] Group 1: Model Performance and Features - Grok 4 and Grok 4 Heavy are reasoning models, with the latter supporting four agents working simultaneously [3] - Grok 4 excels in multi-language support, particularly in English and coding tasks, surpassing competitors like Kimi [3] - The model's deep thinking capabilities make it more reliable in research scenarios compared to faster alternatives [3][6] Group 2: Financial Strategy and Challenges - xAI's aggressive hardware strategy involves building its own infrastructure rather than relying on rented computing power, with plans for a supercomputer costing between $5 billion and $62.5 billion [10] - The company is employing a "burn money for leadership" strategy, aiming for rapid advancements through significant financing and hardware investments [10][12] - xAI's financial situation is compounded by debts from Musk's acquisition of Twitter, which adds pressure to its funding needs [11] Group 3: Market Position and Competition - The AI landscape is shifting from a "scale race" to a focus on efficiency and application depth, with competitors like Claude 4 and Gemini 2.5 Pro making significant strides [13] - Musk's vision for Grok includes potential breakthroughs in technology and physics, but the definition of AGI remains contentious [15] - xAI aims to achieve profitability by 2027, with Morgan Stanley predicting revenues could exceed $13 billion by 2029, potentially outpacing OpenAI [17]