Workflow
Grok 4 Heavy
icon
Search documents
X @Elon Musk
Elon Musk· 2025-08-25 06:39
RT Tetsuo (@tetsuoai)Build an oscilloscope with Grok 4 Heavy.Music Credit: Jerobeam Fenderson https://t.co/rJSH0Qi6EF ...
X @Elon Musk
Elon Musk· 2025-08-23 22:45
RT X Freeze (@amXFreeze)Did you know Grok 4 Heavy is as powerful as the Starship Super Heavy booster is in space?That’s where SuperGrok Heavy gets its name fromit's built for big & bold missions https://t.co/4wFI2AsU82 ...
OpenAI最强模型GPT-5来了!免费可用,Altman高呼迈向AGI一大步,微软抢先接入
硬AI· 2025-08-08 01:36
Core Viewpoint - OpenAI has launched GPT-5, which integrates reasoning capabilities from the o-series models with the rapid response abilities of the GPT series, marking a significant upgrade and a step towards achieving Artificial General Intelligence (AGI) [2][3][24]. Group 1: Model Performance - GPT-5 achieved a 74.9% accuracy rate in the SWE-bench Verified coding test, surpassing the previous models and slightly exceeding Anthropic's Claude Opus 4.1 [3][7]. - The hallucination issue has significantly improved, with an error rate of only 1.6% in health-related information, compared to 15.8% for GPT-4o [14][19]. - In general, GPT-5's responses contain factual errors 45% less frequently than GPT-4o and 80% less than o3, with an overall error rate of 4.8% [17][19]. Group 2: Key Advantages - GPT-5 excels in three main areas: programming, creative writing, and health consultation [5]. - In programming, GPT-5 can generate complex applications and websites from simple prompts, showing improvements in design choices [6][12]. - For creative writing, GPT-5 demonstrates better taste and can handle complex writing tasks more naturally [12]. - In health consultation, GPT-5 actively identifies potential health issues and interprets medical results, although it is not a substitute for professional medical advice [14]. Group 3: User Access and Integration - Starting from August 7, GPT-5 is available to all ChatGPT users, with different usage limits for free and paid subscribers [4]. - Microsoft has integrated GPT-5 into its product suite, including Microsoft 365 Copilot and Azure AI Foundry, allowing users to leverage its advanced reasoning capabilities [26][27]. Group 4: Safety and Customization - OpenAI has introduced a new safety training mode called "safe completions," which helps the model provide useful answers while minimizing risks [19][20]. - GPT-5 offers four customizable personality presets for ChatGPT interactions, enhancing user experience by matching communication styles [21][22].
X @外汇交易员
外汇交易员· 2025-08-08 00:58
OpenAI周四发布全新GPT5系列,共4个版本,分别为GPT-5 、 GPT-5 mini、 GPT-5 nano,以及只对企业版和每月200美元高级版开放的GPT-5 Pro模式。在编程能力方面,GPT-5是OpenAI目前最强大的编程模型,在复杂的前端生成和大型代码库调试方面表现突出,能够仅凭一个提示就创建美观响应式的网站、应用程序App和游戏。在从GitHub获取现实世界编码任务的基准测试SWE-bench Verified中,GPT-5思考后首次尝试的准确率达74.9%,高于OpenAI推理模型o3的69.1%和GPT-4o的30.8%(略胜于Claude Opus 4.1和谷歌Gemini 2.5 Pro,后两者得分分别为74.5%和59.6%)。不过在前沿数学测试集方面,GPT-5的效果还不如ChatGPT Agent,只有使用Pro模式下才会更强。在衡量数学、人文和自然科学领域模型表现的各学科专家级能力Humanity’s Last Exam测试方面,带有扩展推理功能的GPT-5增强版本GPT-5 pro在使用工具的情况下得分42%。略低于得分44.4%的xAI模型Grok 4 Heavy。O ...
全网开测GPT-oss!技术架构也扒明白了
量子位· 2025-08-07 00:56
Core Insights - The article highlights the impressive performance of GPT-oss, which surpasses many existing open-source models and is poised to lead in the SaaS fast-fashion era [1][3][4]. Performance Testing - GPT-oss has successfully passed multiple performance tests, achieving top rankings in various benchmarks, including GPQA Diamond, AIME 2024, AIME 2025, and Codeforces, outperforming models like DeepSeek R1, Qwen3, and Llama 4 [5][6]. - In the MMLU benchmark, GPT-oss achieved scores of 85.9 for the low 120B model and 88 for the medium model, while Qwen3-235B performed slightly better in MMLU [6][7]. Model Architecture - The architecture of GPT-oss is noted for its wider structure, more attention heads, and higher hidden dimensions compared to similar models, incorporating advanced techniques such as attention bias units [22][24][26]. - The model retains the core MoE Transformer architecture while optimizing performance and reducing complexity, making it suitable for open-source applications [26][28]. Cost and Training - The estimated cost for training the GPT-oss-120B model is between $4.2 million and $23.1 million, while the 20B model costs between $420,000 and $2.3 million [30]. - There are indications that the model may have limitations in non-English text performance, with a significant portion of responses containing grammatical or spelling errors [30]. User Applications - Users have begun exploring various applications for GPT-oss, including its integration into platforms for academic paper understanding and data transformation [17][19][20]. - The model can be easily accessed and utilized through platforms like LM Studio and AWS, facilitating rapid development of AI applications [33][34]. Community Engagement - The article encourages users to test GPT-oss and share their experiences, indicating a growing community interest in the model's capabilities [39].
X @Elon Musk
Elon Musk· 2025-08-02 09:09
RT X Freeze (@amXFreeze)Grok 4 Heavy unlocks you instant access to all of this and there’s much more on the way!... https://t.co/l4xhoowZkW ...
X @Elon Musk
Elon Musk· 2025-07-30 17:49
Product Features - Grok Heavy subscribers receive priority access to all Grok beta releases for other products [1] - Grok 4 Heavy is considered superior to other $200/monthly plans like Anthropic, OpenAI, or Gemini due to its unique features [1] - Grok Heavy utilizes multiple specialized agents [1]
Grok 4遥遥领先,但马斯克想要得更多
首席商业评论· 2025-07-21 03:34
Core Viewpoint - The article discusses the ambitious and high-risk strategy of Elon Musk's xAI, focusing on the launch of the Grok 4 AI model, its capabilities, and the financial challenges the company faces in the competitive AI landscape [5][10][25]. Group 1: Grok 4 AI Model - Grok 4 is touted as the "world's strongest AI model," achieving graduate-level performance across various disciplines and outperforming competitors in standardized tests [5][8]. - Grok 4's training volume is 100 times that of Grok 2, with a computational power investment in reinforcement learning exceeding that of any other model on the market [8]. - The subscription fee for Grok 4 is set at $30 per month, while the more advanced Grok 4 Heavy version costs $300 per month [8]. Group 2: Financial Challenges - xAI's monthly expenditure is reported to be around $1 billion, with total expenses projected to reach $13 billion in 2024 against an expected revenue of only $500 million [11][22]. - The company is pursuing an aggressive hardware strategy, planning to build a supercomputer with 1 million NVIDIA GPUs, which could cost between $5 billion and $62.5 billion [13]. - xAI's financial needs are compounded by debts incurred from Musk's acquisition of Twitter, which adds to the company's overall financial burden [15]. Group 3: Competitive Landscape - The AI industry is shifting from a "scale race" to a focus on efficiency and application depth, with competitors like OpenAI and Anthropic making significant advancements [16][20]. - xAI's current revenue heavily relies on the X Premium subscription service, with only 20 million active users compared to OpenAI's 100 million [23]. - Analysts predict that xAI could achieve profitability by 2027, potentially outpacing OpenAI, but this is contingent on several optimistic assumptions [22]. Group 4: Strategic Positioning - Musk's strategy involves leveraging the data and computational resources from his other ventures, such as Tesla and Twitter, to enhance Grok's capabilities and reduce costs [20]. - The article highlights the ongoing debate about the definition of AGI, with Musk positioning Grok as a potential leader in this space, despite skepticism about its current capabilities [19].
Grok 4遥遥领先,但马斯克想要得更多
Sou Hu Cai Jing· 2025-07-19 13:37
Core Insights - Elon Musk has introduced "the world's strongest AI model," Grok 4, which is claimed to achieve graduate-level proficiency across all subjects and outperform many PhD candidates [3][6] - Grok 4 has demonstrated near-perfect scores on standardized tests like SAT and GRE, and has outperformed competitors in the "Human Ultimate Exam" [3] - The financial demands of xAI are significant, with monthly expenditures reaching $1 billion, and projected total expenses of $13 billion in 2024 against revenues of only $500 million [7][10] Group 1: Model Performance and Features - Grok 4 and Grok 4 Heavy are reasoning models, with the latter supporting four agents working simultaneously [3] - Grok 4 excels in multi-language support, particularly in English and coding tasks, surpassing competitors like Kimi [3] - The model's deep thinking capabilities make it more reliable in research scenarios compared to faster alternatives [3][6] Group 2: Financial Strategy and Challenges - xAI's aggressive hardware strategy involves building its own infrastructure rather than relying on rented computing power, with plans for a supercomputer costing between $5 billion and $62.5 billion [10] - The company is employing a "burn money for leadership" strategy, aiming for rapid advancements through significant financing and hardware investments [10][12] - xAI's financial situation is compounded by debts from Musk's acquisition of Twitter, which adds pressure to its funding needs [11] Group 3: Market Position and Competition - The AI landscape is shifting from a "scale race" to a focus on efficiency and application depth, with competitors like Claude 4 and Gemini 2.5 Pro making significant strides [13] - Musk's vision for Grok includes potential breakthroughs in technology and physics, but the definition of AGI remains contentious [15] - xAI aims to achieve profitability by 2027, with Morgan Stanley predicting revenues could exceed $13 billion by 2029, potentially outpacing OpenAI [17]
X @Elon Musk
Elon Musk· 2025-07-18 18:15
Model Performance - Grok 4 Heavy excels in coding compared to other models [1] - Grok 4 can generate code that Gemini struggles with, even with multiple prompts [1]