Grok 4.1
Search documents
马斯克:Grok 4.20下周发布,较4.1版改进重大
Sou Hu Cai Jing· 2026-02-15 09:41
更引人注目的是,当时 Grok 4.1 无需深度思考的"即时响应"版本也以 1465 的 Elo 分数位列第二,性能甚至超越了其他所有模型的"全推理"模式。这一成绩 相较于前代 Grok 4(IT之家注:排名第 33 位)实现了巨大飞跃,也印证了其在底层能力上的绝对优势。 作为参考,Grok 4.1 发布于去年 11 月,继承前代模型敏锐的智能与高可靠性,在创造性、情感理解和协作互动方面实现了重大改进,当时在 LMArena 文本 能力排行榜以 1483 的 Elo 分数高居榜首,领先第二名达 31 分。 IT之家 2 月 15 日消息,xAI CEO 埃隆 · 马斯克今天在 X 平台表示,Grok 4.20 将在下周发布,相比 4.1 版改进重大。 值得注意的是,Grok 4.1 当时还改进了"幻觉"出现率,为用户提供更可靠、更准确的信息。 ...
又见印奇
3 6 Ke· 2026-01-27 00:25
Core Insights - The article discusses the evolution of AI commercialization, focusing on the experiences and insights of Yin Qi, founder of Megvii Technology, and his current role at StepFun. It highlights the challenges faced in the AI 1.0 era and the shift towards more viable business models in the AI 2.0 landscape. Group 1: AI Commercialization Challenges - Yin Qi reflects on the difficulties of closing the commercial loop during the AI 1.0 era, which significantly impacted his ventures [3] - He emphasizes that once a business model fails, it is challenging to revert, leading to a lack of scalable profits and viable products [4] - The majority of the "Six Little Tigers" in the AI sector are still in the early stages of commercialization, struggling to find effective business models [4] Group 2: Insights on Competitors and Market Dynamics - Yin Qi expresses skepticism about the commercialization strategies of many AI startups in Silicon Valley, noting that Google has an advantage due to its established revenue streams [4] - He identifies xAI, associated with Tesla, as having a potentially successful commercial model due to its strong integration of software and hardware capabilities [5] Group 3: StepFun's Strategic Direction - StepFun has recently secured over 5 billion RMB in funding, setting a record for single financing rounds in the domestic large model sector [6] - The company aims to combine AI with smart terminals, focusing on hardware development alongside foundational model research [7][10] - StepFun's recent release of the Step3-VL-10B model demonstrates superior performance in benchmarks compared to larger models, indicating a strong position in the market [8] Group 4: Talent and Team Composition - StepFun's team comprises top talents from Megvii and Microsoft, maintaining a high density of expertise and a balanced skill set [12] - Yin Qi hopes to attract back some of the talent that has left for other companies in the sector, emphasizing the importance of a strong team for future success [13] Group 5: Long-term Vision and Philosophy - Yin Qi advocates for a long-term approach to business, focusing on delivering tangible commercial results rather than merely pursuing theoretical advancements [15] - He acknowledges a shift from a passionate to a more pragmatic mindset, prioritizing clear customer and commercial value in AI developments [15]
马斯克旗下AI企业,斥资超200亿美元加码算力基建
财联社· 2026-01-09 16:07
Core Viewpoint - The article highlights xAI's ambitious plans to invest over $20 billion in building a data center in South Haven, Mississippi, driven by the increasing demand for computational power due to the generative AI boom [2][4]. Group 1: Investment and Expansion Plans - xAI plans to commence operations at the new data center in February 2026 [3]. - The data center will be located near a recently acquired power plant and close to xAI's existing data center in Memphis, Tennessee [4]. - The company has already established a supercomputer cluster named "Colossus" in Memphis, which is touted as the largest of its kind globally [4]. Group 2: Competitive Landscape - xAI's expansion reflects its ambition to compete more effectively with industry leaders like OpenAI's ChatGPT and Anthropic's Claude by training more advanced models [4]. - The latest version of xAI's flagship chatbot, Grok 4.1, was released in November, with Grok 5 expected in Q1 of the following year, which has a 10% chance of achieving Artificial General Intelligence (AGI) [4]. Group 3: Financial Performance and Projections - In the first nine months of the previous year, xAI consumed $7.8 billion in cash, with a net loss of $1.46 billion recorded in Q3 [5]. - The company has raised $20 billion in its latest funding round, surpassing its initial target of $15 billion, leading to a valuation increase to $230 billion since last spring [5]. - Elon Musk expressed confidence that if xAI can navigate the next two to three years successfully, it could outperform competitors, with rapid expansion in computational power and data capacity being crucial [5].
X @Tesla Owners Silicon Valley
Tesla Owners Silicon Valley· 2025-12-23 06:55
RT Tesla Owners Silicon Valley (@teslaownersSV)Grok 4.1 outperforms Gemini Pro 3 and GPT-5.1 on real-world codebases, APIs, and complex algorithms with an end-to-end accuracy of 85.6% on DeepCodeBench.designed with developers in mind. tested in real-world situations. https://t.co/G7SzCxNtyu ...
X @Tesla Owners Silicon Valley
Tesla Owners Silicon Valley· 2025-12-23 06:35
Highest precision on DeepCodeBench.When it comes to understanding, generating, and reasoning real-world code, Grok 4.1 outperforms Gemini Pro 3 and GPT-5.1.Grok is ahead; this is measurable performance, not hype. https://t.co/CP2djwhRxa ...
X @Tesla Owners Silicon Valley
Tesla Owners Silicon Valley· 2025-12-23 04:22
Grok 4.1 outperforms Gemini Pro 3 and GPT-5.1 on real-world codebases, APIs, and complex algorithms with an end-to-end accuracy of 85.6% on DeepCodeBench.designed with developers in mind. tested in real-world situations. https://t.co/G7SzCxNtyu ...
X @Tesla Owners Silicon Valley
Tesla Owners Silicon Valley· 2025-12-22 07:33
RT Tesla Owners Silicon Valley (@teslaownersSV)🚨 BREAKING: Grok 4.1 Fast dominates OpenRouter – #1 in token usage with trillions processed, fastest responses, top intelligence, and unmatched cost-performance.The most used, most efficient, most powerful model out there. https://t.co/qC10HD0Eur ...
X @Tesla Owners Silicon Valley
Tesla Owners Silicon Valley· 2025-12-22 06:30
Model Performance - Grok 4.1 is the 1 model on OpenRouter in token usage, processing trillions of tokens [1] - Grok 4.1 offers the fastest responses [1] - Grok 4.1 provides top intelligence [1] - Grok 4.1 delivers unmatched cost-performance [1] Market Position - Grok 4.1 is the most used model [1] - Grok 4.1 is the most efficient model [1] - Grok 4.1 is the most powerful model [1]
X @Elon Musk
Elon Musk· 2025-12-22 06:25
AI Model Development Roadmap - Grok 3 was released in February [1] - Grok 4 was released in July [1] - Grok Imagine was released in July [1] - Grok Code Fast 1 was released in August [1] - Grok 4 Fast was released in September [1] - Grokipedia was released in October [1] - Grok 4.1 was released in November [1] - Grok 4.1 Fast was released in November [1] - Grok Voice Agent API is scheduled for release in December [1]
X @Tesla Owners Silicon Valley
Tesla Owners Silicon Valley· 2025-12-22 04:17
🚀 BREAKING Grok 4.1 – Dominating the Frontier! 🚀🥇 #1 on LMArena Text Arena (Thinking mode: 1483 Elo)🥇 #1 in Emotional Intelligence (EQ-Bench v3)🥇 #1 in Creative Writing (v3 benchmark)🥇 #1 in Agentic Tool Use (τ²-Bench)The most human-like, reliable, and capable AI yet. Built by xAI to push the boundaries of intelligence.Try Grok 4.1 now on https://t.co/KaH5w8JGff or the X app! ...