Grok 4.1
Search documents
X @Tesla Owners Silicon Valley
Tesla Owners Silicon Valley· 2025-12-23 06:55
RT Tesla Owners Silicon Valley (@teslaownersSV)Grok 4.1 outperforms Gemini Pro 3 and GPT-5.1 on real-world codebases, APIs, and complex algorithms with an end-to-end accuracy of 85.6% on DeepCodeBench.designed with developers in mind. tested in real-world situations. https://t.co/G7SzCxNtyu ...
X @Tesla Owners Silicon Valley
Tesla Owners Silicon Valley· 2025-12-23 06:35
Highest precision on DeepCodeBench.When it comes to understanding, generating, and reasoning real-world code, Grok 4.1 outperforms Gemini Pro 3 and GPT-5.1.Grok is ahead; this is measurable performance, not hype. https://t.co/CP2djwhRxa ...
X @Tesla Owners Silicon Valley
Tesla Owners Silicon Valley· 2025-12-23 04:22
Grok 4.1 outperforms Gemini Pro 3 and GPT-5.1 on real-world codebases, APIs, and complex algorithms with an end-to-end accuracy of 85.6% on DeepCodeBench.designed with developers in mind. tested in real-world situations. https://t.co/G7SzCxNtyu ...
X @Tesla Owners Silicon Valley
Tesla Owners Silicon Valley· 2025-12-22 07:33
RT Tesla Owners Silicon Valley (@teslaownersSV)🚨 BREAKING: Grok 4.1 Fast dominates OpenRouter – #1 in token usage with trillions processed, fastest responses, top intelligence, and unmatched cost-performance.The most used, most efficient, most powerful model out there. https://t.co/qC10HD0Eur ...
X @Tesla Owners Silicon Valley
Tesla Owners Silicon Valley· 2025-12-22 06:30
🚨 BREAKING: Grok 4.1 Fast dominates OpenRouter – #1 in token usage with trillions processed, fastest responses, top intelligence, and unmatched cost-performance.The most used, most efficient, most powerful model out there. https://t.co/qC10HD0Eur ...
X @Elon Musk
Elon Musk· 2025-12-22 06:25
Rapid evolutionTestlabor (@testerlabor):Grok progress 2025:• Grok 3 – February• Grok 4 – July• Grok Imagine – July• Grok Code Fast 1 – August• Grok 4 Fast – September• Grokipedia – October• Grok 4.1 – November• Grok 4.1 Fast – November• Grok Voice Agent API – December https://t.co/GusvrTwIkX ...
X @Tesla Owners Silicon Valley
Tesla Owners Silicon Valley· 2025-12-22 04:17
🚀 BREAKING Grok 4.1 – Dominating the Frontier! 🚀🥇 #1 on LMArena Text Arena (Thinking mode: 1483 Elo)🥇 #1 in Emotional Intelligence (EQ-Bench v3)🥇 #1 in Creative Writing (v3 benchmark)🥇 #1 in Agentic Tool Use (τ²-Bench)The most human-like, reliable, and capable AI yet. Built by xAI to push the boundaries of intelligence.Try Grok 4.1 now on https://t.co/KaH5w8JGff or the X app! ...
X @Elon Musk
Elon Musk· 2025-12-21 15:33
RT Testlabor (@testerlabor)Did you know that Grok 3, Grok 4 and Grok 4.1 were all released within one year? https://t.co/jBFi0bUMf2 ...
年终大冲刺,中美科技大厂都杀疯了
商业洞察· 2025-12-19 09:58
以下文章来源于华商韬略 ,作者华商韬略 华商韬略 . 聚焦标杆与热点、解构趋势与韬略 作者: 杨彼得 来源:华商韬略 随后,备受关注的DeepSeek推出新模型,字节豆包不仅对AI助手进行大幅升级,更以系统级服务的方式切入手机生态,直接触碰既有应用与平台 的"操作权边界"…… 年关将近,科技大厂在AI领域集体发力。 一场大厂AI的年末大战,正式打响 。 01 "白热化"冲刺 11月中旬,阿里巴巴和蚂蚁两大集团先后发布了各自的重大AI应用产品。 阿里巴巴正式上线了全新的千问APP,这是一款基于其大模型"通义千问"打造,普遍被视为直指ChatGPT的C端应用级产品。公开报道显示,为了这 个项目,阿里至少抽调了上百名工程师,在杭州总部划出两层办公区秘密开发。 千问A PP 的核心优势在于强大的多语言能力和生活服务整合潜力。上线仅三天,即推出覆盖119种语言的实时翻译功能,几乎涵盖全球98%以上人 口的常用语种,支持文字、图片、同传等四大场景。 蚂蚁发布的"灵光"A PP 则强调"让复杂变简单"的效率理念,它率先在移动端实现"自然语言30秒生成小应用",并支持编辑、交互和分享,定位为 高效创作工具。 11月中旬,阿里 ...
X @Tesla Owners Silicon Valley
Tesla Owners Silicon Valley· 2025-12-18 19:15
BREAKING: 🧠 Grok 4.1 & 4.1 Fast, Best Human-Like Intelligence#1 Emotional Intelligence (EQ-Bench3)#1 in Agentic Tool Use & Function CallingLowest factual error rate in classTop performer on LMArena (human preference benchmark)This is the most human-aligned frontier model, ideal for assistants, agents, and complex decision-making. ...