Grok 3 Mini

Search documents
马斯克新模型性价比拉满:1折价格实现Gemini 2.5性能,支持2M上下文
量子位· 2025-09-21 13:29
时令 发自 凹非寺 量子位 | 公众号 QbitAI 马斯克xAI又出手了! 这次闪亮登场的是 Grok 4 Fast —— 不仅实现1折价格追平Gemini 2.5,还支持 2M 上下文窗口。 帮我找一篇今年的X帖子,其中mkbhd分别拿着书本式折叠手机和翻盖式折叠手机。 Grok 4 Fast不仅详细描述了帖子内容,提供了准确链接,甚至还贴心地附上了相关的YouTube视频网址。 除此之外,这个全新的多模态推理模型还可与X实现无缝衔接。 例如,给它输入以下提示词: 下面具体来看。 以最低的成本实现最高的性能 可以说,Grok 4 Fast这一波在性价比这件事上树立了新标杆。 在推理基准测试中,它不仅 全面超越Grok 3 Mini ,还大幅降低了Token成本。 与Grok 4相比,Grok 4 Fast在保持与前者性能差不多的同时,平均使用的思考Token数量减少了40%。 根据Artificial Analysis的独立评测验证,在"人工分析智能指数"榜单中,Grok 4 Fast与其它公开可用模型相比,呈现出业界领先的"价格-智 能"比。 除此之外,Grok 4 Fast还在LMArena上进行了对 ...
AI版华尔街之狼,o3-mini靠「神之押注」狂赚9倍,DeepSeek R1最特立独行
3 6 Ke· 2025-08-18 06:58
AI能像科幻电影中的先知一样预测未来吗?一个名为「Prophet Arena」的全新基准测试,正通过预测真实世界事件来评估AI的「预言」能力。 AI能预测未来吗? 在《黑客帝国》里,先知能对Neo的未来做出预测。 以ChatGPT为代表的AI,则可以根据过去的语料来「预测下一个Token」。 那问题来了,AI能不能像先知一样,从全世界的杂乱信息里找出蛛丝马迹,准确地预测未来呢? 比如: AI监管今年能否成为联邦法律? 美国职业足球大联盟比赛中,谁会获胜? NBA今年的冠军会是谁? | 2025年降息次数? | | | | 今年经济衰退? | | 本月鸡蛋价格会上涨 | | --- | --- | --- | --- | --- | --- | --- | | | | | | | 吗? | | | 最佳预测: | | | 最佳预测: | | 最佳预测: | | | 精确地2次切割 | | | 开始 | | 高于 0% | | | GPT-5 | | 43% | o3 Mini | 27% | o3 Mini | 90% | | Grok 3 Mini | | 40% | GPT-5 | 19% | GPT-5 ...
Microsoft CTO says the number of people using AI agents doubled in the last year
Business Insider· 2025-05-19 20:30
Core Insights - The focus of Microsoft's recent Build conference was on the development and potential of agentic AI, with a significant increase in daily active users of AI agents noted by Microsoft CTO Kevin Scott [1][2] Group 1: Definition and Evolution of Agentic AI - Microsoft defines agentic AI as systems that allow humans to delegate tasks, with ongoing improvements expected in their capabilities and cost-effectiveness [3] - The tech industry anticipates 2025 as a pivotal year for agentic AI, with Microsoft leading the charge in defining and developing these technologies [2] Group 2: Microsoft’s AI Tools and Features - Microsoft announced various AI updates and partnerships aimed at creating an "agentic web," leveraging its Azure platform for cloud computing tools [4] - The introduction of the Azure SRE agent for site reliability engineering, integrated with GitHub Copilot, emphasizes the role of AI agents in alleviating developer challenges [5] Group 3: GitHub Copilot and Coding Agents - The new coding agent within GitHub Copilot is designed to autonomously handle tasks such as bug fixes and code maintenance, enhancing productivity for developers [6] - OpenAI's Codex, introduced by CEO Sam Altman, represents a significant advancement in agentic coding, allowing for true task delegation in software engineering [7][8] Group 4: Collaboration and Integration - Microsoft plans to expand AI models on Azure, integrating xAI's Grok 3 and Grok 3 Mini, showcasing collaboration with other AI leaders [9] - The introduction of "Copilot Tuning" aims to create customized agents that utilize organizational knowledge, enhancing the functionality of AI tools [10] Group 5: Broader Impact and Vision - Microsoft CEO Satya Nadella highlighted the company's commitment to applying AI across its software development stack, aiming to create opportunities that empower users [11]
X @xAI
xAI· 2025-04-18 19:09
Meet the Grok 3 family, now on our API!Grok 3 Mini outperforms reasoning models at 5x lower cost, redefining cost-efficient intelligence.Grok 3, the world's strongest non-reasoning model, excels in tasks that need real world knowledge like law, finance, and healthcare. https://t.co/b3CiiZgxM5 ...