Grok 3 Mini
Search documents
马斯克新模型性价比拉满:1折价格实现Gemini 2.5性能,支持2M上下文
量子位· 2025-09-21 13:29
Core Viewpoint - The article discusses the launch of Grok 4 Fast by Elon Musk's xAI, highlighting its competitive pricing and advanced capabilities in multimodal reasoning and context handling [1][3]. Group 1: Product Features and Performance - Grok 4 Fast achieves a price-performance benchmark by matching the price of Gemini 2.5 while offering a 2 million token context window [1][3]. - It significantly reduces token costs, using 40% fewer tokens on average compared to Grok 4 while maintaining similar performance levels [11][12]. - In benchmark tests, Grok 4 Fast outperformed Grok 3 Mini and ranked 8th in text arena competitions, demonstrating superior performance among similarly sized models [17][18]. Group 2: Competitive Advantage - Grok 4 Fast leads the "price-intelligence" ratio in the industry, as verified by independent assessments [14]. - It scored 1163 points in the search arena, outperforming the second-place model by 17 points, showcasing its competitive edge [18]. Group 3: Technological Innovations - The model employs end-to-end reinforcement learning to enhance its tool usage, excelling in determining when to invoke tools like code execution or web browsing [20]. - Grok 4 Fast integrates advanced search capabilities, allowing seamless web browsing and real-time data enhancement for queries [21][22]. - It features a unified architecture that reduces end-to-end latency and token costs, making it suitable for real-time applications [25]. Group 4: Market Position and Future Developments - Grok 4 Fast is now available to all users, with complex queries automatically utilizing its capabilities in Auto mode [26]. - Two new models are set to be launched, with specific pricing for input and output tokens detailed [27]. - The recruitment of Dustin Tran from Google, a key figure in the development of Gemini models, strengthens the team behind Grok 4 Fast [28][30].
AI版华尔街之狼,o3-mini靠「神之押注」狂赚9倍,DeepSeek R1最特立独行
3 6 Ke· 2025-08-18 06:58
AI能像科幻电影中的先知一样预测未来吗?一个名为「Prophet Arena」的全新基准测试,正通过预测真实世界事件来评估AI的「预言」能力。 AI能预测未来吗? 在《黑客帝国》里,先知能对Neo的未来做出预测。 以ChatGPT为代表的AI,则可以根据过去的语料来「预测下一个Token」。 那问题来了,AI能不能像先知一样,从全世界的杂乱信息里找出蛛丝马迹,准确地预测未来呢? 比如: AI监管今年能否成为联邦法律? 美国职业足球大联盟比赛中,谁会获胜? NBA今年的冠军会是谁? | 2025年降息次数? | | | | 今年经济衰退? | | 本月鸡蛋价格会上涨 | | --- | --- | --- | --- | --- | --- | --- | | | | | | | 吗? | | | 最佳预测: | | | 最佳预测: | | 最佳预测: | | | 精确地2次切割 | | | 开始 | | 高于 0% | | | GPT-5 | | 43% | o3 Mini | 27% | o3 Mini | 90% | | Grok 3 Mini | | 40% | GPT-5 | 19% | GPT-5 ...
Microsoft CTO says the number of people using AI agents doubled in the last year
Business Insider· 2025-05-19 20:30
Core Insights - The focus of Microsoft's recent Build conference was on the development and potential of agentic AI, with a significant increase in daily active users of AI agents noted by Microsoft CTO Kevin Scott [1][2] Group 1: Definition and Evolution of Agentic AI - Microsoft defines agentic AI as systems that allow humans to delegate tasks, with ongoing improvements expected in their capabilities and cost-effectiveness [3] - The tech industry anticipates 2025 as a pivotal year for agentic AI, with Microsoft leading the charge in defining and developing these technologies [2] Group 2: Microsoft’s AI Tools and Features - Microsoft announced various AI updates and partnerships aimed at creating an "agentic web," leveraging its Azure platform for cloud computing tools [4] - The introduction of the Azure SRE agent for site reliability engineering, integrated with GitHub Copilot, emphasizes the role of AI agents in alleviating developer challenges [5] Group 3: GitHub Copilot and Coding Agents - The new coding agent within GitHub Copilot is designed to autonomously handle tasks such as bug fixes and code maintenance, enhancing productivity for developers [6] - OpenAI's Codex, introduced by CEO Sam Altman, represents a significant advancement in agentic coding, allowing for true task delegation in software engineering [7][8] Group 4: Collaboration and Integration - Microsoft plans to expand AI models on Azure, integrating xAI's Grok 3 and Grok 3 Mini, showcasing collaboration with other AI leaders [9] - The introduction of "Copilot Tuning" aims to create customized agents that utilize organizational knowledge, enhancing the functionality of AI tools [10] Group 5: Broader Impact and Vision - Microsoft CEO Satya Nadella highlighted the company's commitment to applying AI across its software development stack, aiming to create opportunities that empower users [11]
X @xAI
xAI· 2025-04-18 19:09
Meet the Grok 3 family, now on our API!Grok 3 Mini outperforms reasoning models at 5x lower cost, redefining cost-efficient intelligence.Grok 3, the world's strongest non-reasoning model, excels in tasks that need real world knowledge like law, finance, and healthcare. https://t.co/b3CiiZgxM5 ...