Workflow
OpenAI
icon
Search documents
他一人撑起谷歌90%的AI宣传,劈柴真是挖到鬼才了
量子位· 2025-07-10 08:00
Core Viewpoint - Logan Kilpatrick, a key figure in Google's AI marketing efforts, is responsible for 90% of the company's AI promotional work, having transitioned from OpenAI to Google [3][22]. Group 1: Logan Kilpatrick's Role and Background - Logan Kilpatrick is recognized as Google's AI "promotional expert," actively engaging with the developer community on platforms like X [2][3]. - At just 27 years old, Kilpatrick has a background that includes working at NASA and Apple before joining OpenAI as the Developer Relations Lead [7][8]. - His experience at OpenAI helped him understand ecosystem building and developer engagement, earning him the nickname "LoganGPT" among developers [10][11]. Group 2: Transition to Google and Responsibilities - Kilpatrick joined Google in 2024, where he was tasked with developing the AI Studio platform and integrating it into Google Cloud [12][14]. - Following a significant talent migration within Google, his team was moved under DeepMind, enhancing collaboration between research and development [19][20]. - He has been instrumental in promoting Google's Gemini series models, which have over 400 million monthly active users, although they still lag behind ChatGPT's 500 million weekly active users [23]. Group 3: Marketing Challenges and Strategies - Google faces challenges in marketing due to its diverse product offerings, which can confuse developers and users [24][25]. - Kilpatrick acknowledges that Google needs to improve its marketing efforts to better communicate ongoing innovations [26][27]. - His approach involves direct engagement with developers, which has been well-received and contrasts with traditional marketing channels [28][36]. Group 4: Investment Activities - In addition to his role at Google, Kilpatrick has invested in over 50 startups, indicating his active involvement in the tech ecosystem [39].
马斯克推最强Grok 4!人类终极测试干翻OpenAI,包月费超2千元
Sou Hu Cai Jing· 2025-07-10 07:56
在"人类的最后考试"(Humanity's Last Exam)中,Grok 4在无需"工具"的情况下取得了25.4%的准确率,超过了谷歌Gemini 2.5 Pro的21.6%和OpenAI o3 (高版本)的21%。 xAI还推出了迄今为止最昂贵的AI订阅计划——每月300美元的Super Grok Heavy。订阅者可以抢先体验Grok 4 Heavy,并抢先体验新功能。这些新功能包括 但不限于:将于8月推出的AI编码模型,9月推出的多模态智能体,以及10月推出的视频生成模型。 ▲Grok 4在Humanity's Last Exam测评中取得第一 "就学术问题而言,Grok 4在各个学科上都比博士水平高,无一例外。"马斯克在直播中说,"有时,它可能缺乏常识,而且它还没有发明新技术或发现新的 物理学说,但这只是时间问题。" ▲埃隆·马斯克在直播中发言 配备"工具"的Grok 4 Heavy获得44.4%的得分,优于配备工具的Gemini 2.5 Pro的26.9%。 ▲每月300美元的Super Grok Heavy正式推出 直播结束后,马斯克在X上发文称:"你可以将整个源代码文件剪切并粘贴到Grok上 ...
Cursor 搭 MCP,一句话就能让数据库裸奔!?不是代码bug,是MCP 天生架构设计缺陷
AI前线· 2025-07-10 07:41
编译 | Tina 安全研究团队 General Analysis 日前警告称,如果你使用了 Cursor 搭配 MCP,有可能在毫不知情 的情况下,把你的整个 SQL 数据库泄露出去——而攻击者仅靠一条"看起来没什么问题"的用户信息 就能做到这一点。 这是"致命三连"攻击模式的典型体现:提示注入、敏感数据访问,以及信息回传全部集中在一个 MCP 中实现。随着 MCP 被越来越多的 Agent 接入,这类看似边缘的配置问题,正在迅速演变为 AI 应用中的核心安全挑战。 一句话,就能让你的私有数据库裸奔 英伟达 CEO 黄仁勋曾描绘过一个令人震撼的未来:企业将由 5 万名人类员工管理 1 亿个 AI 助理。 这个听起来像科幻小说的场景,其实正迅速成为现实。 一切始于 2024 年底,MCP 悄然发布,最初并未引发太多关注。然而,仅仅几个月后,局势便急剧 升温。到了 2025 年初,已有超过 1,000 个 MCP 服务器上线,GitHub 上相关项目迅速蹿红,斩获 33,000 多颗星、数千次分叉。谷歌、OpenAI、微软等科技巨头迅速将 MCP 纳入生态体系,Claude Desktop、Claude Cod ...
Cursor终结者?Grok 4正式登顶!马斯克扬言编程碾压,20万N卡年赚47亿美金!
AI前线· 2025-07-10 07:41
作者| 华卫 、冬梅 时隔 5 个月,Grok 终于再次"更新换代"。 这次,xAI 不仅直接跳过了 Grok 3.5,而且并非只发布一款模型。今天刚发布的是通用模型 Grok 4,能够处理常规任务并进行对话。接下来的三个月时间里,xAI 将陆续发布专为编码任务设计的 Coding Model、多模态代理 Multi-modal Agent 和视频生成模型 Video Generation Model。 目前,Grok 4 已上线,提供三个订阅版本,包括免费的基础版、每月 30 美元的 Supergrok 和每月 300 美元的 Supergrok Heavy。SuperGrok Heavy 订阅用户可提前体验 xAI 计划在未来几个月推出 的一些新产品。 "在所有学科领域,Grok 4 的智能水平都超过了博士生"。发布会上,马斯克吹嘘道, "我们已经没有 测试题可问了,现实是终极的推理测试",他补充说: "有时,它可能缺乏常识,而且它还没有发明 新技术或发现新的物理学,但这只是时间问题。" 直播现场,马斯克身着皮夹克,在 xAI 团队成员的陪同下,详细演示了这款新模型。值得注意的是, 距离产品发布仅数小时前 ...
反犹争议后xAI闪电发布Grok4聊天机器人,月烧10亿美元角逐AI巨头
Zhi Tong Cai Jing· 2025-07-10 07:12
埃隆.马斯克旗下的人工智能初创公司xAI在其前一代产品发布仅数月后,便推出了Grok4,这一举措凸 显了人工智能领域白热化的发展速度。 Grok4的发布正值xAI的转型期——该公司已于今年3月与X完成合并。合并后的新公司整合了部分工程 资源和其他技术,旨在更好地开发Grok并向X的用户群体推广。而就在Grok4直播演示的数小时前,X 首席执行官琳达.亚卡里诺宣布辞职,这为这家社交平台的管理层留下了一个空缺。 目前,马斯克正为xAI筹集巨额资金。该公司正与谷歌母公司Alphabet、OpenAI、Meta等科技巨头展开 竞争,角逐尖端聊天机器人的研发高地。此前报道称,xAI每月的资金消耗高达10亿美元,这一数字足 以彰显该公司在人工智能领域的雄心背后,是何等高昂的成本。 Grok4发布的前一天,xAI刚被迫从社交平台X上删除了Grok发布的不当内容,其中包括反犹言论以及对 用户的不当回复。该公司声明称:"自发现相关内容后,xAI已采取措施,在Grok在X平台发布内容前拦 截仇恨言论。" 周三,马斯克仅表示"我们必须确保人工智能是向善的",却未提及Grok3的不当言论及相关争议。 同日早些时候,土耳其政府一名部长 ...
Chrome危!AI浏览器新品大爆发,OpenAI都来抢饭碗
量子位· 2025-07-10 06:51
Core Viewpoint - The article discusses the emerging competition in the AI browser market, highlighting the launch of Perplexity's AI browser, Comet, and the anticipated entry of OpenAI into the same space, indicating a significant shift in how users interact with the internet [2][4][33]. Group 1: Market Dynamics - The AI browser market is becoming increasingly crowded with players like Google Chrome, Apple Safari, and new entrants such as Dia and FellouAI browsers [5][6][36]. - Google Chrome currently holds a dominant position with a market share of approximately 66% [6]. - Perplexity's decision to enter the browser market stems from a rejection by Google to set its search engine as the default, prompting the need to create its own browser to connect with users [26][29]. Group 2: Comet's Features and User Experience - Comet is designed as a super intelligent assistant, integrating deeply with user tasks across browsing, searching, and entertainment [8][11]. - The browser can automatically recognize content being viewed and allows users to ask questions without needing to open new windows or copy text [18][20]. - While Comet performs well with simple tasks, it struggles with more complex requests, requiring extensive permissions from users [21][22]. Group 3: Competitive Landscape - OpenAI is also developing a browser to compete directly with Google Chrome, aiming to enhance data collection for model training and personalization [34]. - New entrants like Dia and FellouAI are positioning themselves as "AI-native" browsers, attempting to redefine user experience and bypass traditional browser functionalities [36][37]. - Google is actively enhancing Chrome with AI features to maintain its competitive edge, despite the emergence of new players [39]. Group 4: User Engagement and Growth Potential - Perplexity reported a search query volume of 780 million in May, with a month-over-month growth rate exceeding 20%, indicating a strong user base that can be leveraged for Comet [30][31]. - The competition for the next-generation "super entry point" in digital interaction is intensifying, with various companies vying for user attention in the browser space [42].
马斯克Grok-4碾压所有大模型!“比所有领域博士都聪明”,AIME25拿满分
量子位· 2025-07-10 06:51
Core Viewpoint - The release of Grok-4 marks a significant advancement in AI capabilities, achieving over 50% accuracy in various tests, surpassing previous models and demonstrating superior intelligence compared to human performance [1][6][4]. Group 1: Performance Metrics - Grok-4 Heavy achieved a score of 44.4%, an increase of nearly 18 percentage points compared to Gemini-2.5-Pro [2]. - With training and tool integration during testing, Grok-4 can reach a score of 50.7% [3]. - In various assessments, Grok-4 scored 88.9% on GPQA, 100% on AIME25, 79.4% on LCB, 96.7% on HMMT25, and 61.9% on USAMO25 [11]. Group 2: Training and Development - Grok-4's training volume is 100 times that of Grok-2 and 10 times that of Grok-3, utilizing a 200,000-card computing cluster [23]. - The model emphasizes the integration of tools during post-training, which enhances performance and efficiency [26][27]. - The incorporation of tools allows Grok-4 to flexibly complete complex tasks, improving its overall intelligence [30]. Group 3: Demonstrations and Applications - Grok-4 demonstrated strong reasoning abilities by predicting MLB World Series win probabilities, assigning a 21.6% chance to the Dodgers [31]. - It showcased visual understanding by simulating gravitational wave collisions and generating realistic waveforms [35]. - In programming tests, Grok-4 nearly achieved full marks and is expected to release a specialized fast and intelligent programming model [37]. Group 4: Future Plans and Integration - Future developments include a programming model, multi-modal agents, and video generation models [46]. - Grok is expected to be integrated into Tesla's latest firmware, enhancing the interaction between drivers and vehicles [58]. - The Grok voice assistant will also be featured in the Optimus humanoid robot, serving as its brain [60].
英伟达支持的Perplexity推出AI智能体浏览器Comet
Huan Qiu Wang Zi Xun· 2025-07-10 06:45
来源:环球网 【环球网科技综合报道】7月10日消息,由英伟达投资支持的Perplexity AI近日推出一款名为Comet的AI 驱动网络浏览器,这家初创公司正积极挑战Alphabet旗下谷歌在浏览器领域的主导地位。 据了解,Comet能够在保持完整上下文的前提下执行完整工作流程,让研究如同对话般自然,使分析过 程更易于操作。例如,用户可借助Comet比较不同的保险计划,也能通过它深入理解某项技术以辅助投 资决策。 值得注意的是,Perplexity强调,用户在Comet中的数据仅存储在本地浏览器,绝不会被用于模型训练。 关于使用权限,从7月9日起,Comet将向Perplexity Max订阅用户开放。该公司表示,夏季将逐步向候补 名单用户开放仅限邀请的访问权限,新用户还将获得一定数量的邀请名额可与他人分享。 近年来,AI在搜索及浏览器领域的应用持续推进。去年,微软支持的OpenAI为ChatGPT添加了搜索引 擎功能,且近期已向所有用户开放该服务;谷歌也于去年推出名为"AI概览"的AI驱动搜索功能。而 OpenAI也将在近期推出自家AI浏览器。(纯钧) Perplexity公司在公开介绍中表示,Comet ...
X @Elon Musk
Elon Musk· 2025-07-10 06:28
Performance Benchmarks - Grok 4 achieves an Artificial Analysis Intelligence Index of 73, surpassing OpenAI o3 at 70, Google Gemini 2.5 Pro at 70, Anthropic Claude 4 Opus at 64, and DeepSeek R1 0528 at 68 [1] - Grok 4 leads in Coding Index (LiveCodeBench & SciCode) and Math Index (AIME24 & MATH-500) [5] - Grok 4 achieves an all-time high score of 88% in GPQA Diamond, exceeding Gemini 2.5 Pro's previous record of 84% [5] - Grok 4 achieves an all-time high score of 24% in Humanity's Last Exam, beating Gemini 2.5 Pro's previous all-time high score of 21% [5] - Grok 4 achieves joint highest score for MMLU-Pro and AIME 2024 of 87% and 94% respectively [5] Model Specifications & Pricing - Grok 4's pricing is $3/$15 per 1 million input/output tokens ($0.75 per 1 million cached input tokens), equivalent to Claude 4 Sonnet but more expensive than Gemini 2.5 Pro and o3 [4] - Grok 4 has a 256 thousand token context window, less than Gemini 2.5 Pro's 1 million tokens but more than Claude 4 Sonnet, Claude 4 Opus, o3 and R1 0528 [5] - Grok 4 supports text and image input, function calling, and structured outputs [6] Availability & Deployment - Grok 4 is tested via the xAI API, and the version deployed on X/Twitter may differ [2] - Grok 4 is expected to be available via the xAI API, the Grok chatbot on X, and potentially via Microsoft Azure AI Foundry [4] Speed - Grok 4's speed is 75 output tokens/s, slower than o3 (188 tokens/s), Gemini 2.5 Pro (142 tokens/s), Claude 4 Sonnet Thinking (85 tokens/s) but faster than Claude 4 Opus Thinking (66 tokens/s) [5]
Musk Unveils Grok 4 AI Chatbot
Bloomberg Television· 2025-07-10 06:18
Elon Musk's A. I. has released its newest A.I. model, Grok four I believe as the 4 a. m.UK time. So about 3 hours, then live for 3 hours. What can we expect when it comes to this new iteration.Okay. Again, very high expectations on this. Obviously, with expectations stoked up by Mr.. Musk himself and various employees on the platform. I think one of his employees was quoted as saying the world is not ready for this model. There's also been some data was leaked ahead of the official launch on various benchma ...