OpenAI

Search documents
Grok 4号称“全球最强AI”?其实是马斯克的自吹自擂
3 6 Ke· 2025-07-10 11:46
「这是世界上最聪明的人工智能。」 尽管比原定发布会时间推迟了近一个小时,但在今天中午(北京时间 9 月 10 日),xAI 创始人马斯克还是发布了新一代大模型 Grok 4。 纸面上,Grok 4 已经全面超越了所有竞争对手,包括 OpenAI o3、Gemini 2.5 Pro 以及 Claude 4 等当前的顶级大模型,不管是传统的基准测试,还是 SAT 考 试(美国高考)以及各个学科的 GRE 水平测试。 但比起这些已经有点乏味的传统基准测试,更有意思的是,Grok 4 还跑了被成为「人类最后一场闭卷考试」的 Humanity's Last Exam(简称 HLE 测试),并 超越此前一众模型,实现了最高 44.4% 的准确率。 图/ xAI 马斯克在直播中也指出,Grok 4 比几乎所有学科的所有研究生都更聪明,而至少在学术问题上,也优于所有学科的博士水平,「没有例外。」 这还不是 Grok 4 全部潜力。按照马斯克的说法,Grok 4 基础模型的第七版将在本月完成,然后将进行后训练 RL(强化学习)等,最终也会拥有出色的视 频理解能力和工具调用能力。按照路线图,接下来几个月 xAI 还会推出代码模型 ...
马斯克发布Grok 4!号称“世界上最强AI模型”
Zheng Quan Shi Bao Wang· 2025-07-10 11:44
左手刚刚融资,右手就发大模型,马斯克重金打造的Grok 4,正式面世! 7月10日,特斯拉创始人兼首席执行官马斯克旗下的人工智能公司xAI正式发布了Grok 4。在将近1小时 的发布会直播中,xAI发布了这个系列的两款模型,分别是Grok 4(单智能体版本)和Grok 4 Heavy (多智能体版本),其中后者支持4个智能体并行思考,在推理过程中横向比对、纵向协同,调用更大 规模的计算资源以完成更复杂、更精密的任务。 作为xAI在2023年推出首代大模型以来的第四次重要更新,Grok 4在"人类的最后考试"(Humanity's Last Exam)取得了25.4%的准确率,超过了谷歌Gemini 2.5 Pro的21.6%和OpenAI o3(高版本)的21%,被称 为"世界上最强AI模型"。 据xAI的研究人员介绍,Humanity's Last Exam测试总共有2500个问题,包括数学、自然科学、工程以及 所有人文学科,问题广泛且都是博士甚至高级研究水平,极具挑战性,但Grok 4在这些问题上都可以得 到很好的分数。 此外,据发布会披露,在GPQA、AIME25、LCB(Jan-May)、HMMT25 ...
前瞻全球产业早报:我国连续15年稳坐全球制造业首位
Qian Zhan Wang· 2025-07-10 11:29
Group 1 - China has maintained its position as the world's largest manufacturing country for 15 consecutive years, with annual manufacturing value added exceeding 30 trillion yuan since the start of the 14th Five-Year Plan [2] - The National Development and Reform Commission (NDRC) has planned 102 major projects during the 14th Five-Year Plan, all of which are expected to be completed by the end of the year [2] - The NDRC projects that China's GDP will reach approximately 140 trillion yuan by 2025, continuing to surpass previous milestones of 110, 120, and 130 trillion yuan [3] Group 2 - The State Administration for Market Regulation and the Ministry of Industry and Information Technology have issued a plan to establish a risk assessment system for artificial intelligence, focusing on key technologies and measurement capabilities [4] - Shanghai has included unicorn companies in its listing cultivation database and is developing measures to support their listing and fundraising projects [5] Group 3 - China's largest green hydrogen and ammonia project has officially commenced production, with an annual output of 320,000 tons of green synthetic ammonia, powered entirely by renewable energy [6] - The project is expected to reach a total capacity of 1.52 million tons upon full completion [6] Group 4 - Starbucks' stake sale in China has attracted nearly 30 investment institutions, with a valuation of around 10 billion USD, although the company may retain a 30% stake [7] - The first domestic nine-valent HPV vaccine has been priced at 499 yuan per dose, significantly lower than imported alternatives [7] Group 5 - The U.S. is set to implement "reciprocal tariffs" starting August 1, 2025, as announced by President Trump [8] - Nissan has suspended production of certain models in two U.S. factories due to tariffs imposed on imported vehicles between the U.S. and Canada [8] Group 6 - Nvidia's first desktop chip is reported to have performance close to Apple's M3, indicating competitive advancements in the AI PC processor market [9] - New Zealand has launched its first national AI strategy, aiming to enhance productivity and competitiveness, with potential contributions to GDP estimated at 76 billion NZD by 2038 [10] Group 7 - Merck is nearing a deal to acquire Verona for approximately 10 billion USD, indicating ongoing consolidation in the pharmaceutical sector [11] - Meta has invested 3.5 billion USD in EssilorLuxottica to advance its AI glasses strategy, reflecting the growing interest in smart eyewear [12] Group 8 - OpenAI has successfully recruited top engineers from Tesla, xAI, and Meta, intensifying competition in the AI talent market [13] - The company Extreme Robotics has recently gone public in Hong Kong, achieving a market valuation exceeding 21.5 billion HKD [13]
马斯克最新访谈:10个问题告诉你,第一性原理是超能力
混沌学园· 2025-07-10 11:14
前段时间,在 YC 举办的 AI Startup School 上, YC 首席执行官 Garry Tan 邀请到马斯克连线对 谈。这次对谈非常特别,台下围观的是一群非常年轻的创业者,年龄差不多在 18 到 25 岁。其中一些 人,已经在 AI 领域崭露头角。 面对这些明日之星,马斯克前所未有的坦诚。他讲到的内容非常细节、非常接地气儿,比如 1995 年, 他在 Zip2 地板上钻洞接网线; 2001 年,他飞到俄罗斯买洲际导弹; 2008 年, SpaceX 和特斯拉差 点双双破产…… 再比如,他讲到怎么"简单粗暴"地用第一性原理思维扭转局面,把火箭的成本拉到极限,在 6 个月建成 10 万块芯片的计算中心。还分享了对 AI 和人类未来的种种预测期盼。 这些内容打破了一个"天才创业者"的叙事,也展现了马斯克一贯的颠覆性思维和务实精神,非常具有借 鉴意义。 马斯克: 我们在 Zip2 倾注了心血,开发了非常厉害的软件技术。但从我的角度,这些技术从未真正发 挥作用。当时 《纽约时报》和赫斯特集团等等媒体公司,是投资人也是客户,还是董事会成员。 他们 会用传统媒体的视角看问题,让你做看似合理、但和新技术格格不入的 ...
14亿天价薪酬!华人AI大佬被挖走
Zhong Guo Ji Jin Bao· 2025-07-10 11:06
Meta挖角苹果工程师,薪酬包超2亿美元 Meta(原Facebook)为其"超级智能"团队的新成员提供了异常高额的薪酬,包括为一名前苹果杰出工程师开出的超过2亿美元(约人民币14亿元)的薪酬 包。 据悉,Meta聘用了曾负责苹果AI模型团队的Ruoming Pang,提供的是数亿美元规模、跨越数年的薪酬方案。苹果并未尝试匹配该报价,因为除了首席执 行官蒂姆·库克外,这远超苹果公司任何高管的薪酬水平。 这些薪酬方案与Meta"超级智能"新团队的其他重要招募保持一致。据称,该团队的目标是打造能够与人类一样好甚至更好的AI系统。目前团队成员还包括 前GitHub首席执行官Nat Friedman,以及AI初创企业创始人Daniel Gross。Meta还通过收购Scale AI公司49%的股份(估值143亿美元),任命其联合创始人 Alexandr Wang为Meta的首席AI官。 从纯数字角度来看,这支超级智能团队的薪酬在全球所有企业职位中都名列前茅,甚至超过世界各大银行的首席执行官水平。但大部分薪酬与绩效目标挂 钩,并需要通过多年留任逐步兑现,这意味着如果员工提前离职或股价表现不佳,他们可能无法拿到全部报酬。 ...
14亿天价薪酬!华人AI大佬被挖走
中国基金报· 2025-07-10 10:48
【导读】曾领导苹果公司基础模型团队的Ruoming Pang从苹果跳槽至Meta,扎克伯格不惜重金为Meta的新部门招兵买马 中国基金报记者 泰勒 大家好,关注一下AI圈的大消息。 Meta挖角苹果工程师,薪酬包超2亿美元 Meta(原Facebook)为其"超级智能"团队的新成员提供了异常高额的薪酬,包括为一名前苹果杰出工程师开出的超过2亿美元(约人民币 14亿元)的薪酬包。 据悉,Meta聘用了曾负责苹果AI模型团队的Ruoming Pang,提供的是数亿美元规模、跨越数年的薪酬方案。苹果并未尝试匹配该报价, 因为除了首席执行官蒂姆·库克外,这远超苹果公司任何高管的薪酬水平。 这些薪酬方案与Meta"超级智能"新团队的其他重要招募保持一致。据称,该团队的目标是打造能够与人类一样好甚至更好的AI系统。目前 团队成员还包括前GitHub首席执行官Nat Friedman,以及AI初创企业创始人Daniel Gross。Meta还通过收购Scale AI公司49%的股份 (估值143亿美元),任命其联合创始人Alexandr Wang为Meta的首席AI官。 从纯数字角度来看,这支超级智能团队的薪酬在全球所有企 ...
周跟踪(20250616-20250620):MWC上海展示低轨卫星地面基建新机遇,AMDHelios机柜或使用更多光模块与铜缆
Shanxi Securities· 2025-07-10 10:48
Investment Rating - The report maintains an investment rating of "Outperform the Market" for the telecommunications industry [1][42]. Core Insights - The MWC Shanghai showcased new opportunities in low-orbit satellite ground infrastructure, with AMD Helios cabinets potentially utilizing more optical modules and copper cables [2][5]. - CoreWeave has initiated the first batch of shipments for the GB300 NVL72 system, expected to significantly enhance AI computing capabilities, with a projected shipment of over one million units of GB200 by 2025 [5][16]. - Oracle has signed a substantial cloud computing agreement worth $30 billion, anticipated to contribute over $30 billion annually starting from the 2028 fiscal year, indicating strong demand for AI computing resources [6][17]. - The new leadership at China Star Network is expected to accelerate the construction and commercialization of low-orbit satellite internet, with a focus on market-oriented operations and ecosystem breakthroughs [7][18][19]. Summary by Sections Industry Dynamics - The telecommunications industry is experiencing a shift towards AI computing and satellite internet, driven by significant investments and technological advancements [5][6][7]. - The demand for AI computing is expected to remain robust, with a favorable outlook for the second half of the year and into the next [5][16]. Market Performance - The overall market showed mixed performance during the week of June 30 to July 6, 2025, with the Shanghai Composite Index rising by 1.40% and the Shenzhen Component Index increasing by 1.25% [9][20]. - The telecommunications sector saw a slight decline, with the Shenwan Communications Index down by 0.10% [9][20]. Key Companies to Watch - Recommended companies in the overseas computing sector include Zhongji Xuchuang, Xinyi Sheng, Tianfu Communication, and others [8][20]. - In the satellite internet space, companies such as Shanghai Huanxun and Xinke Mobile are highlighted for potential investment [8][20].
Kimi新功能Deep Researcher海外引发热议 还被马斯克直播点名
Sou Hu Cai Jing· 2025-07-10 10:15
是Kimi上月发布的首款Agent产品,在HLE测试中超过了Gemini2.5Pro,略高于OpenAI Deep Research,并与Gemini-Pro的Deep Research Agent打平,是目 前已知的最高水平之一。 当地时间9日晚,马斯克旗下公司xAI举办直播发布会,正式发布其最新旗舰模型Grok 4。 直播中提到HLE(Humanities Last Exam,人类最后的考试)进行对比时,分别介绍了OpenAI、谷歌旗下Gemini以及月之暗面Kimi三家公司,而 DeepResearcher正 资料显示,Kimi DeepResearcher功能在执行每个研究任务时,会平均进行23次推理,由模型判断并筛选出信息质量最高的内容后,剔除冗余及低质信息, 自动生成分析结论,拥有文献的严谨性,可有效告别模型幻觉。 在海外社交媒体上,AI从业者纷纷表达着对这款来自中国AI产品的喜爱,有网友表示,Kimi Deep Researcher可能是用过的最好的深度研究模型,视觉效 果出色。也有博主表示,对深度研究的能力和准确性印象深刻。 | February 3. | OpenAl Deep | A ma ...
OpenAI即将推出AI浏览器 直接挑战谷歌Chrome霸主地位
硬AI· 2025-07-10 08:30
据报道,OpenAI的浏览器有望在未来数周内上线,集成聊天界面和AI代理功能。若能获得其4亿每周活跃ChatGPT用户 的拥护,OpenAI或将对谷歌广告生态、Web数据流和搜索流量产生实质冲击。 硬·AI 作者 | 鲍奕龙 编辑 | 硬 AI OpenAI即将推出AI浏览器,旨在利用人工智能技术从根本上改变消费者的网络浏览方式,直接挑战占据 市场主导地位的谷歌Chrome。 7月9日据媒体报道,OpenAI的浏览器有望在未来数周内上线,集成聊天界面和AI代理功能。 若能获得其 4亿每周活跃ChatGPT用户的拥护,OpenAI或将对谷歌广告生态、Web数据流和搜索流量产生实质冲 击。 谷歌Chrome长期作为Alphabet广告业务的支柱,为广告精准投放和流量导向自有搜索引擎提供基础 数据。 01 AI驱动浏览器: 重新定义互联网入口 报道指出,OpenAI浏览器最大特色,是让用户在ChatGPT式本地界面完成部分交互,减少传统跳转网站 的行为。同时,浏览器将深度整合AI"代理人"(agent),可代表用户完成如预订、表单填写等操作。此举 旨在推动AI服务更深入个人与工作场景,加快AI与用户日常行为的天然融 ...
马斯克带领xAI团队发布Grok 4,“全球最强模型”含金量如何?
Di Yi Cai Jing· 2025-07-10 08:19
Core Insights - The release of Grok 4 was delayed by about an hour, with Elon Musk appearing somewhat fatigued, indicating extensive preparation by the xAI team [1][8] - Grok 4 is touted as the "most powerful AI model globally," outperforming existing top models in various benchmark tests, including achieving a perfect score in the AIME25 math competition and a high score of 26.9% in the "Human Last Exam" (HLE) [3][6] - Grok 4's AI analysis index reached 73, surpassing competitors like OpenAI's o3 (70) and Google's Gemini 2.5 Pro (70) [3][6] Model Performance - Grok 4 achieved a historical high score of 24% in the HLE, exceeding Google's previous high of 21% [6] - The model's training volume is 100 times that of Grok 2, with over 10 times the computational power invested in the reinforcement learning phase compared to other models [6] - Subscription fees for Grok 4 are set at $30 per month, with a more advanced version, Grok 4 Heavy, priced at $300 per month [6] Funding and Financials - xAI has raised a total of $10 billion in a recent funding round, including $5 billion in debt and $5 billion in equity, bringing its total funding in 2024 to $22 billion [11][12] - The company reportedly incurs monthly expenses of $1 billion, with cash reserves projected to last until March 2025 [12] - xAI's revenue is significantly lower than its costs, with expected revenues of $5 billion in 2025, compared to OpenAI's projected $12.7 billion [13] Future Developments - xAI plans to release a coding model in August, a multi-modal agent in September, and a video generation model in October [14][15] - The company aims to leverage the vast data archives from X to train its models, potentially reducing data acquisition costs [13] Competitive Landscape - Despite Grok 4's initial success, the competitive landscape remains intense, with OpenAI set to release its GPT-5 model this summer [13] - Major tech companies like Microsoft, Amazon, Google, and Meta are significantly increasing their investments in AI technologies, with a combined capital expenditure of $320 billion planned for 2025 [13]