Alphabet(GOOG)

Search documents
谷歌深夜放出「创世引擎」Genie 3,一句话秒生宇宙,终极模拟器觉醒
3 6 Ke· 2025-08-06 07:32
全球最强「世界AI模拟器」今夜诞生! 刚刚,谷歌DeepMind祭出新一代通用世界模型——Genie 3,能模拟出史无前例的丰富交互环境。 总有一天,UE5所有复杂功能,都能被一个数据驱动的「注意力权重」吸纳。 未来,只需要将手柄指令作为输入,即可渲染一段时空中的像素画面。 一句话,Genie 3即可生成一个动态世界。 令人惊艳的是,它能以每秒20-24帧速度,实时生成720p画面,还能持续数分钟一致性。 相比于前代,Genie 3在生成时长方面也得到了史诗级的加强——一口气能搞定长达数分钟,且内容连贯的可交互世界。 英伟达Jim Fan高度评价,「这就是游戏引擎2.0时代」! 如今,Genie 3的问世,标志着世界模拟AI迈向了全新高度,加速了人类通向AGI/ASI的终极目标。 AI实时交互模拟,真·矩阵世界 一直以来,「世界模型」被业界看作是通往AGI道路上的关键基石。 因为,它能让AI智能体在无限丰富的模拟环境中接受训练。 十多年来,谷歌DeepMind一直在模拟环境领域引领前沿研究,从训练AI智能体玩转即时战略游戏,到为开放式学习和机器人技术开发模拟环境。 正是在这些研究的推动下,他们开发出了「世界模 ...
深夜,OpenAI、谷歌等更新多款模型
第一财经· 2025-08-06 07:17
Core Insights - The article discusses the recent product launches by major AI model companies, highlighting shifts in product strategies and advancements in AI capabilities [3][11]. Group 1: OpenAI Developments - OpenAI has released two new open-source models, gpt-oss-120b with 117 billion parameters and gpt-oss-20b with 21 billion parameters, both utilizing the MoE architecture [4][5]. - The gpt-oss-120b model can run on a single 80GB GPU, while gpt-oss-20b can operate on consumer devices with 16GB memory, allowing for local deployment on laptops and smartphones [5][6]. - OpenAI's new models have shown competitive performance in benchmark tests, with gpt-oss-120b scoring close to or exceeding the closed-source o4-mini model [5][6]. Group 2: Anthropic's Strategy - Anthropic has shifted to a strategy of more frequent incremental updates, exemplified by the release of Claude Opus 4.1, which improves upon its predecessor in areas like coding and data analysis [6][7]. - In benchmark tests, Claude Opus 4.1 scored 74.5%, surpassing Opus 4's 72.5%, indicating enhanced coding capabilities [7]. Group 3: Google's Innovations - Google introduced Genie 3, its first world model that supports real-time interaction, building on previous models like Genie 1 and 2 [8][9]. - Genie 3 can simulate complex environments and interactions, generating consistent visuals for several minutes, a significant improvement over Genie 2 [9][11]. - Despite its advancements, Genie 3 still faces limitations, such as restricted action spaces and challenges in simulating multiple agents in shared environments [11].
图解丨过去25年全球前十大公司变迁
Ge Long Hui A P P· 2025-08-06 06:43
2025年全球前十大公司已变成:英伟达、微软、苹果、谷歌、亚马逊、Meta、沙特阿美、博通、台积 电、伯克希尔哈萨维。 格隆汇8月6日|2000年全球前十大公司分别是:通用电气、微软、思科、埃克森美孚、沃尔玛、英特 尔、花旗、NTT DOCOMo、辉瑞、沃达丰。 25年来没变的是微软,依然留在全球前十大公司的榜单中。 ...
DeepMind独家访谈实录,解密Genie 3世界模型,将颠覆游戏与机器人行业未来
3 6 Ke· 2025-08-06 06:14
当地时间8月5日,谷歌DeepMind最新研发的AI技术"Genie 3"被誉为一项革命性的突破,有望彻底改变 虚拟世界生成、机器人训练以及娱乐产业的未来。这项技术能够通过简单的文本提示,在约3秒内生成 一个可交互的、逼真的3D虚拟世界,达到720p分辨率,且具备实时交互和环境一致性等特性。Genie 3 不仅适用于游戏和虚拟现实(VR)领域,还为机器人和自动驾驶汽车的训练提供了无限可能的模拟环 境。 Youtube人气大V蒂姆·斯卡夫(Tim Scarfe)通过对DeepMind研究团队的独家采访,详细介绍了Genie 3 的创新功能、潜在应用以及未来前景。以下是采访全文内容摘要: 主持人:大家好,今天我们带来一项全球独家报道,我认为这是我见过的最令人震撼的技术,简直让人 兴奋不已!上周,我在伦敦谷歌DeepMind的办公室亲眼见证了这项技术的演示。这项技术可能成为下 一个价值万亿美元的产业,也可能是虚拟现实的杀手级应用。谷歌DeepMind近期表现极为出色,甚至 连Gemini Deepthink都无法统计其成功次数。 今天,我们将讨论一类全新的AI模型——生成式交互环境。它们不同于传统游戏引擎、模拟器或 ...
OpenAI、谷歌等深夜更新多款模型 展示开源、智能体、世界模型进展
Di Yi Cai Jing· 2025-08-06 04:59
Core Insights - Major AI companies released new products, showcasing shifts in product strategies, particularly OpenAI's transition to open-source models and Anthropic's focus on incremental updates [1][3] OpenAI - OpenAI launched two open-source models: gpt-oss-120b with 117 billion parameters and gpt-oss-20b with 21 billion parameters, both utilizing MoE architecture [2] - The gpt-oss-120b model can run on an 80GB GPU, while gpt-oss-20b can operate on consumer devices with 16GB memory, allowing local deployment on laptops and mobile phones [2] - These models achieved top-tier performance in benchmark tests, with gpt-oss-120b scoring close to or exceeding the closed-source o4-mini model [2] Anthropic - Anthropic introduced Claude Opus 4.1, marking a shift towards more frequent, incremental updates rather than focusing solely on major version releases [3] - The new model demonstrated improved capabilities in complex multi-step problem-solving and coding tasks, with a SWE-bench Verify score of 74.5%, surpassing the previous version [4] Google - Google launched Genie 3, its first world model allowing real-time interaction, building on previous models Genie 1 and Genie 2 [5] - Genie 3 can simulate diverse environments and natural phenomena, maintaining visual consistency for up to several minutes at 720p resolution [6] - Despite advancements, Genie 3 has limitations, such as restricted action space and challenges in simulating multiple agents in shared environments [9]
OpenAI、谷歌等深夜更新多款模型,展示开源、智能体、世界模型进展
Di Yi Cai Jing· 2025-08-06 04:49
Core Insights - The recent product launches by OpenAI, Anthropic, and Google indicate a shift in product strategies among major AI model developers, with a focus on open-source models and incremental updates [1][3][5] OpenAI - OpenAI has released two open-source models, gpt-oss-120b with 117 billion parameters and gpt-oss-20b with 21 billion parameters, both utilizing the MoE architecture [2] - The gpt-oss-120b model can run on a single 80GB GPU, while gpt-oss-20b can operate on consumer devices with 16GB memory, allowing for local deployment on laptops and mobile devices [2] - OpenAI's CEO, Sam Altman, emphasized the importance of releasing powerful open-source models, which are the result of billions of dollars in research [1][2] Anthropic - Anthropic has shifted its strategy to focus on more frequent incremental updates rather than solely major version releases, exemplified by the launch of Claude Opus 4.1 [3] - Claude Opus 4.1 shows improvements in coding capabilities, scoring 74.5% on the SWE-bench Verify benchmark, surpassing its predecessor [4] - The new model is designed to handle complex multi-step problems more effectively, positioning it as a more capable AI agent [3][4] Google - Google introduced Genie 3, its first world model that supports real-time interaction, building on previous models like Genie 1 and Genie 2 [5] - Genie 3 can simulate diverse interactive environments and model physical properties, allowing for realistic navigation and interaction within generated worlds [5][6] - Despite its advancements, Google acknowledges limitations in Genie 3, such as restricted action spaces and challenges in simulating multiple agents in shared environments [9]
AI混战日
Hu Xiu· 2025-08-06 04:37
Core Insights - August 5 marks a significant moment in the evolution of AI technology and business competition, with Google, Anthropic, and OpenAI releasing important models within 24 hours [2][22] - The competition is shifting from a singular focus on "which model is stronger" to a more complex and diverse landscape, showcasing different evolutionary directions in AI [2][22] OpenAI GPT-oss - OpenAI released GPT-oss, a 13 billion parameter dense model, which is not a state-of-the-art model but is positioned against Llama 3 8B or Qwen2 7B [3] - The significance of GPT-oss lies not in its performance but in the OpenAI brand and its licensing terms, which restrict usage by large commercial entities [5][6] - This release marks a strategic shift for OpenAI, aiming to attract developers to its ecosystem through a usable open model while allowing seamless migration to more powerful closed models [6][8] - The launch of GPT-oss is a defensive move against open-source competitors like DeepSeek and Qwen, which threaten OpenAI's developer base [8] Google Genie 3 - Google introduced Genie 3, a world model that allows real-time interaction within a generated 3D environment, showcasing significant technological imagination [9][10] - Genie 3 can create a cohesive 3D world based on a single image or text description, enabling users to interact with it using natural language commands [12][14] - This model represents a major advancement in AI's ability to create interactive virtual spaces, potentially impacting game development, simulation, and robotics [15][18] - Genie 3 has received widespread acclaim, with experts describing it as a "quantum leap" in AI capabilities [16] Anthropic Claude 4.1 Opus - Anthropic launched Claude 4.1 Opus, aimed at becoming the leading programming assistant, achieving an impressive score of 85.2% on the HumanEval+ benchmark [19][20] - Claude 4.1 outperforms its predecessor and other competitors in various coding and reasoning tasks, indicating a significant enhancement in its capabilities [20][21] - The model is designed to be faster and more cost-effective, improving efficiency and value for developers and enterprise users [21] Industry Dynamics - The simultaneous releases of these models indicate a shift in strategy among major companies, moving away from direct competition to a more collaborative focus on maximizing attention [22][24] - Companies are beginning to define their roles more clearly, with Anthropic focusing on programming, OpenAI building an ecosystem, and Google investing in next-generation breakthroughs [25] - The advancements in AI models signal a more vibrant and competitive landscape in the industry [26]
御三家打起来了:OpenAI 开源、谷歌发布可交互的世界模型、Claude 4.1 成了编程新旗舰
Founder Park· 2025-08-06 03:43
同一天,硅谷模型三巨头连续发布了新的模型(到底也不知道谁截胡谁了)。 OpenAI 终于发布了新的开源模型,gpt-oss-120b 和 gpt-oss-20b,上次开源 GPT-2 已经是 6 年前的事情了。从目前的评测成绩来看,两款模型能力接近 o4- mini,虽然编程能力略弱,但这个 SOTA 级别的能力表现,很期待接下来的开源生态的发展。 DeepMind 也发了个大招,一个看起来基本进入可用阶段的世界模型 Genie 3,一句话直接生成可交互的 3D 世界、角色和道具,目前尚未对外开放,但演 示片很震撼。 Claude 发布了旗舰模型 Opus 的小版本升级——Claude Opus 4.1,编程能力依旧没得说,这次强化了 Agent 能力。 接下来,该期待 DeepSeek R2 了。 文章内容编译自「机器之心」、部分官博文章。 超 10000 人的「AI 产品市集」社群!不错过每一款有价值的 AI 应用。 邀请从业者、开发人员和创业者,飞书扫码加群: 进群后,你有机会得到: 01 OpenAI 开源两个推理模型, o4-mini 水平 最新、最值得关注的 AI 新品资讯; 不定期赠送热门新品的 ...
震撼,世界模型第一次超真实地模拟了真实世界:谷歌Genie 3昨晚抢了OpenAI风头
3 6 Ke· 2025-08-06 03:17
昨晚十点,谷歌 DeepMind 重磅宣布其 Genie 世界模型系列正式来到了第 3 代。 「Genie 3是我们突破性的世界模型,可以通过单个文本提示词创建交互式、可玩的环境。从照片般逼真的风景到奇幻的境界,可能性无穷无尽。」 相比于前一代 Genie 2 世界模型、使用扩散模型的游戏生成引擎 GameNGen 以及视频生成模型 Veo,最新的 Genie 3 在多个特性上都具有明显优势。 | | GameNGen | Genie 2 | Veo | Genie 3 | | --- | --- | --- | --- | --- | | Resolution | 320p | 360p | 720p to 4K | 720p | | Domain | Game-specific | 3D Environments | General | General | | Control | Game-specific | Limited keyboard / mouse actions | Video-level description* | Navigation; Promptable world events ...
散户疯狂、科技巨头分化,AI推动的美股牛市到顶了吗?
Hu Xiu· 2025-08-06 03:12
美股科技巨头的这一轮疯涨还能持续多久? 8月1日,美股大幅下跌、万亿美元市值蒸发。这场恐慌背后,是特朗普因就业数据"断崖式下修"怒炒劳工统计局局长,进而引发各界对官方数据的信任危 机。 与此同时,科技巨头们正值财报发布季,AI军备竞赛也不断推进:微软、Meta、谷歌、亚马逊豪掷近4000亿美金用于AI基础设施。 市场恐慌、分化加剧的大背景下,疯狂的AI投入能否撑起高估值?M7公司形成新平衡,谁又能笑到最后? 本期我们将深扒财报季背后的真相:AI资本狂潮是蜜糖还是毒药?巨头们的分化背后,藏着怎样的投资逻辑?接下来,华尔街关注什么指标? 美股出现恐慌性抛售,就业数据崩盘与信任危机 8月1号大跌最直接的导火索,是美国的就业数据"崩了"。 非农业就业数据一直都对股市非常重要。它是衡量美国经济活力的关键指标,也是美联储加/降息的重要决策依据。比如之前,美联储就一直坚称劳动力 市场很好,所以不需要降息。 而8月1号公布的数据相当糟糕:7月非农就业人数增加7.3万人,大幅低于预期的10.4万人。 更重要的是,美国劳工统计局,还对之前已经公布的5月和6月的数据,进行了大幅下调:5月的新增就业人数,从原来公布的14.4万人修改 ...