Imagen 4

Search documents
X @Demis Hassabis
Demis Hassabis· 2025-07-25 22:15
Model Performance - Imagen 4 模型与 Ultra 在 Arena 排行榜上并列第一 [1] Product Updates - Google 更新了 Imagen 4 模型 [1] - 这些模型已在 Google AI Studio 和 Gemini API 中提供 [1]
X @Demis Hassabis
Demis Hassabis· 2025-07-23 00:59
AI Image Generation Capabilities - Imagen 4 is designed for rendering clear and readable text in AI-generated images [1] - The technology supports the creation of comics, cards, and custom memes with AI-generated text [1] Product Focus - Google Gemini App promotes its AI image generation feature [1] - The app encourages users to prompt their ideas for AI generation [1]
The sky’s the limit with Imagen 4 in the Gemini app. 🎈
Google· 2025-07-18 18:45
Product Update - Google Gemini 应用使用 Imagen 4 将 Super G 提升到新高度 [1] - 用户可以使用提示词 "Create an image of several crochet hot air balloons flying on a blue sky with sparse clouds" 来生成图像 [1] Technology Focus - 该技术使用了 Imagen 4 和 GenAI (Generative AI) [1]
小扎千亿挖人名单下一位:硅谷华人AI高管第一人
量子位· 2025-06-28 04:42
Core Insights - Meta, led by Mark Zuckerberg, is aggressively recruiting AI talent, including those previously poached by competitors like OpenAI and Google [1][2] - Zuckerberg is reaching out to former Meta AI executives and researchers to encourage their return to the company [3][4] - The urgency in Meta's recruitment efforts is highlighted by the recent struggles of its AI projects, particularly the Llama 4 model [18][22] Recruitment Strategy - Meta has restructured its AI teams into two main groups: an AI product team and an AGI Foundations team [25][28] - A new superintelligence lab has been established to develop AI systems that surpass human cognitive abilities [29] - The company is willing to offer substantial compensation packages, reportedly reaching up to $100 million for top talent [33][34] Competitive Landscape - Bill Jia, a prominent AI figure who left Meta for Google, has been instrumental in Google's AI advancements, making his return to Meta uncertain [8][10][17] - Google has made significant strides with its Gemini models, contrasting with Meta's recent setbacks [11][18] - Meta's AI department has expanded to over a thousand employees, reflecting its commitment to rebuilding its capabilities [32] Financial Moves - Meta has made substantial investments, including a $14.3 billion acquisition of a stake in Scale AI and attempts to acquire other AI startups [37] - The company is actively pursuing high-profile AI talent, with reports of multiple recruitment efforts targeting OpenAI researchers [38][40] Future Outlook - Despite recent challenges, Meta remains committed to its open-source strategy and plans to continue developing the Llama series [44] - The competitive landscape in AI is intensifying, with both Meta and Google focusing on innovative models and talent acquisition [45]
AI News: DeepSeek R2 Delayed, Meta Poaches from OpenAI, OpenAI Sued, Imagen 4, and more!
Matthew Berman· 2025-06-27 01:55
AI Model Development & Performance - Deepseek R2的发布因美国出口管制和CEO对其性能不满而被推迟[1] - Meta积极招募AI研究人员,包括从OpenAI挖走三名在苏黎世工作的研究员,他们之前曾在Google DeepMind工作[1] - Meta收购Scale AI,主要目的是为了获得其团队,此前Google和OpenAI已经取消了与Scale AI的合同[1] - Google发布了Imagine 4和Imagine 4 Ultra,这是其新的文本到图像模型,Imagine 4 Ultra的价格为每个输出图像 6 美分[6] - Google发布了Gemma 3N,这是一款高性能的小型开源模型,有两个版本,大小分别为 2 GB和 3 GB[10] - Google发布了Alpha Genome,这是一种新的统一DNA序列模型,可通过API使用,旨在预测人类DNA序列中突变对生物过程的影响[12][13] AI Industry Legal & Business Landscape - OpenAI计划转变为营利性公司以进行IPO,但需要获得微软的批准,微软拥有OpenAI模型到 2030 年的IP权利和 20% 的收入分成[1] - OpenAI考虑采取“核选项”,指控微软存在反竞争行为,如果微软在 6 个月内没有改进,OpenAI的投资将转为债务,软银承诺的 300 亿美元将减少到 100 亿美元[2] - OpenAI与Johnny Ive合作的硬件项目IO因商标投诉而暂停[2] - 一名联邦法官裁定,Anthropic使用书籍训练Claude的行为属于合理使用[16][17] AI Applications & Tools - 11 Labs推出了11 AI,这是一个完整的语音AI助手,旨在探索11 Labs会话AI技术的潜力[4] - Replet的年度经常性收入(ARR)达到了 1 亿美元,在 6 个月内从 1000 万美元增长到 1 亿美元[5] - Google发布了Gemini CLI,这是一个开源AI代理,类似于Claude Code,完全免费,提供每分钟 60 个请求,每天 1000 个模型请求的配额[14][15] - Anthropic发布了一篇关于人们如何使用AI模型进行情感支持的论文,其中 2.9% 的Claude使用案例用于人际关系建议、心理辅导、陪伴等[20][22]
X @Demis Hassabis
Demis Hassabis· 2025-06-25 23:57
Product Release - Google is launching Imagen 4 and Imagen 4 Ultra in the Gemini API + Google AI Studio [1] - Imagen 4 is available for free trial in AI Studio and in paid preview in the API [1]
刚刚,首个能在机器人上本地运行的具身Gemini来了
机器之心· 2025-06-25 00:46
Core Viewpoint - The article discusses the launch of Gemini Robotics On-Device, a new visual-language-action (VLA) model by Google DeepMind, designed for robots to operate efficiently without continuous internet connectivity [1][2]. Group 1: Product Overview - Gemini Robotics On-Device is the first VLA model that can be directly deployed on robots, enhancing their ability to adapt to new tasks and environments [2][4]. - The model is optimized for efficient operation on robotic hardware, showcasing strong general flexibility and task generalization capabilities [4][12]. - It can operate in environments with no data network, making it suitable for latency-sensitive applications [5]. Group 2: Developer Tools - Google will release the Gemini Robotics SDK, allowing developers to evaluate the model's performance in their specific tasks and environments [7]. - Developers can test the model in DeepMind's MuJoCo physics simulator, requiring only 50 to 100 demonstrations to adapt to new tasks [7][21]. Group 3: Performance and Adaptability - Gemini Robotics On-Device has demonstrated strong performance in various dexterous tasks, such as unzipping bags and folding clothes, all executed directly on the robot [12][16]. - The model shows significant advantages over previous local robot models, especially in challenging out-of-distribution tasks and complex multi-step instructions [15][16]. - It can be fine-tuned for improved performance and can adapt to different robotic platforms, including the Franka FR3 and Apollo humanoid robots [25][26]. Group 4: Updates and Changes - Alongside the new model, Google DeepMind has reduced the free usage limits for its Gemini 2.5 Flash and Gemini 2.0 Flash models, which may not be well-received by free users [30][32]. - The company has also announced the launch of new image generation models, Imagen 4 and Imagen 4 Ultra, in its AI Studio and Gemini API [33].
冠军队独享200w?这波是冲大学生来的,超千支队伍已组队报名
量子位· 2025-06-23 08:11
有,你别说还真有。 那就是 大模型变现 。而且更细分的赛道已经很明确了—— 这不最近硅谷大厂都盯上了用 AI打广告 这门生意。 ChatGPT聊着聊着开始带货: 谷歌劈柴哥在IO大会宣布要用AI将内容和广告深度融合。Meta已经披露了实打实的数据,2024第四季度广告营收 增长21% ,都是得益于AI 的优化。 生成式AI一来,打广告的姿势变了,商业模式底层技术的探索空间,空前巨大。 普通人有机会吗?有,而且是专门面向 在校学生 的那种。 明敏 发自 凹非寺 量子位 | 公众号 QbitAI 就说当今之势,还有比搞大模型 更有前途 的吗? 不仅有业内资深专家指导、接触实际工业数据,从小白直接变成领域内小专家,还能有奖金以及直通offer。 用大模型打广告搞钱,有啥机遇? 用大模型搞钱姿势千千万,为啥生成式AI+广告这条路值得关注? 最首要的,有人已经赚到钱了,实打实的营收增长正在发生。 Meta的2024年Q4财报数据显示, 广告收入占整体营收的96.7%,约468亿美元,同比增长21% 。 背后核心驱动因素是 AI 。 2024年12月,Meta官方披露了与英伟达合作的广告投放系统Andromeda。这是一 ...
A surreal slice of life, made with Imagen 4 in the Gemini app 🍊
Google· 2025-06-17 17:16
Social Media Presence - Google encourages users to subscribe to its YouTube channel [1] - Google maintains active accounts on X (formerly Twitter) [1] - Google utilizes TikTok for content sharing [1] - Google engages with users on Instagram [1] - Google connects with its audience on Facebook [1]
3个趋势,看AI到底是怎么重构广告行业的?
3 6 Ke· 2025-06-11 09:42
Core Insights - Google's AI strategy is undergoing a significant transformation, moving towards a new phase of AI platform integration that fundamentally redefines its advertising and content generation models [1][2][5] Group 1: Advertising Evolution - The evolution of Google's advertising system has transitioned from keyword bidding (AdWords) to automated content generation and multi-channel advertising with the introduction of Performance Max in 2021 [2][3] - The recent I/O 2025 conference showcased AI tools that automate the creative process, allowing brands to reduce costs and enhance efficiency while fostering innovation in content production [2][4] Group 2: Personalization Shift - The advertising paradigm is shifting from "mass personalization" to "hyper-personalization," where ads are tailored to individual users rather than demographic groups [3][5] - Google's AI capabilities, integrated into search interfaces, enable personalized product recommendations based on user intent, enhancing the relevance of advertisements [3][6] Group 3: Integration of Ads and Content - Ads are becoming an integral part of the search experience, merging with AI-generated content to provide users with useful information rather than standalone advertisements [6][8] - This integration challenges traditional SEO and alters the advertising value assessment, as AI improves content matching and user intent understanding [6][8] Group 4: Future of Advertising - Brands need to adapt by creating proprietary AI agents that align with their marketing strategies, ensuring consistency in automated content generation and ad placements [7][9] - The focus for advertisers is shifting towards being featured in AI-generated responses, emphasizing the importance of discoverability and authority in the AI ecosystem [8][9]