Gemma

Search documents
X @Polyhedra
Polyhedra· 2025-10-02 12:00
5/Gemma – Model Execution & ValidationFixed shape inference errors in quantized model GPU execution.Added graph information completion interfaces for quantized models.Validated MobileNet circuit memory usage and correctness.Stay tuned for more updates 🚀 ...
X @Polyhedra
Polyhedra· 2025-10-02 12:00
This week’s work focused on Polyhedra i-D Mobile Integration and Gemma improvements. Here’s a breakdown of what we shipped. https://t.co/v7gu3lneUM ...
X @Avi Chawla
Avi Chawla· 2025-09-29 06:33
You're in a Research Scientist interview at OpenAI.The interviewer asks:"Our investors want us to contribute to open-source.o3 crushed benchmarks.But we can lose a competitive edge by open-sourcing it.What do we do?"You: "Release the research paper."Interview over.You forgot that LLMs don't just learn from raw text; they also learn from each other.For example:- Llama 4 Scout & Maverick were trained using Llama 4 Behemoth.- Gemma 2 and 3 were trained using Gemini.Distillation helps us do so, and the visual e ...
Meta(META.US)自研AI落后信号?据传拟采用竞争对手谷歌(GOOGL.US)Gemini模型优化广告业务
智通财经网· 2025-09-26 00:17
智通财经APP获悉,据《The Information》报道,Meta(META.US)已与谷歌(GOOGL.US)展开磋商,计 划采用谷歌的Gemini人工智能模型,以增强自身的广告定向精准度。 消息公布后,Meta股价在盘后交易中下跌,截至发稿,Meta夜盘下跌0.3%,而谷歌股价则上涨近1%。 该媒体援引知情人士补充称,这场磋商发生在Meta部分员工与谷歌云(Google Cloud)团队之间——值得 注意的是,Meta与谷歌在数字广告领域本是直接竞争对手。目前磋商已取得进展,这一动态也成为 Meta自研AI模型落后于竞品的最新信号:在自身技术追赶不及的情况下,这家由马克·扎克伯格领导的 公司至少已开始探讨"采用竞争对手技术"的可能性。 一名知情人士透露,Meta部分员工提议,利用Meta的广告数据对谷歌的Gemini及Gemma模型进行优化 调整;另有消息人士表示,Meta正评估是否通过Gemini模型提升内容理解能力。 值得关注的是,上月已有报道称,Meta与谷歌云签署了一项为期六年的重大云计算合作协议,价值超 100亿美元。 ...
X @Avi Chawla
Avi Chawla· 2025-09-12 20:01
RT Avi Chawla (@_avichawla)- All Meta Llama models use Attention- All OpenAI GPT models use Attention- All Alibaba Qwen models use Attention- All Google Gemma models use AttentionLet's learn how to implement it from scratch: ...
X @Avi Chawla
Avi Chawla· 2025-09-12 06:30
模型架构 - Meta Llama 模型全部使用 Attention 机制 [1] - OpenAI GPT 模型全部使用 Attention 机制 [1] - Alibaba Qwen 模型全部使用 Attention 机制 [1] - Google Gemma 模型全部使用 Attention 机制 [1]
GoogleI/OConnectChina2025:智能体加持,开发效率与全球化双提升
Haitong Securities International· 2025-08-22 06:30
Investment Rating - The report does not explicitly provide an investment rating for the industry or specific companies discussed Core Insights - The Google I/O Connect China 2025 event highlighted advancements in AI model innovation, developer tool upgrades, and the globalization of the ecosystem, particularly focusing on the Gemini 2.5 series and the Gemma open model series [1][16] - Gemini 2.5 architecture enhances multimodal and reasoning capabilities, achieving unified embeddings and cross-modal attention across various modalities, significantly improving understanding and generation accuracy [2][17] - Gemma offers openness and extensibility, allowing developers to fine-tune models for specific domains such as healthcare and education, with derivative models showcasing broad applicability [3][18] - AI-driven development tools have been integrated into core workflows, enhancing productivity through features like task decomposition and code synthesis in Firebase Studio, and semantic code analysis in Chrome DevTools [4][19] - Generative content models, including Lyria, Veo3, and Imagen 4, are designed to strengthen the creative ecosystem, particularly for content-focused teams looking to expand globally [4][20] Summary by Sections AI Model Innovation - The Gemini 2.5 series features enhanced cross-modal processing and faster response times, improving the overall efficiency of AI applications [1][16] - The architecture integrates Chain-of-Thought reasoning and structured reasoning modules, enhancing logical consistency and multi-step reasoning performance [2][17] Developer Tool Upgrades - Firebase Studio's agent mode allows for automatic prototype generation from natural language prompts, while Android Studio introduces BYOM (Bring Your Own Model) for flexible model selection [4][19] - Chrome DevTools now includes a Gemini assistant for semantic code analysis and automatic fixes, significantly improving front-end debugging efficiency [4][19] Global Expansion of AI Ecosystem - The report emphasizes the appeal of Google's generative multimedia models for content creation, particularly in enhancing productivity for short-video production, e-commerce marketing, and game exports [4][20]
X @Demis Hassabis
Demis Hassabis· 2025-08-06 00:38
Adoption Rate - Gemma model downloads surpassed 200 million [1] - The adoption speed is considered incredible [1] Community & Future Development - The community is actively building exciting use cases and projects with Gemma [1] - The company is just getting started and seeking community input on future development [1]
What’s New in Google Accessibility | Episode 9 | American Sign Language
Google· 2025-07-16 14:03
Accessibility Innovations - Google is releasing SignGemma, an open model for sign language understanding, focusing on American Sign Language (ASL) and English, with plans to translate other sign languages into spoken language text [1][2] - Android expands Gemini integration into TalkBack screen reader, providing AI-generated descriptions for images and the entire screen, enabling conversational questions and responses [4] - Expressive Captions on Android now capture the intensity and nuance of speech, including emphasis and sounds like whispering or yawning [5][6] - Pixel's Magnifier app introduces live search, highlighting matches on the screen and vibrating when something is found, aiding blind and low vision users [6][7] - Project Astra Visual interpreter, in collaboration with Aira, is being tested to provide real-time descriptions of surroundings for blind and low-vision users, supervised by live Aira agents [8][9][10] Chrome and Chromebook Updates - Chrome now supports Optical Character Recognition (OCR) for scanned PDFs, allowing screen readers to interact with them [11][12] - Chromebooks now offer the ability to turn off the touchpad and flash the screen for new notifications [12] - New Chromebook features cater to users with limited dexterity and/or tremors, including Bounce Keys, Slow Keys, and Mouse Keys [13] Workspace Enhancements - Workspace allows users to embed interactive Google Calendars into websites, with screen-reader compatibility, improved spacing, and responsive layout [14]
What’s New in Google Accessibility | Episode 9
Google· 2025-07-16 14:02
Accessibility Innovations - Google is releasing SignGemma, an open model for sign language understanding, initially focusing on American Sign Language (ASL) and English, with the potential for community-driven adaptation to other sign languages [1][2] - Android's TalkBack screen reader now integrates Gemini to provide AI-generated descriptions of the entire screen, enabling conversational follow-up questions [4] - Expressive Captions on Android now capture the intensity and nuance of speech, including drawn-out sounds and subtle vocalizations like whispering and yawning [5][6] - The Pixel's Magnifier app introduces live search, allowing blind and low-vision users to type what they're looking for and receive real-time highlights and vibrations when matches are found [6][7] - Project Astra Visual Interpreter, in collaboration with Aira, is being tested to provide real-time descriptions of surroundings for blind and low-vision users, supervised by live Aira agents [8][9][10] Chrome and Chromebook Updates - Chrome now supports Optical Character Recognition (OCR) for scanned PDFs, enabling screen readers to interact with the text [11][12] - Chromebooks now offer the ability to turn off the touchpad, flash notifications for new alerts, and features like Bounce Keys, Slow Keys, and Mouse Keys to assist users with limited dexterity and/or tremors [12][13] Workspace Enhancements - Google Workspace allows users to embed interactive, screen-reader compatible Google Calendars into websites, featuring improved spacing, responsive layouts, and keyboard shortcut navigation [14]