Workflow
Open Models
icon
Search documents
X @Demis Hassabis
Demis Hassabis· 2025-08-15 23:45
Model Performance & Efficiency - Gemma 3 270M establishes a new benchmark for instruction-following among compact models [1] - The model is highly efficient for specialized tasks [1] - It is compact and power efficient, facilitating the deployment of fine-tuned systems on edge devices [1] Model Size & Capabilities - Gemma 3 270M is a new hyper-efficient addition to the Gemma open model family [1] - The model packs a real punch for its tiny size [1]
The Rise of Open Models in the Enterprise — Amir Haghighat, Baseten
AI Engineer· 2025-07-24 15:30
AI Adoption in Enterprises - Enterprises' adoption of AI is crucial for realizing AI's full potential and impact [2] - Enterprises initially experiment with OpenAI and Anthropic models, often deploying them on Azure or AWS for security and privacy [7] - In 2023, enterprises were "toying around" with AI, but by 2024, 40-50% had production use cases built on closed models [9][10] Challenges with Closed Models - Vendor lock-in is not a primary concern for enterprises due to the increasing number of interoperable models [12][13] - Ballooning costs, especially with agentic use cases involving potentially 50 inference calls per user action, are becoming a significant concern [20] - Enterprises are seeking differentiation at the AI level, not just at the workflow or application level, leading them to consider in-house solutions [21] Reasons for Open Source Model Adoption - Frontier models may not be the right tool for specific use cases, such as medical document extraction, where enterprises can leverage their labeled data to build better models [16][17] - Generic API-based models may not suffice for tasks requiring low latency, such as AI voices or AI phone calls [18] - Enterprises aim to reduce costs and improve unit economics by running models themselves and controlling pricing [20][21] Inference Infrastructure Challenges - Optimizing models for latency requires both model-level and infrastructure-level optimizations, such as speculative decoding techniques like Eagle 3 [23][24][25][26] - Guaranteeing high availability (four nines) for mission-critical inference requires robust infrastructure to handle hardware failures and VLM crashes [27][28] - Scaling up quickly to handle traffic bursts is challenging, with some enterprises experiencing delays of up to eight minutes to bring up a new replica of a model [29]
The State of Agentic AI
DDN· 2025-05-15 19:50
AI Development & Trends - The modern AI movement significantly accelerated with the advent of ChatGPT two and a half years ago [1] - Open models like LLaMA and Mistral are democratizing AI, making it accessible for broader deployment on-premise or in the cloud [1] - The industry is evolving from Retrieval Augmented Generation (RAG) models to agentic AI, focusing on making AI more actionable and integrated with enterprise data [1] - DeepSeek, an open model, has emerged as a significant advancement in AI, demonstrating reasoning capabilities comparable to proprietary models [1] Enterprise AI Adoption & Impact - AI is increasingly integrated into applications used by a billion knowledge workers, enhancing productivity [1] - 50% of organizations are expected to leverage AI agents to derive better value by 2025 [1] - AI is facilitating quicker content creation, exemplified by the ease of producing videos for platforms like YouTube [1] - NVIDIA focuses on providing a platform for partners to build AI-powered solutions, rather than developing end-user applications [1] Data & Infrastructure - Enterprise data is growing massively, with 11 zettabytes created, highlighting the need for AI to leverage this unstructured data [2] - NVIDIA emphasizes high-performance ingestion tools and data efficiency improvements (35% data improvement) to ensure accurate and reliable AI outputs [2] - NVIDIA Enterprise provides enterprise-ready AI solutions with constant CVEs, tech support, and ABI stability, ensuring solutions built today will continue to work [2] Customer Solutions & Partnerships - NVIDIA partners with companies like DDN to deliver AI solutions to customers, focusing on solving customer problems and driving revenue [1][2] - NVIDIA's Nemo solutions and custom models have enabled partners like Justt.ai to achieve rapid growth and customer adoption [2] - SAP is leveraging NVIDIA's AI to improve its ABAP programming language, helping customers clean up code and solve 80% of coding problems [2]
NVIDIA, Alphabet and Google Collaborate on the Future of Agentic and Physical AI
Globenewswire· 2025-03-18 19:27
Core Insights - NVIDIA, Alphabet, and Google are launching new initiatives to enhance AI, democratize access to AI tools, and transform various industries including healthcare, manufacturing, and energy [1][20] AI and Robotics Development - Engineers and researchers from Alphabet and NVIDIA are collaborating to develop robots with advanced grasping skills and optimize energy grids using AI and simulation technologies [2] - The partnership includes the use of NVIDIA platforms such as Omniverse™, Cosmos™, and Isaac™ to facilitate these developments [2] AI Infrastructure and Tools - Google Cloud will adopt NVIDIA's GB300 NVL72 rack-scale solution and RTX PRO™ 6000 Blackwell Server Edition GPU to enhance AI research and production capabilities [3][15] - NVIDIA will implement Google DeepMind's SynthID technology for watermarking AI-generated content, ensuring intellectual property protection [3][6] Open Models and Innovation - Google DeepMind and NVIDIA are optimizing Gemma, a family of lightweight open models, to run efficiently on NVIDIA GPUs, enhancing accessibility for developers [7] - The collaboration aims to improve the performance of AI models and facilitate their integration into various applications [7] Robotics and Manufacturing - Intrinsic, an Alphabet company, is focused on creating adaptive AI for robotics, aiming to simplify the programming of industrial robots [9] - The partnership with NVIDIA aims to enhance developer workflows and support universal robot grasping capabilities, significantly reducing application development time [10] Drug Discovery and Healthcare - Isomorphic Labs is leveraging AI for drug discovery, utilizing a state-of-the-art drug design engine on Google Cloud with NVIDIA GPUs to advance human health [12] Energy Solutions - Tapestry, a project under Alphabet's X, is working with NVIDIA to develop AI-powered solutions for optimizing electric grid simulations and integrating new energy sources [13][14] Advanced AI Infrastructure - Google Cloud is among the first to offer NVIDIA's latest Blackwell GPUs, which provide significant performance improvements for AI applications [15][16] - The GB300 NVL72 delivers 1.5 times more AI performance compared to its predecessor, enhancing revenue opportunities for AI factories [16] Collaboration and Future Directions - The ongoing partnership between NVIDIA and Alphabet is set to advance agentic AI and physical AI applications across various sectors [20]