NVIDIA NIM™

Search documents
NVIDIA Partners With Novo Nordisk and DCAI to Advance Drug Discovery
Globenewswire· 2025-06-11 10:52
Core Viewpoint - NVIDIA collaborates with Novo Nordisk to enhance drug discovery through advanced AI technologies, utilizing the Gefion supercomputer for innovative research and development applications [1][3][13] Group 1: Collaboration and Technology - The partnership aims to create customized AI models for early research and clinical development, leveraging advanced simulation and physical AI technologies [2] - Gefion, powered by NVIDIA DGX SuperPOD™, serves as an AI factory for Novo Nordisk, enabling the execution of drug discovery and agentic AI workloads [3][5] - NVIDIA's tools, including BioNeMo™, NIM™, and NeMo™, will facilitate generative AI-powered drug discovery and the development of customized workflows [3] Group 2: Research Focus - Novo Nordisk researchers will utilize single-cell models to predict cellular responses to drug candidates and design models for drug-like molecules [4] - The collaboration will also focus on building biomedical large language models from Novo Nordisk's extensive scientific literature to identify correlations between genes, proteins, and diseases [4] Group 3: Impact on Healthcare - Gefion's computational power is expected to address significant R&D challenges, aiming to unlock new possibilities in pharmaceutical research and development [6] - Danish startup Teton is using Gefion to develop an AI care companion for hospitals, which has shown a 25% reduction in nightshift duties for nurses [7] - Gefion will support efforts to unify health data across Danish health organizations, facilitating secure access to interconnected health data for research [9]
NVIDIA DGX Cloud Lepton Connects Europe's Developers to Global NVIDIA Compute Ecosystem
Globenewswire· 2025-06-11 10:09
Core Insights - NVIDIA announced the expansion of its DGX Cloud Lepton, an AI platform that connects developers with a global compute marketplace for building AI applications [1][5] - The platform now includes contributions from various cloud providers, enhancing access to high-performance computing resources [2][8] - Hugging Face introduced Training Cluster as a Service, integrating with DGX Cloud Lepton to facilitate AI model training for researchers [3][10] Company Developments - NVIDIA collaborates with European venture capital firms to provide marketplace credits to startups, promoting regional development in AI [4][11] - The DGX Cloud Lepton platform simplifies access to GPU resources, supporting data governance and sovereign AI requirements [5][6] - The platform integrates with NVIDIA's software suite, streamlining AI application development and deployment [6][7] Industry Impact - The DGX Cloud Lepton marketplace aims to meet the growing demand for AI compute resources, with major cloud providers like AWS and Microsoft Azure participating [2][8] - Early-access customers include various AI companies leveraging the platform for strategic initiatives [8][9] - The integration with Hugging Face allows for scalable AI training, enhancing the capabilities of researchers in various scientific fields [10][11]
NVIDIA Partners With Europe Model Builders and Cloud Providers to Accelerate Region's Leap Into AI
Globenewswire· 2025-06-11 09:57
Core Insights - NVIDIA is collaborating with model builders and cloud providers in Europe and the Middle East to enhance sovereign large language models (LLMs), aiming to boost enterprise AI adoption across various industries in the region [1][5][16] Group 1: Partnerships and Collaborations - Key partnerships include organizations such as Barcelona Supercomputing Center, Bielik.AI, Dicta, and several universities, which will utilize NVIDIA's Nemotron techniques to improve model efficiency and accuracy for enterprise AI workloads [2][6][16] - The collaboration aims to create an integrated regional AI ecosystem that reflects local languages and cultures, supporting Europe's 24 official languages [6][16] Group 2: Technology and Infrastructure - The LLMs will be optimized using NVIDIA's Nemotron model-building techniques, which include neural architecture search and reinforcement learning, to enhance operational efficiency and user experience [7][16] - Post-training and inference will be conducted on AI infrastructure provided by NVIDIA Cloud Partners, ensuring localized support for the models [3][7] Group 3: Application and Impact - The sovereign models will be integrated into Perplexity, an AI-powered answer engine that processes over 150 million questions weekly, enhancing the accuracy of search queries and AI outputs for European enterprises [4][9][10] - The initiative is expected to empower innovation in Europe by providing AI solutions that are developed and operated locally, thereby transforming various industries [5][10]
NVIDIA Announces DGX Cloud Lepton to Connect Developers to NVIDIA's Global Compute Ecosystem
GlobeNewswire News Room· 2025-05-19 04:43
Core Insights - NVIDIA announced the launch of NVIDIA DGX Cloud Lepton™, an AI platform that connects developers with a global network of cloud providers offering tens of thousands of GPUs [1][3] - The platform aims to meet the increasing demand for AI by providing access to GPU compute capacity for both on-demand and long-term computing needs [2][3] - NVIDIA's CEO, Jensen Huang, emphasized the platform's role in building a planetary-scale AI factory by unifying access to cloud AI services and GPU resources [3] Platform Features - DGX Cloud Lepton integrates with NVIDIA's software stack, including NVIDIA NIM™ and NeMo™ microservices, to facilitate the development and deployment of AI applications [3] - The platform offers management software for cloud providers that includes real-time GPU health diagnostics and automates root-cause analysis, reducing downtime [4] - Key benefits include improved productivity and flexibility, frictionless deployment across multi-cloud environments, and predictable performance for enterprise-grade applications [8] Partnerships and Market Impact - NVIDIA Cloud Partners (NCPs) such as CoreWeave, Foxconn, and Softbank Corp. will provide NVIDIA Blackwell and other GPUs on the DGX Cloud Lepton marketplace [2][7] - Yotta Data Services is the first NCP in the Asia-Pacific region to join the NVIDIA Exemplar Cloud initiative, which aims to enhance security, usability, and performance for cloud partners [5][7] - The platform is expected to attract leading cloud service providers and GPU marketplaces, further expanding its reach and capabilities [3][8]
NVIDIA Dynamo Open-Source Library Accelerates and Scales AI Reasoning Models
Globenewswire· 2025-03-18 18:17
Core Insights - NVIDIA has launched NVIDIA Dynamo, an open-source inference software aimed at enhancing AI reasoning models' performance and cost efficiency in AI factories [1][3][13] - The software is designed to maximize token revenue generation by orchestrating inference requests across a large fleet of GPUs, significantly improving throughput and reducing costs [2][3][4] Performance Enhancements - NVIDIA Dynamo doubles the performance and revenue of AI factories using the same number of GPUs when serving Llama models on the NVIDIA Hopper platform [4] - The software's intelligent inference optimizations can increase the number of tokens generated by over 30 times per GPU when running the DeepSeek-R1 model [4] Key Features - NVIDIA Dynamo includes several innovations such as a GPU Planner for dynamic GPU management, a Smart Router to minimize costly recomputations, a Low-Latency Communication Library for efficient data transfer, and a Memory Manager for cost-effective data handling [14][15] - The platform supports disaggregated serving, allowing different computational phases of large language models to be optimized independently across various GPUs [9][14] Industry Adoption - Major companies like Perplexity AI and Together AI are planning to leverage NVIDIA Dynamo for enhanced inference-serving efficiencies and to meet the compute demands of new AI reasoning models [8][10][11] - The software supports various frameworks including PyTorch and NVIDIA TensorRT, facilitating its adoption across enterprises, startups, and research institutions [6][14]