NVIDIA Llama Nemotron

Search documents
 NVIDIA Launches Family of Open Reasoning AI Models for Developers and Enterprises to Build Agentic AI Platforms
 Globenewswire· 2025-03-18 19:10
 Core Insights - NVIDIA has launched the Llama Nemotron family of models, which are designed to provide advanced AI reasoning capabilities for developers and enterprises [1][4] - The new models enhance multistep math, coding, reasoning, and complex decision-making through extensive post-training, improving accuracy by up to 20% and optimizing inference speed by 5x compared to other leading models [2][3]   Model Features - The Llama Nemotron model family is available in three sizes: Nano, Super, and Ultra, each tailored for different deployment needs, with the Nano model optimized for PCs and edge devices, the Super model for single GPU throughput, and the Ultra model for multi-GPU servers [5] - The models are built on high-quality curated synthetic data and additional datasets co-created by NVIDIA, ensuring flexibility for enterprises to develop custom reasoning models [6]   Industry Collaboration - Major industry players such as Microsoft, SAP, and Accenture are collaborating with NVIDIA to integrate Llama Nemotron models into their platforms, enhancing AI capabilities across various applications [4][7][8][10] - Microsoft is incorporating these models into Azure AI Foundry, while SAP is using them to improve its Business AI solutions and AI copilot, Joule [7][8]   Deployment and Accessibility - The Llama Nemotron models and NIM microservices are available as hosted APIs, with free access for NVIDIA Developer Program members for development, testing, and research [12] - Enterprises can run these models in production using NVIDIA AI Enterprise on accelerated data center and cloud infrastructure, with additional tools and software to facilitate advanced reasoning in collaborative AI systems [16]

