NVIDIA Unveils Llama Nemotron: Open-Source Agentic AI for the Enterprise

May 26, 2025

Artificial intelligence is entering a new era—agentic AI—where teams of specialized agents can work together to solve complex problems and automate tasks across the enterprise. To drive this transformation, NVIDIA has announced the Llama Nemotron family: a new set of open, enterprise-grade large language models (LLMs) built with Meta’s Llama foundation.

Agentic AI: The Next Frontier

Agentic AI moves beyond basic chatbots by enabling multiple, coordinated AI agents that can reason, perceive, and act in the world—revolutionizing how businesses approach automation and problem-solving. These advanced agents demand robust, efficient, and customizable language and vision models.

Introducing Llama Nemotron and Cosmos Nemotron

Llama Nemotron: Optimized for enterprise use, these models excel at instruction following, chat, function calling, code, and math. They are pruned, distilled, and aligned using NVIDIA NeMo to boost agentic performance, accuracy, and throughput—while remaining compact enough for deployment across NVIDIA GPUs, from edge devices to data centers.
Cosmos Nemotron VLMs: NVIDIA’s new vision-language models and NIM microservices allow AI agents to analyze and respond to images and video for tasks in robotics, healthcare, retail, logistics, sports, and more.

Key Features and Benefits

Enterprise-Grade, Open Models: Built on Llama—one of the world’s most popular open LLMs—Nemotron is designed for commercial viability, with over 650 million downloads to date.
Optimized for Agentic AI: Supports advanced reasoning, multi-step workflows, and seamless integration of language and vision, empowering businesses to automate complex processes.
Flexible Deployment: Available as downloadable models or as NVIDIA NIM microservices for clouds, data centers, workstations, and even PCs—ensuring enterprise-grade security and integration.
Customizable at Every Scale: Three sizes—Nano (edge devices), Super (single GPU), and Ultra (highest-accuracy, data center)—enable deployment for any workload.

Customization and Enterprise Tools

Enterprises can easily fine-tune Nemotron models to their unique data and use cases using NVIDIA NeMo microservices. NeMo Retriever enables retrieval-augmented generation for connecting AI to real business knowledge. And with NVIDIA Blueprints, organizations can quickly build end-to-end agentic applications—including advanced video search and summarization.

Industry Impact and Partnerships

Major platform providers like SAP and ServiceNow are integrating Llama Nemotron models to power next-generation enterprise AI agents for customer support, fraud detection, supply chain management, and more. As these agents collaborate, enterprises can unlock new levels of productivity, efficiency, and automation.

Availability

The Llama Nemotron and Cosmos Nemotron model families will be available soon for free development and testing on build.nvidia.com and Hugging Face, with production deployment via NVIDIA AI Enterprise. Join the NVIDIA Developer Program to get notified about early access and updates.

Conclusion

As agentic AI becomes the new paradigm, NVIDIA’s Llama Nemotron models offer enterprises the tools and flexibility to lead the next wave of intelligent automation—delivering scalable, open, and powerful AI agents for real-world impact.