Llama 3.3 Nemotron 49B Super - NVIDIA NIM
A high-efficiency model with leading accuracy for reasoning, tool calling, chat, and instruction following.
Deploy Llama 3.3 Nemotron 49B Super - NVIDIA NIM behind an API endpoint in seconds.
Llama 3.3 Nemotron 49B Super is an NVIDIA NIM large language model (LLM) derived from Llama 3.3 70B Instruct. It can be deployed with early access on Baseten.
Llama 3.3 Nemotron is a reasoning model post-trained for enterprise AI agent use cases, including reasoning, tool calling, chat, and instruction following tasks with a 128k token context length.