34 large language models

DeepSeek LogoDeepSeek-R1 Llama 70B

LLM
R1LlamaTRT-LLMH100

Meta logoLlama 3.3 70B Instruct

LLM
3.3TRT-LLMH100

DeepSeek LogoDeepSeek-R1 Qwen 32B

LLM
R1QwenTRT-LLMH100

Qwen LogoQwen 2.5 14B Instruct

LLM
2.5TRT-LLMH100

Qwen LogoQwen 2.5 32B Coder Instruct

LLM
2.5CoderTRT-LLMH100

Qwen LogoQwen 2.5 7B Math Instruct

LLM
2.5MathTRT-LLMH100 MIG 40GB

Meta logoLlama 3.1 8B Instruct

LLM
3.1InstructTRT-LLMH100

Qwen LogoQwen 2.5 32B QwQ

LLM
2.5QwQTRT-LLMH100

DeepSeek LogoDeepSeek-R1

LLM
R1SGLangH200

DeepSeek LogoDeepSeek-R1 Qwen 7B

LLM
R1QwenTRT-LLMH100 MIG 40GB

NVIDIA logoLlama 3.1 Nemotron 70B

LLM
3.1NemotronA100

Meta logoLlama 3.1 405B Instruct

LLM
3.1InstructH100

Fixie LogoUltravox v0.4

LLM
0.4vLLMH100 MIG 40GB

Qwen LogoQwen 2.5 72B Instruct

LLM
2.5TRT-LLMH100

Qwen LogoQwen 2.5 72B Math Instruct

LLM
2.5MathTRT-LLMH100

Qwen LogoQwen 2.5 14B Coder Instruct

LLM
2.5CoderTRT-LLMH100

Qwen LogoQwen 2.5 32B Instruct

LLM
2.5TRT-LLMH100

Qwen LogoQwen 2.5 7B Coder Instruct

LLM
2.5CoderTRT-LLMH100 MIG 40GB

Qwen LogoQwen 2.5 7B Instruct

LLM
2.5TRT-LLMH100 MIG 40GB

Mistral AI logoMistral 7B Instruct

LLM
v3TRT-LLMH100 MIG 40GB

Meta logoLlama 3.1 70B Instruct

LLM
3.1InstructTRT-LLMH100

Qwen LogoQwen 2.5 3B Instruct

LLM
2.5TRT-LLMA10G

DeepSeek LogoDeepSeek-R1 Zero

LLM
R1ZeroSGLangH200

DeepSeek LogoDeepSeek-V3

LLM
V3SGLangH200

Mistral AI logoPixtral 12B

LLM
PixtralvLLMA100

Microsoft LogoPhi 3.5 Mini Instruct

LLM
3.5128kvLLMA10G

google logoGemma 2 9B

LLM
vLLMA100

google logoGemma 2 27B

LLM
vLLMA100

Mistral AI logoMixtral 8x7B Instruct

LLM
v1TRT-LLMH100

Deploy any model in just a few commands

Avoid getting tangled in complex deployment processes. Deploy best-in-class open-source models and take advantage of optimized serving for your own models.

$

truss init -- example stable-diffusion-2-1-base ./my-sd-truss

$

cd ./my-sd-truss

$

export BASETEN_API_KEY=MdNmOCXc.YBtEZD0WFOYKso2A6NEQkRqTe

$

truss push

INFO

Serializing Stable Diffusion 2.1 truss.

INFO

Making contact with Baseten 👋 👽

INFO

🚀 Uploading model to Baseten 🚀

Upload progress: 0% | | 0.00G/2.39G