29 large language models

Meta logoLlama 3.1 70B Instruct

LLM
3.1InstructTRT-LLMH100

Qwen LogoQwen 2.5 7B Math Instruct

LLM
2.5MathTRT-LLMH100 MIG 40GB

Qwen LogoQwen 2.5 14B Instruct

LLM
2.5TRT-LLMH100

Qwen LogoQwen 2.5 32B Coder Instruct

LLM
2.5CoderTRT-LLMH100

Meta logoLlama 3.1 8B Instruct

LLM
3.1InstructTRT-LLMH100

NVIDIA logoLlama 3.1 Nemotron 70B

LLM
3.1NemotronA100

Meta logoLlama 3.1 405B Instruct

LLM
3.1InstructH100

Fixie LogoUltravox v0.4

LLM
0.4vLLMH100 MIG 40GB

Meta logoLlama 3 70B Instruct

LLM
3TRT-LLMH100

Meta logoLlama 3 8B Instruct

LLM
3InstructTRT-LLMH100

Mistral AI logoMistral 7B Instruct

LLM
v3TRT-LLMH100 MIG 40GB

Qwen LogoQwen 2.5 14B Coder Instruct

LLM
2.5CoderTRT-LLMH100

Qwen LogoQwen 2.5 7B Coder Instruct

LLM
2.5CoderTRT-LLMH100 MIG 40GB

Qwen LogoQwen 2.5 72B Math Instruct

LLM
2.5MathTRT-LLMH100

Qwen LogoQwen 2.5 72B Instruct

LLM
2.5TRT-LLMH100

Qwen LogoQwen 2.5 32B Instruct

LLM
2.5TRT-LLMH100

Qwen LogoQwen 2.5 7B Instruct

LLM
2.5TRT-LLMH100 MIG 40GB

Qwen LogoQwen 2.5 3B Instruct

LLM
2.5TRT-LLMA10G

Mistral AI logoPixtral 12B

LLM
PixtralvLLMA100

Microsoft LogoPhi 3.5 Mini Instruct

LLM
3.5128kvLLMA10G

google logoGemma 2 9B

LLM
vLLMA100

google logoGemma 2 27B

LLM
vLLMA100

Hugging Face logoZephyr 7B Alpha

LLM
AlphaA10G

Mistral AI logoMixtral 8x7B Instruct

LLM
v1TRT-LLMH100

Deploy any model in just a few commands

Avoid getting tangled in complex deployment processes. Deploy best-in-class open-source models and take advantage of optimized serving for your own models.

$

truss init -- example stable-diffusion-2-1-base ./my-sd-truss

$

cd ./my-sd-truss

$

export BASETEN_API_KEY=MdNmOCXc.YBtEZD0WFOYKso2A6NEQkRqTe

$

truss push

INFO

Serializing Stable Diffusion 2.1 truss.

INFO

Making contact with Baseten 👋 👽

INFO

🚀 Uploading model to Baseten 🚀

Upload progress: 0% | | 0.00G/2.39G