Glossary | Page 2

Topics

Latest Model performance Hacks & projects GPU guides ML models Glossary Community Product News

Introduction to quantizing ML models

Quantizing ML models like LLMs makes it possible to run big models on less expensive GPUs. But it must be done carefully to avoid quality reduction.

Abu Qader

1 other

Prompt: A steampunk microscope in a lab run by lord of the rings elves. Model: Playground 2

How to benchmark image generation models like Stable Diffusion XL

Benchmarking Stable Diffusion XL performance across latency, throughput, and cost depends on factors from hardware to model variant to inference config.

Philip Kiely

Prompt: a sleek bus driving through the mountains. Model: Playground 2

Understanding performance benchmarks for LLM inference

This guide helps you interpret LLM performance metrics to make direct comparisons on latency, throughput, and cost.

Philip Kiely

Prompt: Two racecars on the beach at sunset. Model: Playground 2.

AI infrastructure: build vs. buy

AI infrastructure, ML infrastructure, build vs. buy, model deployment

Baseten

1 2