Early Access: Announcing B200s on Baseten

We're thrilled to announce early access to NVIDIA B200 GPUs on Baseten!

From benchmarks on models like DeepSeek R1, Llama 4, and Qwen, we’re already seeing 5x higher throughput, over 2x better cost per token, and 38% lower latency—powering use cases from code generation to search and more.

If you want to start using B200s, you can reach out to our team for access here.