Baseten / Blog / Product

Product | Page 2

Topics

Latest Model performance Hacks & projects GPU guides ML models Glossary Community Product News

1 2 3 4 5

New in April 2024

Use four new best in class LLMs, stream synthesized speech with XTTS, and deploy models with CI/CD

Baseten

Prompt: the steps and entrance to a solarpunk museum

New in March 2024

Fast Mistral 7B, fractional H100 GPUs, FP8 quantization, and API endpoints for model management.

Baseten

New in February 2024

3x throughput with H100 GPUs, 40% lower SDXL latency with TensorRT, and multimodal open source models.

Baseten

Prompt: A futuristic submarine in a colorful coral reef

New in January 2024

A library for open source models, general availability for L4 GPUs, and performance benchmarking for ML inference

Baseten

Prompt: A futuristic bullet train crossing under a waterfall with soft lighting. Model: Playground 2.

New in December 2023

Faster Mixtral inference, Playground v2 image generation, and ComfyUI pipelines as API endpoints.

Baseten

Prompt: A forest green airplane on the runway at dawn. Model: Playground v2.

New in November 2023

Switching to open source ML, a guide to model inference math, and Stability.ai's new generative AI image-to-video model.

Baseten

SDXL prompt: A green sailboat in the icy sea

New in October 2023

All-new model management, a text embedding model that matches OpenAI, and misgif, the most fun you’ll have with AI all week.

Baseten

A glowing cyberpunk car racing through an enchanted forest

New in September 2023

Mistral 7B LLM, GPU comparisons, model observability features, and an open source AI event series

Baseten

New in August 2023

Truss' latest update addresses key ML model serving issues. Discover how to speed up SDXL inference to 3s and build ChatGPT-like apps with Llama 2 & Chainlit.

Baseten

Prompt: a heavily constructed solarpunk bridge over a chasm

1 2 3 4 5