We raised a $75m series C to build the future of inference

Baseten / Blog / Hacks & projects

Hacks & projects

Topics

Latest Model performance Hacks & projects GPU guides ML models Glossary Community Product News

Deploying custom ComfyUI workflows as APIs

Easily package your ComfyUI workflow to use any custom node or model checkpoint.

Rachel Rapp

Het Trivedi

1 other

CI/CD for AI model deployments

In this article, we outline a continuous integration and continuous deployment (CI/CD) pipeline for using AI models in production.

Philip Kiely

Sid Shanker

Samiksha Pal

Vlad Shulman

3 others

Prompt: A movie still of an aqueduct

Streaming real-time text to speech with XTTS V2

In this tutorial, we'll build a streaming endpoint for the XTTS V2 text to speech model with real-time narration and 200 ms time to first chunk.

Philip Kiely

Het Trivedi

1 other

Prompt: A wooden boat full of books floating down a rapid river in a Japanese garden

How to serve your ComfyUI model behind an API endpoint

This guide details deploying ComfyUI image generation pipelines via API for app integration, using Truss for packaging & production deployment.

Philip Kiely

Het Trivedi

1 other

Model: SDXL + ControlNet, Prompt: A top down view of a river through the woods

GPT vs Llama: Migrate to open source LLMs seamlessly

Use ChatCompletions API to test open-source LLMs like Llama in your AI app with just three minor code modifications.

Philip Kiely

Sid Shanker

1 other

Prompt: A sturdy stone bridge under a full moon, warm colors

Build your own open-source ChatGPT with Llama 2 and Chainlit

Llama 2 rivals GPT-3.5 in quality and powers ChatGPT. Chainlit helps build ChatGPT-like interfaces. This guide shows creating such interfaces with Llama 2.

Philip Kiely

Prompt: A llama wearing multiple gold chains in the park

Build a chatbot with Llama 2 and LangChain

Build a ChatGPT-style chatbot with open-source Llama 2 and LangChain in a Python notebook.

Philip Kiely

Prompt: A llama dressed as a pirate with a parrot on a ship

Three techniques to adapt LLMs for any use case

Prompt engineering, embeddings, vector databases, and fine-tuning are ways to adapt Large Language Models (LLMs) to run on your data for your use case

Philip Kiely

Prompt: Three glowing paper lanterns

Serving four million Riffusion requests in two days

Riffusion is a fine-tuned version of Stable Diffusion. Baseten served Riffusion over four million times in a couple of days, serving top-of-hacker-news traffic.

Phil Howes

Prompt: A solarpunk piano