Baseten Blog | Page 8
New in November 2023
Switching to open source ML, a guide to model inference math, and Stability.ai's new generative AI image-to-video model.
NVIDIA A10 vs A10G for ML model inference
The A10, an Ampere-series GPU, excels in tasks like running 7B parameter LLMs. AWS's A10G variant, similar in GPU memory & bandwidth, is mostly interchangeable.
Stable Video Diffusion now available
Stability AI announced the release of Stable Video Diffusion, marking a huge leap forward for open source novel video synthesis
GPT vs Llama: Migrate to open source LLMs seamlessly
Use ChatCompletions API to test open-source LLMs like Llama in your AI app with just three minor code modifications.
Open source alternatives for machine learning models
Building on top of open source models gives you access to a wide range of capabilities that you would otherwise lack from a black box endpoint provider.
A checklist for switching to open source ML models
Transitioning from using ML models via closed source APIs to open source ML models? This checklist provides all necessary resources for the shift.
A guide to LLM inference and performance
Learn if LLM inference is compute or memory bound to fully utilize GPU power. Get insights on better GPU resource utilization.
Pinning ML model revisions for compatibility and security
Pin versions of open source packages like PyPi's transformers to avoid breaking changes or security issues; similarly, pin model revisions for stability.
Deployment and inference for open source text embedding models
Text embedding models convert text into semantic vectors. Numerous open source models cater to search, recommendation, classification & LLM-augmented retrieval.