Baseten Hybrid: control and flexibility in your cloud and ours
Get the performance of a managed service in your own VPC, with seamless overflow to Baseten Cloud.
++++Baseten built for maximum flexibility
Flex your cloud
Gain seamless multi-cloud flexibility, use existing cloud credits, avoid vendor lock-in, and maintain SLAs during traffic spikes.
Reduce latency
Leverage our optimized infrastructure for ultra-low-latency inference with elastic, traffic-driven autoscaling.
Meet compliance
Host your data where you need it, ensuring compliance with strict government and industry standards.
Choosing Baseten Hybrid, Self-hosted, or Cloud
Baseten Hybrid | Baseten Self-hosted | Baseten Cloud | |
---|---|---|---|
Feature | |||
Data control | Full data control in your VPC; managed data security on Baseten Cloud | Full data control | Managed data security |
Data residency requirements | Region-locked data and deployments with multi-region support | Region-locked data and deployments | Multi-region support |
Compute capacity | Leverage existing resources or Baseten compute for overflow | Leverage existing in-house resources | Leverage on-demand compute with SOTA GPUs |
Cost efficiency | Use in-house compute whenever available for optimized costs | Utilize dedicated resources without extra spend on hardware | Gain cost-effective, on-demand compute |
Integration with internal systems | Custom or out-of-the-box integrations | Custom or out-of-the-box integrations | Easy integration via Baseten's ecosystem |
Performance optimization | SOTA on-chip model performance and low network latency | SOTA on-chip model performance and low network latency | SOTA on-chip model performance and low network latency |
Scalability | High, tailored scalability with flex capacity on Baseten Cloud | High, tailored scalability | High, flexible scaling options |
Security and compliance | Adhere to custom policies or lean on our SOC 2 Type II and HIPAA compliance | Adhere to custom organizational policies | SOC 2 Type II certified and HIPAA compliant |
Support and maintenance | Comprehensive support and managed services | Comprehensive support and managed services | Comprehensive support and managed services |
Utilization of existing cloud commits | Use credits or commits | Use credits or commits | Spend down existing cloud commits |
Feature
Data control
Data residency requirements
Compute capacity
Cost efficiency
Integration with internal systems
Performance optimization
Scalability
Security and compliance
Support and maintenance
Utilization of existing cloud commits
Blend the best of Self-hosted and Cloud deployments.
Flex on-demand
Utilize internal resources whenever they’re available, seamlessly transition to Baseten Cloud whenever necessary.
Control data residency
Keep data where you need it. Host in your VPC, or use Baseten Cloud for data with less stringent requirements.
Scale elastically
Rapidly scale up or down automatically based on traffic, future-proofing your infrastructure against traffic bursts.
Meet compliance
Self-host data when required or lean on the SOC 2 Type II, HIPAA, and GDPR compliance of Baseten Cloud.
Optimize costs
Use existing hardware or cloud commits, and take advantage of Baseten’s transparent, on-demand pricing for overflow.
Ship faster
Leverage our performant, scalable, secure ML inference infrastructure without the time investment needed to build your own.
++++Low cost, high performance.
In your cloud and ours.
In your cloud and ours.
Lower infra costs
Baseten Hybrid optimizes resource utilization. Reduce hardware and operational costs while maintaining high performance.
Meet advanced compliance needs
Ensure your AI infrastructure complies with your policies along with regional and industry-specific regulations.
Save development time
Baseten's platform reduces AI model management and deployment efforts, significantly reducing the time required by your engineering teams to manage infrastructure.
Get started with Baseten Hybrid
Guides and examples
Learn about the Baseten platform
Read our documentation to understand how the Baseten platform operates in both Cloud and Self-hosted deployments.
Deploy a model in minutes
Experience the Baseten UI and inference capabilities before customizing your deployments.
Security and compliance
Learn how Baseten ensures security and compliance in Hybrid, Self-hosted, and Cloud deployments.