Salad is a fully managed orchestration platform for AI/ML deployment. With just one click, Salad Inference Endpoints API allows you to scale inferences to infinity without configuring infrastructure.
Salad GPU nodes cost less and perform better than conventional public cloud instances.
Salad Inference Endpoints achieves 4X more inferences per dollar spent.
Never pay for more than you use. Our industry-leading inference , and they get even better with volume pricing.
1,100+ Stable Diffusion images per dollar
900,000+ BERT inferences per dollar
Save with volume pricing