Skip to content

jamesmurdza/awesome-inference-hosts

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 

Repository files navigation

List of cloud hosts for inference and fine-tuning

A list of cloud hosting services offering resources for AI inference and fine-tuning:

Hosts

Service Focus GPU Types Pricing Additional Features
Banana.dev High-throughput inference Not specified Scales to zero Deploys from GitHub repo
Fal.ai Inference, image and audio generation A100s, A10Gs, T4s Scales to zero Playgrounds and shared endpoints
Replicate.com Text, image, music, speech generation A100s, A40s, T4s Scales to zero Preconfigured models and playgrounds
Lepton.ai Hosting and inference via CLI Not specified Scales to zero Deploys HuggingFace models
TitanML.co CLI and enterprise inference deployments Not specified Not specified -
Anyscale.com Inference and fine-tuning A10s, A100s, H100s (soon) Not specified Shared endpoints for various models
Together.ai Inference and fine-tuning A40s, A100s, H100s Scales to zero 68 shared endpoints and playgrounds
Brev.dev Hosting and CLI tools A100s, H100s, others Varies, Free for existing cloud Deploys to AWS and GCP
Gradient.ai Fine-tuning via CLI/Python/NodeJS Not specified Not specified Online console for job management
Fireworks.ai CLI inference platform Not specified Varies Shared endpoints, supports LoRA addons

About

List of cloud inference hosts.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published