Pricing

CentML offers flexible pricing options for small to
large-scale model deployment on a wide range of accelerators

Contact Us for Custom Options

Serverless Endpoints

Serverless endpoint usage is billed according to the total number of tokens generated and processed

Dedicated Deployments

Pricing for dedicated deployments is based on the type and duration of hardware used, following a per-minute billing system

Customized Pricing

Looking for specialized requirements or larger-scale deployments? We offer customizable plans to suit enterprise needs. Contact us for details.