Platform Pricing
Free Credits to all new users (worth 2 million tokens on Llama 3.3-70B)
CentML offers competitive pricing for GenAI model deployment,
with flexible options to suit a wide range of
models, from small to large-scale deployments.
Developer
Best performance - no hidden fees
- Free credits for all new users (worth 2 million tokens on Llama 3.3-70B)
- Full pay-as-you-go billing - per minute/per token
- On-demand dedicated endpoints - no rate limits
- Planner feature available for all deployments
- No daily limits
Enterprise
Custom solutions for scaling
- Custom pricing
- Unlimited rate limits
- Unlimited deployed models
- Dedicated and self-hosted deployments
- Guaranteed uptime SLA
- 24/7 tech support
- Plus all features from the Developer package
Platform Pricing Overview
Deploying Applications are calculated on a credit-based billing system, where 1 CentML credit equals 1 USD.
You can buy credits through the Platform by going to your Account page.
Serverless Endpoint usage is billed according to the total number of tokens generated and processed.
Dedicated Deployments
Dedicated deployments are charged based on the type and duration of hardware used,
following a per-minute
billing system.
Customized Plans
Looking for specialized requirements or larger-scale deployments?
We offer customizable plans to suit
enterprise needs. Contact us for details.