May 8, 2025
Accelerating AI Optimization for Enterprise Customers

May 8, 2025
Accelerating AI Optimization for Enterprise Customers
Updates
Apr 29, 2025
CentML welcomes three new members to the Llama herd
Updates
Apr 22, 2025
Building a Team that Powers Innovation
Updates
Apr 10, 2025
Enterprise growth now means building with AI, not just hiring.
Updates
CentML is proud to announce our participation in the UTE Startup Perks program
Updates
Since the release of DeepSeek-R1, the open-source community has been working to optimize its inference speed. While low-level GPU optimizations have improved performance, CentML took it a step further using speculative decoding. By repurposing DeepSeek’s MTP module and implementing EAGLE-style recursive generation, we achieved a 2x speedup, generating up to 70 tokens/second.
Updates
ECR Anywhere for Cross-Cloud Container Flexibility From vendor lock-in and security overhead to reduced agility, multi-cloud deployments present some sizeable hurdles. With a new cross-cloud solution, ECR Anywhere, developers can now eliminate the complexity of native registries, allowing for secure, seamless multi-cloud deployment of Docker images on any Kubernetes cluster. Managing containerized applications across multiple […]
Updates
Nov 6, 2024
The team is thrilled to have been accepted into the program, which supports startups working to revolutionize industries.
Updates
The CentML team is thrilled to announce the launch of the CentML Platform — a frictionless and economical AI deployment solution for enterprises and startups alike. Since ChatGPT’s launch two years ago, GenAI has reshaped industries and unlocked new possibilities. Yet, for many businesses, adopting GenAI remains challenging. High costs, complex deployments, significant compute resource […]
Updates
Learn how to deploy your applications on Snowpark Container Services.