Apr 29, 2025
CentML welcomes three new members to the Llama herd

Apr 29, 2025
CentML welcomes three new members to the Llama herd
Updates
Apr 22, 2025
Building a Team that Powers Innovation
Updates
Apr 10, 2025
Enterprise growth now means building with AI, not just hiring.
Updates
CentML is proud to announce our participation in the UTE Startup Perks program
Updates
Since the release of DeepSeek-R1, the open-source community has been working to optimize its inference speed. While low-level GPU optimizations have improved performance, CentML took it a step further using speculative decoding. By repurposing DeepSeek’s MTP module and implementing EAGLE-style recursive generation, we achieved a 2x speedup, generating up to 70 tokens/second.
Updates
ECR Anywhere for Cross-Cloud Container Flexibility From vendor lock-in and security overhead to reduced agility, multi-cloud deployments present some sizeable […]
Updates
Nov 6, 2024
The team is thrilled to have been accepted into the program, which supports startups working to revolutionize industries.
Updates
The CentML team is thrilled to announce the launch of the CentML Platform — a frictionless and economical AI deployment […]
Updates
Learn how to deploy your applications on Snowpark Container Services.
Updates
Tally allows multiple AI tasks to share the same GPU, allowing for superior infrastructure efficiency.