Blog

LlamaGuard Now Available on the CentML Platform

Apr 29, 2025

CentML welcomes three new members to the Llama herd

Updates

Updates

CentML is a Great Place to Work

Apr 22, 2025

Building a Team that Powers Innovation

Updates

Welcome to the Team, AI Agents

Apr 10, 2025

Enterprise growth now means building with AI, not just hiring.

Updates

CentML x U of Toronto Entrepreneurship: Driving Innovation, Together

Feb 27, 2025

CentML is proud to announce our participation in the UTE Startup Perks program

Updates

How CentML Achieved 2x Inference Speed on DeepSeek-R1 using Speculative Decoding

Feb 24, 2025

Since the release of DeepSeek-R1, the open-source community has been working to optimize its inference speed. While low-level GPU optimizations have improved performance, CentML took it a step further using speculative decoding. By repurposing DeepSeek’s MTP module and implementing EAGLE-style recursive generation, we achieved a 2x speedup, generating up to 70 tokens/second.

Updates

Introducing ‘ECR Anywhere’: A New Tool for Simplifying Multi-Cloud Deployments

Nov 18, 2024

ECR Anywhere for Cross-Cloud Container Flexibility From vendor lock-in and security overhead to reduced agility, multi-cloud deployments present some sizeable […]