Blog

LlamaGuard Now Available on the CentML Platform

Apr 29, 2025

CentML welcomes three new members to the Llama herd

Case Studies

Frode.ai: Personalized, Intelligent AI-Powered Assistance

Apr 23, 2025

CentML provides Frode.ai with secure, GDPR-compliant inference

Updates

CentML is a Great Place to Work

Apr 22, 2025

Building a Team that Powers Innovation

Updates

Welcome to the Team, AI Agents

Apr 10, 2025

Enterprise growth now means building with AI, not just hiring.

Guides

DeepSeek-ing a Needle in a Haystack: Leveraging Agentic Workflows to Find the Best GTC 2025 Sessions with CentML

Mar 17, 2025

Learn how we automated the search and get our top sessions for GTC 2025!

Guides

Simplicity and Speed: How CentML Sets Record Performance on DeepSeek-R1

Mar 13, 2025

Discover how CentML optimized DeepSeek-R1 AWQ for record-breaking inference speed using Hidet

Updates

CentML x U of Toronto Entrepreneurship: Driving Innovation, Together

Feb 27, 2025

CentML is proud to announce our participation in the UTE Startup Perks program

Updates

How CentML Achieved 2x Inference Speed on DeepSeek-R1 using Speculative Decoding

Feb 24, 2025

Since the release of DeepSeek-R1, the open-source community has been working to optimize its inference speed. While low-level GPU optimizations have improved performance, CentML took it a step further using speculative decoding. By repurposing DeepSeek’s MTP module and implementing EAGLE-style recursive generation, we achieved a 2x speedup, generating up to 70 tokens/second.

Guides

AI Inference: Understanding the Cornerstone of Modern AI

Dec 3, 2024

AI inference forms the foundation of modern AI applications, transforming trained models into tools for actionable insight and real-world solutions. […]