Apr 10, 2025
Enterprise growth now means building with AI, not just hiring.

Apr 10, 2025
Enterprise growth now means building with AI, not just hiring.
Guides
Learn how we automated the search and get our top sessions for GTC 2025!
Guides
Discover how CentML optimized DeepSeek-R1 AWQ for record-breaking inference speed using Hidet
Updates
CentML is proud to announce our participation in the UTE Startup Perks program
Updates
Since the release of DeepSeek-R1, the open-source community has been working to optimize its inference speed. While low-level GPU optimizations have improved performance, CentML took it a step further using speculative decoding. By repurposing DeepSeek’s MTP module and implementing EAGLE-style recursive generation, we achieved a 2x speedup, generating up to 70 tokens/second.
Guides
AI inference forms the foundation of modern AI applications, transforming trained models into tools for actionable insight and real-world solutions. […]
Guides
Nov 22, 2024
As AI progresses at a blazing pace, LLMs have emerged as a go-to tool for text generation, from code to […]
Guides
Nov 21, 2024
AI has already left an indelible mark on the corporate world. And although it’s become the buzzword of the times, […]
Updates
ECR Anywhere for Cross-Cloud Container Flexibility From vendor lock-in and security overhead to reduced agility, multi-cloud deployments present some sizeable […]
Updates
Nov 6, 2024
The team is thrilled to have been accepted into the program, which supports startups working to revolutionize industries.
Updates
The CentML team is thrilled to announce the launch of the CentML Platform — a frictionless and economical AI deployment […]