Insights
CentML's New Platform Enables Rapid, Economical AI Deployment for All
The CentML team is thrilled to announce the launch of the CentML Platform — a frictionless and economical AI deployment […]
Introducing ‘ECR Anywhere’: A New Tool for Simplifying Multi-Cloud Deployments
ECR Anywhere for Cross-Cloud Container Flexibility From vendor lock-in and security overhead to reduced agility, multi-cloud deployments present some sizeable […]
CentML Joins NVIDIA Inception Program
The team is thrilled to have been accepted into the program, which supports startups working to revolutionize industries.
Building Modern GenAI: How We Packaged CServe as a Snowflake Native App
Learn how to deploy your applications on Snowpark Container Services.
Introducing ‘Tally’: A Novel Method of Efficient GPU Sharing for AI Workloads
Tally allows multiple AI tasks to share the same GPU, allowing for superior infrastructure efficiency.
Leading Compiler Engineer and Researcher Tatiana Shpeisman Joins CentML as Director of Engineering
Tatiana brings more than two decades of experience in groundbreaking compiler development to the CentML team.
CServe Now on Snowflake: Deploy Secure, Optimized LLMs For Less
This groundbreaking solution allows you to self-host LLMs with 81% lower compute costs, enhanced security, and flexible model support.
A Fine-Tuning Breakthrough: CentML’s New Sylva Method Revolutionizes LLM Adaptation
Learn about CentML’s next-gen fine-tuning, which will take your LLM performance to new heights.
Enterprise LLMs: Comparing Leading Models
Uncover the unique features of leading enterprise LLMs to help you make smart deployment choices.
Take Our New ‘Simple Sidecar’ Solution for a Spin
Discover how our new Simple Sidecar tool can streamline sidecar implementation, reducing friction and accelerating app development across any cloud environment.
CentML Named ‘Rising Star’ by the Intelligent Applications Summit
We are thrilled to announce that CentML has been recognized as a 'Rising Star' in the 2024 Intelligent Applications Top 40 list (IA40).
From Constraint to Competitive Edge: Exploring EquoAI’s Tech Leap with CentML
In this case study, we take a closer look at how EquoAI reduced its LLM deployment costs, improved deployment efficiency, […]
Optimize or Overpay? Navigating Cloud GPU Choices for ML Models
DeepView accurately predicts ML model performance across various cloud GPUs, helping you choose the most cost-effective option. It reveals whether […]
How to profile a Hugging Face model with DeepView
Hugging Face has become a leading platform for natural language processing (NLP) and machine learning (ML) enthusiasts. It provides a […]
Introducing DeepView: Visualize your neural network performance
Optimize PyTorch neural networks, peak performance, and cost efficiency for your deep learning projects
Maximizing LLM training and inference efficiency using CentML on OCI
In partnership with CentML, Oracle has developed innovative solutions to meet the growing demand for high-performance NVIDIA GPUs for machine […]
GenAI company cuts training costs by 36% with CentML
A growing generative AI company partnered with CentML to accelerate their API-as-a-service and iterate with foundational models—all without using top-of-the-line […]
Building Better AI Infrastructure
Learn how to build robust, scalable AI infrastructure that maximizes performance, conserves resources, and future-proofs your projects.