Insights - CentML

CentML's New Platform Enables Rapid, Economical AI Deployment for All

The CentML team is thrilled to announce the launch of the CentML Platform — a frictionless and economical AI deployment […]

Updates

Introducing ‘ECR Anywhere’: A New Tool for Simplifying Multi-Cloud Deployments

ECR Anywhere for Cross-Cloud Container Flexibility From vendor lock-in and security overhead to reduced agility, multi-cloud deployments present some sizeable […]

Updates

CentML Joins NVIDIA Inception Program

The team is thrilled to have been accepted into the program, which supports startups working to revolutionize industries.

Updates

Building Modern GenAI: How We Packaged CServe as a Snowflake Native App

Learn how to deploy your applications on Snowpark Container Services.

Updates

Introducing ‘Tally’: A Novel Method of Efficient GPU Sharing for AI Workloads

Tally allows multiple AI tasks to share the same GPU, allowing for superior infrastructure efficiency.

Updates

Leading Compiler Engineer and Researcher Tatiana Shpeisman Joins CentML as Director of Engineering

Tatiana brings more than two decades of experience in groundbreaking compiler development to the CentML team.

Updates

CServe Now on Snowflake: Deploy Secure, Optimized LLMs For Less

This groundbreaking solution allows you to self-host LLMs with 81% lower compute costs, enhanced security, and flexible model support.

Updates

A Fine-Tuning Breakthrough: CentML’s New Sylva Method Revolutionizes LLM Adaptation

Learn about CentML’s next-gen fine-tuning, which will take your LLM performance to new heights.

Guides

Enterprise LLMs: Comparing Leading Models

Uncover the unique features of leading enterprise LLMs to help you make smart deployment choices.

Updates

Take Our New ‘Simple Sidecar’ Solution for a Spin

Discover how our new Simple Sidecar tool can streamline sidecar implementation, reducing friction and accelerating app development across any cloud environment.

Updates

CentML Named ‘Rising Star’ by the Intelligent Applications Summit

We are thrilled to announce that CentML has been recognized as a 'Rising Star' in the 2024 Intelligent Applications Top 40 list (IA40).

Case Studies

From Constraint to Competitive Edge: Exploring EquoAI’s Tech Leap with CentML

In this case study, we take a closer look at how EquoAI reduced its LLM deployment costs, improved deployment efficiency, […]

Updates

Optimize or Overpay? Navigating Cloud GPU Choices for ML Models

DeepView accurately predicts ML model performance across various cloud GPUs, helping you choose the most cost-effective option. It reveals whether […]

Updates

How to profile a Hugging Face model with DeepView

Hugging Face has become a leading platform for natural language processing (NLP) and machine learning (ML) enthusiasts. It provides a […]

Updates

Introducing DeepView: Visualize your neural network performance

Optimize PyTorch neural networks, peak performance, and cost efficiency for your deep learning projects

Case Studies

Maximizing LLM training and inference efficiency using CentML on OCI

In partnership with CentML, Oracle has developed innovative solutions to meet the growing demand for high-performance NVIDIA GPUs for machine […]

Case Studies

GenAI company cuts training costs by 36% with CentML

A growing generative AI company partnered with CentML to accelerate their API-as-a-service and iterate with foundational models—all without using top-of-the-line […]

Guides

AI Inference: Understanding the Cornerstone of Modern AI

AI inference forms the foundation of modern AI applications, transforming trained models into tools for actionable insight and real-world solutions. […]

Guides

The Basics of LLM Training

As AI progresses at a blazing pace, LLMs have emerged as a go-to tool for text generation, from code to […]

Guides

Mastering Enterprise AI to Scale Innovation

AI has already left an indelible mark on the corporate world. And although it’s become the buzzword of the times, […]

1 2 3 … 11 Next

Get started

Let's make your LLM better! Book a Demo