Blog

All

blog post

Updates

How to profile a Hugging Face model with DeepView

Mar 18, 2024

Hugging Face has become a leading platform for natural language processing (NLP) and machine learning (ML) enthusiasts. It provides a large repository of pre-trained models and tools for developing advanced applications. But before you start using an ML model, you need to profile it. Our open-source tool, DeepView, can help you analyze the behavior and […]

blog post

Case Studies

Maximizing LLM training and inference efficiency using CentML on OCI

Feb 12, 2024

In partnership with CentML, Oracle has developed innovative solutions to meet the growing demand for high-performance NVIDIA GPUs for machine learning (ML) model training and inference. Utilizing CentML’s state-of-the-art ML optimization software and Oracle Cloud Infrastructure (OCI), the collaboration has achieved significant performance improvements for both training and inference tasks, specifically with the LLaMa-V2 and […]

blog post

Case Studies

GenAI company cuts training costs by 36% with CentML

Feb 5, 2024

A growing generative AI company partnered with CentML to accelerate their API-as-a-service and iterate with foundational models—all without using top-of-the-line NVIDIA GPUs like the A100. The challenge A growing generative AI company realized that their modern Large Language Models (LLMs) needed powerful GPUs for pre-training, fine-tuning, and inference. They used the Hugging Face Trainer to […]

blog post

Updates

Charting a secure path amidst AI legal challenge

Jan 5, 2024

The New York Times has filed a lawsuit against OpenAI and Microsoft over alleged copyright infringement in AI model training. The generative AI is at a crossroads. The implications are clear: content creators recognize the value of their work, lawyers see an opportunity to challenge tech giants like Microsoft and Google, and enterprises are seeking […]

blog post

Updates

Introducing CServe: Reduce LLM deployment cost by more than 50%

Dec 18, 2023

tl;dr: We’re excited to introduce CServe—an easy-to-deploy, highly efficient, and low-cost serving framework for LLMs to help you cut your operational costs in half while optimizing for both server-side throughput and client-side latency constraints. Generative AI is BIG. We’ve witnessed its power with ChatGPT. But if AI is the future that can benefit us all, […]

blog post

Updates

CentML Secures $27 Million Funding to Realize its Vision in AI Model Optimization

Nov 23, 2023

October 25, 2023 – CentML, a software platform that dramatically improves the performance and cost of deploying ML models, today announced the completion of a $27 million seed round. The round was led by Gradient Ventures, Google’s AI-focused venture fund, with participation from Radical Ventures, NVIDIA, Deloitte Ventures, and Thomson Reuters Ventures. As AI and […]

Get started

Let's make your LLM better!

Book a Demo