Introducing ‘Tally’: A Novel Method of Efficient GPU Sharing for AI Workloads

Updates

Introducing ‘Tally’: A Novel Method of Efficient GPU Sharing for AI Workloads

Tally allows multiple AI tasks to share the same GPU, allowing for superior infrastructure efficiency.

Read More

Leading Compiler Engineer and Researcher Tatiana Shpeisman Joins CentML as Director of Engineering

Updates

Leading Compiler Engineer and Researcher Tatiana Shpeisman Joins CentML as Director of Engineering

Tatiana brings more than two decades of experience in groundbreaking compiler development to the CentML team.

Read More

CServe Now on Snowflake: Deploy Secure, Optimized LLMs For Less

Updates

CServe Now on Snowflake: Deploy Secure, Optimized LLMs For Less

This groundbreaking solution allows you to self-host LLMs with 81% lower compute costs, enhanced security, and flexible model support.

Read More

A Fine-Tuning Breakthrough: CentML’s New Sylva Method Revolutionizes LLM Adaptation

Updates

A Fine-Tuning Breakthrough: CentML’s New Sylva Method Revolutionizes LLM Adaptation

Learn about CentML’s next-gen fine-tuning, which will take your LLM performance to new heights.

Read More

Get started

Let's make your LLM better! Book a Demo