60-90%
Estimated monthly savings on GPU spend
3.10X
LLaMa inference acceleration
3.98X
RoBERTa inference acceleration
7.86X
InceptionV3 inference acceleration