Model Library

Llama 4 Maverick 17B (128E) Instruct

meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8

Featured Popular

High-capacity multimodal language model built on a mixture-of-experts (MoE) architecture with 128 experts and 17 billion active parameters. Supports multilingual text and image input.

Learn More

Deepseek V3 03-24

deepseek-ai/DeepSeek-V3-0324

Popular

An iteration of the DeepSeek V3 model with notable improvements in reasoning capabilities across various benchmarks, including MMLU-Pro, GPQA, and AIME.

Learn More

Llama 4 Scout 17B (16E) Instruct

meta-llama/Llama-4-Scout-17B-16E-Instruct

Popular

A mixture-of-experts (MoE) language model activating 17 billion parameters out of a total of 109B. Designed for assistant-style interaction and visual reasoning.

Learn More

DeepSeek R1 (Basic)

deepseek-ai/DeepSeek-R1

Popular

DeepSeek R1 is a powerful reasoning model designed for complex problem-solving and advanced coding tasks.

Learn More

All LLM VLM Chat

Qwen2-VL-7B-Instruct

An advanced vision-language model. It is purpose-built for tasks that involve both visual and textual understanding, including image captioning, visual question answering, and content generation.

Serverless

Qwen2-VL-2B-Instruct

A powerful multimodal model that excels in visual understanding tasks, including image and video comprehension.

Qwen2.5-VL-7B-Instruct

A 7B-parameter multimodal model, designed for complex vision-language tasks. Ideal for document understanding, video summarization, and interactive applications requiring both visual perception and language reasoning.

Serverless

Llama 4 Scout 17B (16E) Instruct

A mixture-of-experts (MoE) language model activating 17 billion parameters out of a total of 109B. Designed for assistant-style interaction and visual reasoning.

Serverless

Llama 4 Maverick 17B (128E) Instruct

High-capacity multimodal language model built on a mixture-of-experts (MoE) architecture with 128 experts and 17 billion active parameters. Supports multilingual text and image input.

Serverless

Llama 3.2 11B Vision Instruct

Part of Meta's vision family, designed for a wide array of vision-language tasks including visual recognition, image reasoning, and captioning.

Get started

Let's make your LLM better!

Book a Demo