Skip to main content

Supported Endpoints

Learn how to deploy + call models from different providers on LiteLLM

🗃️ /chat/completions

3 items

📄️ /responses [Beta]

LiteLLM provides a BETA endpoint in the spec of OpenAI's /responses API

📄️ /completions

Usage

📄️ /embeddings

Quick Start

📄️ /v1/messages

Use LiteLLM to call all your LLM APIs in the Anthropic v1/messages format.

📄️ /mcp - Model Context Protocol

LiteLLM Proxy provides an MCP Gateway that allows you to use a fixed endpoint for all MCP tools and control MCP access by Key, Team.

📄️ Google AI generateContent

Use LiteLLM to call Google AI's generateContent endpoints for text generation, multimodal interactions, and streaming responses.

🗃️ /images

3 items

🗃️ /audio

2 items

🗃️ /vector_stores

1 item

🗃️ Pass-through Endpoints (Anthropic SDK, etc.)

12 items

📄️ /rerank

LiteLLM Follows the cohere api request / response for the rerank api

📄️ /assistants

Covers Threads, Messages, Assistants.

🗃️ /files

2 items

🗃️ /batches

2 items

📄️ /realtime

Use this to loadbalance across Azure + OpenAI.

🗃️ /fine_tuning

2 items

📄️ /moderations

Usage

📄️ /guardrails/apply_guardrail

Use this endpoint to directly call a guardrail configured on your LiteLLM instance. This is useful when you have services that need to directly call a guardrail.