[Preview] v1.80.11 - Google Interactions API
Deploy this version​
- Docker
- Pip
docker run litellm
docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
docker.litellm.ai/berriai/litellm:v1.80.11.rc.1
pip install litellm
pip install litellm==1.80.11
Key Highlights​
- Gemini 3 Flash Preview - Day 0 support for Google's Gemini 3 Flash Preview with reasoning capabilities
- Stability AI Image Generation - New provider for Stability AI image generation and editing
- LiteLLM Content Filter - Built-in guardrails for harmful content, bias, and PII detection with image support
- New Provider: Venice.ai - Support for Venice.ai API via providers.json
- Unified Skills API - Skills API works across Anthropic, Vertex, Azure, and Bedrock
- Azure Sentinel Logging - New logging integration for Azure Sentinel
- Guardrails Load Balancing - Load balance between multiple guardrail providers
- Email Budget Alerts - Send email notifications when budgets are reached
- Cloudzero Integration on UI - Setup your Cloudzero Integration Directly on the UI
Cloudzero Integration on UI​
Users can now configure their Cloudzero Integration directly on the UI.
Performance: 50% Reduction in Memory Usage and Import Latency for the LiteLLM SDK​
We've completely restructured litellm.__init__.py to defer heavy imports until they're actually needed, implementing lazy loading for 109 components.
This refactoring includes 41 provider config classes, 40 utility functions, cache implementations (Redis, DualCache, InMemoryCache), HTTP handlers, logging, types, and other heavy dependencies. Heavy libraries like tiktoken and boto3 are now loaded on-demand rather than eagerly at import time.
This makes LiteLLM especially beneficial for serverless functions, Lambda deployments, and containerized environments where cold start times and memory footprint matter.
New Providers and Endpoints​
New Providers (5 new providers)​
| Provider | Supported LiteLLM Endpoints | Description |
|---|---|---|
| Stability AI | /images/generations, /images/edits | Stable Diffusion 3, SD3.5, image editing and generation |
| Venice.ai | /chat/completions, /messages, /responses | Venice.ai API integration via providers.json |
| Pydantic AI Agents | /a2a | Pydantic AI agents for A2A protocol workflows |
| VertexAI Agent Engine | /a2a | Google Vertex AI Agent Engine for agentic workflows |
| LinkUp Search | /search | LinkUp web search API integration |
New LLM API Endpoints (2 new endpoints)​
| Endpoint | Method | Description | Documentation |
|---|---|---|---|
/interactions | POST | Google Interactions API for conversational AI | Docs |
/search | POST | RAG Search API with rerankers | Docs |
New Models / Updated Models​
New Model Support (55+ new models)​
| Provider | Model | Context Window | Input ($/1M tokens) | Output ($/1M tokens) | Features |
|---|---|---|---|---|---|
| Gemini | gemini/gemini-3-flash-preview | 1M | $0.50 | $3.00 | Reasoning, vision, audio, video, PDF |
| Vertex AI | vertex_ai/gemini-3-flash-preview | 1M | $0.50 | $3.00 | Reasoning, vision, audio, video, PDF |
| Azure AI | azure_ai/deepseek-v3.2 | 164K | $0.58 | $1.68 | Reasoning, function calling, caching |
| Azure AI | azure_ai/cohere-rerank-v4.0-pro | 32K | $0.0025/query | - | Rerank |
| Azure AI | azure_ai/cohere-rerank-v4.0-fast | 32K | $0.002/query | - | Rerank |
| OpenRouter | openrouter/openai/gpt-5.2 | 400K | $1.75 | $14.00 | Reasoning, vision, caching |
| OpenRouter | openrouter/openai/gpt-5.2-pro | 400K | $21.00 | $168.00 | Reasoning, vision |
| OpenRouter | openrouter/mistralai/devstral-2512 | 262K | $0.15 | $0.60 | Function calling |
| OpenRouter | openrouter/mistralai/ministral-3b-2512 | 131K | $0.10 | $0.10 | Function calling, vision |
| OpenRouter | openrouter/mistralai/ministral-8b-2512 | 262K | $0.15 | $0.15 | Function calling, vision |
| OpenRouter | openrouter/mistralai/ministral-14b-2512 | 262K | $0.20 | $0.20 | Function calling, vision |
| OpenRouter | openrouter/mistralai/mistral-large-2512 | 262K | $0.50 | $1.50 | Function calling, vision |
| OpenAI | gpt-4o-transcribe-diarize | 16K | $6.00/audio | - | Audio transcription with diarization |
| OpenAI | gpt-image-1.5-2025-12-16 | - | Various | Various | Image generation |
| Stability | stability/sd3-large | - | - | $0.065/image | Image generation |
| Stability | stability/sd3.5-large | - | - | $0.065/image | Image generation |
| Stability | stability/stable-image-ultra | - | - | $0.08/image | Image generation |
| Stability | stability/inpaint | - | - | $0.005/image | Image editing |
| Stability | stability/outpaint | - | - | $0.004/image | Image editing |
| Bedrock | stability.stable-conservative-upscale-v1:0 | - | - | $0.40/image | Image upscaling |
| Bedrock | stability.stable-creative-upscale-v1:0 | - | - | $0.60/image | Image upscaling |
| Vertex AI | vertex_ai/deepseek-ai/deepseek-ocr-maas | - | $0.30 | $1.20 | OCR |
| LinkUp | linkup/search | - | $5.87/1K queries | - | Web search |
| LinkUp | linkup/search-deep | - | $58.67/1K queries | - | Deep web search |
| GitHub Copilot | 20+ models | Various | - | - | Chat completions |
Features​
- Gemini
- Anthropic
- Vertex AI
- Add Gemini 3 Flash Preview support - PR #18164
- Add reasoning support for gemini-3-flash-preview - PR #18175
- Fix image edit credential source - PR #18121
- Pass credentials to PredictionServiceClient for custom endpoints - PR #17757
- Fix multimodal embeddings for text + base64 image combinations - PR #18172
- Add OCR support for DeepSeek model - PR #17971
- Azure AI
- Fireworks AI
- Add reasoning param support for Fireworks AI models - PR #17967
- Bedrock
- Perplexity
- Use API-provided cost instead of manual calculation - PR #17887
- OpenAI
- GitHub Copilot
- Add github_copilot model info - PR #17858
- Custom LLM
- Add image_edit and aimage_edit support - PR #17999
Bug Fixes​
LLM API Endpoints​
Features​
- Responses API
- Add provider specific tools support - PR #17980
- Add custom headers support - PR #18036
- Fix tool calls transformation in completion bridge - PR #18226
- Use list format with input_text for tool results - PR #18257
- Add cost tracking in background mode - PR #18236
- Fix Claude code responses API bridge errors - PR #18194
- Chat Completions API
- Add support for agent skills - PR #18031
- Skills API
- Unified Skills API works across Anthropic, Vertex, Azure, Bedrock - PR #18232
- Search API
- Add new RAG Search API with rerankers - PR #18217
- Interactions API
- Image Edit API
- Add drop_params support and fix Vertex AI config - PR #18077
- General
Bugs​
- General
Management Endpoints / UI​
Features​
- Virtual Keys
- Models + Endpoints
- Vector Stores
- Logs & Spend
- SSO & Health
- CloudZero
- General
- Update UI path handling for non-root Docker - PR #17989
Bugs​
- UI Fixes
AI Integrations​
Logging​
- Azure Sentinel
- Add new Azure Sentinel Logger integration - PR #18146
- Prometheus
- Add extraction of top level metadata for custom labels - PR #18087
- Langfuse
- Fix not working log_failure_event - PR #18234
- Arize Phoenix
- Fix nested spans - PR #18102
- General
- Change extra_headers to additional_headers - PR #17950
Guardrails​
- LiteLLM Content Filter
- Guardrails API
- Support LLM tool call response checks on
/chat/completions,/v1/responses,/v1/messages- PR #17619 - Add guardrails load balancing - PR #18181
- Fix guardrails for passthrough endpoint - PR #18109
- Add headers to metadata for guardrails on pass-through endpoints - PR #17992
- Various fixes for guardrail on OpenRouter models - PR #18085
- Support LLM tool call response checks on
- Lakera
- Add monitor mode for Lakera - PR #18084
- Pillar Security
- Add masking support and MCP call support - PR #17959
- Bedrock Guardrails
Secret Managers​
- HashiCorp Vault
- UI
- Add secret manager settings controls to team management UI - PR #18149
Spend Tracking, Budgets and Rate Limiting​
- Email Budget Alerts - Send email notifications when budgets are reached - PR #17995
MCP Gateway​
- Auth Header Propagation - Add MCP auth header propagation - PR #17963
- Fix deepcopy error - Fix MCP tool call deepcopy error when processing requests - PR #18010
- Fix list tool - Fix MCP list_tools not working without database connection - PR #18161
Agent Gateway (A2A)​
- New Provider: Agent Gateway - Add pydantic ai agents support - PR #18013
- VertexAI Agent Engine - Add Vertex AI Agent Engine provider - PR #18014
- Fix model extraction - Fix get_model_from_request() to extract model ID from Vertex AI passthrough URLs - PR #18097
Performance / Loadbalancing / Reliability improvements​
- Lazy Imports - Use per-attribute lazy imports and extract shared constants - PR #17994
- Lazy Load HTTP Handlers - Lazy load http handlers - PR #17997
- Lazy Load Caches - Lazy load caches - PR #18001
- Lazy Load Types - Lazy load bedrock types, .types.utils, GuardrailItem - PR #18053, PR #18054, PR #18072
- Lazy Load Configs - Lazy load 41 configuration classes - PR #18267
- Lazy Load Client Decorators - Lazy load heavy client decorator imports - PR #18064
- Prisma Build Time - Download Prisma binaries at build time instead of runtime for security restricted environments - PR #17695
- Docker Alpine - Add libsndfile to Alpine image for ARM64 audio processing - PR #18092
- Security - Prevent LiteLLM API key leakage on /health endpoint failures - PR #18133
Documentation Updates​
- SAP Docs - Update SAP documentation - PR #17974
- Pydantic AI Agents - Add docs on using pydantic ai agents with LiteLLM A2A gateway - PR #18026
- Vertex AI Agent Engine - Add Vertex AI Agent Engine documentation - PR #18027
- Router Order - Add router order parameter documentation - PR #18045
- Secret Manager Settings - Improve secret manager settings documentation - PR #18235
- Gemini 3 Flash - Add version requirement in Gemini 3 Flash blog - PR #18227
- README - Expand Responses API section and update endpoints - PR #17354
- Amazon Nova - Add Amazon Nova to sidebar and supported models - PR #18220
- Benchmarks - Add infrastructure recommendations to benchmarks documentation - PR #18264
- Broken Links - Fix broken link corrections - PR #18104
- README Fixes - Various README improvements - PR #18206
Infrastructure / CI/CD​
- PR Templates - Add LiteLLM team PR template and CI/CD rules - PR #17983, PR #17985
- Issue Labeling - Improve issue labeling with component dropdown and more provider keywords - PR #17957
- PR Template Cleanup - Remove redundant fields from PR template - PR #17956
- Dependencies - Bump altcha-lib from 1.3.0 to 1.4.1 - PR #18017
New Contributors​
- @dongbin-lunark made their first contribution in PR #17757
- @qdrddr made their first contribution in PR #18004
- @donicrosby made their first contribution in PR #17962
- @NicolaivdSmagt made their first contribution in PR #17992
- @Reapor-Yurnero made their first contribution in PR #18085
- @jk-f5 made their first contribution in PR #18086
- @castrapel made their first contribution in PR #18077
- @dtikhonov made their first contribution in PR #17484
- @opleonnn made their first contribution in PR #18175
- @eurogig made their first contribution in PR #18084

