Skip to main content

[Preview] v1.80.11 - Google Interactions API

Krrish Dholakia
CEO, LiteLLM
Ishaan Jaff
CTO, LiteLLM

Deploy this version​

docker run litellm
docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
docker.litellm.ai/berriai/litellm:v1.80.11.rc.1

Key Highlights​


Cloudzero Integration on UI​

Users can now configure their Cloudzero Integration directly on the UI.


Performance: 50% Reduction in Memory Usage and Import Latency for the LiteLLM SDK​

We've completely restructured litellm.__init__.py to defer heavy imports until they're actually needed, implementing lazy loading for 109 components.

This refactoring includes 41 provider config classes, 40 utility functions, cache implementations (Redis, DualCache, InMemoryCache), HTTP handlers, logging, types, and other heavy dependencies. Heavy libraries like tiktoken and boto3 are now loaded on-demand rather than eagerly at import time.

This makes LiteLLM especially beneficial for serverless functions, Lambda deployments, and containerized environments where cold start times and memory footprint matter.


New Providers and Endpoints​

New Providers (5 new providers)​

ProviderSupported LiteLLM EndpointsDescription
Stability AI/images/generations, /images/editsStable Diffusion 3, SD3.5, image editing and generation
Venice.ai/chat/completions, /messages, /responsesVenice.ai API integration via providers.json
Pydantic AI Agents/a2aPydantic AI agents for A2A protocol workflows
VertexAI Agent Engine/a2aGoogle Vertex AI Agent Engine for agentic workflows
LinkUp Search/searchLinkUp web search API integration

New LLM API Endpoints (2 new endpoints)​

EndpointMethodDescriptionDocumentation
/interactionsPOSTGoogle Interactions API for conversational AIDocs
/searchPOSTRAG Search API with rerankersDocs

New Models / Updated Models​

New Model Support (55+ new models)​

ProviderModelContext WindowInput ($/1M tokens)Output ($/1M tokens)Features
Geminigemini/gemini-3-flash-preview1M$0.50$3.00Reasoning, vision, audio, video, PDF
Vertex AIvertex_ai/gemini-3-flash-preview1M$0.50$3.00Reasoning, vision, audio, video, PDF
Azure AIazure_ai/deepseek-v3.2164K$0.58$1.68Reasoning, function calling, caching
Azure AIazure_ai/cohere-rerank-v4.0-pro32K$0.0025/query-Rerank
Azure AIazure_ai/cohere-rerank-v4.0-fast32K$0.002/query-Rerank
OpenRouteropenrouter/openai/gpt-5.2400K$1.75$14.00Reasoning, vision, caching
OpenRouteropenrouter/openai/gpt-5.2-pro400K$21.00$168.00Reasoning, vision
OpenRouteropenrouter/mistralai/devstral-2512262K$0.15$0.60Function calling
OpenRouteropenrouter/mistralai/ministral-3b-2512131K$0.10$0.10Function calling, vision
OpenRouteropenrouter/mistralai/ministral-8b-2512262K$0.15$0.15Function calling, vision
OpenRouteropenrouter/mistralai/ministral-14b-2512262K$0.20$0.20Function calling, vision
OpenRouteropenrouter/mistralai/mistral-large-2512262K$0.50$1.50Function calling, vision
OpenAIgpt-4o-transcribe-diarize16K$6.00/audio-Audio transcription with diarization
OpenAIgpt-image-1.5-2025-12-16-VariousVariousImage generation
Stabilitystability/sd3-large--$0.065/imageImage generation
Stabilitystability/sd3.5-large--$0.065/imageImage generation
Stabilitystability/stable-image-ultra--$0.08/imageImage generation
Stabilitystability/inpaint--$0.005/imageImage editing
Stabilitystability/outpaint--$0.004/imageImage editing
Bedrockstability.stable-conservative-upscale-v1:0--$0.40/imageImage upscaling
Bedrockstability.stable-creative-upscale-v1:0--$0.60/imageImage upscaling
Vertex AIvertex_ai/deepseek-ai/deepseek-ocr-maas-$0.30$1.20OCR
LinkUplinkup/search-$5.87/1K queries-Web search
LinkUplinkup/search-deep-$58.67/1K queries-Deep web search
GitHub Copilot20+ modelsVarious--Chat completions

Features​

Bug Fixes​

  • Gemini
    • Fix pricing for Gemini 3 Flash on Vertex AI - PR #18202
    • Add output_cost_per_image_token for gemini-2.5-flash-image models - PR #18156
    • Fix properties should be non-empty for OBJECT type - PR #18237
  • Qwen
    • Add qwen3-embedding-8b input per token price - PR #18018
  • General
    • Fix image URL handling - PR #18139
    • Support Signed URLs with Query Parameters in Image Processing - PR #17976
    • Add none to encoding_format instead of omitting it - PR #18042

LLM API Endpoints​

Features​

Bugs​

  • General
    • Fix basemodel import in guardrail translation - PR #17977
    • Fix No module named 'fastapi' error - PR #18239

Management Endpoints / UI​

Features​

  • Virtual Keys
    • Add master key rotation for credentials table - PR #17952
    • Fix tag management to preserve encrypted fields in litellm_params - PR #17484
    • Fix key delete and regenerate permissions - PR #18214
  • Models + Endpoints
    • Add Models Conditional Rendering in UI - PR #18071
    • Add Health Check Model for Wildcard Model in UI - PR #18269
    • Auto Resolve Vector Store Embedding Model Config - PR #18167
  • Vector Stores
    • Add Milvus Vector Store UI support - PR #18030
    • Persist Vector Store Settings in Team Update - PR #18274
  • Logs & Spend
    • Add LiteLLM Overhead to Logs - PR #18033
    • Show LiteLLM Overhead in Logs UI - PR #18034
    • Resolve Team ID to Team Alias in Usage Page - PR #18275
    • Fix Usage Page Top Key View Button Visibility - PR #18203
  • SSO & Health
    • Add SSO Readiness Health Check - PR #18078
    • Fix /health/test_connection to resolve env variables like /chat/completions - PR #17752
  • CloudZero
  • General
    • Update UI path handling for non-root Docker - PR #17989

Bugs​

  • UI Fixes
    • Fix Login Page Failed To Parse JSON Error - PR #18159
    • Fix new user route user_id collision handling - PR #17559
    • Fix Callback Environment Variables Casing - PR #17912

AI Integrations​

Logging​

Guardrails​

Secret Managers​


Spend Tracking, Budgets and Rate Limiting​

  • Email Budget Alerts - Send email notifications when budgets are reached - PR #17995

MCP Gateway​

  • Auth Header Propagation - Add MCP auth header propagation - PR #17963
  • Fix deepcopy error - Fix MCP tool call deepcopy error when processing requests - PR #18010
  • Fix list tool - Fix MCP list_tools not working without database connection - PR #18161

Agent Gateway (A2A)​

  • New Provider: Agent Gateway - Add pydantic ai agents support - PR #18013
  • VertexAI Agent Engine - Add Vertex AI Agent Engine provider - PR #18014
  • Fix model extraction - Fix get_model_from_request() to extract model ID from Vertex AI passthrough URLs - PR #18097

Performance / Loadbalancing / Reliability improvements​

  • Lazy Imports - Use per-attribute lazy imports and extract shared constants - PR #17994
  • Lazy Load HTTP Handlers - Lazy load http handlers - PR #17997
  • Lazy Load Caches - Lazy load caches - PR #18001
  • Lazy Load Types - Lazy load bedrock types, .types.utils, GuardrailItem - PR #18053, PR #18054, PR #18072
  • Lazy Load Configs - Lazy load 41 configuration classes - PR #18267
  • Lazy Load Client Decorators - Lazy load heavy client decorator imports - PR #18064
  • Prisma Build Time - Download Prisma binaries at build time instead of runtime for security restricted environments - PR #17695
  • Docker Alpine - Add libsndfile to Alpine image for ARM64 audio processing - PR #18092
  • Security - Prevent LiteLLM API key leakage on /health endpoint failures - PR #18133

Documentation Updates​

  • SAP Docs - Update SAP documentation - PR #17974
  • Pydantic AI Agents - Add docs on using pydantic ai agents with LiteLLM A2A gateway - PR #18026
  • Vertex AI Agent Engine - Add Vertex AI Agent Engine documentation - PR #18027
  • Router Order - Add router order parameter documentation - PR #18045
  • Secret Manager Settings - Improve secret manager settings documentation - PR #18235
  • Gemini 3 Flash - Add version requirement in Gemini 3 Flash blog - PR #18227
  • README - Expand Responses API section and update endpoints - PR #17354
  • Amazon Nova - Add Amazon Nova to sidebar and supported models - PR #18220
  • Benchmarks - Add infrastructure recommendations to benchmarks documentation - PR #18264
  • Broken Links - Fix broken link corrections - PR #18104
  • README Fixes - Various README improvements - PR #18206

Infrastructure / CI/CD​

  • PR Templates - Add LiteLLM team PR template and CI/CD rules - PR #17983, PR #17985
  • Issue Labeling - Improve issue labeling with component dropdown and more provider keywords - PR #17957
  • PR Template Cleanup - Remove redundant fields from PR template - PR #17956
  • Dependencies - Bump altcha-lib from 1.3.0 to 1.4.1 - PR #18017

New Contributors​

  • @dongbin-lunark made their first contribution in PR #17757
  • @qdrddr made their first contribution in PR #18004
  • @donicrosby made their first contribution in PR #17962
  • @NicolaivdSmagt made their first contribution in PR #17992
  • @Reapor-Yurnero made their first contribution in PR #18085
  • @jk-f5 made their first contribution in PR #18086
  • @castrapel made their first contribution in PR #18077
  • @dtikhonov made their first contribution in PR #17484
  • @opleonnn made their first contribution in PR #18175
  • @eurogig made their first contribution in PR #18084

Full Changelog​

View complete changelog on GitHub