v1.85.1 - Gemini 3.5 Flash & Reliability Fixes

Deploy this version

Docker
Pip

docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
docker.litellm.ai/berriai/litellm:1.85.1

pip install litellm==1.85.1

v1.85.1 is a patch release on top of v1.85.0. It adds day-0 support for Gemini 3.5 Flash and ships two reliability fixes — cross-pod spend accuracy and Vertex AI tool calling.

New Models / Updated Models

New Model Support (1 new model)

Provider	Model	Context Window	Input ($/1M tokens)	Output ($/1M tokens)	Features
Gemini / Vertex AI	`gemini/gemini-3.5-flash`, `vertex_ai/gemini-3.5-flash`	1M	$1.50	$9.00	Reasoning, vision, audio input, PDF input, prompt caching, web search, function calling, response schema

Features

Gemini / Vertex AI
- Day-0 support for Gemini 3.5 Flash on both Google AI Studio and Vertex AI - PR #28268

Bug Fixes

Vertex AI
- Omit the function_call / function_response id on Vertex Gemini 3.5+ tool turns, fixing HTTP 400 Unknown name "id" errors. Google AI Studio (gemini provider) still forwards the id on Gemini 3.5+ for strict tool-call matching - PR #28324

Spend Tracking, Budgets and Rate Limiting

Seed the Redis spend counter via SET NX instead of INCRBYFLOAT to prevent cross-pod double-seeding. On multi-pod deployments this previously caused team spend to jump to ~Nx the pod count after a Redis cache miss / TTL expiry, triggering false "Budget Crossed" alerts - PR #27854

Full Changelog

https://github.com/BerriAI/litellm/compare/v1.85.0...v1.85.1

Deploy this version​

New Models / Updated Models​

New Model Support (1 new model)​

Features​

Bug Fixes​

Spend Tracking, Budgets and Rate Limiting​

Full Changelog​

Deploy this version

New Models / Updated Models

New Model Support (1 new model)

Features

Bug Fixes

Spend Tracking, Budgets and Rate Limiting

Full Changelog