Skip to main content

v1.85.1 - Gemini 3.5 Flash & Reliability Fixes

Deploy this version​

docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
docker.litellm.ai/berriai/litellm:1.85.1

v1.85.1 is a patch release on top of v1.85.0. It adds day-0 support for Gemini 3.5 Flash and ships two reliability fixes — cross-pod spend accuracy and Vertex AI tool calling.

New Models / Updated Models​

New Model Support (1 new model)​

ProviderModelContext WindowInput ($/1M tokens)Output ($/1M tokens)Features
Gemini / Vertex AIgemini/gemini-3.5-flash, vertex_ai/gemini-3.5-flash1M$1.50$9.00Reasoning, vision, audio input, PDF input, prompt caching, web search, function calling, response schema

Features​

Bug Fixes​

  • Vertex AI
    • Omit the function_call / function_response id on Vertex Gemini 3.5+ tool turns, fixing HTTP 400 Unknown name "id" errors. Google AI Studio (gemini provider) still forwards the id on Gemini 3.5+ for strict tool-call matching - PR #28324

Spend Tracking, Budgets and Rate Limiting​

  • Seed the Redis spend counter via SET NX instead of INCRBYFLOAT to prevent cross-pod double-seeding. On multi-pod deployments this previously caused team spend to jump to ~Nx the pod count after a Redis cache miss / TTL expiry, triggering false "Budget Crossed" alerts - PR #27854

Full Changelog​

https://github.com/BerriAI/litellm/compare/v1.85.0...v1.85.1

🚅
LiteLLM Enterprise
SSO/SAML, audit logs, spend tracking, multi-team management, and guardrails — built for production.
Learn more →