v1.84.1 - Gemini 3.5 Flash & Reliability Fixes
Deploy this version​
- Docker
- Pip
docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
docker.litellm.ai/berriai/litellm:1.84.1
pip install litellm==1.84.1
v1.84.1 is a patch release on top of v1.84.0. It adds day-0 support for Gemini 3.5 Flash and ships two reliability fixes — cross-pod spend accuracy and Vertex AI tool calling.
New Models / Updated Models​
New Model Support (1 new model)​
| Provider | Model | Context Window | Input ($/1M tokens) | Output ($/1M tokens) | Features |
|---|---|---|---|---|---|
| Gemini / Vertex AI | gemini/gemini-3.5-flash, vertex_ai/gemini-3.5-flash | 1M | $1.50 | $9.00 | Reasoning, vision, audio input, PDF input, prompt caching, web search, function calling, response schema |
Features​
- Gemini / Vertex AI
- Day-0 support for Gemini 3.5 Flash on both Google AI Studio and Vertex AI - PR #28268
Bug Fixes​
- Vertex AI
- Omit the
function_call/function_responseidon Vertex Gemini 3.5+ tool turns, fixing HTTP 400Unknown name "id"errors. Google AI Studio (geminiprovider) still forwards theidon Gemini 3.5+ for strict tool-call matching - PR #28324
- Omit the
Spend Tracking, Budgets and Rate Limiting​
- Seed the Redis spend counter via
SET NXinstead ofINCRBYFLOATto prevent cross-pod double-seeding. On multi-pod deployments this previously caused teamspendto jump to ~Nx the pod count after a Redis cache miss / TTL expiry, triggering false "Budget Crossed" alerts - PR #27854
Full Changelog​
https://github.com/BerriAI/litellm/compare/v1.84.0...v1.84.1