Skip to main content

[Preview] v1.80.10.rc.1 - Agent Gateway & A2A Cost Tracking

Krrish Dholakia
CEO, LiteLLM
Ishaan Jaff
CTO, LiteLLM

Deploy this version​

docker run litellm
docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:v1.80.10.rc.1

Key Highlights​


Agent (A2A) Usage UI​

Users can now filter usage statistics by agents, providing the same granular filtering capabilities available for teams, organizations, and customers.

Details:

  • Filter usage analytics, spend logs, and activity metrics by agent ID
  • View breakdowns on a per-agent basis
  • Consistent filtering experience across all usage and analytics views

New Providers and Endpoints​

New Providers (5 new providers)​

ProviderSupported LiteLLM EndpointsDescription
SAP Gen AI Hub/chat/completions, /messages, /responsesSAP Generative AI Hub integration for enterprise AI
LangGraph/chat/completions, /messages, /responses, /a2aLangGraph agents for agentic workflows
Azure AI Foundry Agents/chat/completions, /messages, /responses, /a2aAzure AI Foundry Agents for enterprise agent deployments
Voyage AI Rerank/rerankVoyage AI rerank models support
Fireworks AI Rerank/rerankFireworks AI rerank endpoint support

New LLM API Endpoints (4 new endpoints)​

EndpointMethodDescriptionDocumentation
/containers/{id}/filesGETList files in a containerDocs
/containers/{id}/files/{file_id}GETRetrieve container file metadataDocs
/containers/{id}/files/{file_id}DELETEDelete a file from a containerDocs
/containers/{id}/files/{file_id}/contentGETRetrieve container file contentDocs

New Models / Updated Models​

New Model Support (270+ new models)​

ProviderModelContext WindowInput ($/1M tokens)Output ($/1M tokens)Features
OpenAIgpt-5.2400K$1.75$14.00Reasoning, vision, PDF, caching
OpenAIgpt-5.2-pro400K$21.00$168.00Reasoning, web search, vision
Azureazure/gpt-5.2400K$1.75$14.00Reasoning, vision, PDF, caching
Azureazure/gpt-5.2-pro400K$21.00$168.00Reasoning, web search
Bedrockus.writer.palmyra-x4-v1:0128K$2.50$10.00Function calling, PDF input
Bedrockus.writer.palmyra-x5-v1:01M$0.60$6.00Function calling, PDF input
Bedrockeu.anthropic.claude-opus-4-5-20251101-v1:0200K$5.00$25.00Reasoning, computer use, vision
Bedrockgoogle.gemma-3-12b-it128K$0.10$0.30Audio input
Bedrockmoonshot.kimi-k2-thinking128K$0.60$2.50Reasoning
Bedrocknvidia.nemotron-nano-12b-v2128K$0.20$0.60Vision
Bedrockqwen.qwen3-next-80b-a3b128K$0.15$1.20Function calling
Vertex AIvertex_ai/deepseek-ai/deepseek-v3.2-maas164K$0.56$1.68Reasoning, caching
Mistralmistral/codestral-2508256K$0.30$0.90Function calling
Mistralmistral/devstral-2512256K$0.40$2.00Function calling
Mistralmistral/labs-devstral-small-2512256K$0.10$0.30Function calling
Cerebrascerebras/zai-glm-4.6128K--Chat completions
NVIDIA NIMnvidia_nim/ranking/nvidia/llama-3.2-nv-rerankqa-1b-v2-FreeFreeRerank
Voyagevoyage/rerank-2.532K$0.05/1K tokens-Rerank
Fireworks AI227 new modelsVariousVariousVariousFull model catalog

Features​

Bug Fixes​


LLM API Endpoints​

Features​

Bugs​

  • General
    • Fix handle string content in is_cached_message - PR #17853

Management Endpoints / UI​

Features​

  • UI Settings
    • Add Get and Update Backend Routes for UI Settings - PR #17689
    • UI Settings page implementation - PR #17697
    • Ensure Model Page honors UI Settings - PR #17804
    • Add All Proxy Models to Default User Settings - PR #17902
  • Agent & Usage UI
  • Logs & Spend
    • Enhanced spend analytics in logs view - PR #17623
    • Add user info delete modal for user management - PR #17625
    • Show request and response details in logs view - PR #17928
  • Virtual Keys
    • Fix x-litellm-key-spend header update - PR #17864
  • Models & Endpoints
    • Model Hub Useful Links Rearrange - PR #17859
    • Create Team Model Dropdown honors Organization's Models - PR #17834
  • SSO & Auth
    • Allow upserting user role when SSO provider role changes - PR #17754
    • Allow fetching role from generic SSO provider (Keycloak) - PR #17787
    • JWT Auth - allow selecting team_id from request header - PR #17884
    • Remove SSO Config Values from Config Table on SSO Update - PR #17668
  • Teams
    • Attach team to org table - PR #17832
    • Expose the team alias when authenticating - PR #17725
  • MCP Server Management
    • Add extra_headers and allowed_tools to UpdateMCPServerRequest - PR #17940
  • Notifications
    • Show progress and pause on hover for Notifications - PR #17942
  • General
    • Allow Root Path to Redirect when Docs not on Root Path - PR #16843
    • Show UI version number on top left near logo - PR #17891
    • Re-organize left navigation with correct categories and agents on root - PR #17890
    • UI Playground - allow custom model names in model selector dropdown - PR #17892

Bugs​

  • UI Fixes
    • Fix links + old login page deprecation message - PR #17624
    • Filtering for Chat UI Endpoint Selector - PR #17567
    • Race Condition Handling in SCIM v2 - PR #17513
    • Make /litellm_model_cost_map public - PR #16795
    • Custom Callback on UI - PR #17522
    • Add User Writable Directory to Non Root Docker for Logo - PR #17180
    • Swap URL Input and Display Name inputs - PR #17682
    • Change deprecation banner to only show on /sso/key/generate - PR #17681
    • Change credential encryption to only affect db credentials - PR #17741
  • Auth & Routes
    • Return 403 instead of 503 for unauthorized routes - PR #17723
    • AI Gateway Auth - allow using wildcard patterns for public routes - PR #17686

AI Integrations​

New Integrations (4 new integrations)​

IntegrationTypeDescription
SumoLogicLoggingNative webhook integration for SumoLogic - PR #17630
Arize PhoenixPrompt ManagementArize Phoenix OSS prompt management integration - PR #17750
SendgridEmailSendgrid email notifications integration - PR #17775
OnyxGuardrailsOnyx guardrail hooks integration - PR #16591

Logging​

  • Langfuse
    • Propagate Langfuse trace_id - PR #17669
    • Prefer standard trace id for Langfuse logging - PR #17791
    • Move query params to create_pass_through_route call in Langfuse passthrough - PR #17660
    • Add support for custom masking function - PR #17826
  • Prometheus
    • Add 'exception_status' to prometheus logger - PR #17847
  • OpenTelemetry
    • Add latency metrics (TTFT, TPOT, Total Generation Time) to OTEL payload - PR #17888
  • General
    • Add polling via cache feature for async logging - PR #16862

Guardrails​

Prompt Management​

  • General
    • New API for integrating prompt management providers - PR #17829

Spend Tracking, Budgets and Rate Limiting​

  • Service Tier Pricing - Extract service_tier from response/usage for OpenAI flex pricing - PR #17748
  • Agent Cost Tracking - Track agent_id in SpendLogs - PR #17795
  • Tag Activity - Deduplicate /tag/daily/activity metadata - PR #16764
  • Rate Limiting - Dynamic Rate Limiter - allow specifying ttl for in memory cache - PR #17679

MCP Gateway​

  • Chat Completions Integration - Add support for using MCPs on /chat/completions - PR #17747
  • UI Session Permissions - Fix UI session MCP permissions across real teams - PR #17620
  • OAuth Callback - Fix MCP OAuth callback routing and URL handling - PR #17789
  • Tool Name Prefix - Fix MCP tool name prefix - PR #17908

Agent Gateway (A2A)​

  • Cost Per Query - Add cost per query for agent invocations - PR #17774
  • Token Counting - Add token counting non streaming + streaming - PR #17779
  • Cost Per Token - Add cost per token pricing for A2A - PR #17780
  • LangGraph Provider - Add LangGraph provider for Agent Gateway - PR #17783
  • Bedrock & LangGraph Agents - Allow using Bedrock AgentCore, LangGraph agents with A2A Gateway - PR #17786
  • Agent Management - Allow adding LangGraph, Bedrock Agent Core agents - PR #17802
  • Azure Foundry Agents - Add Azure AI Foundry Agents support - PR #17845
  • Azure Foundry UI - Allow adding Azure Foundry Agents on UI - PR #17909
  • Azure Foundry Fixes - Ensure Azure Foundry agents work correctly - PR #17943

Performance / Loadbalancing / Reliability improvements​

  • Memory Leak Fix - Cut memory leak in half - PR #17784
  • Spend Logs Memory - Reduce memory accumulation of spend_logs - PR #17742
  • Router Optimization - Replace time.perf_counter() with time.time() - PR #17881
  • Filter Internal Params - Filter internal params in fallback code - PR #17941
  • Gunicorn Suggestion - Suggest Gunicorn instead of uvicorn when using max_requests_before_restart - PR #17788
  • Pydantic Warnings - Mitigate PydanticDeprecatedSince20 warnings - PR #17657
  • Python 3.14 Support - Add Python 3.14 support via grpcio version constraints - PR #17666
  • OpenAI Package - Bump openai package to 2.9.0 - PR #17818

Documentation Updates​

  • Contributing - Update clone instructions to recommend forking first - PR #17637
  • Getting Started - Improve Getting Started page and SDK documentation structure - PR #17614
  • JSON Mode - Make it clearer how to get Pydantic model output - PR #17671
  • drop_params - Update litellm docs for drop_params - PR #17658
  • Environment Variables - Document missing environment variables and fix incorrect types - PR #17649
  • SumoLogic - Add SumoLogic integration documentation - PR #17647
  • SAP Gen AI - Add SAP Gen AI provider documentation - PR #17667
  • Authentication - Add Note for Authentication - PR #17733
  • Known Issues - Adding known issues to 1.80.5-stable docs - PR #17738
  • Supported Endpoints - Fix Supported Endpoints page - PR #17710
  • Token Count - Document token count endpoint - PR #17772
  • Overview - Made litellm proxy and SDK difference cleaner in overview with a table - PR #17790
  • Containers API - Add docs for containers files API + code interpreter on LiteLLM - PR #17749
  • Target Storage - Add documentation for target storage - PR #17882
  • Agent Usage - Agent Usage documentation - PR #17931, PR #17932, PR #17934
  • Cursor Integration - Cursor Integration documentation - PR #17855, PR #17939
  • A2A Cost Tracking - A2A cost tracking docs - PR #17913
  • Azure Search - Update azure search docs - PR #17726
  • Milvus Client - Fix milvus client docs - PR #17736
  • Streaming Logging - Remove streaming logging doc - PR #17739
  • Integration Docs - Update integration docs location - PR #17644
  • Links - Updated docs links for mistral and anthropic - PR #17852
  • Community - Add community doc link - PR #17734
  • Pricing - Update pricing for global.anthropic.claude-haiku-4-5-20251001-v1:0 - PR #17703
  • gpt-image-1-mini - Correct model type for gpt-image-1-mini - PR #17635

Infrastructure / Deployment​

  • Docker - Use python instead of wget for healthcheck in docker-compose.yml - PR #17646
  • Helm Chart - Add extraResources support for Helm chart deployments - PR #17627
  • Helm Versioning - Add semver prerelease suffix to helm chart versions - PR #17678
  • Database Schema - Add storage_backend and storage_url columns to schema.prisma for target storage feature - PR #17936

New Contributors​

  • @xianzongxie-stripe made their first contribution in PR #16862
  • @krisxia0506 made their first contribution in PR #17637
  • @chetanchoudhary-sumo made their first contribution in PR #17630
  • @kevinmarx made their first contribution in PR #17632
  • @expruc made their first contribution in PR #17627
  • @rcII made their first contribution in PR #17626
  • @tamirkiviti13 made their first contribution in PR #16591
  • @Eric84626 made their first contribution in PR #17629
  • @vasilisazayka made their first contribution in PR #16053
  • @juliettech13 made their first contribution in PR #17663
  • @jason-nance made their first contribution in PR #17660
  • @yisding made their first contribution in PR #17671
  • @emilsvennesson made their first contribution in PR #17656
  • @kumekay made their first contribution in PR #17646
  • @chenzhaofei01 made their first contribution in PR #17584
  • @shivamrawat1 made their first contribution in PR #17733
  • @ephrimstanley made their first contribution in PR #17723
  • @hwittenborn made their first contribution in PR #17743
  • @peterkc made their first contribution in PR #17727
  • @saisurya237 made their first contribution in PR #17725
  • @Ashton-Sidhu made their first contribution in PR #17728
  • @CyrusTC made their first contribution in PR #17810
  • @jichmi made their first contribution in PR #17703
  • @ryan-crabbe made their first contribution in PR #17852
  • @nlineback made their first contribution in PR #17851
  • @butnarurazvan made their first contribution in PR #17468
  • @yoshi-p27 made their first contribution in PR #17915

Full Changelog​

View complete changelog on GitHub