Skip to main content

[Pre-Release] v1.74.15-stable

Krrish Dholakia
CEO, LiteLLM
Ishaan Jaffer
CTO, LiteLLM

Deploy this versionโ€‹

docker run litellm
docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:v1.74.15.rc.1

Key Highlightsโ€‹

  • User Agent Activity Tracking - Track how much usage each coding tool gets.
  • Prompt Management - Use Git-Ops style prompt management with prompt templates.
  • MCP Gateway: Guardrails - Support for using Guardrails with MCP servers.
  • Google AI Studio Imagen4 - Support for using Imagen4 models on Google AI Studio.

User Agent Activity Trackingโ€‹


This release brings support for tracking usage and costs for AI-powered coding tools like Claude Code, Roo Code, Gemini CLI through LiteLLM. You can now track LLM cost, total tokens used, and DAU/WAU/MAU for each coding tool.

This is great to central AI Platform teams looking to track how they are helping developer productivity.

Read More


Prompt Managementโ€‹


Read More


New Models / Updated Modelsโ€‹

New Model Supportโ€‹

ProviderModelContext WindowInput ($/1M tokens)Output ($/1M tokens)Cost per Image
OpenRouteropenrouter/x-ai/grok-4256k$3$15N/A
Google AI Studiogemini/imagen-4.0-generate-preview-06-06N/AN/AN/A$0.04
Google AI Studiogemini/imagen-4.0-ultra-generate-preview-06-06N/AN/AN/A$0.06
Google AI Studiogemini/imagen-4.0-fast-generate-preview-06-06N/AN/AN/A$0.02
Google AI Studiogemini/imagen-3.0-generate-002N/AN/AN/A$0.04
Google AI Studiogemini/imagen-3.0-generate-001N/AN/AN/A$0.04
Google AI Studiogemini/imagen-3.0-fast-generate-001N/AN/AN/A$0.02

Featuresโ€‹

Bugsโ€‹


LLM API Endpointsโ€‹

Bugsโ€‹


MCP Gatewayโ€‹

Featuresโ€‹

  • Health Check Improvements
    • Add health check endpoints for MCP servers - PR #13106
  • Guardrails Integration
    • Add pre and during call hooks initialization - PR #13067
    • Move pre and during hooks to ProxyLogging - PR #13109
    • MCP pre and during guardrails implementation - PR #13188
  • Protocol & Header Support
  • URL & Namespacing
    • Improve MCP server URL validation for internal/Kubernetes URLs - PR #13099

Bugsโ€‹

  • UI

Read More


Management Endpoints / UIโ€‹

Featuresโ€‹

  • Usage Analytics

    • New tab for user agent activity tracking - PR #13146
    • Daily usage per user analytics - PR #13147
    • Default usage chart date range set to last 7 days - PR #12917
    • New advanced date range picker component - PR #13141, PR #13221
    • Show loader on usage cost charts after date selection - PR #13113
  • Models

    • Added Voyage, Jinai, Deepinfra and VolcEngine providers on UI - PR #13131
    • Added Sagemaker on UI - PR #13117
    • Preserve model order in /v1/models and /model_group/info endpoints - PR #13178
  • Key Management

    • Properly parse JSON options for key generation in UI - PR #12989
  • Authentication

    • JWT Fields
      • Add dot notation support for all JWT fields - PR #13013

Bugsโ€‹

  • Permissions
    • Fix object permission for organizations - PR #13142
    • Fix list team v2 security check - PR #13094
  • Models
    • Fix model reload on model update - PR #13216
  • Router Settings
    • Fix displaying models for fallbacks in UI - PR #13191
    • Fix wildcard model name handling with custom values - PR #13116
    • Fix fallback delete functionality - PR #12606

Logging / Guardrail Integrationsโ€‹

Featuresโ€‹

Bugsโ€‹

  • API Key Logging
    • Fix API Key being logged inappropriately - PR #12978
  • MCP Spend Tracking
    • Set default value for MCP namespace tool name in spend table - PR #12894

Performance / Loadbalancing / Reliability improvementsโ€‹

Featuresโ€‹

  • Background Health Checks
    • Allow disabling background health checks for specific deployments - PR #13186
  • Database Connection Management
    • Ensure stale Prisma clients disconnect DB connections properly - PR #13140
  • Jitter Improvements
    • Fix jitter calculation (should be added not multiplied) - PR #12901

Bugsโ€‹

  • Anthropic Streaming
    • Always use choice index=0 for Anthropic streaming responses - PR #12666
  • Custom Auth
    • Bubble up custom exceptions properly - PR #13093
  • OTEL with Managed Files
    • Fix using managed files with OTEL integration - PR #13171

General Proxy Improvementsโ€‹

Featuresโ€‹

  • Database Migration
    • Move to use_prisma_migrate by default - PR #13117
    • Resolve team-only models on auth checks - PR #13117
  • Infrastructure
    • Loosened MCP Python version restrictions - PR #13102
    • Migrate build_and_test to CI/CD Postgres DB - PR #13166
  • Helm Charts
    • Allow Helm hooks for migration jobs - PR #13174
    • Fix Helm migration job schema updates - PR #12809

Bugsโ€‹

  • Docker
    • Remove obsolete version attribute in docker-compose - PR #13172
    • Add openssl in runtime stage for non-root Dockerfile - PR #13168
  • Database Configuration
    • Fix DB config through environment variables - PR #13111
  • Logging
  • Token Counting
    • Ignore unsupported keys like prefix in token counter - PR #11954

New Contributorsโ€‹

Full Changelogโ€‹