[Preview] v1.83.3.rc.1 - Introducing MCP Skills Marketplace
Deploy this version​
- Docker
- Pip
docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
docker.litellm.ai/berriai/litellm:main-v1.83.3.rc.1
pip install litellm==1.83.3rc1
Key Highlights​
- MCP Toolsets — Create curated tool subsets from one or more MCP servers with scoped permissions, and manage them from the UI or API
- Skills Marketplace — Browse, install, and publish Claude Code skills from a self-hosted marketplace — works across Anthropic, Vertex AI, Azure, and Bedrock
- Guardrail Fallbacks — Configure
on_errorbehavior so guardrail failures degrade gracefully instead of blocking the request - Team Bring Your Own Guardrails — Teams can now attach and manage their own guardrails directly from team settings in the UI
Skills Marketplace​
The Skills Marketplace gives teams a self-hosted catalog for discovering, installing, and publishing Claude Code skills. Skills are portable across Anthropic, Vertex AI, Azure, and Bedrock — so a skill published once works everywhere your gateway routes to.

Guardrail Fallbacks​
Guardrail pipelines now support an optional on_error behavior. When a guardrail check fails or errors out, you can configure the pipeline to fall back gracefully — logging the failure and continuing the request — instead of returning a hard 500 to the caller. This is especially useful for non-critical guardrails where availability matters more than enforcement.
Team Bring Your Own Guardrails​
Teams can now attach guardrails directly from the team management UI. Admins configure available guardrails at the project or proxy level, and individual teams select which ones apply to their traffic — no config file changes or proxy restarts needed. This also ships with project-level guardrail support in the project create/edit flows.
MCP Toolsets​
MCP Toolsets let AI platform admins create curated subsets of tools from one or more MCP servers and assign them to teams and keys with scoped permissions. Instead of granting access to an entire MCP server, you can now bundle specific tools into a named toolset — controlling exactly which tools each team or API key can invoke. Toolsets are fully managed through the UI (new Toolsets tab) and API, and work seamlessly with the Responses API and Playground.

Get Started​
New Models / Updated Models​
New Model Support​
| Provider | Model | Context Window | Input ($/1M tokens) | Output ($/1M tokens) | Features |
|---|---|---|---|---|---|
| Brave Search | brave/search | - | - | - | Search tool integration metadata in cost map (PR #25042) |
| AWS Bedrock | nvidia.nemotron-super-3-120b | 256K | Added | Added | Chat completions, function calling, system messages (PR #24588) |
| OCI GenAI | Multiple new chat + embedding entries | Varies | Updated | Updated | Expanded chat + embedding model catalog |
Features​
-
- Add unversioned Claude Haiku pricing entry to ensure accurate spend accounting - PR #25151
Bug Fixes​
- General
LLM API Endpoints​
Features​
Bugs​
Management Endpoints / UI​
Features​
-
Virtual Keys
-
Teams + Organizations
-
Usage + Analytics
-
Models + Providers
-
Guardrails UI
-
UI Cleanup
- Migrate Tremor Text/Badge to antd Tag and native spans - PR #24750
Bugs​
- Fix logs page showing unfiltered results when backend filter returns zero rows - PR #24745
- Enforce upperbound key params on
/key/updateand bulk update hook paths - PR #25110, PR #25103 - Fix team model update 500 due to unsupported Prisma JSON path filter - PR #25152
AI Integrations​
Logging​
- General
Guardrails​
- Add optional
on_errorfor guardrail pipeline failures - PR #25150, PR #24831 - Return HTTP 400 (vs 500) for Model Armor streaming blocks - PR #24693
Prompt Management​
- Add environment + user tracking for prompts (
development/staging/production) in CRUD + UI flows - PR #25110, PR #24855
Secret Managers​
- No major new secret manager provider additions in this RC.
Spend Tracking, Budgets and Rate Limiting​
- Enforce budget for models not directly present in the cost map - PR #24949
- Add per-model rate limits in team settings/info UI - PR #25144
- Fix unversioned Vertex Claude Haiku pricing entry to avoid
$0.00accounting - PR #25151
MCP Gateway​
- Introduce MCP Toolsets with DB types, CRUD APIs, scoped permissions, and UI management tab - PR #25155
- Resolve toolset names and enforce toolset access correctly in Responses API and streamable MCP paths - PR #25155
- Switch toolset permission caching to shared cache path and improve cache invalidation behavior - PR #25155
- Allow JWT auth for
/v1/mcp/server/*sub-paths - PR #25113, PR #24698 - Add STS AssumeRole support for MCP SigV4 auth - PR #25151
- Add tag query fix + MCP metadata support cherry-pick - PR #25145
Performance / Loadbalancing / Reliability improvements​
- Integrate router health-check failures with cooldown behavior and transient 429/408 handling - PR #25150, PR #24988
- Add distributed lock for key rotation job execution - PR #25150, PR #23364, PR #23834
- Improve team routing reliability with deterministic grouping, isolation fixes, stale alias controls, and order-based fallback - PR #25154, PR #25148
- Regenerate GCP IAM token per async Redis cluster connection (fix token TTL failures) - PR #25155, PR #24426
- Restore MCP server fields dropped by schema sync migration - PR #24078
- Proxy server reliability hardening with bounded queue usage - PR #25155
Documentation Updates​
- Improve HA control plane diagram clarity + mobile rendering updates - PR #24747
- Document
default_team_paramsin config reference and examples - PR #25032 - Add JWT to Virtual Key mapping guide - PR #24882
- Add MCP Toolsets docs and sidebar updates - PR #25155
- Security docs updates and April hardening blog - PR #24867, PR #24868, PR #24871, PR #25102
- General docs cleanup + townhall announcement updates - PR #24839, PR #25026, PR #25021
Infrastructure / Security Notes​
- Harden npm and Docker supply chain workflows and release pipeline checks - PR #24838, PR #24877, PR #24881, PR #24905, PR #24951, PR #25023, PR #25034, PR #25036, PR #25037, PR #25136, PR #25158
- Resolve CodeQL/security workflow issues and fix broken action SHA references - PR #24880, PR #24815
- Re-add Codecov reporting in GHA matrix workflows - PR #24804
- Fix(docker): load enterprise hooks in non-root runtime image - PR #24917
- Apply Black formatting to 14 files - PR #24532
- Fix lint issues - PR #24932
New Contributors​
- @vanhtuan0409 made their first contribution in https://github.com/BerriAI/litellm/pull/24078
- @clfhhc made their first contribution in https://github.com/BerriAI/litellm/pull/24932
Full Changelog: https://github.com/BerriAI/litellm/compare/v1.83.0-nightly...v1.83.3.rc.1