Multi-Tenant Architecture with LiteLLM

Overview

LiteLLM provides a centralized solution that scales across multiple tenants, enabling organizations to:

Centrally manage LLM access for multiple tenants (organizations, teams, departments)
Isolate spend and usage across different organizational units
Delegate administration without compromising security
Track costs at granular levels (organization → team → user → key)
Scale seamlessly as new teams and users are added

Open Source vs. Enterprise

Teams + Virtual Keys: ✅ Available in open source
Organizations + Org Admins: ✨ Enterprise feature (Get a 7 day trial)

You can implement multi-tenancy using Teams alone in the open source version, or add Organizations on top for additional hierarchy in the enterprise version.

The Multi-Tenant Challenge

Organizations with multi-tenant architectures face several challenges when deploying LLM solutions:

Centralized vs. Decentralized: Need a single unified gateway while maintaining tenant isolation
Cost Attribution: Tracking spend across different business units, departments, or customers
Access Control: Different teams need different models, budgets, and rate limits
Delegation: Team leads should manage their teams without platform-wide admin access
Scalability: Solution must scale from 10 to 10,000+ users without architectural changes

How LiteLLM Solves Multi-Tenancy

LiteLLM implements a hierarchical multi-tenant architecture with four levels:

1. Organizations (Top-Level Tenants) ✨ Enterprise Feature

Organizations represent the highest level of tenant isolation - typically different business units, departments, or customers.

Each organization has its own:
- Budget limits
- Allowed models
- Admin users (org admins)
- Teams
- Spend tracking

Use Cases:

Enterprise Departments: Separate organizations for Engineering, Marketing, Sales
Multi-Customer SaaS: Each customer is an organization with full isolation
Geographic Regions: EMEA, APAC, Americas as separate organizations

Key Features:

Organizations cannot see each other's data
Each organization can have multiple teams
Organization admins manage teams within their organization only
Spend and usage tracked at organization level

API Reference for Organizations

2. Teams (Mid-Level Grouping) ✅ Open Source

Teams can work independently or sit within organizations, representing logical groupings of users working together.

tip

Teams are available in open source and can be used as your primary multi-tenant boundary without needing Organizations. Organizations provide an additional layer of hierarchy for enterprise deployments.

Each team has:
- Team-specific budgets and rate limits
- Team admins who manage members
- Service account keys for shared resources
- Model access controls
- Granular team member permissions

Use Cases:

Project Teams: ML Research team, Product team, Data Science team
Customer Sub-Groups: Different divisions within a customer organization
Environment Separation: Development, Staging, Production teams

Key Features:

Teams inherit organization constraints (can't exceed org budget/models)
Team admins can manage their team without affecting others
Service account keys survive team member changes
Per-team spend tracking and billing

API Reference for Teams

3. Users (Individual Members) ✅ Open Source

Users are individuals who belong to teams and create/use API keys.

Each user can:
- Belong to multiple teams
- Have their own budget limits
- Create personal API keys
- Track individual spend

User Types:

Internal Users: Employees, developers, data scientists
Team Admins: Lead their teams, manage members
Org Admins: Manage multiple teams within their organization
Proxy Admins: Platform-wide administrators

Key Features:

User spend tracked individually
Users can be on multiple teams simultaneously
Role-based permissions control what users can do
User keys deleted when user is removed

API Reference for Users

4. Virtual Keys (Authentication Layer) ✅ Open Source

Virtual Keys are the API keys used to authenticate requests and track spend.

Each key can be one of three types:

Key Type	Configuration	Use Case	Spend Tracking	Lifecycle
User-only	`user_id` only	Developer personal keys	User level	Deleted with user
Team Service Account	`team_id` only	Production apps, CI/CD	Team level	Survives member changes
User + Team	Both `user_id` and `team_id`	User within team context	User AND Team	Deleted with user

Example Scenarios:

Use user-only keys for developers testing locally
Use team service account keys for your production application that shouldn't break when employees leave
Use user + team keys when you want individual accountability within a team budget

API Reference for Keys

Role-Based Access Control (RBAC)

LiteLLM provides granular RBAC across the hierarchy:

Global Proxy Roles (Platform-Wide)

Role	Scope	Permissions
Proxy Admin	Entire platform	Create orgs, teams, users. View all spend. Full control.
Proxy Admin Viewer	Entire platform	View-only access to all data. Cannot make changes.
Internal User	Own resources	Create/delete own keys. View own spend.

Organization/Team Roles (Scoped)

Role	Scope	Permissions
Org Admin ✨	Specific organization	Create teams, add users, view org spend within their org only.
Team Admin ✨	Specific team	Manage team members, budgets, keys within their team only.

✨ = Premium Feature

Team Member Permissions

Team admins can configure granular permissions for regular team members:

Read-only (default):

["/key/info", "/key/health"]

Allow key creation:

["/key/info", "/key/health", "/key/generate", "/key/update"]

Full key management:

["/key/info", "/key/health", "/key/generate", "/key/update", "/key/delete", "/key/regenerate", "/key/block", "/key/unblock"]

Learn more about RBAC

Spend Tracking & Cost Attribution

LiteLLM provides multi-level spend tracking that flows through the hierarchy:

Hierarchical Spend Flow

Organization Spend
    ├── Team 1 Spend
    │   ├── User A Spend
    │   │   ├── Key 1 Spend
    │   │   └── Key 2 Spend
    │   └── Service Account Spend
    │       └── Key 3 Spend
    └── Team 2 Spend
        └── User B Spend
            └── Key 4 Spend

Budget Enforcement

Budgets can be set at every level with inheritance:

Organization Budget: $10,000/month
- Team 1: $6,000/month (within org limit)
  - User A: $3,000/month (within team limit)
  - User B: $3,000/month (within team limit)
- Team 2: $4,000/month (within org limit)

Enforcement Rules:

Team budgets cannot exceed organization budget
User budgets cannot exceed team budget
Requests blocked when any level exceeds budget
Real-time tracking prevents overruns

Learn more about Budgets

Common Multi-Tenant Patterns

Pattern 1: Enterprise Departments

Scenario: Large enterprise with multiple departments needing centralized LLM access

Enterprise Setup (with Organizations):

Platform (LiteLLM Instance)
├── Engineering Organization ✨
│   ├── Backend Team
│   ├── Frontend Team
│   └── ML Team
├── Marketing Organization ✨
│   ├── Content Team
│   └── Analytics Team
└── Sales Organization ✨
    ├── Sales Ops Team
    └── Customer Success Team

Open Source Alternative (Teams only):

Platform (LiteLLM Instance)
├── Engineering Backend Team
├── Engineering Frontend Team
├── Engineering ML Team
├── Marketing Content Team
├── Marketing Analytics Team
├── Sales Ops Team
└── Customer Success Team

Benefits:

Each department/team manages their own budget
Department leads (org/team admins) control their teams
Centralized billing and model access
Cross-department cost visibility for finance

Pattern 2: Multi-Customer SaaS

Scenario: SaaS provider offering LLM-powered features to multiple customers

Enterprise Setup (with Organizations):

Platform (LiteLLM Instance)
├── Customer A Organization ✨
│   ├── Production Team (Service Accounts)
│   ├── Development Team
│   └── QA Team
├── Customer B Organization ✨
│   ├── Production Team (Service Accounts)
│   └── Development Team
└── Customer C Organization ✨
    └── Production Team (Service Accounts)

Open Source Alternative (Teams only):

Platform (LiteLLM Instance)
├── Customer A Production Team (Service Accounts)
├── Customer A Development Team
├── Customer A QA Team
├── Customer B Production Team (Service Accounts)
├── Customer B Development Team
└── Customer C Production Team (Service Accounts)

Benefits:

Complete isolation between customers/teams
Per-customer/team billing and usage tracking
Customer/team admins can self-serve
Production service account keys survive employee turnover

Pattern 3: Environment Separation

Scenario: Single organization with multiple environments

Platform (LiteLLM Instance)
└── Company Organization
    ├── Production Team
    │   └── Service Account Keys (strict rate limits)
    ├── Staging Team
    │   └── Service Account Keys (moderate limits)
    └── Development Team
        └── User Keys (generous limits for testing)

Benefits:

Separate budgets for each environment
Different model access (production vs. development)
Prevent development usage from affecting production budget
Easy cost attribution by environment

Delegation & Self-Service

One of LiteLLM's key advantages is delegated administration:

Without LiteLLM

Every team → Requests platform admin → Admin makes changes

❌ Bottleneck on platform team
❌ Slow onboarding
❌ Poor scalability

With LiteLLM

Proxy Admin → Creates org + org admin
Org Admin → Creates teams + team admins  
Team Admin → Manages their team independently

✅ Decentralized management
✅ Fast onboarding
✅ Scales to thousands of users

Self-Service Capabilities

Team Admins Can:

Add/remove team members
Create API keys for team members
Update team budgets (within org limits)
Configure team member permissions
View team usage and spend

Org Admins Can:

Create new teams within their organization
Assign team admins
View organization-wide spend
Manage users across their teams

Platform Admins Can:

Create organizations
Assign org admins
Set organization-level policies
View platform-wide analytics

Scalability

LiteLLM's architecture scales from small teams to enterprise deployments:

Small Team (10-100 users)

Single organization
Few teams (5-10)
Proxy admins manage everything

Mid-Size (100-1,000 users)

Multiple organizations
Many teams (50+)
Org admins delegate to team admins

Enterprise (1,000+ users)

Many organizations (departments/regions)
Hundreds of teams
Fully delegated admin structure
Centralized observability and billing

Key Scalability Features:

No architectural changes needed as you grow
Database-backed (PostgreSQL) for reliability
Horizontal scaling support
Efficient spend tracking and logging

Security & Isolation

Tenant Isolation

Each tenant (organization) is isolated:

✅ Cannot view other organizations' data
✅ Cannot access other organizations' keys
✅ Cannot exceed their budget limits
✅ Cannot access models not in their allowed list

Authentication Security

Master key for platform admins
Virtual keys with scoped permissions
SSO integration support
JWT authentication
IP allowlisting

Audit & Compliance

All API calls logged with user/team/org context
Spend tracking for chargeback/showback
Admin actions audited
Integration with observability tools

Learn more about Security

Getting Started

Enterprise vs. Open Source Setup

The steps below show the full enterprise hierarchy with Organizations.

For open source, skip Steps 1-2 and start directly with Step 3 (creating teams). Teams can function as your top-level tenant boundary without Organizations.

Step 1: Set Up Organizations ✨ Enterprise

Create your first organization:

curl --location 'http://0.0.0.0:4000/organization/new' \
    --header 'Authorization: Bearer sk-1234' \
    --header 'Content-Type: application/json' \
    --data '{
        "organization_alias": "engineering_department",
        "models": ["gpt-4", "gpt-4o", "claude-3-5-sonnet"],
        "max_budget": 10000
    }'

Step 2: Add an Organization Admin ✨ Enterprise

curl -X POST 'http://0.0.0.0:4000/organization/member_add' \
    -H 'Authorization: Bearer sk-1234' \
    -H 'Content-Type: application/json' \
    -d '{
        "organization_id": "org-123",
        "member": {
            "role": "org_admin",
            "user_id": "admin@company.com"
        }
    }'

Step 3: Create Teams ✅ Open Source

For Enterprise: Organization admin creates team within their organization
For Open Source: Proxy admin creates team directly (no organization_id needed)

# Enterprise: Org admin creates team in their organization
curl --location 'http://0.0.0.0:4000/team/new' \
    --header 'Authorization: Bearer sk-org-admin-key' \
    --header 'Content-Type: application/json' \
    --data '{
        "team_alias": "ml_team",
        "organization_id": "org-123",
        "max_budget": 5000
    }'

# Open Source: Proxy admin creates team directly
curl --location 'http://0.0.0.0:4000/team/new' \
    --header 'Authorization: Bearer sk-1234' \
    --header 'Content-Type: application/json' \
    --data '{
        "team_alias": "ml_team",
        "max_budget": 5000
    }'

Step 4: Add Team Admin

curl -X POST 'http://0.0.0.0:4000/team/member_add' \
    -H 'Authorization: Bearer sk-org-admin-key' \
    -H 'Content-Type: application/json' \
    -d '{
        "team_id": "team-456",
        "member": {
            "role": "admin",
            "user_id": "team-lead@company.com"
        }
    }'

Step 5: Team Admin Manages Their Team

# Team admin adds members
curl -X POST 'http://0.0.0.0:4000/team/member_add' \
    -H 'Authorization: Bearer sk-team-admin-key' \
    -H 'Content-Type: application/json' \
    -d '{
        "team_id": "team-456",
        "member": {
            "role": "user",
            "user_id": "developer@company.com"
        }
    }'

# Team admin creates keys for members
curl --location 'http://0.0.0.0:4000/key/generate' \
    --header 'Authorization: Bearer sk-team-admin-key' \
    --header 'Content-Type: application/json' \
    --data '{
        "user_id": "developer@company.com",
        "team_id": "team-456"
    }'

Use Case Examples

Example 1: Chargeback Model

Goal: Each business unit pays for their own LLM usage

Setup:

Create organization per business unit
Set budgets based on allocated budgets
Track spend per organization
Generate monthly reports for finance

Result: Finance can charge back costs to respective departments with accurate attribution.

Example 2: Customer-Facing AI Product

Goal: Provide LLM capabilities to customers with isolation and cost tracking

Setup:

Create organization per customer
Use service account keys for production workloads
Track spend per customer organization
Set rate limits per customer tier

Result: Bill customers accurately, prevent noisy neighbors, maintain isolation.

Example 3: Development vs. Production

Goal: Separate development and production environments with different policies

Setup:

Create "Development" and "Production" teams
Development: Generous budgets, all models, user keys
Production: Strict budgets, approved models only, service account keys
Different rate limits per environment

Result: Developers can experiment freely without impacting production budget or reliability.

Best Practices

1. Organization Design

✅ Map organizations to cost centers or customers
✅ Set realistic budgets with buffer for growth
✅ Assign 1-2 org admins per organization
❌ Don't create too many organizations (adds management overhead)

2. Team Structure

✅ Keep teams aligned with actual working groups
✅ Use service account keys for production
✅ Give team admins enough permissions to self-serve
❌ Don't create single-user teams (use user-only keys instead)

3. Key Management

✅ Use descriptive key names
✅ Rotate keys regularly
✅ Delete unused keys
✅ Use appropriate key type for use case
❌ Don't share keys across users/teams

4. Budget Management

✅ Set budgets at multiple levels (org → team → user)
✅ Monitor spend regularly
✅ Alert before budget exhaustion
❌ Don't set budgets too tight (may block legitimate usage)

5. Delegation

✅ Assign org admins for large organizations
✅ Assign team admins for active teams
✅ Configure team member permissions appropriately
❌ Don't make everyone a proxy admin

Monitoring & Observability

LiteLLM provides comprehensive monitoring:

Spend Tracking: Real-time spend by org/team/user/key
Usage Analytics: Request counts, token usage, model usage
Admin UI: Visual dashboard for all metrics
Logging: Detailed logs with tenant context
Alerting: Budget alerts, rate limit alerts, error alerts

Learn more about Logging

Comparison with Other Approaches

Approach	Pros	Cons	LiteLLM Advantage
Separate instances per tenant	Strong isolation	High operational overhead, cost inefficient	Single instance, same isolation, 90% cost reduction
Single shared pool	Simple setup	No cost attribution, no access control	Full attribution, granular access control
API key prefixes	Basic separation	Manual tracking, no hierarchy, no RBAC	Automatic tracking, hierarchical, full RBAC
External auth layer	Flexible	Complex integration, no built-in budgets	Native integration, built-in budgets

FAQ

Q: Can users belong to multiple teams?
A: Yes, users can be members of multiple teams and have different keys for each team.

Q: What happens when a user leaves?
A: User-specific keys are deleted, but team service account keys remain active.

Q: Can team budgets exceed organization budget?
A: No, the system enforces that team budgets cannot exceed their organization's budget.

Q: How granular is the cost tracking?
A: Every API call is tracked with organization, team, user, and key context.

Q: Can I have teams without organizations?
A: Yes! Teams work independently in open source without needing Organizations. Organizations are an enterprise feature that adds an additional hierarchy layer on top of teams.

Q: Is there a limit to hierarchy depth?
A: The hierarchy is: Organization → Team → User → Key (4 levels). This covers most use cases.

Q: How do I migrate from flat structure to hierarchical?
A: You can gradually create organizations and teams, then move existing users/keys into them.

User Management Hierarchy - Visual hierarchy overview
Access Control (RBAC) - Detailed role permissions
Team Budgets - Budget management guide
Virtual Keys - API key management
Admin UI - Visual dashboard for management

Summary

LiteLLM solves multi-tenant architecture challenges through:

Hierarchical Structure: Organizations → Teams → Users → Keys
Granular RBAC: Platform-wide and tenant-scoped roles
Cost Attribution: Spend tracking at every level
Delegation: Org admins and team admins self-manage
Isolation: Strong tenant boundaries
Scalability: Handles 10 to 10,000+ users with same architecture

Open Source vs. Enterprise

Open Source (Teams + Users + Keys):

✅ Teams as primary tenant boundary
✅ Team admins manage their teams
✅ Virtual keys with team/user tracking
✅ Budget and rate limits per team
✅ Spend tracking and logging

Enterprise (Adds Organizations layer):

✨ Organizations for top-level tenant isolation
✨ Organization admins manage multiple teams
✨ Organization-level budgets and model access
✨ Hierarchical delegation and reporting

This makes LiteLLM ideal for:

✅ Enterprises with multiple departments
✅ SaaS providers with multiple customers
✅ Organizations needing cost chargeback/showback
✅ Teams requiring self-service LLM access
✅ Any multi-tenant LLM deployment

Start with LiteLLM Proxy →

Overview​

The Multi-Tenant Challenge​

How LiteLLM Solves Multi-Tenancy​

1. Organizations (Top-Level Tenants) ✨ Enterprise Feature​

2. Teams (Mid-Level Grouping) ✅ Open Source​

3. Users (Individual Members) ✅ Open Source​

4. Virtual Keys (Authentication Layer) ✅ Open Source​

Role-Based Access Control (RBAC)​

Global Proxy Roles (Platform-Wide)​

Organization/Team Roles (Scoped)​

Team Member Permissions​

Spend Tracking & Cost Attribution​

Hierarchical Spend Flow​

Budget Enforcement​

Common Multi-Tenant Patterns​

Pattern 1: Enterprise Departments​

Pattern 2: Multi-Customer SaaS​

Pattern 3: Environment Separation​

Delegation & Self-Service​

Without LiteLLM​

With LiteLLM​

Self-Service Capabilities​

Scalability​

Small Team (10-100 users)​

Mid-Size (100-1,000 users)​

Enterprise (1,000+ users)​

Security & Isolation​

Tenant Isolation​

Authentication Security​

Audit & Compliance​

Getting Started​

Step 1: Set Up Organizations ✨ Enterprise​

Step 2: Add an Organization Admin ✨ Enterprise​

Step 3: Create Teams ✅ Open Source​

Step 4: Add Team Admin​

Step 5: Team Admin Manages Their Team​

Use Case Examples​

Example 1: Chargeback Model​

Example 2: Customer-Facing AI Product​

Example 3: Development vs. Production​

Best Practices​

1. Organization Design​

2. Team Structure​

3. Key Management​

4. Budget Management​

5. Delegation​

Monitoring & Observability​

Comparison with Other Approaches​

FAQ​

Related Documentation​

Summary​

Open Source vs. Enterprise​

Overview

The Multi-Tenant Challenge

How LiteLLM Solves Multi-Tenancy

1. Organizations (Top-Level Tenants) ✨ Enterprise Feature

2. Teams (Mid-Level Grouping) ✅ Open Source

3. Users (Individual Members) ✅ Open Source

4. Virtual Keys (Authentication Layer) ✅ Open Source

Role-Based Access Control (RBAC)

Global Proxy Roles (Platform-Wide)

Organization/Team Roles (Scoped)

Team Member Permissions

Spend Tracking & Cost Attribution

Hierarchical Spend Flow

Budget Enforcement

Common Multi-Tenant Patterns

Pattern 1: Enterprise Departments

Pattern 2: Multi-Customer SaaS

Pattern 3: Environment Separation

Delegation & Self-Service

Without LiteLLM

With LiteLLM

Self-Service Capabilities

Scalability

Small Team (10-100 users)

Mid-Size (100-1,000 users)

Enterprise (1,000+ users)

Security & Isolation

Tenant Isolation

Authentication Security

Audit & Compliance

Getting Started

Step 1: Set Up Organizations ✨ Enterprise

Step 2: Add an Organization Admin ✨ Enterprise

Step 3: Create Teams ✅ Open Source

Step 4: Add Team Admin

Step 5: Team Admin Manages Their Team

Use Case Examples

Example 1: Chargeback Model

Example 2: Customer-Facing AI Product

Example 3: Development vs. Production

Best Practices

1. Organization Design

2. Team Structure

3. Key Management

4. Budget Management

5. Delegation

Monitoring & Observability

Comparison with Other Approaches

FAQ

Related Documentation

Summary

Open Source vs. Enterprise