Rubrik Guardrail

Use Rubrik's tool blocking and logging integration to validate LLM tool calls against an external policy service and batch-log all LLM requests/responses.

Key features:

Tool blocking: Validates tool calls against an external Rubrik service after LLM completion. Blocked tool calls trigger a policy violation response.
Batch logging: Logs all LLM requests and responses to Rubrik with configurable sampling and batching.
Fail-open: If the tool blocking service is unavailable, requests are allowed through unchanged.

Quick Start

1. Configure `config.yaml`

Credentials can be set directly in the YAML config or via environment variables. The config approach is recommended.

config.yaml (Recommended)
Environment Variables

model_list:
  - model_name: gpt-4
    litellm_params:
      model: openai/gpt-4
      api_key: os.environ/OPENAI_API_KEY

guardrails:
  - guardrail_name: "rubrik"
    litellm_params:
      guardrail: rubrik
      mode: "post_call"
      api_key: "your-rubrik-api-key"
      api_base: "https://your-rubrik-service.example.com"
      default_on: true

You can also reference environment variables in the config:

guardrails:
  - guardrail_name: "rubrik"
    litellm_params:
      guardrail: rubrik
      mode: "post_call"
      api_key: os.environ/RUBRIK_API_KEY
      api_base: os.environ/RUBRIK_WEBHOOK_URL
      default_on: true

As an alternative, you can configure the Rubrik service URL and API key purely through environment variables. When set, these are used as fallbacks if api_base / api_key are not provided in the config.

export RUBRIK_WEBHOOK_URL="https://your-rubrik-service.example.com"
export RUBRIK_API_KEY="your-rubrik-api-key"

With a minimal config:

model_list:
  - model_name: gpt-4
    litellm_params:
      model: openai/gpt-4
      api_key: os.environ/OPENAI_API_KEY

guardrails:
  - guardrail_name: "rubrik"
    litellm_params:
      guardrail: rubrik
      mode: "post_call"
      default_on: true

2. Launch the Proxy

litellm --config config.yaml --port 4000

3. Test It

curl -X POST http://localhost:4000/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer sk-1234" \
  -d '{
    "model": "gpt-4",
    "messages": [{"role": "user", "content": "What is the weather in SF?"}],
    "tools": [
      {
        "type": "function",
        "function": {
          "name": "get_weather",
          "description": "Get the weather for a location",
          "parameters": {
            "type": "object",
            "properties": {
              "location": {"type": "string"}
            },
            "required": ["location"]
          }
        }
      }
    ]
  }'

Configuration Reference

YAML Config Parameters

These are set under guardrails.[].litellm_params in your config.yaml:

Parameter	Required	Description
`guardrail: rubrik`	Yes	Selects the Rubrik guardrail integration
`mode: "post_call"`	Yes	Run after the LLM response is received
`api_base`	Yes	Rubrik webhook base URL. Can use `os.environ/RUBRIK_WEBHOOK_URL`. Falls back to `RUBRIK_WEBHOOK_URL` env var if omitted.
`api_key`	No	Rubrik API key. Can use `os.environ/RUBRIK_API_KEY`. Falls back to `RUBRIK_API_KEY` env var if omitted.
`default_on`	No	When `true`, the guardrail runs on all requests without needing per-request opt-in

Environment Variables

These are optional fallbacks used when api_base / api_key are not set in the YAML config. RUBRIK_SAMPLING_RATE and RUBRIK_BATCH_SIZE can only be set via environment variables.

Variable	Required	Default	Description
`RUBRIK_WEBHOOK_URL`	Only if `api_base` not in config	—	Base URL of the Rubrik webhook service
`RUBRIK_API_KEY`	No	—	Bearer token for authenticating with the Rubrik service
`RUBRIK_SAMPLING_RATE`	No	`1.0`	Fraction of requests to log (0.0 to 1.0). Does not affect tool blocking, which always runs. Set to `0.5` to log ~50% of requests.
`RUBRIK_BATCH_SIZE`	No	`512`	Number of log entries to buffer before flushing. Logs are also flushed on a periodic interval.

How Tool Blocking Works

After the LLM returns a response with tool calls, the Rubrik guardrail sends them to the blocking service at {api_base}/v1/after_completion/openai/v1.
The service evaluates each tool call against configured policies and returns the set of allowed tool calls.
If any tool calls are blocked, the proxy returns the policy violation explanation as a response instead of the original LLM response.
If the blocking service is unreachable or returns an error, the guardrail fails open — the original response is returned unchanged.

Request/Response format

The guardrail sends a JSON envelope to the blocking service:

{
  "request": {
    "messages": [...],
    "model": "gpt-4",
    "proxy_server_request": {...}
  },
  "response": {
    "id": "chatcmpl-...",
    "object": "chat.completion",
    "choices": [{
      "message": {
        "role": "assistant",
        "tool_calls": [...]
      }
    }]
  }
}

The service should return an OpenAI chat completion format response containing only the allowed tool calls and an optional content field with the blocking explanation.

How Batch Logging Works

All LLM requests (successes and failures) are queued and sent in batches to {api_base}/v1/litellm/batch.

Logs are flushed when the queue reaches RUBRIK_BATCH_SIZE (default 512) or on a periodic interval (default 5 seconds). These defaults are inherited from LiteLLM's global settings.
Use RUBRIK_SAMPLING_RATE to reduce logging volume in high-traffic deployments. Sampling only affects logging - tool blocking always runs regardless of the sampling rate.
For Anthropic /v1/messages requests, the log ID is normalized to litellm_call_id for consistency across tool blocking and logging.

Quick Start​

1. Configure config.yaml​

2. Launch the Proxy​

3. Test It​

Configuration Reference​

YAML Config Parameters​

Environment Variables​

How Tool Blocking Works​

Request/Response format​

How Batch Logging Works​