Leaky Bucket

Incoming requests fill the bucket. The bucket drains (leaks) at a constant rate over time. If the bucket is full when a new request arrives, the request is denied.

How It Works

Compute elapsed time since the last check.
Drain: subtract elapsed * rate from the water level (min 0).
If water_level < capacity, increment water level and allow.
Otherwise, deny.

flowchart TD
    Start[Request arrives] --> Drain["Subtract elapsed x rate from water level"]
    Drain --> Floor["Floor at 0"]
    Floor --> Check{"water_level < capacity?"}
    Check -->|Yes| Fill["Increment water level"] --> Allow["ALLOW"]
    Check -->|No| Deny["DENY"]

Parameters

Name	Type	Description
`capacity`	`int`	Maximum number of requests the bucket can hold
`rate`	`int`	Requests drained per second

Trade-offs

Pros:

Enforces a smooth, steady output rate regardless of input burstiness — ideal for protecting downstream services
O(1) time and memory per request

Cons:

Does not permit bursts at all; even legitimate traffic spikes are rejected once the bucket is full
A sustained burst fills the bucket instantly, and recovery depends entirely on the drain rate

Comparison

vs Token Bucket: Token Bucket allows bursts up to capacity (spend accumulated tokens at once). Leaky Bucket absorbs bursts and enforces a uniform drain rate. Choose Leaky Bucket when you need strict output smoothing; choose Token Bucket when bursts are acceptable.

View Source on GitHub