Rate limits are per key, not per user or per workspace. Each key has its own counter, so minting two keys in the same workspace gives each its own budget. The goal: don’t let one runaway script slow everyone else down.Documentation Index
Fetch the complete documentation index at: https://uselora.dev/llms.txt
Use this file to discover all available pages before exploring further.
Default policy
Every key starts with these defaults:| Setting | Value |
|---|---|
| Window | 1 minute |
| Max requests | 600 (10 rps sustained) |
| Refill | Counter resets when the window rolls |
When you hit the limit
You get429 Too Many Requests with a Retry-After header in seconds:
Choosing a key budget
A few guidelines:- For an interactive integration like an internal tool or one-user script, leave the default alone. You won’t hit 600/min under normal use.
- For background jobs and batch imports, pace the worker fleet so the combined rate stays under 600/min. If you need a higher cap, get in touch.
- For untrusted clients like browser extensions, public widgets, or embeds, mint a short-lived key per surface. Per-key isolation means a leaky client burns its own budget instead of starving the rest of the workspace.