Docs/Rate limits

Rate limits.

Sensible defaults that scale automatically as your usage grows. Read the rate-limit headers on every response to know exactly where you stand.

Defaults

TierRequests / secondConcurrent streamsCharacters / month
Free10510 000
Pay-as-you-go6030unlimited
Pro (auto-upgrade past $200 / month)200100unlimited
Enterprisenegotiatednegotiatedunlimited

Tiers move automatically as you accrue usage. The free tier never auto-converts to paid — you're never surprise-billed.

Headers

Every REST response carries:

HeaderValue
X-RateLimit-LimitPer-second cap currently applied.
X-RateLimit-RemainingRequests left in the current 1-second window.
X-RateLimit-ResetUnix seconds until the window resets.
Retry-AfterSeconds to wait before retrying. Sent only on 429.

Raising your limits

Most teams never hit the ceiling. If you do, email hello@leanvoice.ai with the expected RPS and concurrent-stream count; we lift the cap the same business day for any account in good standing.

Streaming sessions count differently

An open WebSocket counts as one concurrent stream regardless of how many input.text frames flow through it. Idle streams (no synthesis in progress) don't consume a slot.