Limits
| Limit | Default | Description |
|---|---|---|
| Requests per minute (RPM) | 5 | Maximum API calls to generation endpoints per minute |
| Concurrent generations | 10 | Maximum in-flight generation jobs at any time |
| Spending limit | $5,000 | Maximum spending per billing period (in dollars) |
Rate limits apply only to write endpoints (
POST /v1/switchx/generations).
Read endpoints (status polling, listing jobs) have a separate limit of 5
requests per minute per account.Spending Limit
Each account has aspending_limit (in dollars) that caps total spending per billing period. If a new job would push your current period usage over this limit, the API returns 402 with error code HARD_LIMIT_EXCEEDED.
The error message includes your current usage and the job cost so you can decide whether to wait for the next billing cycle or request a limit increase.