Skip to main content
We enforce rate limits to ensure fair usage and stable performance for all users. The rate limit specifies the maximum number of API requests that can be made within a specified time frame for a given workspace.

Default Rate Limit

Every account starts with the following default rate limits:
APIRate limit
TTS20 requests per second
LLM25 requests per second
Embedding50 requests per second
STT50 audio chunks per second
These limits are usually sufficient for hundreds of concurrent users in interactive use cases, such as real-time conversations, and for thousands of concurrent users in less interactive use cases.

Request a Rate Limit Increase

If you require a higher number of requests, we will be happy to increase your rate limit at no additional cost. Please follow the steps below and you can expect a response within 48 hours:
  1. In Inworld Portal, click on your profile icon in the top right corner and select Billing.
  2. Click Increase rate limit in the top right corner, and populate details about your request. This includes your expected usage increase and more details about why a rate limit increase is requested.
  3. Click Submit. Our team will review your request, and may reach out if there are any other questions.