Create chat completion

curl --location 'https://api.inworld.ai/llm/v1alpha/completions:completeChat' \ --header 'Authorization: Basic <your-api-key>' \ --header 'Content-Type: application/json' \ --data '{ "servingId": { "modelId": { "model": "gpt-4o-mini", "serviceProvider": "SERVICE_PROVIDER_OPENAI" }, "userId": "user-123" }, "messages": [ {"role": "MESSAGE_ROLE_USER", "content": "Hello, how are you?"} ], "textGenerationConfig": { "maxTokens": 100, "stream": false } }'

{ "result": { "id": "chatcmpl-D0KDzqXlvlNfWI00Ynj5eRFoI14Dw", "choices": [ { "finishReason": "FINISH_REASON_STOP", "message": { "content": "Hello! I'm just a program, so I don't have feelings, but I'm here and ready to help you. How can I assist you today?", "role": "MESSAGE_ROLE_ASSISTANT" } } ], "createTime": "2026-01-21T04:35:15Z", "model": "gpt-4o-mini", "usage": { "completionTokens": 29, "promptTokens": 13 }, "serviceProvider": "SERVICE_PROVIDER_OPENAI" } }

Authorizations

Authorization

string

header

required

Should follow the format Basic {credentials}. The {credentials} consists of the Base64-encoded string of the API key and the secret in the format key:secret

Body

application/json

Chat completion request.

servingId

object

required

Describes the serving ID of the request to select the right model.

Show child attributes

messages

(Text Content · object | Multi-modal Content · object)[]

required

A list of messages comprising the conversation so far.

Chat message.

Text Content
Multi-modal Content

Show child attributes

tools

object[]

A list of tools the model may call. Currently, only functions are supported as a tool. Use this to provide a list of functions the model may generate JSON inputs for. Only supported for OpenAI.

Show child attributes

toolChoice

object

Controls which (if any) function is called by the model. Only supported for OpenAI.

Show child attributes

textGenerationConfig

object

Configuration for chat completion generation.

Show child attributes

responseFormat

enum<string>

default:RESPONSE_FORMAT_UNSPECIFIED

Format that the model must output..

RESPONSE_FORMAT_UNSPECIFIED: Response format is not specified. Defaults to "text".
RESPONSE_FORMAT_TEXT: Text response format.
RESPONSE_FORMAT_JSON: Only supported when stream = False. JSON response format. This guarantees that the message the model generates is valid JSON. Note that your system prompt must still instruct the model to produce JSON, and to help ensure you don't forget, the API will throw an error if the string JSON does not appear in your system message. Also note that the message content may be partial (i.e. cut off) if finish_reason="length", which indicates the generation exceeded max_tokens or the conversation exceeded the max context length. Only supported for OpenAI.
RESPONSE_FORMAT_JSON_SCHEMA: JSON schema response format. It enables Structured Outputs which ensures the model will match your supplied JSON schema. Only supported for OpenAI.

Available options:

RESPONSE_FORMAT_UNSPECIFIED,

RESPONSE_FORMAT_TEXT,

RESPONSE_FORMAT_JSON,

RESPONSE_FORMAT_JSON_SCHEMA

requestTimeout

number<float>

Request timeout in seconds. This setting applies only to selected clients and configured by a separate request to Inworld. Make sure to configure these specific requests accordingly, as this timeout will not affect others.

jsonSchema

object

JSON schema configuration. Only supported for OpenAI.

Show child attributes

Response

A successful response.(streaming responses)

result

object

Chat completion response.

Show child attributes

error

object

Show child attributes

Overview

Text-to-Speech

Voices

Speech-to-Text

Realtime API

LLM

Router

Models

Embeddings

Authorizations

Body

Response