Version: v0.2.23

Generate a chat completion for the given messages using the specified model.

POST /v1/inference/chat-completion

Request

Responses

200
400
429
500
default

If stream=False, returns a ChatCompletionResponse with the full completion. If stream=True, returns an SSE event stream of ChatCompletionResponseStreamChunk.

Generate a chat completion for the given messages using the specified model.

/v1/inference/chat-completion

Request​

Responses​

Request

Responses