Skip to main content
Version: v0.2.23

Generate a chat completion for the given messages using the specified model.

POST 

/v1/inference/chat-completion

Generate a chat completion for the given messages using the specified model.

Request​

Responses​

If stream=False, returns a ChatCompletionResponse with the full completion. If stream=True, returns an SSE event stream of ChatCompletionResponseStreamChunk.