Generate a chat completion for the given messages using the specified model.
POST/v1/inference/chat-completion
Generate a chat completion for the given messages using the specified model.
Request​
Responses​
- 200
- 400
- 429
- 500
- default
If stream=False, returns a ChatCompletionResponse with the full completion. If stream=True, returns an SSE event stream of ChatCompletionResponseStreamChunk.
The request was invalid or malformed
The client has sent too many requests in a given amount of time
The server encountered an unexpected error
An unexpected error occurred