Batch inference API for generating completions and chat completions.
This is an asynchronous API. If the request is successful, the response will be a job which can be polled for completion.
NOTE: This API is not yet implemented and is subject to change in concert with other asynchronous APIs including (post-training, evals, etc).
📄️ Generate a chat completion for the given messages using the specified model.
Generate a chat completion for the given messages using the specified model.
📄️ Generate a completion for the given content using the specified model.
Generate a completion for the given content using the specified model.