Skip to main content
Version: v0.2.23

Batch inference API for generating completions and chat completions.

This is an asynchronous API. If the request is successful, the response will be a job which can be polled for completion.

NOTE: This API is not yet implemented and is subject to change in concert with other asynchronous APIs including (post-training, evals, etc).