Skip to main content
Version: v0.3.0

remote::runpod

Description​

RunPod inference provider for running models on RunPod's cloud GPU platform.

Configuration​

FieldTypeRequiredDefaultDescription
allowed_modelslist[str | NoneNoList of models that should be registered with the model registry. If None, all models are allowed.
refresh_models<class 'bool'>NoFalseWhether to refresh models periodically from the provider
api_tokenpydantic.types.SecretStr | NoneNoThe API token
urlstr | NoneNoThe URL for the Runpod model serving endpoint

Sample Configuration​

url: ${env.RUNPOD_URL:=}
api_token: ${env.RUNPOD_API_TOKEN}