remote::vertexai
Description
Google Vertex AI inference provider enables you to use Google's Gemini models through Google Cloud's Vertex AI platform, providing several advantages:
• Enterprise-grade security: Uses Google Cloud's security controls and IAM • Better integration: Seamless integration with other Google Cloud services • Advanced features: Access to additional Vertex AI features like model tuning and monitoring • Authentication: Uses Google Cloud Application Default Credentials (ADC) instead of API keys
Configuration:
- Set VERTEX_AI_PROJECT environment variable (required)
- Set VERTEX_AI_LOCATION environment variable (optional, defaults to global)
- Use Google Cloud Application Default Credentials or service account key
Authentication Setup: Option 1 (Recommended): gcloud auth application-default login Option 2: Set GOOGLE_APPLICATION_CREDENTIALS to service account key path
Available Models:
- vertex_ai/gemini-2.0-flash
- vertex_ai/gemini-2.5-flash
- vertex_ai/gemini-2.5-pro
Configuration
| Field | Type | Required | Default | Description |
|---|---|---|---|---|
allowed_models | list[str] | None | No | List of models that should be registered with the model registry. If None, all models are allowed. | |
refresh_models | bool | No | False | Whether to refresh models periodically from the provider |
network | NetworkConfig | None | No | Network configuration including TLS, proxy, and timeout settings. | |
network.tls | TLSConfig | None | No | TLS/SSL configuration for secure connections. | |
network.tls.verify | bool | Path | No | True | Whether to verify TLS certificates. Can be a boolean or a path to a CA certificate file. |
network.tls.min_version | Literal[TLSv1.2, TLSv1.3] | None | No | Minimum TLS version to use. Defaults to system default if not specified. | |
network.tls.ciphers | list[str] | None | No | List of allowed cipher suites (e.g., ['ECDHE+AESGCM', 'DHE+AESGCM']). | |
network.tls.client_cert | Path | None | No | Path to client certificate file for mTLS authentication. | |
network.tls.client_key | Path | None | No | Path to client private key file for mTLS authentication. | |
network.proxy | ProxyConfig | None | No | Proxy configuration for HTTP connections. | |
network.proxy.url | HttpUrl | None | No | Single proxy URL for all connections (e.g., 'http://proxy.example.com:8080'). | |
network.proxy.http | HttpUrl | None | No | Proxy URL for HTTP connections. | |
network.proxy.https | HttpUrl | None | No | Proxy URL for HTTPS connections. | |
network.proxy.cacert | Path | None | No | Path to CA certificate file for verifying the proxy's certificate. Required for proxies in interception mode. | |
network.proxy.no_proxy | list[str] | None | No | List of hosts that should bypass the proxy (e.g., ['localhost', '127.0.0.1', '.internal.corp']). | |
network.timeout | float | TimeoutConfig | None | No | Timeout configuration. Can be a float (for both connect and read) or a TimeoutConfig object with separate connect and read timeouts. | |
network.timeout.connect | float | None | No | Connection timeout in seconds. | |
network.timeout.read | float | None | No | Read timeout in seconds. | |
network.headers | dict[str, str] | None | No | Additional HTTP headers to include in all requests. | |
project | str | No | Google Cloud project ID for Vertex AI | |
location | str | No | global | Google Cloud location for Vertex AI |
Sample Configuration
project: ${env.VERTEX_AI_PROJECT:=}
location: ${env.VERTEX_AI_LOCATION:=global}