This guide is for enterprise customers who want to connect to GCP Vertex AI privately from their GCP Runner. If you’re using Ona Intelligence, this configuration is not required.
AI Model Support
The GCP Runner currently supports the following AI model providers:- Anthropic - Claude models via Anthropic API
- Google Vertex AI - Gemini and other models via Vertex AI API
- Ona Intelligence - All supported models through Ona’s managed AI service (recommended)
When using Ona Intelligence, you don’t need to configure private connectivity to Vertex AI or Anthropic. Ona Intelligence provides a unified interface to multiple AI providers with built-in security and compliance features.
Overview
Private Service Connect (PSC) endpoints allow you to access Google Vertex AI services from your GCP Runner without traffic leaving your VPC network. This provides enhanced security and compliance for AI workloads running in your development environments.Benefits
- Private connectivity: All traffic to Vertex AI stays within your VPC network
- No external IPs: Access Vertex AI using internal IP addresses
- Network isolation: Traffic never traverses the public internet
- Compliance: Meet regulatory requirements for data residency and network isolation
Architecture
When you configure Private Service Connect endpoints for Vertex AI:- A consumer endpoint is created in your VPC network
- The endpoint uses an internal IP address from your subnet
- Traffic from your GCP Runner to Vertex AI is routed through the PSC endpoint
- Google performs network address translation (NAT) to route requests to Vertex AI services
Prerequisites
Before configuring Private Service Connect endpoints for Vertex AI, ensure you have:- GCP Runner deployed - Follow the Setup guide to deploy your runner
- Vertex AI API enabled in your GCP project
- Subnet with available IPs - You’ll need IP addresses for PSC endpoints
- IAM permissions to create forwarding rules and manage Private Service Connect
Configuration Methods
You can configure Private Service Connect endpoints for Vertex AI using either the Google Cloud Console (UI) or the gcloud CLI.Method 1: Using Google Cloud Console (Recommended)
Step 1: Navigate to Private Service Connect
- Open the Google Cloud Console
- Navigate to Network services → Private Service Connect
- Click on the Connected endpoints tab
- Click Connect endpoint
Step 2: Configure the Endpoint
- Target: Select Published service
- 
Target service: Enter the Vertex AI service attachment for your region:
ReplaceREGIONwith your GCP Runner’s region (e.g.,us-central1)
- 
Endpoint name: Provide a descriptive name (e.g., vertex-ai-psc-endpoint)
- Network: Select your VPC network (the same network where your GCP Runner is deployed)
- 
Subnetwork: Select the subnet where you want to allocate the endpoint IP
- This should be the same subnet as your runner or a routable subnet
- Ensure the subnet has available IP addresses
 
- 
IP address: Choose how to assign the IP:
- Automatic: Let Google Cloud assign an available IP
- Manual: Specify a specific IP address from the subnet range
 
- Enable global access (optional): Enable if you need to access the endpoint from other regions
- Click Add endpoint
Step 3: Verify the Endpoint
- Wait for the endpoint to be created (typically 2-5 minutes)
- Note the IP address assigned to the endpoint
- Verify the status shows as Accepted
Method 2: Using gcloud CLI
Step 1: Set Environment Variables
Step 2: Create the PSC Endpoint
Step 3: Get the Endpoint IP Address
DNS Configuration
To use the Private Service Connect endpoint, you need to configure DNS to resolve Vertex AI API hostnames to your PSC endpoint IP address.Option 1: Cloud DNS Private Zone (Recommended)
Create a Cloud DNS private zone to override Vertex AI API DNS resolution:PSC_ENDPOINT_IP with the IP address from the previous step.
Option 2: Custom DNS Server
If you’re using a custom DNS server in your VPC:- Add an A record for REGION-aiplatform.googleapis.compointing to your PSC endpoint IP
- Ensure your GCP Runner VMs use this DNS server
Firewall Configuration
Ensure your VPC firewall rules allow traffic from your GCP Runner subnet to the PSC endpoint:PSC_ENDPOINT_IP with your endpoint IP address.
Verification
Test Connectivity
From a development environment or runner VM, test connectivity to Vertex AI through the PSC endpoint:Verify Traffic Routing
Check that traffic is routing through the PSC endpoint:Regional Endpoints
Vertex AI service attachments are regional. You must create a PSC endpoint in the same region as your GCP Runner:| Region | Service Attachment | 
|---|---|
| us-central1 | projects/cloud-ai-platform-public/regions/us-central1/serviceAttachments/vertex-ai-api | 
| us-east1 | projects/cloud-ai-platform-public/regions/us-east1/serviceAttachments/vertex-ai-api | 
| us-west1 | projects/cloud-ai-platform-public/regions/us-west1/serviceAttachments/vertex-ai-api | 
| europe-west1 | projects/cloud-ai-platform-public/regions/europe-west1/serviceAttachments/vertex-ai-api | 
| europe-west4 | projects/cloud-ai-platform-public/regions/europe-west4/serviceAttachments/vertex-ai-api | 
| asia-southeast1 | projects/cloud-ai-platform-public/regions/asia-southeast1/serviceAttachments/vertex-ai-api | 
projects/cloud-ai-platform-public/regions/REGION/serviceAttachments/vertex-ai-api
Cost Considerations
Private Service Connect endpoints incur charges:- PSC endpoint: ~$0.01 per hour per endpoint
- Data processing: ~$0.01 per GB processed through the endpoint
Next Steps
- Configure Anthropic API access - Set up private connectivity to Anthropic
- Ona Intelligence - Learn about Ona’s managed AI service
- Troubleshooting Guide - Comprehensive troubleshooting for GCP Runner