The Vertex API for accessing Claude is nearly-identical to the Messages API and supports all of the same options, with two key differences:

  • In Vertex, model is not passed in the request body. Instead, it is specified in the Google Cloud endpoint URL.
  • In Vertex, anthropic_version is passed in the request body (rather than as a header), and must be set to the value vertex-2023-10-16.

Vertex is also supported by Anthropic’s official client SDKs. This guide will walk you through the process of making a request to Claude on Vertex AI in either Python or TypeScript.

Note that this guide assumes you have already have a GCP project that is able to use Vertex AI. See using the Claude 3 models from Anthropic for more information on the setup required, as well as a full walkthrough.

Install an SDK for accessing Vertex AI

First, install Anthropic’s client SDK for your language of choice.

pip install -U google-cloud-aiplatform "anthropic[vertex]"

Accessing Vertex AI

Model Availability

Note that Anthropic model availability varies by region. Search for “Claude” in the Vertex AI Model Garden or go to Use Claude 3 for the latest information.

API model names

ModelVertex AI API model name
Claude Opus 4claude-opus-4@20250514
Claude Sonnet 4claude-sonnet-4@20250514
Claude Sonnet 3.7claude-3-7-sonnet@20250219
Claude Haiku 3.5claude-3-5-haiku@20241022
Claude Sonnet 3.5claude-3-5-sonnet-v2@20241022
Claude Opus 3 (Public Preview)claude-3-opus@20240229
Claude Sonnet 3claude-3-sonnet@20240229
Claude Haiku 3claude-3-haiku@20240307

Making requests

Before running requests you may need to run gcloud auth application-default login to authenticate with GCP.

The following examples shows how to generate text from Claude on Vertex AI:

from anthropic import AnthropicVertex

project_id = "MY_PROJECT_ID"
# Where the model is running
region = "us-east5"

client = AnthropicVertex(project_id=project_id, region=region)

message = client.messages.create(
    model="claude-opus-4@20250514",
    max_tokens=100,
    messages=[
        {
            "role": "user",
            "content": "Hey Claude!",
        }
    ],
)
print(message)

See our client SDKs and the official Vertex AI docs for more details.

Activity logging

Vertex provides a request-response logging service that allows customers to log the prompts and completions associated with your usage.

Anthropic recommends that you log your activity on at least a 30-day rolling basis in order to understand your activity and investigate any potential misuse.

Turning on this service does not give Google or Anthropic any access to your content.

Feature support

You can find all the features currently supported on Vertex here.