Zohreh Norouz
Kia ora!
Customers in New Zealand have been asking for access to foundation models (FMs) on Amazon Bedrock from their local AWS Region.
Today, we’re excited to announce that Amazon Bedrock is now available in the Asia Pacific (New Zealand) Region (ap-southeast-6). Customers in New Zealand can now access Anthropic Claude models (Claude Opus 4.5, Opus 4.6, Sonnet 4.5, Sonnet 4.6, and Haiku 4.5) and Amazon (Nova 2 Lite) models directly in the Auckland Region with cross region inference.
In this post, we explore how cross-Region inference works from the New Zealand Region, the models available through geographic and global routing, and how to get started with your first API call. We cover three key areas:
Cross-Region inference is an Amazon Bedrock capability that distributes inference processing across multiple AWS Regions to help you achieve higher throughput at scale.
When you invoke a cross-Region inference profile, Amazon Bedrock routes your request from the source Region (where you initiate the API call) to a destination Region (where inference processing occurs). All data transmitted during cross-Region operations remains on the AWS network and doesn’t traverse the public internet, and data is encrypted in transit between AWS Regions. All cross-Region inference requests are logged in AWS CloudTrail in your source Region. If you configure model invocation logging, logs are published to Amazon CloudWatch Logs or Amazon Simple Storage Service (Amazon S3) in the same Region.
Amazon Bedrock provides two types of cross-Region inference profiles:
With this launch, Auckland (ap-southeast-6) becomes a new source Region for both AU geographic and global cross-Region inference on Amazon Bedrock. This means that you can now make Amazon Bedrock API calls from the New Zealand Region, and cross-Region inference routes your requests to destination Regions where the FMs process inference.
The AU cross-Region profile now spans three Regions across Australia and New Zealand. The following table details the source and destination Region routing.
| Source Region | Destination Regions | Description |
Auckland (ap-southeast-6) | ap-southeast-6, ap-southeast-2, ap-southeast-4 | New – Requests from Auckland can be routed to Sydney, Melbourne, or Auckland |
Sydney (ap-southeast-2) | ap-southeast-2, ap-southeast-4 | Requests from Sydney can be routed to Sydney or Melbourne |
Melbourne (ap-southeast-4) | ap-southeast-2, ap-southeast-4 | Requests from Melbourne can be routed to Sydney or Melbourne |
There are two important details to note:
For organizations without strict data residency requirements, global cross-Region inference from the Auckland Region provides access to inference capacity across all supported AWS commercial Regions worldwide. Global cross-Region inference delivers two key advantages:
Cross-Region inference from the New Zealand Region supports foundation models from multiple providers across both AU geographic and global cross-Region inference profiles. The following table shows examples of the latest models available at launch.
| Cross-Region inference type | Example models |
| AU geographic cross-Region inference | Anthropic Claude Opus 4.6, Claude Sonnet 4.6, Claude Sonnet 4.5, Claude Haiku 4.5 |
| Global cross-Region inference | Anthropic Claude Opus 4.6, Claude Sonnet 4.6, Claude Opus 4.5, Claude Sonnet 4.5, Claude Haiku 4.5 |
AU geographic cross-Region inference currently supports Anthropic Claude models, keeping inference processing within the ANZ geography. Global cross-Region inference provides access to a broader set of foundation models from multiple providers. To use a cross-Region inference profile, replace the foundational model ID with the geographic (au.) or global (global.) prefix — for example, anthropic.claude-sonnet-4-6 becomes au.anthropic.claude-sonnet-4-6 or global.anthropic.claude-sonnet-4-6.
For the complete and up-to-date list of supported models and inference profile IDs, refer to Supported Regions and models for inference profiles.
Cross-Region inference profiles work with the InvokeModel, InvokeModelWithResponseStream, Converse, and ConverseStream APIs. The Converse API provides a consistent request and response format across different foundation models, making it straightforward to switch between models without rewriting integration code.
To invoke foundation models through AU geographic cross-Region inference from the Auckland Region, your AWS Identity and Access Management (IAM) policy needs two statements:
The following IAM policy example grants access to invoke Anthropic Claude Sonnet 4.6 through AU geographic cross-Region inference from Auckland. Replace <ACCOUNT_ID> with your AWS account ID.
The first statement allows invoking the AU inference profile from the Auckland source Region. The second statement allows the FM to be invoked in the three destination Regions, but only when the request is routed through the AU inference profile. This follows the principle of least privilege by preventing direct model invocation in those Regions.
The same two-statement pattern applies to any model in the AU cross-Region inference profile—replace the model ID in the resource ARNs. For global cross-Region inference IAM policies, service control policies (SCP) configurations, and advanced security patterns, refer to Securing Amazon Bedrock cross-Region inference: Geographic and global.
Cross-Region inference is designed with security at its core. All requests travel exclusively over the AWS Global Network with end-to-end encryption, and your data at rest remains in the source Region.
For organizations using SCPs to restrict access to specific AWS Regions, note the following when calling from the Auckland source Region (ap-southeast-6):
ap-southeast-2, ap-southeast-4, and ap-southeast-6 for Amazon Bedrock actions in your SCPs, because Auckland’s AU profile routes to all three ANZ Regions.The following example SCP restricts services to the Auckland Region, with exceptions for Amazon Bedrock and global services like IAM. It limits Amazon Bedrock to the three ANZ Regions, and requires that Amazon Bedrock access in Sydney and Melbourne go through cross-Region inference profiles rather than direct model invocation:
In the previous policy:
For detailed SCP configuration examples, global cross-Region inference IAM policies, disabling specific cross-Region inference types, and AWS Control Tower integration guidance, refer to Securing Amazon Bedrock cross-Region inference: Geographic and global.
AWS CloudTrail logs all cross-Region inference calls in the source Region. The additionalEventData.inferenceRegion field records where each request was processed, so you can audit exactly where inference occurred:
For real-time operational monitoring, Amazon CloudWatch provides metrics for cross-Region inference requests in your source Region. Key metrics include:
Amazon Bedrock service quotas are managed at the source Region level. Quota increases requested from the Auckland Region (ap-southeast-6) apply only to requests originating from Auckland.
Quotas are measured in two dimensions:
When calculating your required quota, account for the token burndown rate. For Anthropic Claude Opus 4.6, Sonnet 4.6, and Sonnet 4.5, output tokens consume five times more quota than input tokens (5:1 burndown rate). For Claude Haiku 4.5 and Amazon Nova models, the burndown rate is 1:1.
Quota consumption formula:
Quota consumption = Input tokens + Cache write tokens + (Output tokens x Burndown rate)
To request quota increases, navigate to the AWS Service Quotas console in your source Region, select Amazon Bedrock, and search for the relevant cross-Region inference quota for your model.
In this post, we introduced cross-Region inference support from the New Zealand Region on Amazon Bedrock. Customers in New Zealand can now make API calls from Auckland and access foundation models through geographic and global cross-Region inference profiles.Key takeaways:
You can get started with cross-Region inference from the New Zealand Region with the following steps:
ap-southeast-6).For more information, refer to:
I created a completely local Ethot online as an experiment. I dream of a world…
Traditional databases answer a well-defined question: does the record matching these criteria exist?
Despite their output being ultimately consumed by human viewers, 3D Gaussian Splatting (3DGS) methods often…
How we built lightweight, real-time map collaboration for teams operating at the edge.About This SeriesFrontend engineering at…
AI has made it easier than ever for student developers to work efficiently, tackle harder…
A judge temporarily blocked the Trump administration’s designation, clearing the way for Anthropic to keep…