Purchase a Provisioned Throughput for a Amazon Bedrock model - Amazon Bedrock

Purchase a Provisioned Throughput for a Amazon Bedrock model

When you purchase a Provisioned Throughput for a model, you specify the level of commitment for it and the number of model units (MUs) to allot. For MU quotas, see Provisioned Throughput quotas. The number of MUs that you can allot to your Provisioned Throughputs depends on the commitment term for the Provisioned Throughput:

  • By default, your account provides you with 2 MUs to distribute between Provisioned Throughputs with no commitment.

  • If you're purchasing a Provisioned Throughput with commitment, you must first visit the AWS support center to request MUs for your account to distribute between Provisioned Throughputs with commitment. After your request is granted, you can purchase a Provisioned Throughput with commitment.

Note

After you purchase the Provisioned Throughput, you can only change the associated model if you select a custom model. You can change the associated model to one of the following:

  • The base model that it's customized from.

  • Another custom model that's derived from the same base model.

To learn how to purchase Provisioned Throughput for a model, select the tab corresponding to your method of choice and follow the steps.

Console
  1. Sign in to the AWS Management Console, and open the Amazon Bedrock console at https://console.aws.amazon.com/bedrock/.

  2. Select Provisioned Throughput under Assessment and deployment from the left navigation pane.

  3. In the Provisioned Throughput section, choose Purchase Provisioned Throughput.

  4. For the Provisioned Throughput details section, do the following:

    1. In the Provisioned Throughput name field, enter a name for the Provisioned Throughput.

    2. Under Select model, select a base model provider or a custom model category. Then select the model for which to provision throughput.

      Note

      To see the base models for which you can purchase Provisioned Throughput without commitment, see Supported region and models for Provisioned Throughput.

      In the AWS GovCloud (US) region, you can only purchase Provisioned Throughput for custom models with no commitment.

    3. (Optional) To associate tags with your Provisioned Throughput, expand the Tags section and choose Add new tag. For more information, see Tag resources.

  5. For the Commitment term & model units section, do the following:

    1. In the Select commitment term section, select the amount of time for which you want to commit to using the Provisioned Throughput.

    2. In the Model units field, enter the desired number of model units (MUs). If you are provisioning a model with commitment, you must first visit the AWS support center to request an increase in the number of MUs that you can purchase.

  6. Under Estimated purchase summary, review the estimated cost.

  7. Choose Purchase Provisioned Throughput.

  8. Review the note that appears and acknowledge the commitment duration and price by selecting the checkbox. Then choose Confirm purchase.

  9. The console displays the Provisioned Throughput overview page. The Status of the Provisioned Throughput in the Provisioned Throughput table becomes Creating. When the Provisioned Throughput is finished being created, the Status becomes In service. If the update fails, the Status becomes Failed.

API

To purchase a Provisioned Throughput, send a CreateProvisionedModelThroughput request (see link for request and response formats and field details) with an Amazon Bedrock control plane endpoint.

Note

To see the base models for which you can purchase Provisioned Throughput without commitment, see Supported region and models for Provisioned Throughput.

In the AWS GovCloud (US) region, you can only purchase Provisioned Throughput for custom models with no commitment.

The following table briefly describes the parameters and request body (for detailed information and the request structure, see the CreateProvisionedModelThroughput request syntax):

Variable Required? Use case
modelId Yes To specify the base model ID or ARN for purchasing Provisioned Throughput, or the custom model name or ARN
modelUnits Yes To specify the number of model units (MUs) to purchase. To increase the number of MUs that you can purchase, visit the AWS support center to request an increase in the number of MUs that you can purchase
provisionedModelName Yes To specify a name for the Provisioned Throughput
commitmentDuration No To specify the duration for which to commit to the Provisioned Throughput. Omit this field to opt for no-commitment pricing
tags No To associate tags with your Provisioned Throughput
clientRequestToken No To prevent reduplication of the request

The response returns a provisionedModelArn that you can use as a modelId in model inference. To check when the Provisioned Throughput is ready for use, send a GetProvisionedModelThroughput request and check that the status is InService. If the update fails, its status will be Failed, and the GetProvisionedModelThroughput response will contain a failureMessage.

See code examples