Skip to content

/AWS1/CL_SGMINFERENCECOMPONE11

Specifies the type and size of the endpoint capacity to activate for a rolling deployment or a rollback strategy. You can specify your batches as either of the following:

  • A count of inference component copies

  • The overall percentage or your fleet

For a rollback strategy, if you don't specify the fields in this object, or if you set the Value parameter to 100%, then SageMaker AI uses a blue/green rollback strategy and rolls all traffic back to the blue fleet.

CONSTRUCTOR

IMPORTING

Required arguments:

iv_type TYPE /AWS1/SGMINFERENCECOMPONENTC01 /AWS1/SGMINFERENCECOMPONENTC01

Specifies the endpoint capacity type.

COPY_COUNT

The endpoint activates based on the number of inference component copies.

CAPACITY_PERCENT

The endpoint activates based on the specified percentage of capacity.

iv_value TYPE /AWS1/SGMCAPACITYSIZEVALUE /AWS1/SGMCAPACITYSIZEVALUE

Defines the capacity size, either as a number of inference component copies or a capacity percentage.


Queryable Attributes

Type

Specifies the endpoint capacity type.

COPY_COUNT

The endpoint activates based on the number of inference component copies.

CAPACITY_PERCENT

The endpoint activates based on the specified percentage of capacity.

Accessible with the following methods

Method Description
GET_TYPE() Getter for TYPE, with configurable default
ASK_TYPE() Getter for TYPE w/ exceptions if field has no value
HAS_TYPE() Determine if TYPE has a value

Value

Defines the capacity size, either as a number of inference component copies or a capacity percentage.

Accessible with the following methods

Method Description
GET_VALUE() Getter for VALUE, with configurable default
ASK_VALUE() Getter for VALUE w/ exceptions if field has no value
HAS_VALUE() Determine if VALUE has a value