Seleccione sus preferencias de cookies

Usamos cookies esenciales y herramientas similares que son necesarias para proporcionar nuestro sitio y nuestros servicios. Usamos cookies de rendimiento para recopilar estadísticas anónimas para que podamos entender cómo los clientes usan nuestro sitio y hacer mejoras. Las cookies esenciales no se pueden desactivar, pero puede hacer clic en “Personalizar” o “Rechazar” para rechazar las cookies de rendimiento.

Si está de acuerdo, AWS y los terceros aprobados también utilizarán cookies para proporcionar características útiles del sitio, recordar sus preferencias y mostrar contenido relevante, incluida publicidad relevante. Para aceptar o rechazar todas las cookies no esenciales, haga clic en “Aceptar” o “Rechazar”. Para elegir opciones más detalladas, haga clic en “Personalizar”.

DescribeInferenceComponent

Modo de enfoque
DescribeInferenceComponent - Amazon SageMaker
Esta página no se ha traducido a su idioma. Solicitar traducción

Returns information about an inference component.

Request Syntax

{ "InferenceComponentName": "string" }

Request Parameters

For information about the parameters that are common to all actions, see Common Parameters.

The request accepts the following data in JSON format.

InferenceComponentName

The name of the inference component.

Type: String

Length Constraints: Maximum length of 63.

Pattern: ^[a-zA-Z0-9]([\-a-zA-Z0-9]*[a-zA-Z0-9])?$

Required: Yes

Response Syntax

{ "CreationTime": number, "EndpointArn": "string", "EndpointName": "string", "FailureReason": "string", "InferenceComponentArn": "string", "InferenceComponentName": "string", "InferenceComponentStatus": "string", "LastDeploymentConfig": { "AutoRollbackConfiguration": { "Alarms": [ { "AlarmName": "string" } ] }, "RollingUpdatePolicy": { "MaximumBatchSize": { "Type": "string", "Value": number }, "MaximumExecutionTimeoutInSeconds": number, "RollbackMaximumBatchSize": { "Type": "string", "Value": number }, "WaitIntervalInSeconds": number } }, "LastModifiedTime": number, "RuntimeConfig": { "CurrentCopyCount": number, "DesiredCopyCount": number }, "Specification": { "BaseInferenceComponentName": "string", "ComputeResourceRequirements": { "MaxMemoryRequiredInMb": number, "MinMemoryRequiredInMb": number, "NumberOfAcceleratorDevicesRequired": number, "NumberOfCpuCoresRequired": number }, "Container": { "ArtifactUrl": "string", "DeployedImage": { "ResolutionTime": number, "ResolvedImage": "string", "SpecifiedImage": "string" }, "Environment": { "string" : "string" } }, "ModelName": "string", "StartupParameters": { "ContainerStartupHealthCheckTimeoutInSeconds": number, "ModelDataDownloadTimeoutInSeconds": number } }, "VariantName": "string" }

Response Elements

If the action is successful, the service sends back an HTTP 200 response.

The following data is returned in JSON format by the service.

CreationTime

The time when the inference component was created.

Type: Timestamp

EndpointArn

The Amazon Resource Name (ARN) of the endpoint that hosts the inference component.

Type: String

Length Constraints: Minimum length of 20. Maximum length of 2048.

Pattern: arn:aws[a-z\-]*:sagemaker:[a-z0-9\-]*:[0-9]{12}:endpoint/.*

EndpointName

The name of the endpoint that hosts the inference component.

Type: String

Length Constraints: Maximum length of 63.

Pattern: ^[a-zA-Z0-9](-*[a-zA-Z0-9]){0,62}

FailureReason

If the inference component status is Failed, the reason for the failure.

Type: String

Length Constraints: Maximum length of 1024.

InferenceComponentArn

The Amazon Resource Name (ARN) of the inference component.

Type: String

Length Constraints: Minimum length of 20. Maximum length of 2048.

InferenceComponentName

The name of the inference component.

Type: String

Length Constraints: Maximum length of 63.

Pattern: ^[a-zA-Z0-9]([\-a-zA-Z0-9]*[a-zA-Z0-9])?$

InferenceComponentStatus

The status of the inference component.

Type: String

Valid Values: InService | Creating | Updating | Failed | Deleting

LastDeploymentConfig

The deployment and rollback settings that you assigned to the inference component.

Type: InferenceComponentDeploymentConfig object

LastModifiedTime

The time when the inference component was last updated.

Type: Timestamp

RuntimeConfig

Details about the runtime settings for the model that is deployed with the inference component.

Type: InferenceComponentRuntimeConfigSummary object

Specification

Details about the resources that are deployed with this inference component.

Type: InferenceComponentSpecificationSummary object

VariantName

The name of the production variant that hosts the inference component.

Type: String

Length Constraints: Maximum length of 63.

Pattern: ^[a-zA-Z0-9](-*[a-zA-Z0-9]){0,62}

Errors

For information about the errors that are common to all actions, see Common Errors.

See Also

For more information about using this API in one of the language-specific AWS SDKs, see the following:

PrivacidadTérminos del sitioPreferencias de cookies
© 2025, Amazon Web Services, Inc o sus afiliados. Todos los derechos reservados.