Model parallelism and large model inference

Focus mode

Model parallelism and large model inference - Amazon SageMaker AI

Amazon SageMaker AI includes specialized deep learning containers (DLCs), libraries, and tooling for model parallelism and large model inference (LMI). In the following sections, you can find resources to get started with LMI on SageMaker AI.

Topics

The large model inference (LMI) container documentation
SageMaker AI endpoint parameters for large model inference
Deploying uncompressed models
Deploy large models for inference with TorchServe

Warning Javascript is disabled or is unavailable in your browser.

To use the Amazon Web Services Documentation, Javascript must be enabled. Please refer to your browser's Help pages for instructions.

Document Conventions

Troubleshooting

The LMI container documentation

Select your cookie preferences

Customize cookie preferences

Essential

Performance

Functional

Advertising

Unable to save cookie preferences

Model parallelism and large model inference

Topics

Did this page help you?

Next topic:

Previous topic:

Need help?