Data Processing with Framework Processors - Amazon SageMaker

Data Processing with Framework Processors

A FrameworkProcessor can run Processing jobs with a specified machine learning framework, providing you with an Amazon SageMaker–managed container for whichever machine learning framework you choose. FrameworkProcessor provides premade containers for the following machine learning frameworks: Hugging Face, MXNet, PyTorch, TensorFlow, and XGBoost.

The FrameworkProcessor class also provides you with customization over the container configuration. The FrameworkProcessor class supports specifying a source directory source_dir for your processing scripts and dependencies. With this capability, you can give the processor access to multiple scripts in a directory instead of only specifying one script. FrameworkProcessor also supports including a requirements.txt file in the source_dir for customizing the Python libraries to install in the container.

For more information on the FrameworkProcessor class and its methods and parameters, see FrameworkProcessor in the Amazon SageMaker Python SDK.

To see examples of using a FrameworkProcessor for each of the supported machine learning frameworks, see the following topics.