Tutorial for building models with Notebook Instances

This Get Started tutorial walks you through how to create a SageMaker notebook instance, open a Jupyter notebook with a preconfigured kernel with the Conda environment for machine learning, and start a SageMaker AI session to run an end-to-end ML cycle. You'll learn how to save a dataset to a default Amazon S3 bucket automatically paired with the SageMaker AI session, submit a training job of an ML model to Amazon EC2, and deploy the trained model for prediction by hosting or batch inferencing through Amazon EC2.

This tutorial explicitly shows a complete ML flow of training the XGBoost model from the SageMaker AI built-in model pool. You use the US Adult Census dataset, and you evaluate the performance of the trained SageMaker AI XGBoost model on predicting individuals' income.

SageMaker AI XGBoost – The XGBoost model is adapted to the SageMaker AI environment and preconfigured as Docker containers. SageMaker AI provides a suite of built-in algorithms that are prepared for using SageMaker AI features. To learn more about what ML algorithms are adapted to SageMaker AI, see Choose an Algorithm and Use Amazon SageMaker Built-in Algorithms. For the SageMaker AI built-in algorithm API operations, see First-Party Algorithms in the Amazon SageMaker Python SDK.
Adult Census dataset – The dataset from the 1994 Census bureau database by Ronny Kohavi and Barry Becker (Data Mining and Visualization, Silicon Graphics). The SageMaker AI XGBoost model is trained using this dataset to predict if an individual makes over $50,000 a year or less.

Topics

Warning Javascript is disabled or is unavailable in your browser.

To use the Amazon Web Services Documentation, Javascript must be enabled. Please refer to your browser's Help pages for instructions.

Document Conventions

Notebook instances

Create an Amazon SageMaker Notebook Instance for the tutorial