In this tutorial, you use Amazon SageMaker Studio to track the lineage of an Amazon SageMaker AI ML Pipeline.
The pipeline was created by the
Orchestrating
Jobs with Amazon SageMaker Model Building Pipelines
Lineage tracking in Studio is centered around a directed acyclic graph (DAG). The DAG represents the steps in a pipeline. From the DAG you can track the lineage from any step to any other step. The following diagram displays the steps in the pipeline. These steps appear as a DAG in Studio.
data:image/s3,"s3://crabby-images/39241/39241c0abe00749d5a2a500528af6223ccd19cd2" alt="A diagram of the steps of a pipeline workflow."
To track the lineage of a pipeline in the Amazon SageMaker Studio console, complete the following steps based on whether you use Studio or Studio Classic.
To track the lineage of a pipeline
-
Open the SageMaker Studio console by following the instructions in Launch Amazon SageMaker Studio.
-
In the left navigation pane, select Pipelines.
-
(Optional) To filter the list of pipelines by name, enter a full or partial pipeline name in the search field.
-
In the Name column, select a pipeline name to view details about the pipeline.
Choose the Executions tab.
In the Name column of the Executions table, select the name of a pipeline execution to view.
-
At the top right of the Executions page, choose the vertical ellipsis and choose Download pipeline definition (JSON). You can view the file to see how the pipeline graph was defined.
Choose Edit to open the Pipeline Designer.
Use the resizing and zoom controls at the top right corner of the canvas to zoom in and out of the graph, fit the graph to screen, or expand the graph to full screen.
-
To view your training, validation, and test datasets, complete the following steps:
Choose the Processing step in your pipeline graph.
In the right sidebar, choose the Overview tab.
In the Files section, find the Amazon S3 paths to the training, validation, and test datasets.
-
To view your model artifacts, complete the following steps:
Choose the Training step in your pipeline graph.
In the right sidebar, choose the Overview tab.
In the Files section, find the Amazon S3 paths to the model artifact.
-
To find the model package ARN, complete the following steps:
Choose the Register model step.
In the right sidebar, choose the Overview tab.
In the Files section, find the ARN of the model package.