Creating ETL jobs with AWS Glue Studio - AWS Glue Studio

Creating ETL jobs with AWS Glue Studio

You can use the simple graphical interface in AWS Glue Studio to create your ETL jobs. You use the Jobs page to create new jobs.

On the Jobs page, you can see all the jobs that you have created either with AWS Glue Studio or the AWS Glue console. You can view, manage, and run your jobs on this page.

Start the job creation process

You use the visual graph editor to create and customize your jobs. When you create a new job, you have the option of starting with an empty canvas, a job graph with only a source node, or a job graph with a data source, transform, and data target node.

To create a job in AWS Glue Studio

  1. Sign in to the AWS Management Console and open the AWS Glue Studio console at https://console.aws.amazon.com/gluestudio/.

  2. You can either choose Create and manage jobs from the AWS Glue Studio landing page, or you can choose Jobs from the navigation pane.

    The Jobs page appears.

  3. In the Create job section, choose a configuration option for your job.

    • To create a job starting with an empty canvas, choose Blank graph.

    • To create a job starting with source node, or with a source, transform and target node, choose Source and target added to the graph.

      You then choose the data source type. You can also choose the data target type, or you can choose the Choose later option to start with only a data source node in the graph.

  4. Choose Create to open the visual job editor.

    
            The screen shot shows the Jobs page of AWS Glue Studio. In the "Create job"
              section, the "Source and target added to the graph" option is selected, and the Source
              drop-down list shows the various available data source types: S3, Kinesis, Kafka, RDS,
              Redshift, Cdata Salesforce, "My Snowflake connector", and "Go to AWS Marketplace"
              (link). To the right of the Source drop-down list is the Target drop-down list, which
              shows "Data Catalog". The Create button is highlighted in orange near the top
              right of the image.

Create jobs that use a connector

After you have added a connector to AWS Glue Studio and created a connection for that connector, you can create a job that uses the connection for the data source.

For detailed instructions, see Authoring jobs with custom connectors.

Next steps for creating a job in AWS Glue Studio

You use the visual job editor to configure nodes in the graph for your job. Each node represents an action, such as reading data from the source location or applying a transform to the data. Each node you add to your job graph has properties that provide information about either the data location or the transform.

The next steps for creating and managing your jobs are: