Amazon EMR
Amazon EMR Release Guide

The AWS Documentation website is getting a new look!
Try it now and let us know what you think. Switch to the new look >>

You can return to the original look by selecting English in the language selector above.

Apache Sqoop

Apache Sqoop is a tool for transferring data between Amazon S3, Hadoop, HDFS, and RDBMS databases. For more information, see the Apache Sqoop website. Sqoop is included in Amazon EMR release version 5.0.0 and later. Earlier release versions include Sqoop as a sandbox application. For more information, see Amazon EMR 4.x Release Versions.

The following table lists the version of Sqoop included in the latest release of Amazon EMR, along with the components that Amazon EMR installs with Sqoop.

For the version of components installed with Sqoop in this release, see Release 5.27.0 Component Versions.

Sqoop Version Information for emr-5.27.0

Amazon EMR Release Label Sqoop Version Components Installed With Sqoop

emr-5.27.0

Sqoop 1.4.7

emrfs, emr-ddb, emr-goodies, hadoop-client, hadoop-mapred, hadoop-hdfs-datanode, hadoop-hdfs-library, hadoop-hdfs-namenode, hadoop-httpfs-server, hadoop-kms-server, hadoop-yarn-nodemanager, hadoop-yarn-resourcemanager, hadoop-yarn-timeline-server, mysql-server, sqoop-client