Amazon Elastic MapReduce
Developer Guide (API Version 2009-03-31)
Did this page help you?  Yes | No |  Tell us about it...
« PreviousNext »
View the PDF for this guide.Go to the AWS Discussion Forum for this product.Go to the Kindle Store to download this guide in Kindle format.

Tutorial: Launching and Querying Impala Clusters on Amazon EMR

This tutorial demonstrates how you can perform interactive queries with Impala on Amazon EMR. The instructions in this tutorial include how to:

  • Sign up for Amazon EMR

  • Launch a long-running cluster with Impala installed

  • Connect to the cluster using SSH

  • Generate a test data set

  • Create Impala tables and populate them with data

  • Perform interactive queries on Impala tables

Amazon EMR provides several tools you can use to launch and manage clusters: the console, a CLI, an API, and several SDKs. For more information about these tools, see What Tools are Available for Amazon EMR?.