Amazon Elastic MapReduce
Developer Guide (API Version 2009-03-31)
Did this page help you?  Yes | No |  Tell us about it...
« PreviousNext »
View the PDF for this guide.Go to the AWS Discussion Forum for this product.Go to the Kindle Store to download this guide in Kindle format.

Share Data Between Hive Versions

You can take advantage of Hive bug fixes and performance improvements on your existing Hive clusters by upgrading your version of Hive. Different versions of Hive, however, have different schemas. To share data between two versions of Hive, you can create an external table in each version of Hive with the same LOCATION parameter.

To share data between Hive versions

1

Start a cluster with the new version of Hive. This procedure assumes that you already have a cluster with the old version of Hive running.

2

Configure the two clusters to allow communication:

On the cluster with the old version of Hive, configure the insert overwrite directory to the location of the HDFS of the cluster with the new version of Hive.

3

Export and reimport the data.