Process Data Using Cascading
Cascading is an open-source Java library that provides a query API, a query planner, and a job scheduler for creating and running Hadoop MapReduce applications. Applications developed with Cascading are compiled and packaged into standard Hadoop-compatible JAR files similar to other native Hadoop applications. A Cascading step is submitted as a custom JAR in the Amazon EMR console. For more information about Cascading, go to http://www.cascading.org.