Crunch on EMR

Som Satpathy Tue, 01 Oct 2013 12:01:04 -0700

Hi All,

I have been trying to run crunch jobs on amazon EMR and faced a problem
while job execution -


"found class org.apache.hadoop.mapreduce.taskinputoutputcontext but
interface was expected"

This is happening because of hadoop incompatibilities between APIs used
while implementing the hadoop job, and the hadoop-code that runs in the
cluster.

My crunch fat jar is based on crunch version 0.7 (CDH 4.3.0) while EMR runs
hadoop 1.0.3 (where TaskInputOutputContext is implemented as an abstract
class)

Has any one been able to successfully execute their crunch jobs on EMR?

If yes, what are the best practices to make custom crunch fat jars work on
EMR?


Look forward to hearing your thoughts.

Thanks,

Som

Crunch on EMR

Reply via email to