Hallo,

Is it possible to chain Avro MR jobs using the ChainMapper? I'm looking to 
chain two map tasks and a reducer, but haven't been able to find any examples:

Chain summary:
a) first map task: takes non-avro input and produces K/V output in the form of 
AvroKey(Record), NullWritable
b) second map task: taking output of first task as its input [mapper extends 
AvroMapper(Record, Pair(Record, NullWritable))]
c) reducer: AvroReducer

In particular, how would I specify the input and output schemas - simply 
calling AvroJob.setInputSchema/setOutputSchema on the individual chained job 
conf objects?

Thanks,

Andrew

Reply via email to