Re: Map-only conversion job

Markus Weimer Tue, 12 Apr 2011 14:20:19 -0700

Hi Doug,

I seem to hit a case not covered by the mapred package documentation:
I'd like to read from a TextInputFormat and produce AVRO data in a
map-only job. How Do I do that?
In short, the way to do this is to:
- use aorg.apache.hadoop.mapred.Mapper<K,V,AvroWrapper<O>,NullWritable>
- call AvroJob.setOutputSchema(job,schema) with O's schema
Does that make sense? If that works for you, I can add it to thejavadoc.

Yes, it worked. Incidently, it also reduced my file size to 33% of myprevious custom-avro-writable-in-sequence-file approach.


Thanks,

Markus

Re: Map-only conversion job

Reply via email to