Re: Hi,all. How can I involve two avro files with different schema into one M/R job?

Doug Cutting Fri, 18 Mar 2011 12:59:45 -0700

On 03/18/2011 11:31 AM, Harsh J wrote:
> Probably a small case, in which I would require reading from multiple
> sources in my job (perhaps even process them differently until the Map
> phase), with special reader-schemas for each of my sources.


How would your mapper detect which schema was in use?  Would it use
something like instanceof?  If that's the case, then you could simply
use a union as the job's schema.

Or would you want a different mapper for each input type?  That seems
like a higher-level tool, like Hadoop's MultipleInputs, which shouldn't
be too hard to build, but I don't think should be built into the base
MapReduce API, but rather a layer above it, no?

Doug

Re: Hi,all. How can I involve two avro files with different schema into one M/R job?

Reply via email to