Re: Is it possible to input two different files under same mapper

Mori Bellamy Fri, 11 Jul 2008 13:42:37 -0700

Hey Amer,

It sounds to me like you're going to have to write your own inputformat (or atleast modify an existing one). Take a look here:

http://hadoop.apache.org/core/docs/current/api/org/apache/hadoop/mapred/FileSplit.html


I'm not sure how you'd go about doing this, but i hope this helps you.

(Also, have you considered preprocessing your input so that anyarbitrary mapper can know whether or not its looking at a line fromthe "large file"?)

On Jul 11, 2008, at 12:31 PM, Muhammad Ali Amer wrote:

HI,
My requirement is to compare the contents of one very large file (GBto TB size) with a bunch of smaller files (100s of MB to GB sizes).Is there a way I can give the mapper the 1st file independently ofthe remaining bunch?
Amer

Re: Is it possible to input two different files under same mapper

Reply via email to