I don't know how dirty of a hack this might be, but what about this: Store the Path.getName() of the two separate Paths prior to the job, then within the Mapper, look up context.getWorkingDirectory().getName() and compare it to the two variables set before. Whichever one matches, you know you're working with that specific SequenceFile.
Would that even work? On Tue, Aug 3, 2010 at 3:50 PM, Shannon Quinn <[email protected]> wrote: > Here's my next question, then: within the Mapper itself, how do I know the > source SequenceFile of the VectorWritable I'm currently holding, A or B? > > > On Tue, Aug 3, 2010 at 1:33 PM, Ted Dunning <[email protected]> wrote: > >> Well if both vectors are the same size, then a map-reduce on vector number >> is the natural solution here. >> >> Map-side reduce is only useful when one or the other operand is relatively >> small. >> >> On Tue, Aug 3, 2010 at 10:12 AM, Shannon Quinn <[email protected]> wrote: >> >> > but how to gain access >> > to the two original vectors at the same time is beyond me. >> > >> > >
