That's right. You'd just have to place the input where it expects that first phase's output to have been dumped, and that's a subdirectory of your --temp directory (userVectors I think).
On Thu, Sep 1, 2011 at 4:30 PM, Grant Ingersoll <[email protected]> wrote: > > On Sep 1, 2011, at 10:04 AM, Sean Owen wrote: > > > Your input needs to be CSV if you want to use it all as-is. But, it > quickly > > creates vectors out of things, so really you can comment out the first > > mapper than creates user vectors, and just wire it to use yours instead. > it > > should do all the rest from there. > > > > I could use the --startPhase functionality to skip the first two phases, > right? > >
