Turns out that apache-nutch*.jar was packed inside the jobfile's 'lib' directory along with the classes. And hadoop picked the Fetcher class from the jar inside 'lib'.
On Thu, Dec 27, 2012 at 11:46 PM, Sourajit Basak <[email protected]>wrote: > Maybe on hadoop 1.1, any job submitted via ToolRunner is stored in the > distributed cache. > Will keep the thread updated. > > > On Thu, Dec 27, 2012 at 8:24 PM, Sourajit Basak > <[email protected]>wrote: > >> This is what I did. >> >> Our nutch directory only contains the following structure. Basically the >> script does what I was doing previously. >> >> apache-nutch-1.5.1.job >> +bin >> nutch >> >> Even in this case, I deleted the entire fetcher package. The fetch >> command worked !!! >> >> Is anyone in a position to repeat this exercise ? Maybe change a >> LOG.info(..) in Fetcher and see what happens ? >> >> >> >> On Thu, Dec 27, 2012 at 7:42 PM, Sourajit Basak <[email protected] >> > wrote: >> >>> Are you saying that I put hadoop binary on the path and use the nutch >>> script like on local. >>> >>> >>> On Thu, Dec 27, 2012 at 7:35 PM, Sourajit Basak < >>> [email protected]> wrote: >>> >>>> Didn't understand. >>>> Lets say I put the job file in HADOOP_HOME/bin. What commands do I fire >>>> ? >>>> >>>> >>>> >>>> On Thu, Dec 27, 2012 at 7:27 PM, Markus Jelsma < >>>> [email protected]> wrote: >>>> >>>>> CWD >>>> >>>> >>>> >>> >> >

