Turns out that apache-nutch*.jar was packed inside the jobfile's 'lib'
directory along with the classes. And hadoop picked the Fetcher class from
the jar inside 'lib'.



On Thu, Dec 27, 2012 at 11:46 PM, Sourajit Basak
<[email protected]>wrote:

> Maybe on hadoop 1.1, any job submitted via ToolRunner is stored in the
> distributed cache.
> Will keep the thread updated.
>
>
> On Thu, Dec 27, 2012 at 8:24 PM, Sourajit Basak 
> <[email protected]>wrote:
>
>> This is what I did.
>>
>> Our nutch directory only contains the following structure. Basically the
>> script does what I was doing previously.
>>
>> apache-nutch-1.5.1.job
>> +bin
>>    nutch
>>
>> Even in this case, I deleted the entire fetcher package. The fetch
>> command worked !!!
>>
>> Is anyone in a position to repeat this exercise ? Maybe change a
>> LOG.info(..) in Fetcher and see what happens ?
>>
>>
>>
>> On Thu, Dec 27, 2012 at 7:42 PM, Sourajit Basak <[email protected]
>> > wrote:
>>
>>> Are you saying that I put hadoop binary on the path and use the nutch
>>> script like on local.
>>>
>>>
>>> On Thu, Dec 27, 2012 at 7:35 PM, Sourajit Basak <
>>> [email protected]> wrote:
>>>
>>>> Didn't understand.
>>>> Lets say I put the job file in HADOOP_HOME/bin. What commands do I fire
>>>> ?
>>>>
>>>>
>>>>
>>>> On Thu, Dec 27, 2012 at 7:27 PM, Markus Jelsma <
>>>> [email protected]> wrote:
>>>>
>>>>> CWD
>>>>
>>>>
>>>>
>>>
>>
>

Reply via email to