It seems to me that if you run nutch in deploy mode and make changes to config 
files then you need to rebuild .job file again unless you specify config_dir 
option in hadoop command.

Alex.
 

-----Original Message-----
From: Christopher Gross <[email protected]>
To: user <[email protected]>
Sent: Mon, Oct 1, 2012 1:22 pm
Subject: Re: Building Nutch 2.0


I have my 1.3 set up in a /proj/nutch/ directory that has the bin,
conf, lib, logs, ..etc.., with NUTCH_HOME pointing there.  I don't
quite see what the difference would be for 2.x as long as NUTCH_HOME
pointed to the right place.

Is there documentation anywhere on how to do a deployment?

-- Chris


On Mon, Oct 1, 2012 at 3:59 PM, Lewis John Mcgibbney
<[email protected]> wrote:
> Hi Chris,
>
> On Mon, Oct 1, 2012 at 8:52 PM, Christopher Gross <[email protected]> wrote:
>> OK, I added the port being used by hbase to iptables, and now I'm farther.
>>
>> I'm getting:
>> 12/10/01 19:44:17 ERROR fetcher.FetcherJob: Fetcher: No agents listed
>> in 'http.agent.name' property.
>>
>> But I do have an entry there, and it matches the first in the
>> robots.agents as well.
>
> This can only mean that you have not recompiled this stuff into the
> runtime/local directory.
>
>>
>> How should I have this laid out?  Should I be running out of the
>> 'runtime' dir, or is it fine that I've pulled all those files out and
>> into a /proj/nutch-2.1/ directory (so there's a bin, conf, lib,
>> ..etc.. in there, with NUTCH_HOME pointing to that dir).
>
> OK so you are running locally. I can't say whether its OK to copy the
> directories and their content elsewhere as I've never done it however
> I would avoid unless absolutely necessary. It terms of the directory
> layout Nutch 2.x is identical to 1.x.
>
> It really helps if you make explicit which back end you intend to use
> as the config may alter accordingly.

 

Reply via email to