Hi Guruprasad,

  The property should be set to the agent name that you would like to appear
identifying your organization when your Nutch crawling agent visits websites
during its crawl. You could set it to "foo/bar" and it would work fine, but
you probably want to think of an appropriate identifying name and then set
it to that.

Cheers,
  Chris



On 10/11/06 8:36 AM, "Guruprasad Iyer" <[EMAIL PROTECTED]> wrote:

> Hi Chris,
> 
> Thanks for the reply. But, what value should I set it to? Can you help me on
> this?
> 
> Thanks once again.
> 
> Cheers,
> Guruprasad
> 
> On 10/11/06, Chris Mattmann <[EMAIL PROTECTED]> wrote:
>> 
>> Hi there,
>> 
>> You need to set your "http.agent.name" property within
>> $NUTCH_HOME/conf/nutch-default.xml.
>> 
>> HTH,
>>   Chris
>> 
>> 
>> 
>> On 10/11/06 3:57 AM, "Guruprasad Iyer" <[EMAIL PROTECTED]> wrote:
>> 
>>> Hello,
>>> 
>>> I have Nutch 0.8.1 installed on linux (FC3) along with java 1.5.0_07.
>> When I
>>> run the crawl command I get the above error.
>>> 
>>> Here is a snapshot of the log file-
>>> 
>>> 2006-10-11 15:39:16,234 FATAL api.RobotRulesParser - Agent we advertise
>>> (null) not listed first in 'http.robots.agents' property!
>>> 
>>> and it says   "fetcher.Fetcher - fetch of"   the site  "failed with:
>>> java.lang.NullPointerException"
>>> 
>>> Can anybody help? Thanks.
>> 
>> 
>> 
> 



-------------------------------------------------------------------------
Using Tomcat but need to do more? Need to support web services, security?
Get stuff done quickly with pre-integrated technology to make your job easier
Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to