Hi Guruprasad, The property should be set to the agent name that you would like to appear identifying your organization when your Nutch crawling agent visits websites during its crawl. You could set it to "foo/bar" and it would work fine, but you probably want to think of an appropriate identifying name and then set it to that.
Cheers, Chris On 10/11/06 8:36 AM, "Guruprasad Iyer" <[EMAIL PROTECTED]> wrote: > Hi Chris, > > Thanks for the reply. But, what value should I set it to? Can you help me on > this? > > Thanks once again. > > Cheers, > Guruprasad > > On 10/11/06, Chris Mattmann <[EMAIL PROTECTED]> wrote: >> >> Hi there, >> >> You need to set your "http.agent.name" property within >> $NUTCH_HOME/conf/nutch-default.xml. >> >> HTH, >> Chris >> >> >> >> On 10/11/06 3:57 AM, "Guruprasad Iyer" <[EMAIL PROTECTED]> wrote: >> >>> Hello, >>> >>> I have Nutch 0.8.1 installed on linux (FC3) along with java 1.5.0_07. >> When I >>> run the crawl command I get the above error. >>> >>> Here is a snapshot of the log file- >>> >>> 2006-10-11 15:39:16,234 FATAL api.RobotRulesParser - Agent we advertise >>> (null) not listed first in 'http.robots.agents' property! >>> >>> and it says "fetcher.Fetcher - fetch of" the site "failed with: >>> java.lang.NullPointerException" >>> >>> Can anybody help? Thanks. >> >> >> > ------------------------------------------------------------------------- Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642 _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
