Re: plugin configuration

2016-09-15 Thread Sebastian Nagel
Hi Kris, when the plugins (index-anchor or index-more) are enabled and the configuration is properly deployed they should work immediately starting with the segments/steps indexed from now on. Would it be possible to get examples what values are missing in the index? Best, Sebastian On

UpdateDb job fails everytime

2016-09-15 Thread shubham.gupta
Hey, Whenever the update job is executed the following errors occur: INFO mapreduce.Job: Task Id : attempt_1473832356852_0104_m_00_2, Status : FAILED Error: java.net.MalformedURLException: no protocol:

RE: plugin configuration

2016-09-15 Thread Kris Musshorn
Thx for the reply Sebastian. I made the request for info before I tried it out.. just because I couldn’t find any documentation. -Original Message- From: Sebastian Nagel [mailto:wastl.na...@googlemail.com] Sent: Thursday, September 15, 2016 7:40 AM To: user@nutch.apache.org Subject:

Re: UpdateDb job fails everytime

2016-09-15 Thread Sebastian Nagel
Hi, this looks like a bug in Nutch 2.x. Please, open an issue at http://issues.apache.org/jira/NUTCH and add information about the exact Nutch version and the configuration. Invalid URLs should normally be filtered out or corrected by URL normalizers during the parsing step. Thanks, Sebastian

Re: UpdateDb job fails everytime

2016-09-15 Thread Sebastian Nagel
Sorry, the correct link is: https://issues.apache.org/jira/browse/NUTCH On 09/15/2016 01:34 PM, Sebastian Nagel wrote: > Hi, > > this looks like a bug in Nutch 2.x. > > Please, open an issue at http://issues.apache.org/jira/NUTCH > and add information about the exact Nutch version and the >