RE: nutch 1.12 How can I force a URL to get re-indexed

2016-10-07 Thread Markus Jelsma
t; > Sent: Friday 7th October 2016 14:11 > To: user@nutch.apache.org > Subject: RE: nutch 1.12 How can I force a URL to get re-indexed > > Thanks Markus. > > I can not use freegen as this tool is not available via REST api. > > With the combination of -adddays and -exp

RE: nutch 1.12 How can I force a URL to get re-indexed

2016-10-07 Thread Sujan Suppala
ubject: RE: nutch 1.12 How can I force a URL to get re-indexed Hi You can use -adddays N in the generator job to fool it, or just use a lower interval. Or, use the freegen tool to immediately crawl a set of URL's. Markus -Original message- > From:Sujan Suppala <ssupp...@open

RE: nutch 1.12 How can I force a URL to get re-indexed

2016-10-06 Thread Markus Jelsma
6 > To: user@nutch.apache.org > Subject: nutch 1.12 How can I force a URL to get re-indexed > > Hi, > > By default the nutch is fetching the URL based on the already set next fetch > interval(30 days), suppose if the page is updated before this interval (30 > days) how can I force