nutch 1.12 + windows : UnsatisfiedLinkError exception while running inject command

2016-08-08 Thread Sujan Suppala
Hi, I am seeing the below exception when I run the Inject command. I have installed nutch 1.12 by following the wiki http://wiki.apache.org/nutch/NutchTutorial on windows 7 and the java_home is set to 64-bit jdk1.7 . Using cygwin64 to run the inject command. $ bin/nutch injec

RE: nutch 1.12 + windows : UnsatisfiedLinkError exception while running inject command

2016-08-08 Thread Sujan Suppala
heir dependencies. Could be done by copying over into runtime/local/lib/ but it's better (and much easier over time if recompiling) to change the dependency in ivy/ivy.xml Best, Sebastian On 08/08/2016 10:49 AM, Sujan Suppala wrote: > Hi, > > I am seeing the

RE: nutch 1.12 + windows : UnsatisfiedLinkError exception while running inject command

2016-08-10 Thread Sujan Suppala
HGoIu3Lg_miIvTGU1oi04bnz_ILphJoC66rr_8iOdtQ&e= On 08/08/2016 12:01 PM, Sujan Suppala wrote: > Hi Sebastian, > > I am launching Nutch in local mode. I compiled nutch 1.12 source and > executing the inject command from the local directory in > Cygwin(/cygdrive/c/dev/apach

nutch 1.12 How can I force a URL to get re-indexed

2016-10-06 Thread Sujan Suppala
Hi, By default the nutch is fetching the URL based on the already set next fetch interval(30 days), suppose if the page is updated before this interval (30 days) how can I force to re-index? How can I just 're-inject' the URLs to set the next fetch date to 'immediately'? Fyi, I am using the nu

RE: nutch 1.12 How can I force a URL to get re-indexed

2016-10-07 Thread Sujan Suppala
Thanks Markus. I can not use freegen as this tool is not available via REST api. With the combination of -adddays and -expr options of generator I achieved my requirement. Here is what I did: 1. inject the urls with some metadata say pageId= Seed file contains the below entry: h

nutch 1.12 INJECT REST call not honoring db.injector.overwrite

2016-10-14 Thread Sujan Suppala
Hi, I am using the Nutch 1.12. I don't see an option to overwrite an url in the crawl db via the INJECT REST call, also I have observed that this rest call is not honoring the config settings for "db.injector.overwrite" , "db.injector.update" properties when these

RE: nutch 1.12 INJECT REST call not honoring db.injector.overwrite

2016-10-14 Thread Sujan Suppala
Thanks. https://issues.apache.org/jira/browse/NUTCH-2325 Hoping this would be fixed in 1.13 . Thanks Sujan -Original Message- From: Markus Jelsma [mailto:markus.jel...@openindex.io] Sent: Friday, October 14, 2016 4:43 PM To: user@nutch.apache.org Subject: RE: nutch 1.12 INJECT REST cal