Hi,
I am seeing the below exception when I run the Inject command.
I have installed nutch 1.12 by following the wiki
http://wiki.apache.org/nutch/NutchTutorial on windows 7 and the java_home is
set to 64-bit jdk1.7 . Using cygwin64 to run the inject command.
$ bin/nutch injec
heir dependencies.
Could be done by copying over into runtime/local/lib/ but it's better (and much
easier over time if recompiling) to change the dependency in ivy/ivy.xml
Best,
Sebastian
On 08/08/2016 10:49 AM, Sujan Suppala wrote:
> Hi,
>
> I am seeing the
HGoIu3Lg_miIvTGU1oi04bnz_ILphJoC66rr_8iOdtQ&e=
On 08/08/2016 12:01 PM, Sujan Suppala wrote:
> Hi Sebastian,
>
> I am launching Nutch in local mode. I compiled nutch 1.12 source and
> executing the inject command from the local directory in
> Cygwin(/cygdrive/c/dev/apach
Hi,
By default the nutch is fetching the URL based on the already set next fetch
interval(30 days), suppose if the page is updated before this interval (30
days) how can I force to re-index?
How can I just 're-inject' the URLs to set the next fetch date to 'immediately'?
Fyi, I am using the nu
Thanks Markus.
I can not use freegen as this tool is not available via REST api.
With the combination of -adddays and -expr options of generator I achieved my
requirement.
Here is what I did:
1. inject the urls with some metadata say pageId=
Seed file contains the below entry:
h
Hi,
I am using the Nutch 1.12.
I don't see an option to overwrite an url in the crawl db via
the INJECT REST call, also I have observed that this rest call is not honoring
the config settings for "db.injector.overwrite" , "db.injector.update"
properties when these
Thanks.
https://issues.apache.org/jira/browse/NUTCH-2325
Hoping this would be fixed in 1.13 .
Thanks
Sujan
-Original Message-
From: Markus Jelsma [mailto:markus.jel...@openindex.io]
Sent: Friday, October 14, 2016 4:43 PM
To: user@nutch.apache.org
Subject: RE: nutch 1.12 INJECT REST cal
7 matches
Mail list logo