Thanks.
https://issues.apache.org/jira/browse/NUTCH-2325
Hoping this would be fixed in 1.13 .
Thanks
Sujan
-Original Message-
From: Markus Jelsma [mailto:markus.jel...@openindex.io]
Sent: Friday, October 14, 2016 4:43 PM
To: user@nutch.apache.org
Subject: RE: nutch 1.12 INJECT REST
Hi,
I am using the Nutch 1.12.
I don't see an option to overwrite an url in the crawl db via
the INJECT REST call, also I have observed that this rest call is not honoring
the config settings for "db.injector.overwrite" , "db.injector.update"
properties when
Thanks Markus.
I can not use freegen as this tool is not available via REST api.
With the combination of -adddays and -expr options of generator I achieved my
requirement.
Here is what I did:
1. inject the urls with some metadata say pageId=
Seed file contains the below entry:
Hi,
By default the nutch is fetching the URL based on the already set next fetch
interval(30 days), suppose if the page is updated before this interval (30
days) how can I force to re-index?
How can I just 're-inject' the URLs to set the next fetch date to 'immediately'?
Fyi, I am using the
6FoG6hR4
> e7Hf7_lhn8EG8Tl6XqvI=LTQWpRGYa3SK6J7q1ABjA9oyZiUGFyfxqP6o8kUU1_k=
>
> For local mode, you have to replace all Hadoop jars and their dependencies.
> Could be done by copying over into runtime/local/lib/ but it's better
> (and much easier over time if recompiling) to change
by copying over into runtime/local/lib/ but it's better (and much
easier over time if recompiling) to change the dependency in ivy/ivy.xml
Best,
Sebastian
On 08/08/2016 10:49 AM, Sujan Suppala wrote:
> Hi,
>
> I am seeing the below exception when I run the Inject
> com
Hi,
I am seeing the below exception when I run the Inject command.
I have installed nutch 1.12 by following the wiki
http://wiki.apache.org/nutch/NutchTutorial on windows 7 and the java_home is
set to 64-bit jdk1.7 . Using cygwin64 to run the inject command.
$ bin/nutch
7 matches
Mail list logo