Same issue here. What did you do with url regex & normalization?; these
configurations might be changed from site to another.
Kind regards,
Hany Shehata
Enterprise Engineer
Green Six Sigma Certified
Solutions Architect, Marketing and Communications IT
Corporate Functions | HSBC Operations, Ser
Hi,
Nutch loads all configuration files from the Java class path and picks the first
file found on the class path (and ignores other files with the same name).
If there are multiple crawls with different configurations, just place a
crawl-specific
configuration directory in front of the classpat
Hi,
sorry for the late reply. Looks like one of the really nasty dependency
conflicts with incompatible
class implementations resp. versions which are only observed at runtime.
That's the potential conflicting candidates (from current master):
runtime/local/plugins/lib-selenium/xml-apis-1.4.01.
Hi Sebastian,
Pls find the link for issue: https://issues.apache.org/jira/browse/NUTCH-2681
Thanks & Regards
Venkata MR
+91 98455 77125
-Original Message-
From: Sebastian Nagel
Sent: 21 December 2018 19:19
To: user@nutch.apache.org
Cc: Venkata MR
Subject: Re: Apache Nutch 2.3.1 not a
4 matches
Mail list logo