Hiran Chaudhuri created NUTCH-2429:
--------------------------------------
Summary: Fix Plugin System to allow protocol plugins to bundle
their URLStreamHandlers
Key: NUTCH-2429
URL: https://issues.apache.org/jira/browse/NUTCH-2429
Project: Nutch
Issue Type: Improvement
Components: commoncrawl
Affects Versions: 1.14
Environment: Tested on both Nutch 1.13 and 1.14 in Ubuntu Linux with
OpenJDK 1.8.
Reporter: Hiran Chaudhuri
While trying to use the protocol-smb plugin (which is not part of the Nutch
distribution) I realized there are four steps to successfully make use of a
protocol plugin:
1 - put the artifact into the plugins directory
2 - modify Nutch configuration files to allow smb:// urls plus include the
plugin to the loaded list
3 - extract jcifs.jar and place it on the system classpath
4 - run nutch with the correct system property
While steps 1 and 2 seem obvious, 3 and 4 require knowledge of plugin internals
which does not feel right for nutch and plugin users. Even more, the jcifs.jar
would exist twice on the classpath and could even cause further problems during
runtime.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)