Here's an odd one (Nutch 1.11):
I haven't tested this with other extension points, but if you extend or
depend on the "protocol-http" plugin in a new plugin, the name of the new
plugin is significant when ProtocolFactory loads the correct plugin for
fetching.
In other words:
Create a plugin "protocol-httpfoo" that is dependent on "protocol-http" to
do the heavy lifting. Its plugin.xml contains this section (note the
additional dependency):
<requires>
<import plugin="nutch-extensionpoints"/>
<import plugin="lib-http"/>
<import plugin="protocol-http"/>
</requires>
Now, in nutch-site.xml, specify *only* "protocol-httpfoo" in
plugin.includes.
Then just run "bin/nutch parsechecker ..." to test. You'll see
"protocol-http" is used rather than the "foo" version. If you rename the
plugin to "protocol-http-foo", it'll work.
Seems like a bug to me, but also pretty obscure. I can file a ticket if
suggested.
Joe