[
https://issues.apache.org/jira/browse/NUTCH-901?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12912547#action_12912547
]
Markus Jelsma edited comment on NUTCH-901 at 9/20/10 11:53 AM:
---------------------------------------------------------------
Here's a patch for version 1.2 (that's the NUTCH-901-MarkusJelsma.998958.patch
file). It includes a backward compatible setting in nutch-default.xml and
handles the setting the the MoreIndexingFilter.java. It's tested and behaves as
expected on my 1.2 up to date check out.
was (Author: markus17):
Here's a patch for version 1.2. It includes a backward compatible setting
in nutch-default.xml and handles the setting the the MoreIndexingFilter.java.
It's tested and behaves as expected on my 1.2 up to date check out.
> Make index-more plug-in configurable
> ------------------------------------
>
> Key: NUTCH-901
> URL: https://issues.apache.org/jira/browse/NUTCH-901
> Project: Nutch
> Issue Type: Improvement
> Components: indexer
> Affects Versions: 1.2, 2.0
> Reporter: Markus Jelsma
> Fix For: 2.0
>
> Attachments: NUTCH-901-MarkusJelsma.998958.patch,
> NUTCH-901-trunk.998961.patch
>
>
> In my case, i don't want the index-more plug-in to split content-types on
> slash. Tokenization is something a Solr instance should take care of. Instead
> of removing the code (which would break compatibility for users that rely on
> it), we need a way to configure the plug-in not to split the content-type.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.