[ 
https://issues.apache.org/jira/browse/NUTCH-901?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Markus Jelsma updated NUTCH-901:
--------------------------------

    Attachment: NUTCH-901-MarkusJelsma.998958.patch

Here's a patch for version 1.2. It includes a backward compatible setting in 
nutch-default.xml and handles the setting the the MoreIndexingFilter.java. It's 
tested and behaves as expected on my 1.2 up to date check out.

> Make index-more plug-in configurable
> ------------------------------------
>
>                 Key: NUTCH-901
>                 URL: https://issues.apache.org/jira/browse/NUTCH-901
>             Project: Nutch
>          Issue Type: Improvement
>          Components: indexer
>    Affects Versions: 1.2, 2.0
>            Reporter: Markus Jelsma
>             Fix For: 2.0
>
>         Attachments: NUTCH-901-MarkusJelsma.998958.patch
>
>
> In my case, i don't want the index-more plug-in to split content-types on 
> slash. Tokenization is something a Solr instance should take care of. Instead 
> of removing the code (which would break compatibility for users that rely on 
> it), we need a way to configure the plug-in not to split the content-type.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to