[
https://issues.apache.org/jira/browse/NUTCH-2502?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16335994#comment-16335994
]
Moreno Feltscher commented on NUTCH-2502:
-----------------------------------------
Pull request: https://github.com/apache/nutch/pull/280
> Any23 Plugin: Add Content-Type filtering
> ----------------------------------------
>
> Key: NUTCH-2502
> URL: https://issues.apache.org/jira/browse/NUTCH-2502
> Project: Nutch
> Issue Type: Improvement
> Reporter: Moreno Feltscher
> Assignee: Moreno Feltscher
> Priority: Major
>
> It should be possible to filter based on a document's Content-Type when using
> Any23 extractors.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)