[
https://issues.apache.org/jira/browse/NUTCH-2703?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Markus Jelsma updated NUTCH-2703:
-
Priority: Minor (was: Critical)
> parse-tika: Boilerpipe should not run for non-(X)HTML pages
>
[
https://issues.apache.org/jira/browse/NUTCH-2703?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Markus Jelsma updated NUTCH-2703:
-
Attachment: NUTCH-2703.patch
> parse-tika: Boilerpipe should not run for non-(X)HTML pages
>
[
https://issues.apache.org/jira/browse/NUTCH-2703?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2703:
---
Summary: parse-tika: Boilerpipe should not run for non-(X)HTML pages (was:
Boilerpipe should
[
https://issues.apache.org/jira/browse/NUTCH-2703?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2703:
---
Component/s: plugin
> parse-tika: Boilerpipe should not run for non-(X)HTML pages
> -
4 matches
Mail list logo