[
https://issues.apache.org/jira/browse/NUTCH-2584?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16490728#comment-16490728
]
Sebastian Nagel commented on NUTCH-2584:
----------------------------------------
Actually, the Nekohtml parser was used for the DOM and robots-meta unit tests.
Now the Tika parser is used to build the DOM tree of the test documents. That
made further fixes necessary because Tika renames some of the meta elements
resp. their attributes.
> Upgrade parse-tika to use Tika 1.18
> -----------------------------------
>
> Key: NUTCH-2584
> URL: https://issues.apache.org/jira/browse/NUTCH-2584
> Project: Nutch
> Issue Type: Improvement
> Components: parser
> Affects Versions: 1.14
> Reporter: Sebastian Nagel
> Priority: Minor
> Fix For: 1.15
>
>
> Tika 1.18 is released and NUTCH-2583 includes and upgrade of tika-core.
> See
> [howto_upgrade_tika|https://github.com/apache/nutch/blob/master/src/plugin/parse-tika/howto_upgrade_tika.txt].
>
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)