[
https://issues.apache.org/jira/browse/ANY23-26?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13433795#comment-13433795
]
Peter Ansell commented on ANY23-26:
-----------------------------------
It cannot be a Crawler4J bug actually, as Crawler4J is only used in the basic
crawler plugin and the bug appears above that level in the core module tests.
Another candidate may be that Tika-1.2 introduces a dependency on JDOM, where
we are also using DOM4J, which may produce a conflict. Investigating that now
> Upgrade dependency to Apache Tika 1.1
> -------------------------------------
>
> Key: ANY23-26
> URL: https://issues.apache.org/jira/browse/ANY23-26
> Project: Apache Any23
> Issue Type: Improvement
> Affects Versions: 0.7.0
> Reporter: Lewis John McGibbney
> Fix For: 0.8.0
>
> Attachments: 14-img-src-data-url.html, 19-object-data-data-uri.html,
> ANY23-26.patch, org.apache.any23.extractor.html.HCardExtractorTest.txt
>
>
> Upgrading to Apache Tika will hopefully provide a wealth of benefits for the
> project. This issue should act as an umbrella issue to track these changes.
> It would be great to delegate as much as possible to Tika if deemed suitable
> to enhance functionality and to reduce our dependencies on external projects.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira