David Morana created TIKA-1041:
----------------------------------
Summary: Tika 1.2 universalcharset errors
Key: TIKA-1041
URL: https://issues.apache.org/jira/browse/TIKA-1041
Project: Tika
Issue Type: Bug
Affects Versions: 1.2
Environment: I'm running solr 4.0 with tika 1.2 on tomcat 7.0.8 with
manifoldcf v1.1dev
Reporter: David Morana
Fix For: 1.3, 1.2
This is somewhat confusing and frustrating. I successfully crawled Opentext
using all of the above. then I recrawled and it aborted almost immediately.
It choked on images, so I excluded them for now.
but now it's choking on txt files!
sometimes I get this error
SEVERE: null:java.lang.RuntimeException: java.lang.NoClassDefFoundError:
org/mozilla/universalchardet/CharsetListener
and sometimes I get this one
SEVERE: null:java.lang.RuntimeException: java.lang.NoClassDefFoundError:
org/apache/tika/parser/txt/UniversalEncodingListener
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira