Inline the ICU4J charset detection logic ----------------------------------------
Key: TIKA-233 URL: https://issues.apache.org/jira/browse/TIKA-233 Project: Tika Issue Type: Improvement Components: parser Reporter: Jukka Zitting Assignee: Jukka Zitting Priority: Minor Fix For: 0.4 The ICU4J library that we use for handling plain text documents is pretty large (4.3MB) and we only need a small subset of its functionality. I just asked [1] for ICU4J to provide a smaller component jar that we could depend on, but until that happens I'd like to simply replace the entire dependency with copies of relevant classes from ICU4J. [1] http://bugs.icu-project.org/trac/ticket/6964 -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.