[ https://issues.apache.org/jira/browse/TIKA-233?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Jukka Zitting resolved TIKA-233. -------------------------------- Resolution: Fixed Charset detection code inlined in revision 777643. > Inline the ICU4J charset detection logic > ---------------------------------------- > > Key: TIKA-233 > URL: https://issues.apache.org/jira/browse/TIKA-233 > Project: Tika > Issue Type: Improvement > Components: parser > Reporter: Jukka Zitting > Assignee: Jukka Zitting > Priority: Minor > Fix For: 0.4 > > > The ICU4J library that we use for handling plain text documents is pretty > large (4.3MB) and we only need a small subset of its functionality. > I just asked [1] for ICU4J to provide a smaller component jar that we could > depend on, but until that happens I'd like to simply replace the entire > dependency with copies of relevant classes from ICU4J. > [1] http://bugs.icu-project.org/trac/ticket/6964 -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.