[jira] [Commented] (TIKA-2044) MboxParser wrongly concatenates multiple text lines into single header line

2016-07-27 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2044?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15396477#comment-15396477 ] Nick Burch commented on TIKA-2044: -- Are you able to reproduce this in a simple junit unit test case?

[jira] [Updated] (TIKA-2043) junrar tika outofmemoryerror

2016-07-27 Thread Nicholas DiPiazza (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2043?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicholas DiPiazza updated TIKA-2043: Description: I see common junrar related OOM errors how can i prevent them? It loaded a 2GB

[jira] [Created] (TIKA-2044) MboxParser wrongly concatenates multiple text lines into single header line

2016-07-27 Thread Vjeran Marcinko (JIRA)
Vjeran Marcinko created TIKA-2044: - Summary: MboxParser wrongly concatenates multiple text lines into single header line Key: TIKA-2044 URL: https://issues.apache.org/jira/browse/TIKA-2044 Project:

[jira] [Created] (TIKA-2043) junrar tika outofmemoryerror

2016-07-27 Thread Nicholas DiPiazza (JIRA)
Nicholas DiPiazza created TIKA-2043: --- Summary: junrar tika outofmemoryerror Key: TIKA-2043 URL: https://issues.apache.org/jira/browse/TIKA-2043 Project: Tika Issue Type: Bug

[jira] [Reopened] (TIKA-2041) Charset detection doesn't appear to be thread-safe

2016-07-27 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2041?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison reopened TIKA-2041: --- Thanks to [~gagravarr] for pointing out some other Tika-custom bits. I'm reopening this until I have a

[jira] [Commented] (TIKA-2040) OOM when parsing a corrupted CHM

2016-07-27 Thread Luis Filipe Nassif (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2040?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15395467#comment-15395467 ] Luis Filipe Nassif commented on TIKA-2040: -- I have to thank you [~talli...@apache.org], the fix

[jira] [Commented] (TIKA-2041) Charset detection doesn't appear to be thread-safe

2016-07-27 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2041?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15395282#comment-15395282 ] Nick Burch commented on TIKA-2041: -- Running "git log" and "git diff" on the file suggests other custom

[jira] [Commented] (TIKA-2041) Charset detection doesn't appear to be thread-safe

2016-07-27 Thread Christian L. (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2041?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15395176#comment-15395176 ] Christian L. commented on TIKA-2041: Thanks for the fix! That was super-quick. You rock! > Charset