[jira] [Updated] (TIKA-2519) Issue parsing multiple CHM files concurrently

2017-12-06 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2519?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated TIKA-2519: -- Priority: Blocker (was: Minor) > Issue parsing multiple CHM files concurrently >

[jira] [Commented] (TIKA-2519) Issue parsing multiple CHM files concurrently

2017-12-06 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2519?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16281225#comment-16281225 ] Tim Allison commented on TIKA-2519: --- Thank you for opening this issue. That’s definitely a bug. Parsers

[jira] [Created] (TIKA-2519) Issue parsing multiple CHM files concurrently

2017-12-06 Thread Eamonn Saunders (JIRA)
Eamonn Saunders created TIKA-2519: - Summary: Issue parsing multiple CHM files concurrently Key: TIKA-2519 URL: https://issues.apache.org/jira/browse/TIKA-2519 Project: Tika Issue Type: Bug

Re: Tika 1.17?

2017-12-06 Thread Luís Filipe Nassif
Hi Tim, I've had a briefly look at exceptions folder, seems we are much better with ppt (4677 fixed exceptions) and pdf (7798), but there are 208 new exceptions with ppt. I did not check the files to see if they are corrupted, but some common tokens were lost. Below the most common new