[jira] [Created] (TIKA-2735) notes and footer contents are duplicated in extracting text from power point slides

2018-09-24 Thread feng ye (JIRA)
feng ye created TIKA-2735: - Summary: notes and footer contents are duplicated in extracting text from power point slides Key: TIKA-2735 URL: https://issues.apache.org/jira/browse/TIKA-2735 Project: Tika

[jira] [Commented] (TIKA-2727) Parsing and detect mime type of XML file stuck in infinite loop

2018-09-24 Thread Slava G (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16626445#comment-16626445 ] Slava G commented on TIKA-2727: --- Ok, thanks  hope you'll be able to fix this quick.  Thanks a lot >

[jira] [Reopened] (TIKA-2727) Parsing and detect mime type of XML file stuck in infinite loop

2018-09-24 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2727?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison reopened TIKA-2727: --- > Parsing and detect mime type of XML file stuck in infinite loop >

[jira] [Updated] (TIKA-2727) Parsing and detect mime type of XML file stuck in infinite loop

2018-09-24 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2727?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated TIKA-2727: -- Priority: Blocker (was: Major) > Parsing and detect mime type of XML file stuck in infinite loop >

[jira] [Commented] (TIKA-2727) Parsing and detect mime type of XML file stuck in infinite loop

2018-09-24 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16626439#comment-16626439 ] Tim Allison commented on TIKA-2727: --- Y. I can reproduce this on the 10th iteration single threaded. >

[jira] [Comment Edited] (TIKA-2727) Parsing and detect mime type of XML file stuck in infinite loop

2018-09-24 Thread Slava G (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16626437#comment-16626437 ] Slava G edited comment on TIKA-2727 at 9/24/18 8:41 PM: 10 iterations inside for

[jira] [Commented] (TIKA-2727) Parsing and detect mime type of XML file stuck in infinite loop

2018-09-24 Thread Slava G (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16626437#comment-16626437 ] Slava G commented on TIKA-2727: --- 10 iterations inside for loop (same thread) , file 

[jira] [Updated] (TIKA-2727) Parsing and detect mime type of XML file stuck in infinite loop

2018-09-24 Thread Slava G (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2727?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Slava G updated TIKA-2727: -- Attachment: 1_6e4b115e-7d2d-45f1-a842-35b5ad7ba559 > Parsing and detect mime type of XML file stuck in infinite

[jira] [Commented] (TIKA-2727) Parsing and detect mime type of XML file stuck in infinite loop

2018-09-24 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16626434#comment-16626434 ] Tim Allison commented on TIKA-2727: --- This is with the attached .xml file above?  Are you running

[jira] [Comment Edited] (TIKA-2727) Parsing and detect mime type of XML file stuck in infinite loop

2018-09-24 Thread Slava G (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16626408#comment-16626408 ] Slava G edited comment on TIKA-2727 at 9/24/18 8:34 PM: Tried to reproduce, after

[jira] [Commented] (TIKA-2727) Parsing and detect mime type of XML file stuck in infinite loop

2018-09-24 Thread Slava G (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16626408#comment-16626408 ] Slava G commented on TIKA-2727: --- Tried to reproduce, after few hundreds xml that was transfer to TIKA for

Re: 1.19.1?

2018-09-24 Thread Nick Burch
On Mon, 24 Sep 2018, Tim Allison wrote: Aside from the problem with users and non-standard XML parsers, were there any other show-stoppers in POI 4.0.0? Is there a reason to wait for POI 4.0.1? I think, in terms of Tika affecting bugs, it was the xml parser stuff, and commons compress

[jira] [Commented] (TIKA-2732) Allow configuration of XMLReaderUtils via TikaConfig

2018-09-24 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2732?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16626286#comment-16626286 ] Hudson commented on TIKA-2732: -- SUCCESS: Integrated in Jenkins build tika-branch-1x #97 (See

[jira] [Commented] (TIKA-2733) Fix oom test in TikaServerIntegrationTest

2018-09-24 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2733?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16626287#comment-16626287 ] Hudson commented on TIKA-2733: -- SUCCESS: Integrated in Jenkins build tika-branch-1x #97 (See

[jira] [Commented] (TIKA-2732) Allow configuration of XMLReaderUtils via TikaConfig

2018-09-24 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2732?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16626275#comment-16626275 ] Hudson commented on TIKA-2732: -- SUCCESS: Integrated in Jenkins build Tika-trunk #1565 (See

[jira] [Commented] (TIKA-2733) Fix oom test in TikaServerIntegrationTest

2018-09-24 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2733?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16626276#comment-16626276 ] Hudson commented on TIKA-2733: -- SUCCESS: Integrated in Jenkins build Tika-trunk #1565 (See

[jira] [Commented] (TIKA-2733) Fix oom test in TikaServerIntegrationTest

2018-09-24 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2733?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16626249#comment-16626249 ] Hudson commented on TIKA-2733: -- UNSTABLE: Integrated in Jenkins build tika-2.x-windows #320 (See

[jira] [Commented] (TIKA-2732) Allow configuration of XMLReaderUtils via TikaConfig

2018-09-24 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2732?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16626248#comment-16626248 ] Hudson commented on TIKA-2732: -- UNSTABLE: Integrated in Jenkins build tika-2.x-windows #320 (See

[jira] [Resolved] (TIKA-2732) Allow configuration of XMLReaderUtils via TikaConfig

2018-09-24 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2732?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison resolved TIKA-2732. --- Resolution: Fixed Assignee: Tim Allison Fix Version/s: 1.19.1 2.0.0

[jira] [Commented] (TIKA-2732) Allow configuration of XMLReaderUtils via TikaConfig

2018-09-24 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2732?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16626211#comment-16626211 ] Tim Allison commented on TIKA-2732: --- Example: {noformat} {noformat} > Allow configuration of

[jira] [Created] (TIKA-2733) Fix oom test in TikaServerIntegrationTest

2018-09-24 Thread Tim Allison (JIRA)
Tim Allison created TIKA-2733: - Summary: Fix oom test in TikaServerIntegrationTest Key: TIKA-2733 URL: https://issues.apache.org/jira/browse/TIKA-2733 Project: Tika Issue Type: Task

[jira] [Commented] (TIKA-2727) Parsing and detect mime type of XML file stuck in infinite loop

2018-09-24 Thread Slava G (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16625985#comment-16625985 ] Slava G commented on TIKA-2727: --- Thanks, will look. Could be that in 1.19 solution is not always working ?

[jira] [Commented] (TIKA-2727) Parsing and detect mime type of XML file stuck in infinite loop

2018-09-24 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16625979#comment-16625979 ] Tim Allison commented on TIKA-2727: --- See TIKA-2732. > Parsing and detect mime type of XML file stuck in

[jira] [Created] (TIKA-2732) Allow configuration of XMLReaderUtils via TikaConfig

2018-09-24 Thread Tim Allison (JIRA)
Tim Allison created TIKA-2732: - Summary: Allow configuration of XMLReaderUtils via TikaConfig Key: TIKA-2732 URL: https://issues.apache.org/jira/browse/TIKA-2732 Project: Tika Issue Type: Task

[jira] [Commented] (TIKA-2638) Tika server fails with status 500 if X-Tika-OCRLanguage set to multiple OCR dictionaries

2018-09-24 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2638?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16625946#comment-16625946 ] Hudson commented on TIKA-2638: -- SUCCESS: Integrated in Jenkins build tika-branch-1x #96 (See

[jira] [Commented] (TIKA-2638) Tika server fails with status 500 if X-Tika-OCRLanguage set to multiple OCR dictionaries

2018-09-24 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2638?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16625930#comment-16625930 ] Hudson commented on TIKA-2638: -- UNSTABLE: Integrated in Jenkins build Tika-trunk #1564 (See

[jira] [Commented] (TIKA-2638) Tika server fails with status 500 if X-Tika-OCRLanguage set to multiple OCR dictionaries

2018-09-24 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2638?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16625907#comment-16625907 ] Hudson commented on TIKA-2638: -- UNSTABLE: Integrated in Jenkins build tika-2.x-windows #319 (See

Re: 1.19.1?

2018-09-24 Thread Tim Allison
Nick, Aside from the problem with users and non-standard XML parsers, were there any other show-stoppers in POI 4.0.0? Is there a reason to wait for POI 4.0.1? On Fri, Sep 21, 2018 at 12:48 PM Chris Mattmann wrote: > > Let’s roll it…. > > > > > > > > From: Tim Allison > Reply-To:

[jira] [Resolved] (TIKA-2638) Tika server fails with status 500 if X-Tika-OCRLanguage set to multiple OCR dictionaries

2018-09-24 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2638?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison resolved TIKA-2638. --- Resolution: Fixed Assignee: Tim Allison Fix Version/s: 1.19.1 Sorry for not getting