[jira] [Commented] (TIKA-2306) Update Inception v3 to Inception v4 in Object recognition parser

2017-04-13 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2306?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15968627#comment-15968627 ] ASF GitHub Bot commented on TIKA-2306: -- KranthiGV commented on a change in pull request #163:

[jira] [Commented] (TIKA-2306) Update Inception v3 to Inception v4 in Object recognition parser

2017-04-13 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2306?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15968623#comment-15968623 ] ASF GitHub Bot commented on TIKA-2306: -- thammegowda commented on a change in pull request #163:

[jira] [Commented] (TIKA-2322) Video labeling using existing ObjectRecognition

2017-04-13 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2322?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15968622#comment-15968622 ] ASF GitHub Bot commented on TIKA-2322: -- smadha commented on a change in pull request #168: fix for

[jira] [Commented] (TIKA-2322) Video labeling using existing ObjectRecognition

2017-04-13 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2322?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15968619#comment-15968619 ] ASF GitHub Bot commented on TIKA-2322: -- smadha commented on a change in pull request #168: fix for

[jira] [Commented] (TIKA-2322) Video labeling using existing ObjectRecognition

2017-04-13 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2322?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15968615#comment-15968615 ] ASF GitHub Bot commented on TIKA-2322: -- smadha commented on a change in pull request #168: fix for

[jira] [Commented] (TIKA-2306) Update Inception v3 to Inception v4 in Object recognition parser

2017-04-13 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2306?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15968616#comment-15968616 ] ASF GitHub Bot commented on TIKA-2306: -- thammegowda commented on issue #163: TIKA-2306: Update

[jira] [Commented] (TIKA-2322) Video labeling using existing ObjectRecognition

2017-04-13 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2322?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15968613#comment-15968613 ] ASF GitHub Bot commented on TIKA-2322: -- thammegowda commented on a change in pull request #168: fix

[jira] [Commented] (TIKA-2322) Video labeling using existing ObjectRecognition

2017-04-13 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2322?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15968612#comment-15968612 ] ASF GitHub Bot commented on TIKA-2322: -- thammegowda commented on a change in pull request #168: fix

[jira] [Commented] (TIKA-2306) Update Inception v3 to Inception v4 in Object recognition parser

2017-04-13 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2306?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15968610#comment-15968610 ] ASF GitHub Bot commented on TIKA-2306: -- KranthiGV commented on a change in pull request #163:

[jira] [Commented] (TIKA-2322) Video labeling using existing ObjectRecognition

2017-04-13 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2322?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15968602#comment-15968602 ] ASF GitHub Bot commented on TIKA-2322: -- smadha commented on issue #168: fix for TIKA-2322 contributed

[jira] [Commented] (TIKA-2306) Update Inception v3 to Inception v4 in Object recognition parser

2017-04-13 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2306?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15968599#comment-15968599 ] ASF GitHub Bot commented on TIKA-2306: -- thammegowda commented on a change in pull request #163:

[jira] [Commented] (TIKA-2306) Update Inception v3 to Inception v4 in Object recognition parser

2017-04-13 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2306?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15968550#comment-15968550 ] ASF GitHub Bot commented on TIKA-2306: -- thammegowda commented on issue #163: TIKA-2306: Update

[jira] [Commented] (TIKA-1631) OutOfMemoryException in ZipContainerDetector

2017-04-13 Thread Luis Filipe Nassif (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1631?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15968184#comment-15968184 ] Luis Filipe Nassif commented on TIKA-1631: -- See Compress-382 for a related conversation. >

Re: [COMPRESS] zip-bomb prevention for Z?

2017-04-13 Thread Luís Filipe Nassif
I have reported a similar issue to them, see Compress-382, maybe those issues should be handled at Compress side, if I understood correctly the API contract. Luis Em 13 de abr de 2017 3:36 PM, "Allison, Timothy B." escreveu: On TIKA-1631 [1], users have observed that a

[jira] [Commented] (TIKA-1631) OutOfMemoryException in ZipContainerDetector

2017-04-13 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1631?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15968128#comment-15968128 ] Tim Allison commented on TIKA-1631: --- Ok, thank you. Y, you're right. I have a patch semi-ready that

[jira] [Comment Edited] (TIKA-1631) OutOfMemoryException in ZipContainerDetector

2017-04-13 Thread JIRA
[ https://issues.apache.org/jira/browse/TIKA-1631?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15968123#comment-15968123 ] Thorsten Schäfer edited comment on TIKA-1631 at 4/13/17 7:37 PM: -

[jira] [Commented] (TIKA-1631) OutOfMemoryException in ZipContainerDetector

2017-04-13 Thread JIRA
[ https://issues.apache.org/jira/browse/TIKA-1631?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15968123#comment-15968123 ] Thorsten Schäfer commented on TIKA-1631: [~talli...@mitre.org] my belief is if we can prevent the

[jira] [Commented] (TIKA-2311) Preserve "x-tika-ooxml" mime value for truncated ooxml files

2017-04-13 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2311?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15968089#comment-15968089 ] Hudson commented on TIKA-2311: -- SUCCESS: Integrated in Jenkins build Tika-trunk #1239 (See

[jira] [Commented] (TIKA-2311) Preserve "x-tika-ooxml" mime value for truncated ooxml files

2017-04-13 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2311?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15968088#comment-15968088 ] Hudson commented on TIKA-2311: -- SUCCESS: Integrated in Jenkins build tika-2.x #243 (See

[jira] [Commented] (TIKA-2311) Preserve "x-tika-ooxml" mime value for truncated ooxml files

2017-04-13 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2311?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15968059#comment-15968059 ] Hudson commented on TIKA-2311: -- UNSTABLE: Integrated in Jenkins build tika-2.x-windows #197 (See

[jira] [Commented] (TIKA-1631) OutOfMemoryException in ZipContainerDetector

2017-04-13 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1631?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15968038#comment-15968038 ] Tim Allison commented on TIKA-1631: --- Sent note to dev@compress cc'd d...@tika...we'll see what

[COMPRESS] zip-bomb prevention for Z?

2017-04-13 Thread Allison, Timothy B.
On TIKA-1631 [1], users have observed that a corrupt Z file can cause an OOM at Internal_.InternalLZWStream.initializeTable. Should we try to protect against this at the Tika level, or should we open an issue on commons-compress's JIRA? A second question, we're creating a stream with the

[jira] [Updated] (TIKA-2311) Preserve "x-tika-ooxml" mime value for truncated ooxml files

2017-04-13 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2311?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated TIKA-2311: -- Summary: Preserve "x-tika-ooxml" mime value for truncated ooxml files (was: Create x-tika-ooxml-unk

[jira] [Commented] (TIKA-1631) OutOfMemoryException in ZipContainerDetector

2017-04-13 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1631?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15967997#comment-15967997 ] Tim Allison commented on TIKA-1631: --- [~thorsten.schaefer], your recommendation is that we copy/paste the

[jira] [Commented] (TIKA-2311) Create x-tika-ooxml-unk mime type (?)

2017-04-13 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2311?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15967876#comment-15967876 ] Tim Allison commented on TIKA-2311: --- Ha, Nifi overrides our def and just calls it {{.tar}}:

[jira] [Commented] (TIKA-2311) Create x-tika-ooxml-unk mime type (?)

2017-04-13 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2311?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15967868#comment-15967868 ] Tim Allison commented on TIKA-2311: --- When I use a static MediaTypesRegistry in PackageParser, the new

[jira] [Commented] (TIKA-2320) java.util.zip.DataFormatException when parsing a PDF

2017-04-13 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15967846#comment-15967846 ] Tim Allison commented on TIKA-2320: --- Thank you, [~tilman]! > java.util.zip.DataFormatException when

[jira] [Commented] (TIKA-2320) java.util.zip.DataFormatException when parsing a PDF

2017-04-13 Thread Tilman Hausherr (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15967844#comment-15967844 ] Tilman Hausherr commented on TIKA-2320: --- Fixed in PDFBox 2.0.6 despite the user not attaching a PDF

[jira] [Commented] (TIKA-1631) OutOfMemoryException in ZipContainerDetector

2017-04-13 Thread JIRA
[ https://issues.apache.org/jira/browse/TIKA-1631?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15967838#comment-15967838 ] Thorsten Schäfer commented on TIKA-1631: Hi Tim, I have attached the file cache.mpgindex that just

[jira] [Updated] (TIKA-1631) OutOfMemoryException in ZipContainerDetector

2017-04-13 Thread JIRA
[ https://issues.apache.org/jira/browse/TIKA-1631?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thorsten Schäfer updated TIKA-1631: --- Attachment: cache.mpgindex This file should produce a java.lang.OutOfMemoryError: Java heap

[jira] [Commented] (TIKA-2326) java.lang.OutOfMemoryError: Java heap space

2017-04-13 Thread Md (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2326?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15967812#comment-15967812 ] Md commented on TIKA-2326: -- Thanks once again, I was going through above mention discussion unfortunately none of

[jira] [Commented] (TIKA-2326) java.lang.OutOfMemoryError: Java heap space

2017-04-13 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2326?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15967797#comment-15967797 ] Tim Allison commented on TIKA-2326: --- bq. Is it possible to add a timer in the parser? Unfortunately,

[jira] [Commented] (TIKA-2326) java.lang.OutOfMemoryError: Java heap space

2017-04-13 Thread Md (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2326?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15967786#comment-15967786 ] Md commented on TIKA-2326: -- We have many files which are archived, for them, RecursiveParserWrapper is doing the

[jira] [Commented] (TIKA-2326) java.lang.OutOfMemoryError: Java heap space

2017-04-13 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2326?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15967769#comment-15967769 ] Tim Allison commented on TIKA-2326: --- Great. I'm happy to see you're using the RecursiveParserWrapper!

[jira] [Closed] (TIKA-2326) java.lang.OutOfMemoryError: Java heap space

2017-04-13 Thread Md (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2326?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Md closed TIKA-2326. Resolution: Fixed Fixed in 1.13 or later version > java.lang.OutOfMemoryError: Java heap space >

[jira] [Updated] (TIKA-2326) java.lang.OutOfMemoryError: Java heap space

2017-04-13 Thread Md (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2326?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Md updated TIKA-2326: - Fix Version/s: 1.13 > java.lang.OutOfMemoryError: Java heap space > --- > >

[jira] [Commented] (TIKA-2326) java.lang.OutOfMemoryError: Java heap space

2017-04-13 Thread Md (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2326?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15967764#comment-15967764 ] Md commented on TIKA-2326: -- Yes, you are right. It did fix in recent version(1.14). Thanks so much >

[jira] [Commented] (TIKA-2326) java.lang.OutOfMemoryError: Java heap space

2017-04-13 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2326?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15967734#comment-15967734 ] Tim Allison commented on TIKA-2326: --- Pretty sure these are duplicates. Let me know if not. Looks like

[jira] [Comment Edited] (TIKA-2326) java.lang.OutOfMemoryError: Java heap space

2017-04-13 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2326?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15967707#comment-15967707 ] Tim Allison edited comment on TIKA-2326 at 4/13/17 3:13 PM: We switched out

[jira] [Commented] (TIKA-1631) OutOfMemoryException in ZipContainerDetector

2017-04-13 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1631?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15967709#comment-15967709 ] Tim Allison commented on TIKA-1631: --- Can you share a triggering document, by chance? >

[jira] [Updated] (TIKA-2326) java.lang.OutOfMemoryError: Java heap space

2017-04-13 Thread Md (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2326?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Md updated TIKA-2326: - Description: I am using RecursiveParserWrapper with AutoDetectParser() and here is the part of my code which is doing

[jira] [Updated] (TIKA-2326) java.lang.OutOfMemoryError: Java heap space

2017-04-13 Thread Md (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2326?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Md updated TIKA-2326: - Attachment: 5d3e815263c73061d8804e15db3ammn0789_CLEAN_REVISED.docx Here is the file I am having issue with >

[jira] [Commented] (TIKA-1631) OutOfMemoryException in ZipContainerDetector

2017-04-13 Thread JIRA
[ https://issues.apache.org/jira/browse/TIKA-1631?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15967654#comment-15967654 ] Thorsten Schäfer commented on TIKA-1631: Hi, unfortunately we are also running into this bug with

Re: 1.15?

2017-04-13 Thread Konstantin Gribov
Preliminary +1 from me, I'll the a closer look this weekend чт, 13 апр. 2017, 0:00 Allison, Timothy B. : > All, > POI is voting on rc1 of the next release. Once that's released and > integrated into Tika, let's start the release process for Tika 1.15, end of > next week,