[jira] [Commented] (TIKA-3642) Getting java.lang.OutOfMemoryError: Java heap space when parsing PDF file

2022-01-12 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3642?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17475001#comment-17475001 ] Tim Allison commented on TIKA-3642: --- I regret I can't figure out what's going on with logback in the

[jira] [Commented] (TIKA-3642) Getting java.lang.OutOfMemoryError: Java heap space when parsing PDF file

2022-01-12 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3642?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17474997#comment-17474997 ] Tim Allison commented on TIKA-3642: --- Got your file.  Thank you.  That was critical.  What's going on is

[jira] [Updated] (TIKA-3643) writeLimit for bytes in addition to characters

2022-01-12 Thread Josh Burchard (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3643?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Burchard updated TIKA-3643: Description: [~jmssiera] wrote up the enhancement request TIKA-3325 where he originally requested

[jira] [Commented] (TIKA-3645) Allow for key/value attrs in params within tika-config for parsers

2022-01-12 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3645?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17474718#comment-17474718 ] Hudson commented on TIKA-3645: -- SUCCESS: Integrated in Jenkins build Tika » tika-main-jdk8 #414 (See

[jira] [Commented] (TIKA-3642) Getting java.lang.OutOfMemoryError: Java heap space when parsing PDF file

2022-01-12 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3642?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17474674#comment-17474674 ] Tim Allison commented on TIKA-3642: --- If you're getting an infinite loop with 2.2.1 but not with 1.27 on

[jira] [Commented] (TIKA-3642) Getting java.lang.OutOfMemoryError: Java heap space when parsing PDF file

2022-01-12 Thread Tika User (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3642?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17474650#comment-17474650 ] Tika User commented on TIKA-3642: - Tried using setMaxMainMemoryBytes still seeing memory issues. The same

[jira] [Created] (TIKA-3645) Allow for key/value attrs in params within tika-config for parsers

2022-01-12 Thread Tim Allison (Jira)
Tim Allison created TIKA-3645: - Summary: Allow for key/value attrs in params within tika-config for parsers Key: TIKA-3645 URL: https://issues.apache.org/jira/browse/TIKA-3645 Project: Tika

[jira] [Commented] (TIKA-3644) OfficeParser can not detect embedded zip bomb in the office documents

2022-01-12 Thread Jira
[ https://issues.apache.org/jira/browse/TIKA-3644?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17474598#comment-17474598 ] Sergen Bağ commented on TIKA-3644: -- Hi [~tallison], I set 5 to MaximumPackageEntryDepth. My expectation

[jira] [Updated] (TIKA-3644) OfficeParser can not detect embedded zip bomb in the office documents

2022-01-12 Thread Jira
[ https://issues.apache.org/jira/browse/TIKA-3644?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergen Bağ updated TIKA-3644: - Attachment: tika_exception.PNG > OfficeParser can not detect embedded zip bomb in the office documents >

[jira] [Comment Edited] (TIKA-3644) OfficeParser can not detect embedded zip bomb in the office documents

2022-01-12 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3644?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17474453#comment-17474453 ] Tim Allison edited comment on TIKA-3644 at 1/12/22, 11:16 AM: -- How are you

[jira] [Commented] (TIKA-3644) OfficeParser can not detect embedded zip bomb in the office documents

2022-01-12 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3644?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17474453#comment-17474453 ] Tim Allison commented on TIKA-3644: --- How are you calling Tika?  I'm not able to get an exception with

[jira] [Created] (TIKA-3644) OfficeParser can not detect embedded zip bomb in the office documents

2022-01-12 Thread Jira
Sergen Bağ created TIKA-3644: Summary: OfficeParser can not detect embedded zip bomb in the office documents Key: TIKA-3644 URL: https://issues.apache.org/jira/browse/TIKA-3644 Project: Tika