[ https://issues.apache.org/jira/browse/TIKA-741?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15140227#comment-15140227 ]
Tim Allison commented on TIKA-741: ---------------------------------- If the modification is really PDFBox 2.0.0 specific (how to load a file with the new memory config, etc), my personal branch. For more general additions (e.g. XFA extraction), please aim at trunk on Apache's [git repo| https://git-wip-us.apache.org/repos/asf/tika.git] and I'll merge those with my PDFBox 2.0.0 branch after I commit to our trunk. > "Zip bomb" (XML nesting) detection is too strict > ------------------------------------------------ > > Key: TIKA-741 > URL: https://issues.apache.org/jira/browse/TIKA-741 > Project: Tika > Issue Type: Bug > Components: parser > Affects Versions: 0.10 > Reporter: Erik Hetzner > Assignee: Jukka Zitting > Priority: Minor > Fix For: 1.0 > > > I get "zip bomb" errors from many HTML documents, e.g. > http://www.akhbaar.org/wesima_articles/index-20100101-82736.html > Is there a way that the element nesting level could be made configurable? 30 > elements just doesn't seem to be enough. > Thanks! -- This message was sent by Atlassian JIRA (v6.3.4#6332)