[
https://issues.apache.org/jira/browse/PDFBOX-3442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15407405#comment-15407405
]
Egbert commented on PDFBOX-3442:
--------------------------------
If it's any help, our crawler found another similar but not exactly the same
version of the PDF I linked to before on a different website, making it run out
of memory again:
http://www.degroeneeenvoud.nl/wp-content/uploads/2011/03/algemene_voorwaarden.pdf
> OOM for single page pdf file
> ----------------------------
>
> Key: PDFBOX-3442
> URL: https://issues.apache.org/jira/browse/PDFBOX-3442
> Project: PDFBox
> Issue Type: Improvement
> Components: PDModel
> Affects Versions: 2.0.2, 2.0.3, 2.1.0
> Reporter: Tim Allison
> Priority: Minor
> Attachments: res.diff
>
>
> On TIKA-2045, a user posted a single page document that leads to OOM with
> -Xmx1g. I confirmed this with PDFBox's ExtractText.
> Might be a memory leak with the fonts? See
> [this|https://issues.apache.org/jira/browse/TIKA-2045?focusedCommentId=15399583&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-15399583]
> for some diagnostics I did.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]