[ https://issues.apache.org/jira/browse/PDFBOX-5991?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17946377#comment-17946377 ]
Tilman Hausherr edited comment on PDFBOX-5991 at 4/22/25 11:03 AM: ------------------------------------------------------------------- >From Erik Brangs in the users mailing list >https://lists.apache.org/thread/pc8wm93rbzxzv049l2ccqdrv1ndkqnfy : {quote} using PDFBox 3.0.4 to extract text from the PDF from https://d-nb.info/1349431796/34 fails with a NegativeArraySizeException after using up multiple GBs of memory. I used -Xmx12G because the extraction failed with an OutOfMemoryError for -Xmx6G. {quote} was (Author: tilman): >From Erik Branks in the users mailing list >https://lists.apache.org/thread/pc8wm93rbzxzv049l2ccqdrv1ndkqnfy : {quote} using PDFBox 3.0.4 to extract text from the PDF from https://d-nb.info/1349431796/34 fails with a NegativeArraySizeException after using up multiple GBs of memory. I used -Xmx12G because the extraction failed with an OutOfMemoryError for -Xmx6G. {quote} > Mitigate problems with PDF file with huge fonts > ----------------------------------------------- > > Key: PDFBOX-5991 > URL: https://issues.apache.org/jira/browse/PDFBOX-5991 > Project: PDFBox > Issue Type: Bug > Reporter: Tilman Hausherr > Priority: Major > Attachments: PDFBOX-5991-p57.pdf > > -- This message was sent by Atlassian Jira (v8.20.10#820010) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org