Hi, using PDFBox 3.0.4 to extract text from the PDF from https://d-nb.info/1349431796/34 fails with a NegativeArraySizeException after using up multiple GBs of memory. I used -Xmx12G because the extraction failed with an OutOfMemoryError for -Xmx6G.
The problem also appears with 4.0.0-SNAPSHOT. Is this a bug in PDFBox or is the PDF broken? -- Erik Brangs Deutsche Nationalbibliothek Informationstechnik Adickesallee 1 60322 Frankfurt am Main Telefon: +49 69 1525-1850 Telefax: +49 69 1525-1799 mailto:e.bra...@dnb.de https://www.dnb.de