Hi,

using PDFBox 3.0.4 to extract text from the PDF from 
https://d-nb.info/1349431796/34 fails with a NegativeArraySizeException after 
using up multiple GBs of memory. I used -Xmx12G because the extraction failed 
with an OutOfMemoryError for -Xmx6G.

The problem also appears with 4.0.0-SNAPSHOT.

Is this a bug in PDFBox or is the PDF broken?


-- 
Erik Brangs
Deutsche Nationalbibliothek
Informationstechnik
Adickesallee 1
60322 Frankfurt am Main
Telefon: +49 69 1525-1850
Telefax: +49 69 1525-1799
mailto:e.bra...@dnb.de
https://www.dnb.de

Reply via email to