Am 25.01.2018 um 21:33 schrieb Hesham Gneady:
I have reported this because the PDF appeared normal to me. If there is a way to read the text in the PDF in a right way I hope you could help me with that.


See this issue:

https://issues.apache.org/jira/browse/PDFBOX-3970

You need to replace LegacyPDFStreamEngine.java with the file from this issue (start reading at "This seems to be a moving target.") and build. Then the text of your file is extracted properly.

Tilman

Reply via email to