Am 25.01.2018 um 21:33 schrieb Hesham Gneady:
I have reported this because the PDF appeared normal to me. If there is a way to read the text in the PDF in a right way I hope you could help me with that.
See this issue: https://issues.apache.org/jira/browse/PDFBOX-3970You need to replace LegacyPDFStreamEngine.java with the file from this issue (start reading at "This seems to be a moving target.") and build. Then the text of your file is extracted properly.
Tilman

