> > Is there a way to prevent this? I mean a way to configure PDFBox not > to extracted the scanned text and get the right displayed text?
@Andreas, This may be the bug report to follow. In short, not yet. https://issues.apache.org/jira/browse/PDFBOX-1912

