When I use PDFbox to extract text, I get lots of warnings and as output I only
get garbage. But when I use Abode Acrobat to export the attached PDF file to
text, it works fine. I have attached the original PDF file, the text output and
the log with warnings. And besides,
PDF file seems to have a Type-1 font embedded with a custom encoding.
The PDFbox version is pdfbox-app-2.0.5
The command I use is: java -jar pdfbox-app-2.0.5.jar ExtractText
FileWithIssue.pdf
I have checked lots of reports on JIRA issue tracker, still find no way to
solve it.I am looking forward to hearing from you.
Thanks & Best RegardsSunny Xia
ATTENTION
!
"
!"&