Sorry if this get this twice, I accidentally sent this to the wrong list
first.
I have been looking around and I saw where the issue with extracted
Arabic words being written in reverse was fixed, but I'm seeing an issue
where the extracted Arabic text of a sentence is in reverse. I assume
this is due to Arabic being a left-to-right language. Is there anyway to
detect this and have pdfbox extract the text in the correct order?
Expected Arabic Text:
??????? ?????? ?????? ??????? ??????? ??????
Returned Arabic Text:
?????? ?????? ??????? ?????? ????? ???????
I am using the latest version (0.8.0-incubating).
Is there something else that I am missing?
--
Matt