Hi, The text extraction from PDF including CJK text in horizontal writing mode is almost working. However, right-to-left scripts (like Arabic, Hebrew) are not supported yet, and the support for the scripts depending complex text layout feature like Indic scripts is not completed feature. In fact, the text extraction for such scripts cannot be achieved by PDF renderer only, either PDF data should include extra data to translate the composed glyph to Unicode substring... thus, please give a sample PDF for further discussion.
Regards, mpsuzuki On Thu, 7 Apr 2011 19:51:06 +0530 srinivas adicherla <[email protected]> wrote: >Hi, > > I have one pdf other than english language. I want to copy the text >from it. How can I do it? Presently iam using >poppler_page_get_selected_text(). But its not working. How poppler is doing >it? >Please provide me the solution. > >Thanks >-- >A Srinivas > _______________________________________________ poppler mailing list [email protected] http://lists.freedesktop.org/mailman/listinfo/poppler
