Re: [poppler] How to copy the text other than english language pdf

mpsuzuki Thu, 07 Apr 2011 07:46:30 -0700

Hi,

The text extraction from PDF including CJK text in
horizontal writing mode is almost working. However,
right-to-left scripts (like Arabic, Hebrew) are not
supported yet, and the support for the scripts depending
complex text layout feature like Indic scripts is
not completed feature. In fact, the text extraction
for such scripts cannot be achieved by PDF renderer
only, either PDF data should include extra data to
translate the composed glyph to Unicode substring...
thus, please give a sample PDF for further discussion.


Regards,
mpsuzuki

On Thu, 7 Apr 2011 19:51:06 +0530
srinivas adicherla <[email protected]> wrote:

>Hi,
>
>      I have one pdf other than english language. I want to copy the text
>from it. How can I do it? Presently iam using
>poppler_page_get_selected_text(). But its not working. How poppler is doing
>it?
>Please provide me the solution.
>
>Thanks
>--
>A Srinivas
>
_______________________________________________
poppler mailing list
[email protected]
http://lists.freedesktop.org/mailman/listinfo/poppler

Re: [poppler] How to copy the text other than english language pdf

Reply via email to