Re: [iText-questions] Differences btw text extraction from iText and Acrobat Reader?

wwkloo Wed, 20 Mar 2013 01:55:13 -0700

Leonard Rosenthol-3 wrote
> On 3/20/13 8:38 AM, "wwkloo" &lt;

> wklogoo@

> &gt; wrote:
> 
>>Additional information:
>>When create the PDF with another program, the text can be extracted by
>>iText
>>and Acrobat Reader XI correctly.
>>- 1: 0xD841 0xDD47
>>- 2: 0x92DB
>>D
> 
> That tells me that whatever program you are using to create the PDF is
> broken and you should avoid it.
> 
> Leonard
> ------------------------------------------------------------------------------

Yes, the iTextExtract_O.pdf has broken font in the output and is not used.
Just included as additional information that with the broken display, the
copy-and-paste and iText PdfTextExtractor.GetTextFromPage can correctly
retrieve the unicode characters.

However, for the correctly displayed iTextExtract_W.pdf, I cannot retrieve
the unicode characters correctly with iText, but Acrobat Reader can find the
correct characters.

Regards,
wwkloo

--
View this message in context: 
http://itext-general.2136553.n4.nabble.com/Differences-btw-text-extraction-from-iText-and-Acrobat-Reader-tp4657836p4657862.html
Sent from the iText - General mailing list archive at Nabble.com.

------------------------------------------------------------------------------
Everyone hates slow websites. So do we.
Make your web apps faster with AppDynamics
Download AppDynamics Lite for free today:
http://p.sf.net/sfu/appdyn_d2d_mar
_______________________________________________
iText-questions mailing list
iText-questions@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/itext-questions

iText(R) is a registered trademark of 1T3XT BVBA.
Many questions posted to this list can (and will) be answered with a reference 
to the iText book: http://www.itextpdf.com/book/
Please check the keywords list before you ask for examples: 
http://itextpdf.com/themes/keywords.php

Re: [iText-questions] Differences btw text extraction from iText and Acrobat Reader?

Reply via email to