Leonard Rosenthol-3 wrote > On 3/20/13 8:38 AM, "wwkloo" <
> wklogoo@ > > wrote: > >>Additional information: >>When create the PDF with another program, the text can be extracted by >>iText >>and Acrobat Reader XI correctly. >>- 1: 0xD841 0xDD47 >>- 2: 0x92DB >>D > > That tells me that whatever program you are using to create the PDF is > broken and you should avoid it. > > Leonard > ------------------------------------------------------------------------------ Yes, the iTextExtract_O.pdf has broken font in the output and is not used. Just included as additional information that with the broken display, the copy-and-paste and iText PdfTextExtractor.GetTextFromPage can correctly retrieve the unicode characters. However, for the correctly displayed iTextExtract_W.pdf, I cannot retrieve the unicode characters correctly with iText, but Acrobat Reader can find the correct characters. Regards, wwkloo -- View this message in context: http://itext-general.2136553.n4.nabble.com/Differences-btw-text-extraction-from-iText-and-Acrobat-Reader-tp4657836p4657862.html Sent from the iText - General mailing list archive at Nabble.com. ------------------------------------------------------------------------------ Everyone hates slow websites. So do we. Make your web apps faster with AppDynamics Download AppDynamics Lite for free today: http://p.sf.net/sfu/appdyn_d2d_mar _______________________________________________ iText-questions mailing list iText-questions@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/itext-questions iText(R) is a registered trademark of 1T3XT BVBA. Many questions posted to this list can (and will) be answered with a reference to the iText book: http://www.itextpdf.com/book/ Please check the keywords list before you ask for examples: http://itextpdf.com/themes/keywords.php