On 7/11/2011 0:43, WMJ wrote:
Hello,

I met with a PDF file which does not embed font subsets and consequently failed to extract text from it.

The fact that a font isn't embedded doesn't mean you can't extract text. Text extraction doesn't need to know what a glyph looks like, it only needs to know the correct unicode value of each character. I don't know if iText is already able to parse CJK form. Can you share a sample PDF that fails, so that we can take a look at it?
------------------------------------------------------------------------------
RSA(R) Conference 2012
Save $700 by Nov 18
Register now
http://p.sf.net/sfu/rsa-sfdev2dev1
_______________________________________________
iText-questions mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/itext-questions

iText(R) is a registered trademark of 1T3XT BVBA.
Many questions posted to this list can (and will) be answered with a reference 
to the iText book: http://www.itextpdf.com/book/
Please check the keywords list before you ask for examples: 
http://itextpdf.com/themes/keywords.php

Reply via email to