Re: [iText-questions] How to extract text of CNS1 ordering without embedded font resource

1T3XT BVBA Mon, 07 Nov 2011 02:35:37 -0800

On 7/11/2011 0:43, WMJ wrote:

Hello,
I met with a PDF file which does not embed font subsets andconsequently failed to extract text from it.

The fact that a font isn't embedded doesn't mean you can't extract text.Text extraction doesn't need to know what a glyph looks like, it onlyneeds to know the correct unicode value of each character. I don't knowif iText is already able to parse CJK form. Can you share a sample PDFthat fails, so that we can take a look at it?

------------------------------------------------------------------------------
RSA(R) Conference 2012
Save $700 by Nov 18
Register now
http://p.sf.net/sfu/rsa-sfdev2dev1

_______________________________________________
iText-questions mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/itext-questions

iText(R) is a registered trademark of 1T3XT BVBA.
Many questions posted to this list can (and will) be answered with a reference 
to the iText book: http://www.itextpdf.com/book/
Please check the keywords list before you ask for examples: 
http://itextpdf.com/themes/keywords.php

Re: [iText-questions] How to extract text of CNS1 ordering without embedded font resource

Reply via email to