Hi,

I have debugged and found that in the displayPdfString method of the
PdfContentStreamProcessor class the string parameter is valid but it
is decoded to a string of the same length but all bytes are set to 0.

private void displayPdfString(PdfString string){

        String unicode = decode(string);

Drilling down deeper the CMapAwareDocumentFont has no toUnicodeCmap so
in the decodeSingleCID method the cidbyte2uni is used. The cidbyte2uni
has a length of 255 chars which are all set to int 0.

cidbyte2uni is not populated as uni2byte hashtable is empty.

I can then see that the fillEncoding method is not called and nor is
the doType1TT

In the DocumentFonts constructor the font variable has the following
in its hash map

{/FontBBox=[-2, -9, 38, 40], /LastChar=121, /FontMatrix=[0.240000, 0,
0, 0.240000, 0, 0], /Type=/Font, /Resources=Dictionary, /CharProcs=134
0 R, /Encoding=72 0 R, /Subtype=/Type3, /Name=/C0HN2000T1X005000,
/Widths=135 0 R, /FirstChar=32}

baseFont is null

fontName is "Unspecified Font Name"

subType is /Type3

Does this help identify the problem?

Regards

Ben



On 18 June 2010 11:01, Ben Short <b...@benshort.co.uk> wrote:
> Hi Kevin,
>
> I'm happy to dig in to the code. Can you point me to a place to start 
> debugging?
>
> Ben
>
> On 18 June 2010 00:04, Kevin Day <ke...@trumpetinc.com> wrote:
>>
>> ok - most likely the font is using an encoding that we just don't have
>> support for yet.  The encodings are a bit of a hack right now, so these
>> unusual cases are tough to deal with.
>>
>> If you are willing to dig in to the code, I can provide assistance.
>>
>> - K
>> --
>> View this message in context: 
>> http://itext-general.2136553.n4.nabble.com/NPE-while-Extracting-text-tp2256512p2259568.html
>> Sent from the iText - General mailing list archive at Nabble.com.
>>
>> ------------------------------------------------------------------------------
>> ThinkGeek and WIRED's GeekDad team up for the Ultimate
>> GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the
>> lucky parental unit.  See the prize list and enter to win:
>> http://p.sf.net/sfu/thinkgeek-promo
>> _______________________________________________
>> iText-questions mailing list
>> iText-questions@lists.sourceforge.net
>> https://lists.sourceforge.net/lists/listinfo/itext-questions
>>
>> Buy the iText book: http://www.itextpdf.com/book/
>> Check the site with examples before you ask questions: 
>> http://www.1t3xt.info/examples/
>> You can also search the keywords list: http://1t3xt.info/tutorials/keywords/
>>
>

------------------------------------------------------------------------------
ThinkGeek and WIRED's GeekDad team up for the Ultimate 
GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the 
lucky parental unit.  See the prize list and enter to win: 
http://p.sf.net/sfu/thinkgeek-promo
_______________________________________________
iText-questions mailing list
iText-questions@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/itext-questions

Buy the iText book: http://www.itextpdf.com/book/
Check the site with examples before you ask questions: 
http://www.1t3xt.info/examples/
You can also search the keywords list: http://1t3xt.info/tutorials/keywords/

Reply via email to