Re: [poppler] About parseCharName in GfxFont.cc

suzuki toshiya Fri, 24 Aug 2012 01:20:05 -0700

Thanks, it's interesting... When I posted my previous message,
I was thinking the issue would be some CJK issue, but it was
ASCII issue! And, the problematic PDF is generated by Adobe InDesign.


Regards,
mpsuzuki

王璐 wrote:

I tried to send the files through attachment, but got rejected from the
mailling list

The pdf can be found at http://dl.dropbox.com/u/75853179/med-9.pdf

Please check the 'LEKSJON' on the top left corner, without ToUnicode map
you  should get the correct characters.

btw, if you try to extract fonts using fontforge, it won't apply ToUnicode
for non-ttf fonts.


- Lu

On Fri, Aug 24, 2012 at 9:33 AM, 王璐 <[email protected]> wrote:

I've attached a problematic pdf, notice the 'LEKSJON' in the top left
corner, if you copy the text out, you'll get LeKSjoN
So in the ToUnicode map for that font, both 'E' and 'e' are mapped to 'e'

I've extracted the font as 'f2.cff' attached. The font itself is ok.
I've also attached a file showing the font->getToUnicode(), the format for
each line is

GlyphID Unicode [Unicode...] # CharCode

You can see problem at lines of 0x45 and 0x65.

Thanks

- Lu Wang



On Fri, Aug 24, 2012 at 9:21 AM, suzuki toshiya <
[email protected]> wrote:

王璐 wrote:

   Usually this is done by ToUnicode map, but I've many bad mapping for
Type 1 font, where Type 1 font itself provides good mappings.

Could you give some concrete examples?

Regards,
mpsuzuki


_______________________________________________
poppler mailing list
[email protected]
http://lists.freedesktop.org/mailman/listinfo/poppler

Re: [poppler] About parseCharName in GfxFont.cc

Reply via email to