Re: [Podofo-users] reading polish characters using PoDoFo

Etienne Robin Wed, 15 Feb 2017 05:37:02 -0800

Hi,

> okay, in that case, why not to virtualize the GetUnicodeValue()
> function and provide special implementation for CMap? It will avoid
> code duplication and will cover the fix only to CMap.
>


While progressing in the decoding, it is needed to know how many bytes are used 
to get one unicode char. 
The GetUnicodeValue() method does not return this info.

Note that, in the CMap stream, there is a part in which the byte encoding 
ranges are listed. Currently, this info is not parsed.
With this info, it would be possible to know how many bytes the next character 
code is encoded on. And this, just by looking at the next byte of the encoded 
string.
Here, it is just a simple "trial and error” approach.

Regards,

Etienne
------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, SlashDot.org! http://sdm.link/slashdot
_______________________________________________
Podofo-users mailing list
Podofo-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/podofo-users

Re: [Podofo-users] reading polish characters using PoDoFo

Reply via email to