I was able to extract text by making ‘m_pEncoding’ public in PdfFont.h and
changing encoding to Iso88592(which contains polish characters):
PdfEncoding* pIsoEncoding = new PdfIso88592Encoding();
pCurFont->m_pEncoding = pIsoEncoding;
I have text now, but still have not polish characters.
I think it may be bug in
PdfContsntsTokenizer.ReadNext()
which doesn’t read polish chars in the right way, as PoDoFo writes unknown
chars like ‘\200’, ‘\201’.
Will try to figure it out, but if someone has any hint, pls let me know😊
Best regards,
Fryderyk
------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, SlashDot.org! http://sdm.link/slashdot
_______________________________________________
Podofo-users mailing list
Podofo-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/podofo-users