Emanuel Cozariz created PDFBOX-5020:
---------------------------------------
Summary: Surrogate characters are decoded wrongly in
makeJustificationArray
Key: PDFBOX-5020
URL: https://issues.apache.org/jira/browse/PDFBOX-5020
Project: PDFBox
Issue Type: Bug
Affects Versions: 2.0.21
Reporter: Emanuel Cozariz
Given the following string 𧙗, this will be encoded as '\uD85D\uDE57'
The above string will be accepted by the font CODE2002.ttf
{code:java}
PDFont font = PDType0Font.load(doc, new File("CODE2002.ttf"));PDFont font =
PDType0Font.load(doc, new File("CODE2002.ttf")); cs.showText("\uD85D\uDE57");
{code}
But it is not correct decoded.
Method makeJustificationArray of PdfBoxFastOutputDevice
uses Character.toString(c) to add to the data array
uD85D => Character.toString(c) will decode as �
uDE57 => Character.toString(c) will decode as �
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]