chunlinyao created PDFBOX-4570:
----------------------------------

             Summary: U+2225 rendered as U+2016 glyph when use UniJIS-UCS2-H 
and non embedded font
                 Key: PDFBOX-4570
                 URL: https://issues.apache.org/jira/browse/PDFBOX-4570
             Project: PDFBox
          Issue Type: Improvement
          Components: FontBox
    Affects Versions: 2.0.15
         Environment: Windows 10 64bit, Adobe Reader 2019.012.20034
            Reporter: chunlinyao
         Attachments: correct.png, incorrect.png, u2225.pdf

Maybe this is not a bug of  PDFBox, This pdf rendered difference than adobe 
reader. it use MS PMincho font, this font has glyph for U+2225, the glyph in 
Win10 different from WinXP (I confirmed that by using FontForge.)

The Adobe Reader 2019.012.20034 ON Win10 rendered it correctly. Even Adobe 
Reader 2019.012.20034 ON macOS rendered incorrect. (with MSPMincho font 
installed)

MuPDF 1.6 on Windows, Chrome, FireFox all rendered it like PDFBox. 

Although Adobe Reader on win10 rendered it correctly, When you copy the text 
from pdf, you will get U+2016 not U+2225.

I doubt Adobe Reader doesn't use UniJIS-UCS2-H to convert unicode to cid then 
convert back to unicode when retrive glyphs.

The UniJIS-UCS2-H is obsoleted. It mapping both U+2225 and U+2016 to CID+666, 
Change to UniJIS-UTF16-H can workaround this problem.

Is there some posibility to improve PDFBox render like Adobe Reader?



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to