[ 
https://issues.apache.org/jira/browse/PDFBOX-4572?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16863650#comment-16863650
 ] 

Tilman Hausherr commented on PDFBOX-4572:
-----------------------------------------

With "magic number" I meant something like this

[https://en.wikipedia.org/wiki/Byte_order_mark]

but assuming that "{color:#333333}#82l#82r{color}" is just "M S " then I guess 
the answer is "no".

Maybe Adobe (who displays the name correctly) just knows all the fonts names? 
Or just triesĀ {color:#333333}CP932{color} when UTF8 doesn't work?

> Font name not decoded correctly.
> --------------------------------
>
>                 Key: PDFBOX-4572
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-4572
>             Project: PDFBox
>          Issue Type: Improvement
>          Components: Parsing
>    Affects Versions: 2.0.15
>            Reporter: chunlinyao
>            Priority: Minor
>         Attachments: sample_ja.pdf
>
>
> The attached file encode font name in MS932, PDFBox decode it incorrectly. 
> Maybe this file is malformed.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to