[
https://issues.apache.org/jira/browse/PDFBOX-4572?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16863650#comment-16863650
]
Tilman Hausherr commented on PDFBOX-4572:
-----------------------------------------
With "magic number" I meant something like this
[https://en.wikipedia.org/wiki/Byte_order_mark]
but assuming that "{color:#333333}#82l#82r{color}" is just "M S " then I guess
the answer is "no".
Maybe Adobe (who displays the name correctly) just knows all the fonts names?
Or just triesĀ {color:#333333}CP932{color} when UTF8 doesn't work?
> Font name not decoded correctly.
> --------------------------------
>
> Key: PDFBOX-4572
> URL: https://issues.apache.org/jira/browse/PDFBOX-4572
> Project: PDFBox
> Issue Type: Improvement
> Components: Parsing
> Affects Versions: 2.0.15
> Reporter: chunlinyao
> Priority: Minor
> Attachments: sample_ja.pdf
>
>
> The attached file encode font name in MS932, PDFBox decode it incorrectly.
> Maybe this file is malformed.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]