[ 
https://issues.apache.org/jira/browse/PDFBOX-4785?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17054680#comment-17054680
 ] 

Ryosuke Fujita commented on PDFBOX-4785:
----------------------------------------

Thank you for replying. I understand the problem, but do you have any 
workaround on this? I can't explain to my customers why suddenly visible 
character stop being extracted. For example, if extending CMapParser and 
injecting the dependency into our environment, will make me change the behavior.

> No Unicode mapping with MS-Mincho
> ---------------------------------
>
>                 Key: PDFBOX-4785
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-4785
>             Project: PDFBox
>          Issue Type: Bug
>          Components: FontBox
>    Affects Versions: 2.0.18, 2.0.19
>            Reporter: Ryosuke Fujita
>            Priority: Major
>         Attachments: E02779_convocation_notice_p14.pdf
>
>
> ExtractText from attached pdf fails after v2.0.18 while v2.0.17 succeed. 
> Error message is as follows, and can't extract character "最"(CID+7025).
> FEB 26, 2020 10:32:29 AM org.apache.pdfbox.pdmodel.font.PDType0Font toUnicode
> WARNING: No Unicode mapping for CID+7025 (7025) in font NAEGKL+MS-Mincho
> This maybe related to PDFBOX-4661?



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to