[
https://issues.apache.org/jira/browse/PDFBOX-1598?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13655832#comment-13655832
]
James Sullivan commented on PDFBOX-1598:
----------------------------------------
Thanks Maruan. Interesting. You are correct, it doesn't display correctly with
Adobe Reader on Japanese Windows 7 complaining about a missing Gothic font. It
does display correctly with Firefox (Ubuntu 12.04), Chrome (Ubuntu and Windows)
and Document Viewer (Ubuntu 12.04).
I am going to attach another version of the file (PDFExampleError2.pdf) with
the same problem that does work on Acrobat Reader (with Japanese fonts of
course) on Windows 7 as well as all of the other PDF viewers mentioned above.
> Could not parse predefined CMAP file for UCS2 Encoding
> ------------------------------------------------------
>
> Key: PDFBOX-1598
> URL: https://issues.apache.org/jira/browse/PDFBOX-1598
> Project: PDFBox
> Issue Type: Bug
> Components: Text extraction
> Affects Versions: 1.8.1
> Environment: Ubuntu 12.04
> Reporter: James Sullivan
> Attachments: PDFExampleError.pdf
>
>
> To reproduce from the command line type: pdfbox ExtractText -console
> PDFExampleError.pdf
> org.apache.pdfbox.pdmodel.font.PDCIDFont determineEncoding
> SEVERE: Error: Could not parse predefined CMAP file for 'æ¢x-í§sO-UCS2'
> Garbled but may be UniJIS-UCS2-H encoding for Japanese
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira