[
https://issues.apache.org/jira/browse/PDFBOX-1598?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13655791#comment-13655791
]
Maruan Sahyoun commented on PDFBOX-1598:
----------------------------------------
Hi James,
Adobe Reader and other viewers have issues displaying the content too. So IMHO
the PDF is corrupt. Eg. Adobe Reader displays only dots instead of text and
there is an error message when the PDF is opened.
Please give it a try and close the issue if you agree.
BR
Maruan
> Could not parse predefined CMAP file for UCS2 Encoding
> ------------------------------------------------------
>
> Key: PDFBOX-1598
> URL: https://issues.apache.org/jira/browse/PDFBOX-1598
> Project: PDFBox
> Issue Type: Bug
> Components: Text extraction
> Affects Versions: 1.8.1
> Environment: Ubuntu 12.04
> Reporter: James Sullivan
> Attachments: PDFExampleError.pdf
>
>
> To reproduce from the command line type: pdfbox ExtractText -console
> PDFExampleError.pdf
> org.apache.pdfbox.pdmodel.font.PDCIDFont determineEncoding
> SEVERE: Error: Could not parse predefined CMAP file for 'æ¢x-í§sO-UCS2'
> Garbled but may be UniJIS-UCS2-H encoding for Japanese
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira