[jira] [Created] (PDFBOX-3792) Getting lots of warnings "No Unicode mapping for..." when extract text

sunny xia (JIRA) Mon, 15 May 2017 00:04:37 -0700

sunny xia created PDFBOX-3792:
---------------------------------

             Summary: Getting lots of warnings "No Unicode mapping for..." when 
extract text
                 Key: PDFBOX-3792
                 URL: https://issues.apache.org/jira/browse/PDFBOX-3792
             Project: PDFBox
          Issue Type: Bug
          Components: Text extraction
    Affects Versions: 2.0.5
            Reporter: sunny xia
         Attachments: FileWithIssue.pdf, IssueLog.txt, OutputText.txt


When I use PDFbox to extract text, I get lots of warnings and as output I only 
get garbage. But when I use Abode Acrobat to export the attached PDF file to 
text, it works fine. I have attached the original PDF file, the text output and 
the log with warnings. And besides, PDF file seems to  have a Type-1 font 
embedded with a custom encoding.I have checked lots of reports on JIRA issue 
tracker, still find no way to solve it.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Created] (PDFBOX-3792) Getting lots of warnings "No Unicode mapping for..." when extract text

Reply via email to