[ 
https://issues.apache.org/jira/browse/PDFBOX-4992?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17216167#comment-17216167
 ] 

Andreas Lehmkühler commented on PDFBOX-4992:
--------------------------------------------

PDFBOX-1107 is about rendering and the attached pdf renders fine with 2.0.21 
and the current trunk. If this is about "Text extraction" as the component 
choice states, then ist is not related to PDFBOX-1107. 

The pdf uses a Type3 font without any unicode mapping so that it is impossible 
to extract some readable text as the encoding uses some random values.

> PDF created by Bullzip PDF Printer / www.bullzip.com / Freeware Edition shows 
> weird characters
> ----------------------------------------------------------------------------------------------
>
>                 Key: PDFBOX-4992
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-4992
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Text extraction
>    Affects Versions: 2.0.21
>         Environment: windows
>            Reporter: Peter van der Weerd
>            Priority: Major
>         Attachments: 2brightsparks.onfastspring.com - invoice.pdf
>
>
> I copy the text from the original bug (PDFBOX-1107). I experience the same 
> issue. 
> I have quite a few of these documents, but most are classified. I attached a 
> non-classified one.
> I was hoping that the recent version solved this issue, but it doesn't.
>  
> Original text from 1107:
> Opening the PDF via PDFReader 1.6 + 1.7 SNAPSHOT results in an unreadable 
> page. All other pdf viewers I tried have correctly displayed the file.
> The only related log message shown was
> 25.08.2011 11:59:41 org.apache.pdfbox.util.PDFStreamEngine processOperator
> INFO: unsupported/disabled operation: EI
> which is probably unrelated. My guess its the font they used (see screenshot) 
> however if the font is unknown or problematic, shouldn't pdfreader use a 
> default font or something? Maybe I am wrong anyway :)



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to