[
https://issues.apache.org/jira/browse/PDFBOX-4992?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17216325#comment-17216325
]
Peter van der Weerd commented on PDFBOX-4992:
---------------------------------------------
Interesting...
Indeed, I see now that the font didn't have any glyphs or unicode codes. But
then, how is it possible that it renders correctly? Where is that info coming
from?
(maybe a stupid question, I'm a pdf beginner)
> PDF created by Bullzip PDF Printer / www.bullzip.com / Freeware Edition shows
> weird characters
> ----------------------------------------------------------------------------------------------
>
> Key: PDFBOX-4992
> URL: https://issues.apache.org/jira/browse/PDFBOX-4992
> Project: PDFBox
> Issue Type: Bug
> Components: Text extraction
> Affects Versions: 2.0.21
> Environment: windows
> Reporter: Peter van der Weerd
> Priority: Major
> Attachments: 2brightsparks.onfastspring.com - invoice.pdf
>
>
> I copy the text from the original bug (PDFBOX-1107). I experience the same
> issue.
> I have quite a few of these documents, but most are classified. I attached a
> non-classified one.
> I was hoping that the recent version solved this issue, but it doesn't.
>
> Original text from 1107:
> Opening the PDF via PDFReader 1.6 + 1.7 SNAPSHOT results in an unreadable
> page. All other pdf viewers I tried have correctly displayed the file.
> The only related log message shown was
> 25.08.2011 11:59:41 org.apache.pdfbox.util.PDFStreamEngine processOperator
> INFO: unsupported/disabled operation: EI
> which is probably unrelated. My guess its the font they used (see screenshot)
> however if the font is unknown or problematic, shouldn't pdfreader use a
> default font or something? Maybe I am wrong anyway :)
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]