[ 
https://issues.apache.org/jira/browse/PDFBOX-5124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17298398#comment-17298398
 ] 

Gábor Stefanik commented on PDFBOX-5124:
----------------------------------------

Unfortunately we have no control over what kind of messed up junk other PDF 
authoring tools put out. And as long as such junk continues to circulate, 
PDFBox will come across it, and users will blame PDFBox for rendering it 
"wrong", especially when other PDF readers render it correctly.

This is not like with HTML, where back in 2004, Firefox devs could rightfully 
say, "no, it's Internet Explorer that's rendering that page wrong". HTML can be 
fixed after it's originally made; with PDF, it's a lot more difficult.

> Improperly declared OS2WindowsMetricsTable version (v0 table declard as v3) 
> in embedded font stops parsing with EOFException
> ----------------------------------------------------------------------------------------------------------------------------
>
>                 Key: PDFBOX-5124
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-5124
>             Project: PDFBox
>          Issue Type: Bug
>          Components: FontBox
>    Affects Versions: 2.0.22
>            Reporter: Gábor Stefanik
>            Priority: Major
>         Attachments: PDFBOX-5124-new.txt, PDFBOX-5124-old.txt, 
> PDFBOX-5124.pdf-1-new.png, PDFBOX-5124.pdf-1-old.png, 
> SZAMLA-20190417-20190012706-ININET-BroadBitHungary-11646-HUF.pdf
>
>
> The attached document contains an incorrectly versioned 
> OS2WindowsMetricsTable. It's a version 0 table, but claims to be version 3. 
> Due to this, when we try to parse the new fields introduced in newer 
> versions, we hit an EOFException.
> Since this issue does occur in the wild, PDFBox should tolerate it, e.g. by 
> catching the EOFException and resetting the "version" variable to the highest 
> version that doesn't have the missing fields. (Note that the version 
> constants PDFBox checks against are wrong, but that's PDFBOX-5123.)



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org

Reply via email to