[ 
https://issues.apache.org/jira/browse/PDFBOX-1279?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13271722#comment-13271722
 ] 

Guillaume Bailleul commented on PDFBOX-1279:
--------------------------------------------

@Eric

I understand that there can be any value of 8 bits characters.
ISO-8859-1 defines a character for each value, this is not the case for Cp1252 
(81, 8d, 8f, 90, 9D are not used).

So I apply that patch :
* the InputStreamParser used by javacc is initialized specifying a charset 
(ISO-8859-1)
* in the grammar, the charset is always specified in getBytes

No link but I also removed the project.build.sourceEncoding in preflight which 
was overriding the pdfbox one with no (good) reason.

                
> Preflight reports "1.1 : Body Syntax error"
> -------------------------------------------
>
>                 Key: PDFBOX-1279
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-1279
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Preflight
>    Affects Versions: 1.7.0
>         Environment: Win 7 64Bit, jre 1.6.31
>            Reporter: beat weisskopf
>            Priority: Minor
>             Fix For: 1.7.0
>
>         Attachments: input_pdf_a_lvl_a_libreoffice_352.pdf, 
> pdfbox_1279_cs.patch
>
>
> Just tried the PDF/A Validation. It fails on the attached pdf with "1.1 : 
> Body Syntax error". Adobe Preflight reports success for both pdf/a level a 
> and pdf/a level b validation. PDF was created with plain LibreOffice 3.5.2 
> (export as pdf, using pdf/a level a).

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to