[ https://issues.apache.org/jira/browse/PDFBOX-1279?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13253150#comment-13253150 ]
Guillaume Bailleul commented on PDFBOX-1279: -------------------------------------------- Adding a printStackTrace in HeaderParseException.getErrorCode, we have (linux with file encoding cp1252) : org.apache.padaf.preflight.HeaderParseException: Lexical error at line 2, column 9. Encountered: "\u0178" (376), after : "" at org.apache.padaf.preflight.javacc.PDFParser.PDF_header(PDFParser.java:591) at org.apache.padaf.preflight.javacc.PDFParser.PDF(PDFParser.java:837) at org.apache.padaf.preflight.PdfA1bValidator.validate(PdfA1bValidator.java:61) at org.apache.padaf.preflight.Validator_A1b.main(Validator_A1b.java:51) There are also some getBytes() in javacc generated sources : target/generated-sources/javacc/org/apache/padaf/preflight/javacc/PDFParser.java: literalLength += currentToken.image.getBytes().length; target/generated-sources/javacc/org/apache/padaf/preflight/javacc/PDFParser.java: if (previous != null && previous.image.getBytes()[previous.image.getBytes().length-1]!='\u005c\u005c') { target/generated-sources/javacc/org/apache/padaf/preflight/javacc/PDFParser.java: if (previous != null && previous.image.getBytes()[previous.image.getBytes().length-1]!='\u005c\u005c') { target/generated-sources/javacc/org/apache/padaf/preflight/javacc/PDFParser.java: if (previous != null && previous.image.getBytes()[previous.image.getBytes().length-1]!='\u005c\u005c') { target/generated-sources/javacc/org/apache/padaf/preflight/javacc/PDFParser.java: if (token != null && token.image.getBytes().length > MAX_NAME_SIZE) { target/generated-sources/javacc/org/apache/padaf/preflight/javacc/PDFParser.java: throw new PdfParseException("Object Name is toot long : " + token.image.getBytes().length, ERROR_SYNTAX_NAME_TOO_LONG); target/generated-sources/javacc/org/apache/padaf/preflight/javacc/PDFParser.java: if (token != null && token.image.getBytes().length < 4) { > Preflight reports "1.1 : Body Syntax error" > ------------------------------------------- > > Key: PDFBOX-1279 > URL: https://issues.apache.org/jira/browse/PDFBOX-1279 > Project: PDFBox > Issue Type: Bug > Components: Preflight > Affects Versions: 1.7.0 > Environment: Win 7 64Bit, jre 1.6.31 > Reporter: beat weisskopf > Priority: Minor > Attachments: input_pdf_a_lvl_a_libreoffice_352.pdf > > > Just tried the PDF/A Validation. It fails on the attached pdf with "1.1 : > Body Syntax error". Adobe Preflight reports success for both pdf/a level a > and pdf/a level b validation. PDF was created with plain LibreOffice 3.5.2 > (export as pdf, using pdf/a level a). -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira