[ https://issues.apache.org/jira/browse/PDFBOX-720?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13171668#comment-13171668 ]
LynX commented on PDFBOX-720: ----------------------------- Dear Adam, Unfortunately I was not able to reproduce this problem on my Debian. I've tried different PDBbox distrs (1.1.0, 1.2.0, 1.6.0) with different JVMs (same Sun JDK 1.5.0_06 as your were using and OpenJDK 1.6). In all cases I received "Document outline was not null" message. As David stated before it is "dependant on the implementation of HashMap on the host system" so I guess my hosts system is not appropriate for this :). Are you still able to reproduce this problem on your system? If yes could you please try apply the patch from PDFBOX-569. I believe it may fix the problem. Regards, LX > Inconsistency in parsing PDFs between Windows and Linux > ------------------------------------------------------- > > Key: PDFBOX-720 > URL: https://issues.apache.org/jira/browse/PDFBOX-720 > Project: PDFBox > Issue Type: Bug > Components: Parsing > Environment: Windows Vista 32-bit, Sun JDK 1.5.0_06, PDFBox HEAD tag > (revision 941073) > vs. > Red Hat Linux, 2.6.9-67.ELsmp kernel, Java 1.5.0_06, PDFBox HEAD tag > (revision 941073) > Reporter: Adam Nichols > Attachments: 238_Page_Report.pdf > > > Run this same code using the same PDF and you'll get different results on > Linux than on Windows. Regardless of which one you consider "correct", it > should be consistent. > doc = PDDocument.load(inputFile); > PDDocumentOutline outline = doc.getDocumentCatalog().getDocumentOutline(); > if(outline == null) > System.out.println("Document outline was null"); > else > System.out.println("Document outline was not null"); > Some interesting notes about this PDF: Seems that Acrobat Distiller 8.1.0 > basically just concatenated two PDFs into one. There are two trailers, they > both refer to object "1600 0" as the root. 1600 0 appears multiple times, > one time it doesn't have "Outlines" in the dictionary, the other time it has > "Outlines 1667 0". Windows picks up the latter and shows the outline > correctly. Linux picks up the former and thus returns null for the outline. > I tried debugging through PDFParser and BaseParser, but I'm not really sure > how that code works and I quickly got lost. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira