[jira] [Commented] (PDFBOX-4049) IllegalArgumentException: root cannot be null
[ https://issues.apache.org/jira/browse/PDFBOX-4049?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16394186#comment-16394186 ] Andreas Lehmkühler commented on PDFBOX-4049: {quote}if this is bad mime type then all the PDFs should get this error.{quote} Just to avoid missunderstandings: the issue is not about a wrong mime type. The information about the mime-type doesn't belong a the beginning of a pdf and is just garbage. > IllegalArgumentException: root cannot be null > - > > Key: PDFBOX-4049 > URL: https://issues.apache.org/jira/browse/PDFBOX-4049 > Project: PDFBox > Issue Type: Bug > Components: Parsing >Affects Versions: 2.0.8 > Environment: Windows 10 >Reporter: savan patel >Assignee: Andreas Lehmkühler >Priority: Major > Labels: regression > Attachments: 372d5dd7-d4b8-41b2-9f50-80c1353aee59.pdf > > > I got a pdf,,, in which pdfbox gives errors while parsing it. > {code} > Exception in thread "main" java.lang.IllegalArgumentException: root cannot be > null > at org.apache.pdfbox.pdmodel.PDPageTree.(PDPageTree.java:75) > at > org.apache.pdfbox.pdmodel.PDDocumentCatalog.getPages(PDDocumentCatalog.java:129) > at > org.apache.pdfbox.pdmodel.PDDocument.getNumberOfPages(PDDocument.java:1411) > {code} > This did not happen with 2.0.7. -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Commented] (PDFBOX-4049) IllegalArgumentException: root cannot be null
[ https://issues.apache.org/jira/browse/PDFBOX-4049?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16313499#comment-16313499 ] savan patel commented on PDFBOX-4049: - I don't know much but when we generate PDF then all our PDF gets this type of MIME by our mistake(fixed after this). But pdfbox can able to parse some of these files without errors and gives errors in some files. If this is bad mime type then all the PDFs should get this error. So I thought this could be parsing error. > IllegalArgumentException: root cannot be null > - > > Key: PDFBOX-4049 > URL: https://issues.apache.org/jira/browse/PDFBOX-4049 > Project: PDFBox > Issue Type: Bug > Components: Parsing >Affects Versions: 2.0.8 > Environment: Windows 10 >Reporter: savan patel > Labels: regression > Attachments: 372d5dd7-d4b8-41b2-9f50-80c1353aee59.pdf > > > I got a pdf,,, in which pdfbox gives errors while parsing it. > {code} > Exception in thread "main" java.lang.IllegalArgumentException: root cannot be > null > at org.apache.pdfbox.pdmodel.PDPageTree.(PDPageTree.java:75) > at > org.apache.pdfbox.pdmodel.PDDocumentCatalog.getPages(PDDocumentCatalog.java:129) > at > org.apache.pdfbox.pdmodel.PDDocument.getNumberOfPages(PDDocument.java:1411) > {code} > This did not happen with 2.0.7. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Commented] (PDFBOX-4049) IllegalArgumentException: root cannot be null
[ https://issues.apache.org/jira/browse/PDFBOX-4049?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16313466#comment-16313466 ] Tilman Hausherr commented on PDFBOX-4049: - Your PDF starts with {code} 766440766244535832321017 Content-Disposition: form-data; name="fileData"; filename="514bf860-d583-494c-bb0b-68e8167bba53.pdf" Content-Type: application/pdf {code} and ends with {code} 766440766244535832321017-- {code} which indicates a bad MIME extracting tool. Did you receive this file from a client or did you extract it yourself? A coworker (great guy) wrote such a tool in perl and cursed a lot. Years later (he had quit) I rewrote such a tool myself in java with additional features and I quickly understood his pain. I'm asking because if you wrote that tool yourself, or if you have influence on whoever wrote it, the whole problem would be moot. > IllegalArgumentException: root cannot be null > - > > Key: PDFBOX-4049 > URL: https://issues.apache.org/jira/browse/PDFBOX-4049 > Project: PDFBox > Issue Type: Bug > Components: Parsing >Affects Versions: 2.0.8 > Environment: Windows 10 >Reporter: savan patel > Labels: regression > Attachments: 372d5dd7-d4b8-41b2-9f50-80c1353aee59.pdf > > > I got a pdf,,, in which pdfbox gives errors while parsing it. > {code} > Exception in thread "main" java.lang.IllegalArgumentException: root cannot be > null > at org.apache.pdfbox.pdmodel.PDPageTree.(PDPageTree.java:75) > at > org.apache.pdfbox.pdmodel.PDDocumentCatalog.getPages(PDDocumentCatalog.java:129) > at > org.apache.pdfbox.pdmodel.PDDocument.getNumberOfPages(PDDocument.java:1411) > {code} > This did not happen with 2.0.7. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org