[jira] [Updated] (PDFBOX-3630) UnsupportedOperationException on a valid PDF
[ https://issues.apache.org/jira/browse/PDFBOX-3630?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Seva Alekseyev updated PDFBOX-3630: --- Attachment: pagesel.pdf > UnsupportedOperationException on a valid PDF > > > Key: PDFBOX-3630 > URL: https://issues.apache.org/jira/browse/PDFBOX-3630 > Project: PDFBox > Issue Type: Bug > Components: Parsing >Affects Versions: 2.0.3 > Environment: Windows 7 x64, JVM 1.8.0_101 >Reporter: Seva Alekseyev > Attachments: pagesel.pdf > > > The attached document, which opens fine with Adobe Reader, errors out in > PDDocument.load(): > java.lang.UnsupportedOperationException > at java.util.AbstractList.add(AbstractList.java:148) > at java.util.AbstractList.add(AbstractList.java:108) > at > org.apache.pdfbox.pdfparser.COSParser.parseDictObjects(COSParser.java:610) > at > org.apache.pdfbox.pdfparser.PDFParser.initialParse(PDFParser.java:217) > at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:252) > at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:966) > at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:922) > at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:870) -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Created] (PDFBOX-3630) UnsupportedOperationException on a valid PDF
Seva Alekseyev created PDFBOX-3630: -- Summary: UnsupportedOperationException on a valid PDF Key: PDFBOX-3630 URL: https://issues.apache.org/jira/browse/PDFBOX-3630 Project: PDFBox Issue Type: Bug Components: Parsing Affects Versions: 2.0.3 Environment: Windows 7 x64, JVM 1.8.0_101 Reporter: Seva Alekseyev Attachments: pagesel.pdf The attached document, which opens fine with Adobe Reader, errors out in PDDocument.load(): java.lang.UnsupportedOperationException at java.util.AbstractList.add(AbstractList.java:148) at java.util.AbstractList.add(AbstractList.java:108) at org.apache.pdfbox.pdfparser.COSParser.parseDictObjects(COSParser.java:610) at org.apache.pdfbox.pdfparser.PDFParser.initialParse(PDFParser.java:217) at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:252) at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:966) at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:922) at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:870) -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Updated] (PDFBOX-3629) "expected number, actual=COSString" on a valid document
[ https://issues.apache.org/jira/browse/PDFBOX-3629?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Seva Alekseyev updated PDFBOX-3629: --- Attachment: Book#4 - O'Reilly - JavaScript The Definitive Guide 2ed.pdf > "expected number, actual=COSString" on a valid document > --- > > Key: PDFBOX-3629 > URL: https://issues.apache.org/jira/browse/PDFBOX-3629 > Project: PDFBox > Issue Type: Bug > Components: Parsing >Affects Versions: 2.0.3 > Environment: Windows 7 x64, JVM 1.8.0_101 >Reporter: Seva Alekseyev > Attachments: Book#4 - O'Reilly - JavaScript The Definitive Guide > 2ed.pdf > > > On the attached document, which opens in Adobe Reader, PDDocument.load() > throws an error: > java.io.IOException: expected number, actual=COSString{file:///C|/Oreilly > Unix etc/O'Reilly Reference Library/web/cgi/index.html} at offset 845803 > at > org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryValue(BaseParser.java:165) > at > org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryNameValuePair(BaseParser.java:277) > at > org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionary(BaseParser.java:210) > at > org.apache.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:885) > at > org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryValue(BaseParser.java:153) > at > org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryNameValuePair(BaseParser.java:277) > at > org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionary(BaseParser.java:210) > at > org.apache.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:885) > at > org.apache.pdfbox.pdfparser.COSParser.parseFileObject(COSParser.java:772) > at > org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:741) > at > org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:672) > at > org.apache.pdfbox.pdfparser.COSParser.parseDictObjects(COSParser.java:632) > at > org.apache.pdfbox.pdfparser.PDFParser.initialParse(PDFParser.java:217) > at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:252) > at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:966) > at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:922) > at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:870) -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Created] (PDFBOX-3629) "expected number, actual=COSString" on a valid document
Seva Alekseyev created PDFBOX-3629: -- Summary: "expected number, actual=COSString" on a valid document Key: PDFBOX-3629 URL: https://issues.apache.org/jira/browse/PDFBOX-3629 Project: PDFBox Issue Type: Bug Components: Parsing Affects Versions: 2.0.3 Environment: Windows 7 x64, JVM 1.8.0_101 Reporter: Seva Alekseyev Attachments: Book#4 - O'Reilly - JavaScript The Definitive Guide 2ed.pdf On the attached document, which opens in Adobe Reader, PDDocument.load() throws an error: java.io.IOException: expected number, actual=COSString{file:///C|/Oreilly Unix etc/O'Reilly Reference Library/web/cgi/index.html} at offset 845803 at org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryValue(BaseParser.java:165) at org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryNameValuePair(BaseParser.java:277) at org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionary(BaseParser.java:210) at org.apache.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:885) at org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryValue(BaseParser.java:153) at org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryNameValuePair(BaseParser.java:277) at org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionary(BaseParser.java:210) at org.apache.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:885) at org.apache.pdfbox.pdfparser.COSParser.parseFileObject(COSParser.java:772) at org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:741) at org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:672) at org.apache.pdfbox.pdfparser.COSParser.parseDictObjects(COSParser.java:632) at org.apache.pdfbox.pdfparser.PDFParser.initialParse(PDFParser.java:217) at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:252) at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:966) at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:922) at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:870) -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Commented] (PDFBOX-3626) StackOverflowException on a valid PDF
[ https://issues.apache.org/jira/browse/PDFBOX-3626?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15745681#comment-15745681 ] Seva Alekseyev commented on PDFBOX-3626: Sorry about throwing all those trash-but-not-complete-trash documents at you guys. They were thrown at me in the first place. You should see the freak zoo of Office documents that I'm also dealing with. Corrupt and almost-corrupt PDFs are a small percentage in my Tika log. > StackOverflowException on a valid PDF > - > > Key: PDFBOX-3626 > URL: https://issues.apache.org/jira/browse/PDFBOX-3626 > Project: PDFBox > Issue Type: Bug > Components: Parsing >Affects Versions: 2.0.3 > Environment: Windows 7 x64, JVM 1.8.0_101 >Reporter: Seva Alekseyev > Attachments: PDF-01555.PDF > > > On the attached document, which opens fine in Acrobat, PDDocument,load() > throws a StackOverflowException: > Exception in thread "main" java.lang.StackOverflowError > at sun.nio.cs.UTF_8$Decoder.decodeLoop(UTF_8.java:412) > at java.nio.charset.CharsetDecoder.decode(CharsetDecoder.java:579) > at java.nio.charset.CharsetDecoder.decode(CharsetDecoder.java:802) > at > org.apache.pdfbox.pdfparser.BaseParser.isValidUTF8(BaseParser.java:805) > at > org.apache.pdfbox.pdfparser.BaseParser.parseCOSName(BaseParser.java:785) > at > org.apache.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:905) > at > org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryValue(BaseParser.java:153) > at > org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryNameValuePair(BaseParser.java:277) > at > org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionary(BaseParser.java:210) > at > org.apache.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:885) > at > org.apache.pdfbox.pdfparser.COSParser.parseFileObject(COSParser.java:772) > at > org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:741) > at > org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:672) > at org.apache.pdfbox.pdfparser.COSParser.getLength(COSParser.java:897) > at > org.apache.pdfbox.pdfparser.COSParser.parseCOSStream(COSParser.java:949) > at > org.apache.pdfbox.pdfparser.COSParser.parseFileObject(COSParser.java:780) > at > org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:741) > at > org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:672) > ... -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Created] (PDFBOX-3628) BadPaddingException on a valid document
Seva Alekseyev created PDFBOX-3628: -- Summary: BadPaddingException on a valid document Key: PDFBOX-3628 URL: https://issues.apache.org/jira/browse/PDFBOX-3628 Project: PDFBox Issue Type: Bug Components: Parsing Affects Versions: 2.0.3 Environment: Windows 7 x64, JVM 1.8.0_101 Reporter: Seva Alekseyev On the attached document, which opens fine with Adobe Reader, the PDDocument.load() throws an error: java.io.IOException: javax.crypto.BadPaddingException: Given final block not properly padded at org.apache.pdfbox.pdmodel.encryption.SecurityHandler.encryptDataAESother(SecurityHandler.java:296) at org.apache.pdfbox.pdmodel.encryption.SecurityHandler.encryptData(SecurityHandler.java:153) at org.apache.pdfbox.pdmodel.encryption.SecurityHandler.decryptStream(SecurityHandler.java:454) at org.apache.pdfbox.pdfparser.COSParser.parseFileObject(COSParser.java:784) at org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:741) at org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:672) at org.apache.pdfbox.pdfparser.COSParser.parseDictObjects(COSParser.java:632) at org.apache.pdfbox.pdfparser.PDFParser.initialParse(PDFParser.java:217) at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:252) at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:966) at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:922) at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:870) Caused by: javax.crypto.BadPaddingException: Given final block not properly padded at com.sun.crypto.provider.CipherCore.doFinal(CipherCore.java:966) at com.sun.crypto.provider.CipherCore.doFinal(CipherCore.java:824) at com.sun.crypto.provider.AESCipher.engineDoFinal(AESCipher.java:436) at javax.crypto.Cipher.doFinal(Cipher.java:2048) at org.apache.pdfbox.pdmodel.encryption.SecurityHandler.encryptDataAESother(SecurityHandler.java:276) ... 12 more -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Updated] (PDFBOX-3628) BadPaddingException on a valid document
[ https://issues.apache.org/jira/browse/PDFBOX-3628?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Seva Alekseyev updated PDFBOX-3628: --- Attachment: Elisee i-765 final.pdf > BadPaddingException on a valid document > --- > > Key: PDFBOX-3628 > URL: https://issues.apache.org/jira/browse/PDFBOX-3628 > Project: PDFBox > Issue Type: Bug > Components: Parsing >Affects Versions: 2.0.3 > Environment: Windows 7 x64, JVM 1.8.0_101 >Reporter: Seva Alekseyev > Attachments: Elisee i-765 final.pdf > > > On the attached document, which opens fine with Adobe Reader, the > PDDocument.load() throws an error: > java.io.IOException: javax.crypto.BadPaddingException: Given final block not > properly padded > at > org.apache.pdfbox.pdmodel.encryption.SecurityHandler.encryptDataAESother(SecurityHandler.java:296) > at > org.apache.pdfbox.pdmodel.encryption.SecurityHandler.encryptData(SecurityHandler.java:153) > at > org.apache.pdfbox.pdmodel.encryption.SecurityHandler.decryptStream(SecurityHandler.java:454) > at > org.apache.pdfbox.pdfparser.COSParser.parseFileObject(COSParser.java:784) > at > org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:741) > at > org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:672) > at > org.apache.pdfbox.pdfparser.COSParser.parseDictObjects(COSParser.java:632) > at > org.apache.pdfbox.pdfparser.PDFParser.initialParse(PDFParser.java:217) > at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:252) > at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:966) > at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:922) > at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:870) > Caused by: javax.crypto.BadPaddingException: Given final block not properly > padded > at com.sun.crypto.provider.CipherCore.doFinal(CipherCore.java:966) > at com.sun.crypto.provider.CipherCore.doFinal(CipherCore.java:824) > at com.sun.crypto.provider.AESCipher.engineDoFinal(AESCipher.java:436) > at javax.crypto.Cipher.doFinal(Cipher.java:2048) > at > org.apache.pdfbox.pdmodel.encryption.SecurityHandler.encryptDataAESother(SecurityHandler.java:276) > ... 12 more -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Created] (PDFBOX-3627) "/Prev loop at offset 77418" on a valid document
Seva Alekseyev created PDFBOX-3627: -- Summary: "/Prev loop at offset 77418" on a valid document Key: PDFBOX-3627 URL: https://issues.apache.org/jira/browse/PDFBOX-3627 Project: PDFBox Issue Type: Bug Components: Parsing Affects Versions: 2.0.3 Environment: Windows 7 x64, JVM 1.8.0_101 Reporter: Seva Alekseyev Attachments: CIT_2.pdf On the attached document, which opens fine with Word, the PDDocument.load() is throwing an error: java.io.IOException: /Prev loop at offset 77418 at org.apache.pdfbox.pdfparser.COSParser.parseXref(COSParser.java:320) at org.apache.pdfbox.pdfparser.PDFParser.initialParse(PDFParser.java:194) at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:252) at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:966) at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:922) at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:870) -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Updated] (PDFBOX-3627) "/Prev loop at offset 77418" on a valid document
[ https://issues.apache.org/jira/browse/PDFBOX-3627?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Seva Alekseyev updated PDFBOX-3627: --- Attachment: CIT_2.pdf > "/Prev loop at offset 77418" on a valid document > > > Key: PDFBOX-3627 > URL: https://issues.apache.org/jira/browse/PDFBOX-3627 > Project: PDFBox > Issue Type: Bug > Components: Parsing >Affects Versions: 2.0.3 > Environment: Windows 7 x64, JVM 1.8.0_101 >Reporter: Seva Alekseyev > Attachments: CIT_2.pdf > > > On the attached document, which opens fine with Word, the PDDocument.load() > is throwing an error: > java.io.IOException: /Prev loop at offset 77418 > at org.apache.pdfbox.pdfparser.COSParser.parseXref(COSParser.java:320) > at > org.apache.pdfbox.pdfparser.PDFParser.initialParse(PDFParser.java:194) > at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:252) > at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:966) > at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:922) > at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:870) -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Updated] (PDFBOX-3626) StackOverflowException on a valid PDF
[ https://issues.apache.org/jira/browse/PDFBOX-3626?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Seva Alekseyev updated PDFBOX-3626: --- Attachment: PDF-01555.PDF > StackOverflowException on a valid PDF > - > > Key: PDFBOX-3626 > URL: https://issues.apache.org/jira/browse/PDFBOX-3626 > Project: PDFBox > Issue Type: Bug > Components: Parsing >Affects Versions: 2.0.3 > Environment: Windows 7 x64, JVM 1.8.0_101 >Reporter: Seva Alekseyev > Attachments: PDF-01555.PDF > > > On the attached document, which opens fine in Acrobat, PDDocument,load() > throws a StackOverflowException: > Exception in thread "main" java.lang.StackOverflowError > at sun.nio.cs.UTF_8$Decoder.decodeLoop(UTF_8.java:412) > at java.nio.charset.CharsetDecoder.decode(CharsetDecoder.java:579) > at java.nio.charset.CharsetDecoder.decode(CharsetDecoder.java:802) > at > org.apache.pdfbox.pdfparser.BaseParser.isValidUTF8(BaseParser.java:805) > at > org.apache.pdfbox.pdfparser.BaseParser.parseCOSName(BaseParser.java:785) > at > org.apache.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:905) > at > org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryValue(BaseParser.java:153) > at > org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryNameValuePair(BaseParser.java:277) > at > org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionary(BaseParser.java:210) > at > org.apache.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:885) > at > org.apache.pdfbox.pdfparser.COSParser.parseFileObject(COSParser.java:772) > at > org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:741) > at > org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:672) > at org.apache.pdfbox.pdfparser.COSParser.getLength(COSParser.java:897) > at > org.apache.pdfbox.pdfparser.COSParser.parseCOSStream(COSParser.java:949) > at > org.apache.pdfbox.pdfparser.COSParser.parseFileObject(COSParser.java:780) > at > org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:741) > at > org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:672) > ... -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Created] (PDFBOX-3626) StackOverflowException on a valid PDF
Seva Alekseyev created PDFBOX-3626: -- Summary: StackOverflowException on a valid PDF Key: PDFBOX-3626 URL: https://issues.apache.org/jira/browse/PDFBOX-3626 Project: PDFBox Issue Type: Bug Components: Parsing Affects Versions: 2.0.3 Environment: Windows 7 x64, JVM 1.8.0_101 Reporter: Seva Alekseyev Attachments: PDF-01555.PDF On the attached document, which opens fine in Acrobat, PDDocument,load() throws a StackOverflowException: Exception in thread "main" java.lang.StackOverflowError at sun.nio.cs.UTF_8$Decoder.decodeLoop(UTF_8.java:412) at java.nio.charset.CharsetDecoder.decode(CharsetDecoder.java:579) at java.nio.charset.CharsetDecoder.decode(CharsetDecoder.java:802) at org.apache.pdfbox.pdfparser.BaseParser.isValidUTF8(BaseParser.java:805) at org.apache.pdfbox.pdfparser.BaseParser.parseCOSName(BaseParser.java:785) at org.apache.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:905) at org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryValue(BaseParser.java:153) at org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryNameValuePair(BaseParser.java:277) at org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionary(BaseParser.java:210) at org.apache.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:885) at org.apache.pdfbox.pdfparser.COSParser.parseFileObject(COSParser.java:772) at org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:741) at org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:672) at org.apache.pdfbox.pdfparser.COSParser.getLength(COSParser.java:897) at org.apache.pdfbox.pdfparser.COSParser.parseCOSStream(COSParser.java:949) at org.apache.pdfbox.pdfparser.COSParser.parseFileObject(COSParser.java:780) at org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:741) at org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:672) ... -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Updated] (PDFBOX-3591) IOException "expected number, actual=COSFloat{1.0}" on a valid PDF
[ https://issues.apache.org/jira/browse/PDFBOX-3591?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Seva Alekseyev updated PDFBOX-3591: --- Attachment: Vagueness and the Rule of Law- Reconsidering Installment Land Con.pdf > IOException "expected number, actual=COSFloat{1.0}" on a valid PDF > -- > > Key: PDFBOX-3591 > URL: https://issues.apache.org/jira/browse/PDFBOX-3591 > Project: PDFBox > Issue Type: Bug > Components: Parsing >Affects Versions: 2.0.3 > Environment: Windows 7 x64, JVM 1.8.0_101 >Reporter: Seva Alekseyev > Attachments: Vagueness and the Rule of Law- Reconsidering Installment > Land Con.pdf > > > On the attached PDF document, which opens fine with Adobe Reader, the > PDDocument.load() method throws the following: > java.io.IOException: expected number, actual=COSFloat{1.0} at offset 577113 > at org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryValue:162 > at > org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryNameValuePair:274 > at org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionary:207 > at org.apache.pdfbox.pdfparser.BaseParser.parseDirObject:854 > at org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryValue:150 > at > org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryNameValuePair:274 > at org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionary:207 > at org.apache.pdfbox.pdfparser.BaseParser.parseDirObject:854 > at org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryValue:150 > at > org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryNameValuePair:274 > at org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionary:207 > at org.apache.pdfbox.pdfparser.BaseParser.parseDirObject:854 > at org.apache.pdfbox.pdfparser.COSParser.parseFileObject:757 > at org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically:726 > at org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically:657 > at org.apache.pdfbox.pdfparser.COSParser.parseDictObjects:617 > at org.apache.pdfbox.pdfparser.PDFParser.initialParse:215 > at org.apache.pdfbox.pdfparser.PDFParser.parse:249 > at org.apache.pdfbox.pdmodel.PDDocument.load:891 > at org.apache.pdfbox.pdmodel.PDDocument.load:831 > at org.apache.tika.parser.pdf.PDFParser.parse:129 -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Created] (PDFBOX-3591) IOException "expected number, actual=COSFloat{1.0}" on a valid PDF
Seva Alekseyev created PDFBOX-3591: -- Summary: IOException "expected number, actual=COSFloat{1.0}" on a valid PDF Key: PDFBOX-3591 URL: https://issues.apache.org/jira/browse/PDFBOX-3591 Project: PDFBox Issue Type: Bug Components: Parsing Affects Versions: 2.0.3 Environment: Windows 7 x64, JVM 1.8.0_101 Reporter: Seva Alekseyev Attachments: Vagueness and the Rule of Law- Reconsidering Installment Land Con.pdf On the attached PDF document, which opens fine with Adobe Reader, the PDDocument.load() method throws the following: java.io.IOException: expected number, actual=COSFloat{1.0} at offset 577113 at org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryValue:162 at org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryNameValuePair:274 at org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionary:207 at org.apache.pdfbox.pdfparser.BaseParser.parseDirObject:854 at org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryValue:150 at org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryNameValuePair:274 at org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionary:207 at org.apache.pdfbox.pdfparser.BaseParser.parseDirObject:854 at org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryValue:150 at org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryNameValuePair:274 at org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionary:207 at org.apache.pdfbox.pdfparser.BaseParser.parseDirObject:854 at org.apache.pdfbox.pdfparser.COSParser.parseFileObject:757 at org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically:726 at org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically:657 at org.apache.pdfbox.pdfparser.COSParser.parseDictObjects:617 at org.apache.pdfbox.pdfparser.PDFParser.initialParse:215 at org.apache.pdfbox.pdfparser.PDFParser.parse:249 at org.apache.pdfbox.pdmodel.PDDocument.load:891 at org.apache.pdfbox.pdmodel.PDDocument.load:831 at org.apache.tika.parser.pdf.PDFParser.parse:129 -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Updated] (PDFBOX-3556) Error "Error getting header version: %PDF--33" on a valid document
[ https://issues.apache.org/jira/browse/PDFBOX-3556?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Seva Alekseyev updated PDFBOX-3556: --- Attachment: ApproveIt 2 page data sheet.pdf > Error "Error getting header version: %PDF--33" on a valid document > -- > > Key: PDFBOX-3556 > URL: https://issues.apache.org/jira/browse/PDFBOX-3556 > Project: PDFBox > Issue Type: Bug > Components: Parsing >Affects Versions: 2.0.3 > Environment: Windows 7 x64, JVM 1.8.0_101 >Reporter: Seva Alekseyev > Attachments: ApproveIt 2 page data sheet.pdf > > > On the attached document, which opens fine in Adobe Reader, the > PDDocument.load() method throws the following error: > java.io.IOException: Error getting header version: %PDF--33 > at > org.apache.pdfbox.pdfparser.COSParser.parseHeader(COSParser.java:1935) > at > org.apache.pdfbox.pdfparser.COSParser.parsePDFHeader(COSParser.java:1853) > at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:245) > at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:957) > at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:913) > at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:861) -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Created] (PDFBOX-3556) Error "Error getting header version: %PDF--33" on a valid document
Seva Alekseyev created PDFBOX-3556: -- Summary: Error "Error getting header version: %PDF--33" on a valid document Key: PDFBOX-3556 URL: https://issues.apache.org/jira/browse/PDFBOX-3556 Project: PDFBox Issue Type: Bug Components: Parsing Affects Versions: 2.0.3 Environment: Windows 7 x64, JVM 1.8.0_101 Reporter: Seva Alekseyev Attachments: ApproveIt 2 page data sheet.pdf On the attached document, which opens fine in Adobe Reader, the PDDocument.load() method throws the following error: java.io.IOException: Error getting header version: %PDF--33 at org.apache.pdfbox.pdfparser.COSParser.parseHeader(COSParser.java:1935) at org.apache.pdfbox.pdfparser.COSParser.parsePDFHeader(COSParser.java:1853) at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:245) at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:957) at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:913) at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:861) -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Updated] (PDFBOX-3553) IOException "Expected root dictionary, but got this: COSInt{971}" on a valid PDF
[ https://issues.apache.org/jira/browse/PDFBOX-3553?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Seva Alekseyev updated PDFBOX-3553: --- Attachment: MyNIAID_Sprint1_ChangeRequestWires_20150422_R1.pdf > IOException "Expected root dictionary, but got this: COSInt{971}" on a valid > PDF > > > Key: PDFBOX-3553 > URL: https://issues.apache.org/jira/browse/PDFBOX-3553 > Project: PDFBox > Issue Type: Bug > Components: Parsing >Affects Versions: 2.0.3 > Environment: Windows 7 x64, JVM 1.8.0_101 >Reporter: Seva Alekseyev > Attachments: MyNIAID_Sprint1_ChangeRequestWires_20150422_R1.pdf > > > On the attached PDF document, which opens fine with Adobe Reader, the > PDDocument.load() method throws the following error: > java.io.IOException: Expected root dictionary, but got this: COSInt{971} > at > org.apache.pdfbox.pdfparser.PDFParser.initialParse(PDFParser.java:206) > at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:252) > at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:957) > at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:913) > at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:861) -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Commented] (PDFBOX-3553) IOException "Expected root dictionary, but got this: COSInt{971}" on a valid PDF
[ https://issues.apache.org/jira/browse/PDFBOX-3553?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15633930#comment-15633930 ] Seva Alekseyev commented on PDFBOX-3553: Sorry. Attached now. > IOException "Expected root dictionary, but got this: COSInt{971}" on a valid > PDF > > > Key: PDFBOX-3553 > URL: https://issues.apache.org/jira/browse/PDFBOX-3553 > Project: PDFBox > Issue Type: Bug > Components: Parsing >Affects Versions: 2.0.3 > Environment: Windows 7 x64, JVM 1.8.0_101 >Reporter: Seva Alekseyev > Attachments: MyNIAID_Sprint1_ChangeRequestWires_20150422_R1.pdf > > > On the attached PDF document, which opens fine with Adobe Reader, the > PDDocument.load() method throws the following error: > java.io.IOException: Expected root dictionary, but got this: COSInt{971} > at > org.apache.pdfbox.pdfparser.PDFParser.initialParse(PDFParser.java:206) > at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:252) > at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:957) > at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:913) > at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:861) -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Created] (PDFBOX-3553) IOException "Expected root dictionary, but got this: COSInt{971}" on a valid PDF
Seva Alekseyev created PDFBOX-3553: -- Summary: IOException "Expected root dictionary, but got this: COSInt{971}" on a valid PDF Key: PDFBOX-3553 URL: https://issues.apache.org/jira/browse/PDFBOX-3553 Project: PDFBox Issue Type: Bug Components: Parsing Affects Versions: 2.0.3 Environment: Windows 7 x64, JVM 1.8.0_101 Reporter: Seva Alekseyev On the attached PDF document, which opens fine with Adobe Reader, the PDDocument.load() method throws the following error: java.io.IOException: Expected root dictionary, but got this: COSInt{971} at org.apache.pdfbox.pdfparser.PDFParser.initialParse(PDFParser.java:206) at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:252) at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:957) at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:913) at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:861) -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Created] (PDFBOX-3546) IOException over DataFormatException, "invalid stored block lengths" on a valid PDF
Seva Alekseyev created PDFBOX-3546: -- Summary: IOException over DataFormatException, "invalid stored block lengths" on a valid PDF Key: PDFBOX-3546 URL: https://issues.apache.org/jira/browse/PDFBOX-3546 Project: PDFBox Issue Type: Bug Components: Parsing Affects Versions: 2.0.3 Environment: Windows 7 x64, JVM 1.8.0_101 Reporter: Seva Alekseyev Attachments: Null-control-man.pdf On the attached document, which loads and displays with Adobe Reader fine, PDDocument.load() throws the following exception: java.io.IOException: java.util.zip.DataFormatException: invalid stored block lengths at org.apache.pdfbox.filter.FlateFilter.decode(FlateFilter.java:82) at org.apache.pdfbox.cos.COSInputStream.create(COSInputStream.java:69) at org.apache.pdfbox.cos.COSStream.createInputStream(COSStream.java:162) at org.apache.pdfbox.pdfparser.PDFXrefStreamParser.(PDFXrefStreamParser.java:56) at org.apache.pdfbox.pdfparser.COSParser.parseXrefStream(COSParser.java:2053) at org.apache.pdfbox.pdfparser.COSParser.parseXrefObjStream(COSParser.java:333) at org.apache.pdfbox.pdfparser.COSParser.parseXref(COSParser.java:259) at org.apache.pdfbox.pdfparser.PDFParser.initialParse(PDFParser.java:194) at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:252) at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:957) at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:913) at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:861) at Temp.PDFTemp.App.main(App.java:19) Caused by: java.util.zip.DataFormatException: invalid stored block lengths at java.util.zip.Inflater.inflateBytes(Native Method) at java.util.zip.Inflater.inflate(Inflater.java:259) at java.util.zip.Inflater.inflate(Inflater.java:280) at org.apache.pdfbox.filter.FlateFilter.decompress(FlateFilter.java:107) at org.apache.pdfbox.filter.FlateFilter.decode(FlateFilter.java:73) ... 12 more -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Updated] (PDFBOX-3546) IOException over DataFormatException, "invalid stored block lengths" on a valid PDF
[ https://issues.apache.org/jira/browse/PDFBOX-3546?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Seva Alekseyev updated PDFBOX-3546: --- Attachment: Null-control-man.pdf > IOException over DataFormatException, "invalid stored block lengths" on a > valid PDF > --- > > Key: PDFBOX-3546 > URL: https://issues.apache.org/jira/browse/PDFBOX-3546 > Project: PDFBox > Issue Type: Bug > Components: Parsing >Affects Versions: 2.0.3 > Environment: Windows 7 x64, JVM 1.8.0_101 >Reporter: Seva Alekseyev > Attachments: Null-control-man.pdf > > > On the attached document, which loads and displays with Adobe Reader fine, > PDDocument.load() throws the following exception: > java.io.IOException: java.util.zip.DataFormatException: invalid stored block > lengths > at org.apache.pdfbox.filter.FlateFilter.decode(FlateFilter.java:82) > at org.apache.pdfbox.cos.COSInputStream.create(COSInputStream.java:69) > at org.apache.pdfbox.cos.COSStream.createInputStream(COSStream.java:162) > at > org.apache.pdfbox.pdfparser.PDFXrefStreamParser.(PDFXrefStreamParser.java:56) > at > org.apache.pdfbox.pdfparser.COSParser.parseXrefStream(COSParser.java:2053) > at > org.apache.pdfbox.pdfparser.COSParser.parseXrefObjStream(COSParser.java:333) > at org.apache.pdfbox.pdfparser.COSParser.parseXref(COSParser.java:259) > at > org.apache.pdfbox.pdfparser.PDFParser.initialParse(PDFParser.java:194) > at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:252) > at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:957) > at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:913) > at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:861) > at Temp.PDFTemp.App.main(App.java:19) > Caused by: java.util.zip.DataFormatException: invalid stored block lengths > at java.util.zip.Inflater.inflateBytes(Native Method) > at java.util.zip.Inflater.inflate(Inflater.java:259) > at java.util.zip.Inflater.inflate(Inflater.java:280) > at org.apache.pdfbox.filter.FlateFilter.decompress(FlateFilter.java:107) > at org.apache.pdfbox.filter.FlateFilter.decode(FlateFilter.java:73) > ... 12 more -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Updated] (PDFBOX-3538) IOException over NumberFormatException on a valid PDF
[ https://issues.apache.org/jira/browse/PDFBOX-3538?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Seva Alekseyev updated PDFBOX-3538: --- Attachment: PB_AGAP001539_Graphical.pdf > IOException over NumberFormatException on a valid PDF > - > > Key: PDFBOX-3538 > URL: https://issues.apache.org/jira/browse/PDFBOX-3538 > Project: PDFBox > Issue Type: Bug > Components: Parsing > Environment: Windows 7 x64, JVM 1.8.0_101 >Reporter: Seva Alekseyev > Attachments: PB_AGAP001539_Graphical.pdf > > > On the attached document, which loads and displays with Adobe Reader fine, > PDDocument.load() throws the following exception: > java.io.IOException: java.lang.NumberFormatException: For input string: > "000-21" > at > org.apache.pdfbox.pdfparser.COSParser.parseXrefTable(COSParser.java:2017) > at org.apache.pdfbox.pdfparser.COSParser.parseXref(COSParser.java:224) > at > org.apache.pdfbox.pdfparser.PDFParser.initialParse(PDFParser.java:194) > at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:252) > at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:957) > at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:913) > at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:861) > at Temp.PDFTemp.App.main(App.java:19) > Caused by: java.lang.NumberFormatException: For input string: "000-21" > at > java.lang.NumberFormatException.forInputString(NumberFormatException.java:65) > at java.lang.Long.parseLong(Long.java:589) > at java.lang.Long.parseLong(Long.java:631) > at > org.apache.pdfbox.pdfparser.COSParser.parseXrefTable(COSParser.java:2010) > ... 7 more -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Created] (PDFBOX-3538) IOException over NumberFormatException on a valid PDF
Seva Alekseyev created PDFBOX-3538: -- Summary: IOException over NumberFormatException on a valid PDF Key: PDFBOX-3538 URL: https://issues.apache.org/jira/browse/PDFBOX-3538 Project: PDFBox Issue Type: Bug Components: Parsing Environment: Windows 7 x64, JVM 1.8.0_101 Reporter: Seva Alekseyev On the attached document, which loads and displays with Adobe Reader fine, PDDocument.load() throws the following exception: java.io.IOException: java.lang.NumberFormatException: For input string: "000-21" at org.apache.pdfbox.pdfparser.COSParser.parseXrefTable(COSParser.java:2017) at org.apache.pdfbox.pdfparser.COSParser.parseXref(COSParser.java:224) at org.apache.pdfbox.pdfparser.PDFParser.initialParse(PDFParser.java:194) at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:252) at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:957) at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:913) at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:861) at Temp.PDFTemp.App.main(App.java:19) Caused by: java.lang.NumberFormatException: For input string: "000-21" at java.lang.NumberFormatException.forInputString(NumberFormatException.java:65) at java.lang.Long.parseLong(Long.java:589) at java.lang.Long.parseLong(Long.java:631) at org.apache.pdfbox.pdfparser.COSParser.parseXrefTable(COSParser.java:2010) ... 7 more -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Updated] (PDFBOX-3533) IOException "expected number, actual=COSArray{...}" on a valid PDF
[ https://issues.apache.org/jira/browse/PDFBOX-3533?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Seva Alekseyev updated PDFBOX-3533: --- Description: On the attached PDF file, which opens fine with Adobe Reader, the PDDocument.load() method errors with the following message: {code} "expected number, actual=COSArray{[COSObject{7, 0}, COSName{XYZ}, COSNull{}, COSNull{}, COSNull{}]} at offset 497" {code} was: On the following PDF file that open with Acrobat: https://dl.dropboxusercontent.com/u/92341073/097-allowed_claims.pdf the PDDocument.load() method errors with the following message: {code} "expected number, actual=COSArray{[COSObject{7, 0}, COSName{XYZ}, COSNull{}, COSNull{}, COSNull{}]} at offset 497" {code} > IOException "expected number, actual=COSArray{...}" on a valid PDF > -- > > Key: PDFBOX-3533 > URL: https://issues.apache.org/jira/browse/PDFBOX-3533 > Project: PDFBox > Issue Type: Bug > Components: Parsing >Affects Versions: 2.1.0 > Environment: Windows 7 x64, JVM 1.8.0_101 >Reporter: Seva Alekseyev > Attachments: 097-allowed_claims.pdf > > > On the attached PDF file, which opens fine with Adobe Reader, the > PDDocument.load() method errors with the following message: > {code} > "expected number, actual=COSArray{[COSObject{7, 0}, COSName{XYZ}, COSNull{}, > COSNull{}, COSNull{}]} at offset 497" > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Updated] (PDFBOX-3533) IOException "expected number, actual=COSArray{...}" on a valid PDF
[ https://issues.apache.org/jira/browse/PDFBOX-3533?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Seva Alekseyev updated PDFBOX-3533: --- Attachment: 097-allowed_claims.pdf > IOException "expected number, actual=COSArray{...}" on a valid PDF > -- > > Key: PDFBOX-3533 > URL: https://issues.apache.org/jira/browse/PDFBOX-3533 > Project: PDFBox > Issue Type: Bug > Components: Parsing >Affects Versions: 2.1.0 > Environment: Windows 7 x64, JVM 1.8.0_101 >Reporter: Seva Alekseyev > Attachments: 097-allowed_claims.pdf > > > On the following PDF file that open with Acrobat: > https://dl.dropboxusercontent.com/u/92341073/097-allowed_claims.pdf > the PDDocument.load() method errors with the following message: > {code} > "expected number, actual=COSArray{[COSObject{7, 0}, COSName{XYZ}, COSNull{}, > COSNull{}, COSNull{}]} at offset 497" > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Commented] (PDFBOX-3535) ClassCastException in PDAnnotationLink.getAction()
[ https://issues.apache.org/jira/browse/PDFBOX-3535?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15592400#comment-15592400 ] Seva Alekseyev commented on PDFBOX-3535: It doesn't happen in the PDFBox trunk, I've checked. In 2.0.3 (which ships with Tika 1.13), however, the exception happens when the value of currentPage in pdftextStripper.processPages() is 4. > ClassCastException in PDAnnotationLink.getAction() > -- > > Key: PDFBOX-3535 > URL: https://issues.apache.org/jira/browse/PDFBOX-3535 > Project: PDFBox > Issue Type: Bug > Components: PDModel >Affects Versions: 2.0.3 >Reporter: Tim Allison >Priority: Trivial > Fix For: 2.0.4, 2.1.0 > > > {noformat} > Caused by: java.lang.ClassCastException: org.apache.pdfbox.cos.COSString > cannot be cast to org.apache.pdfbox.cos.COSDictionary > at > org.apache.pdfbox.pdmodel.interactive.annotation.PDAnnotationLink.getAction(PDAnnotationLink.java:88) > {noformat} > [~sevaa] raised this issue on TIKA-2121. I confirmed that it happens with > PDFBox 2.0.3. I haven't confirmed trunk yet. -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Updated] (PDFBOX-3536) IOException "Invalid dictionary, found: 'r' but expected: '/' at offset 1148" on a valid PDF
[ https://issues.apache.org/jira/browse/PDFBOX-3536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Seva Alekseyev updated PDFBOX-3536: --- Summary: IOException "Invalid dictionary, found: 'r' but expected: '/' at offset 1148" on a valid PDF (was: IOException "" on a valid PDF) > IOException "Invalid dictionary, found: 'r' but expected: '/' at offset 1148" > on a valid PDF > > > Key: PDFBOX-3536 > URL: https://issues.apache.org/jira/browse/PDFBOX-3536 > Project: PDFBox > Issue Type: Bug > Components: Parsing >Affects Versions: 2.0.3 > Environment: Windows 7 x64, JVM 1.8.0_101 >Reporter: Seva Alekseyev > Attachments: resulprovao.pdf > > > On the attached file, which loads fine with Adobe Reader, the > PDDocument.load() methpod throws the following error: > java.io.IOException: Unknown dir object c='>' cInt=62 peek='>' peekInt=62 at > offset 1196 > at > org.apache.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:982) > at > org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryValue(BaseParser.java:153) > at > org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryNameValuePair(BaseParser.java:277) > at > org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionary(BaseParser.java:210) > at > org.apache.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:885) > at > org.apache.pdfbox.pdfparser.COSParser.parseFileObject(COSParser.java:757) > at > org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:726) > at > org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:657) > at > org.apache.pdfbox.pdfparser.COSParser.parseTrailerValuesDynamically(COSParser.java:2092) > at > org.apache.pdfbox.pdfparser.PDFParser.initialParse(PDFParser.java:203) > at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:252) > at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:957) > at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:913) > at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:861) > at Temp.PDFTemp.App.main(App.java:19) -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Updated] (PDFBOX-3536) IOException "" on a valid PDF
[ https://issues.apache.org/jira/browse/PDFBOX-3536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Seva Alekseyev updated PDFBOX-3536: --- Attachment: resulprovao.pdf > IOException "" on a valid PDF > - > > Key: PDFBOX-3536 > URL: https://issues.apache.org/jira/browse/PDFBOX-3536 > Project: PDFBox > Issue Type: Bug > Components: Parsing >Affects Versions: 2.0.3 > Environment: Windows 7 x64, JVM 1.8.0_101 >Reporter: Seva Alekseyev > Attachments: resulprovao.pdf > > > On the attached file, which loads fine with Adobe Reader, the > PDDocument.load() methpod throws the following error: > java.io.IOException: Unknown dir object c='>' cInt=62 peek='>' peekInt=62 at > offset 1196 > at > org.apache.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:982) > at > org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryValue(BaseParser.java:153) > at > org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryNameValuePair(BaseParser.java:277) > at > org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionary(BaseParser.java:210) > at > org.apache.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:885) > at > org.apache.pdfbox.pdfparser.COSParser.parseFileObject(COSParser.java:757) > at > org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:726) > at > org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:657) > at > org.apache.pdfbox.pdfparser.COSParser.parseTrailerValuesDynamically(COSParser.java:2092) > at > org.apache.pdfbox.pdfparser.PDFParser.initialParse(PDFParser.java:203) > at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:252) > at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:957) > at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:913) > at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:861) > at Temp.PDFTemp.App.main(App.java:19) -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Created] (PDFBOX-3536) IOException "" on a valid PDF
Seva Alekseyev created PDFBOX-3536: -- Summary: IOException "" on a valid PDF Key: PDFBOX-3536 URL: https://issues.apache.org/jira/browse/PDFBOX-3536 Project: PDFBox Issue Type: Bug Components: Parsing Affects Versions: 2.0.3 Environment: Windows 7 x64, JVM 1.8.0_101 Reporter: Seva Alekseyev Attachments: resulprovao.pdf On the attached file, which loads fine with Adobe Reader, the PDDocument.load() methpod throws the following error: java.io.IOException: Unknown dir object c='>' cInt=62 peek='>' peekInt=62 at offset 1196 at org.apache.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:982) at org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryValue(BaseParser.java:153) at org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionaryNameValuePair(BaseParser.java:277) at org.apache.pdfbox.pdfparser.BaseParser.parseCOSDictionary(BaseParser.java:210) at org.apache.pdfbox.pdfparser.BaseParser.parseDirObject(BaseParser.java:885) at org.apache.pdfbox.pdfparser.COSParser.parseFileObject(COSParser.java:757) at org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:726) at org.apache.pdfbox.pdfparser.COSParser.parseObjectDynamically(COSParser.java:657) at org.apache.pdfbox.pdfparser.COSParser.parseTrailerValuesDynamically(COSParser.java:2092) at org.apache.pdfbox.pdfparser.PDFParser.initialParse(PDFParser.java:203) at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:252) at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:957) at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:913) at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:861) at Temp.PDFTemp.App.main(App.java:19) -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Commented] (PDFBOX-3533) IOException "expected number, actual=COSArray{...}" on a valid PDF
[ https://issues.apache.org/jira/browse/PDFBOX-3533?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15589700#comment-15589700 ] Seva Alekseyev commented on PDFBOX-3533: I can't tell right away how many files are affected. That'll take some homework. > IOException "expected number, actual=COSArray{...}" on a valid PDF > -- > > Key: PDFBOX-3533 > URL: https://issues.apache.org/jira/browse/PDFBOX-3533 > Project: PDFBox > Issue Type: Bug > Components: Parsing >Affects Versions: 2.1.0 > Environment: Windows 7 x64, JVM 1.8.0_101 >Reporter: Seva Alekseyev > > On the following PDF file that open with Acrobat: > https://dl.dropboxusercontent.com/u/92341073/097-allowed_claims.pdf > the PDDocument.load() method errors with the following message: > {code} > "expected number, actual=COSArray{[COSObject{7, 0}, COSName{XYZ}, COSNull{}, > COSNull{}, COSNull{}]} at offset 497" > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Commented] (PDFBOX-3533) IOException "expected number, actual=COSArray{...}" on a valid PDF
[ https://issues.apache.org/jira/browse/PDFBOX-3533?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15589601#comment-15589601 ] Seva Alekseyev commented on PDFBOX-3533: Adobe Acrobat is not "some other viewer'. It's the de-facto reference implementation. The users can read this document and print this document. Ergo, the fact that I can't open and parse it is an issue I have to somehow fix. > IOException "expected number, actual=COSArray{...}" on a valid PDF > -- > > Key: PDFBOX-3533 > URL: https://issues.apache.org/jira/browse/PDFBOX-3533 > Project: PDFBox > Issue Type: Bug > Components: Parsing >Affects Versions: 2.1.0 > Environment: Windows 7 x64, JVM 1.8.0_101 >Reporter: Seva Alekseyev > > On the following PDF file that open with Acrobat: > https://dl.dropboxusercontent.com/u/92341073/097-allowed_claims.pdf > the PDDocument.load() method errors with the following message: > {code} > "expected number, actual=COSArray{[COSObject{7, 0}, COSName{XYZ}, COSNull{}, > COSNull{}, COSNull{}]} at offset 497" > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Updated] (PDFBOX-3533) IOException "expected number, actual=COSArray{...}" on a valid PDF
[ https://issues.apache.org/jira/browse/PDFBOX-3533?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Seva Alekseyev updated PDFBOX-3533: --- Description: On the following PDF file that open with Acrobat: https://dl.dropboxusercontent.com/u/92341073/097-allowed_claims.pdf the PDDocument.load() method errors with the following message: "expected number, actual=COSArray{[COSObject{7, 0}, COSName{XYZ}, COSNull{}, COSNull{}, COSNull{}]} at offset 497" was: On the following PDF file that open with Acrobat: https://dl.dropboxusercontent.com/u/92341073/097-allowed_claims.pdf the Tika parser errors with the following message: "expected number, actual=COSArray{[COSObject{7, 0}, COSName{XYZ}, COSNull{}, COSNull{}, COSNull{}]} at offset 497" > IOException "expected number, actual=COSArray{...}" on a valid PDF > -- > > Key: PDFBOX-3533 > URL: https://issues.apache.org/jira/browse/PDFBOX-3533 > Project: PDFBox > Issue Type: Bug > Components: Parsing >Affects Versions: 2.1.0 > Environment: Windows 7 x64, JVM 1.8.0_101 >Reporter: Seva Alekseyev > > On the following PDF file that open with Acrobat: > https://dl.dropboxusercontent.com/u/92341073/097-allowed_claims.pdf > the PDDocument.load() method errors with the following message: > "expected number, actual=COSArray{[COSObject{7, 0}, COSName{XYZ}, COSNull{}, > COSNull{}, COSNull{}]} at offset 497" -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Created] (PDFBOX-3533) IOException "expected number, actual=COSArray{...}" on a valid PDF
Seva Alekseyev created PDFBOX-3533: -- Summary: IOException "expected number, actual=COSArray{...}" on a valid PDF Key: PDFBOX-3533 URL: https://issues.apache.org/jira/browse/PDFBOX-3533 Project: PDFBox Issue Type: Bug Components: Parsing Affects Versions: 2.1.0 Environment: Windows 7 x64, JVM 1.8.0_101 Reporter: Seva Alekseyev On the following PDF file that open with Acrobat: https://dl.dropboxusercontent.com/u/92341073/097-allowed_claims.pdf the Tika parser errors with the following message: "expected number, actual=COSArray{[COSObject{7, 0}, COSName{XYZ}, COSNull{}, COSNull{}, COSNull{}]} at offset 497" -- This message was sent by Atlassian JIRA (v6.3.4#6332) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org