[jira] [Commented] (PDFBOX-3934) Page missing
[ https://issues.apache.org/jira/browse/PDFBOX-3934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16184812#comment-16184812 ] Andreas Lehmkühler commented on PDFBOX-3934: Thanks [~tilman] for the retest and good idea to add the test > Page missing > > > Key: PDFBOX-3934 > URL: https://issues.apache.org/jira/browse/PDFBOX-3934 > Project: PDFBox > Issue Type: Bug > Components: Parsing >Affects Versions: 2.0.5, 2.0.6, 2.0.7 >Reporter: Tilman Hausherr >Assignee: Andreas Lehmkühler > Labels: regression > Attachments: BCZSFNQAB62TUBURWG6B3ZOZCG5IH46P.pdf, > PDFBOX-3934-KTUUMJNQ7NYGJJMEDGSY5OEU76G6JX2V.pdf > > > The first page (with "iéseg") was in 2.0.4 but is no longer there since 2.0.5. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Commented] (PDFBOX-3934) Page missing
[ https://issues.apache.org/jira/browse/PDFBOX-3934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16184593#comment-16184593 ] Tilman Hausherr commented on PDFBOX-3934: - All good now. I've put a test with remote loading of the genko file, this was part of PDFBox until years ago, the test had the name "testParsingTroublePDFs". The file was removed due to copyright reasons. > Page missing > > > Key: PDFBOX-3934 > URL: https://issues.apache.org/jira/browse/PDFBOX-3934 > Project: PDFBox > Issue Type: Bug > Components: Parsing >Affects Versions: 2.0.5, 2.0.6, 2.0.7 >Reporter: Tilman Hausherr >Assignee: Andreas Lehmkühler > Labels: regression > Attachments: BCZSFNQAB62TUBURWG6B3ZOZCG5IH46P.pdf, > PDFBOX-3934-KTUUMJNQ7NYGJJMEDGSY5OEU76G6JX2V.pdf > > > The first page (with "iéseg") was in 2.0.4 but is no longer there since 2.0.5. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Commented] (PDFBOX-3934) Page missing
[ https://issues.apache.org/jira/browse/PDFBOX-3934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16184555#comment-16184555 ] ASF subversion and git services commented on PDFBOX-3934: - Commit 1810021 from [~tilman] in branch 'pdfbox/branches/2.0' [ https://svn.apache.org/r1810021 ] PDFBOX-3934: add parse test of genko file because it's been susceptible to regression for years > Page missing > > > Key: PDFBOX-3934 > URL: https://issues.apache.org/jira/browse/PDFBOX-3934 > Project: PDFBox > Issue Type: Bug > Components: Parsing >Affects Versions: 2.0.5, 2.0.6, 2.0.7 >Reporter: Tilman Hausherr >Assignee: Andreas Lehmkühler > Labels: regression > Attachments: BCZSFNQAB62TUBURWG6B3ZOZCG5IH46P.pdf, > PDFBOX-3934-KTUUMJNQ7NYGJJMEDGSY5OEU76G6JX2V.pdf > > > The first page (with "iéseg") was in 2.0.4 but is no longer there since 2.0.5. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Commented] (PDFBOX-3934) Page missing
[ https://issues.apache.org/jira/browse/PDFBOX-3934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16184554#comment-16184554 ] ASF subversion and git services commented on PDFBOX-3934: - Commit 1810020 from [~tilman] in branch 'pdfbox/trunk' [ https://svn.apache.org/r1810020 ] PDFBOX-3934: add parse test of genko file because it's been susceptible to regression for years > Page missing > > > Key: PDFBOX-3934 > URL: https://issues.apache.org/jira/browse/PDFBOX-3934 > Project: PDFBox > Issue Type: Bug > Components: Parsing >Affects Versions: 2.0.5, 2.0.6, 2.0.7 >Reporter: Tilman Hausherr >Assignee: Andreas Lehmkühler > Labels: regression > Attachments: BCZSFNQAB62TUBURWG6B3ZOZCG5IH46P.pdf, > PDFBOX-3934-KTUUMJNQ7NYGJJMEDGSY5OEU76G6JX2V.pdf > > > The first page (with "iéseg") was in 2.0.4 but is no longer there since 2.0.5. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Commented] (PDFBOX-3934) Page missing
[ https://issues.apache.org/jira/browse/PDFBOX-3934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16184547#comment-16184547 ] Andreas Lehmkühler commented on PDFBOX-3934: I guess I've found the missing piece > Page missing > > > Key: PDFBOX-3934 > URL: https://issues.apache.org/jira/browse/PDFBOX-3934 > Project: PDFBox > Issue Type: Bug > Components: Parsing >Affects Versions: 2.0.5, 2.0.6, 2.0.7 >Reporter: Tilman Hausherr >Assignee: Andreas Lehmkühler > Labels: regression > Attachments: BCZSFNQAB62TUBURWG6B3ZOZCG5IH46P.pdf, > PDFBOX-3934-KTUUMJNQ7NYGJJMEDGSY5OEU76G6JX2V.pdf > > > The first page (with "iéseg") was in 2.0.4 but is no longer there since 2.0.5. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Commented] (PDFBOX-3934) Page missing
[ https://issues.apache.org/jira/browse/PDFBOX-3934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16184537#comment-16184537 ] ASF subversion and git services commented on PDFBOX-3934: - Commit 1810019 from [~lehmi] in branch 'pdfbox/trunk' [ https://svn.apache.org/r1810019 ] PDFBOX-3934: skip trailing spaces > Page missing > > > Key: PDFBOX-3934 > URL: https://issues.apache.org/jira/browse/PDFBOX-3934 > Project: PDFBox > Issue Type: Bug > Components: Parsing >Affects Versions: 2.0.5, 2.0.6, 2.0.7 >Reporter: Tilman Hausherr >Assignee: Andreas Lehmkühler > Labels: regression > Attachments: BCZSFNQAB62TUBURWG6B3ZOZCG5IH46P.pdf, > PDFBOX-3934-KTUUMJNQ7NYGJJMEDGSY5OEU76G6JX2V.pdf > > > The first page (with "iéseg") was in 2.0.4 but is no longer there since 2.0.5. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Commented] (PDFBOX-3934) Page missing
[ https://issues.apache.org/jira/browse/PDFBOX-3934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16184536#comment-16184536 ] ASF subversion and git services commented on PDFBOX-3934: - Commit 1810018 from [~lehmi] in branch 'pdfbox/branches/2.0' [ https://svn.apache.org/r1810018 ] PDFBOX-3934: skip trailing spaces > Page missing > > > Key: PDFBOX-3934 > URL: https://issues.apache.org/jira/browse/PDFBOX-3934 > Project: PDFBox > Issue Type: Bug > Components: Parsing >Affects Versions: 2.0.5, 2.0.6, 2.0.7 >Reporter: Tilman Hausherr >Assignee: Andreas Lehmkühler > Labels: regression > Attachments: BCZSFNQAB62TUBURWG6B3ZOZCG5IH46P.pdf, > PDFBOX-3934-KTUUMJNQ7NYGJJMEDGSY5OEU76G6JX2V.pdf > > > The first page (with "iéseg") was in 2.0.4 but is no longer there since 2.0.5. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Commented] (PDFBOX-3934) Page missing
[ https://issues.apache.org/jira/browse/PDFBOX-3934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16182983#comment-16182983 ] Tilman Hausherr commented on PDFBOX-3934: - File genko_oc_shiryo1.pdf (in PDFBOX-3788) fails with "Missing root object specification in trailer". > Page missing > > > Key: PDFBOX-3934 > URL: https://issues.apache.org/jira/browse/PDFBOX-3934 > Project: PDFBox > Issue Type: Bug > Components: Parsing >Affects Versions: 2.0.5, 2.0.6, 2.0.7 >Reporter: Tilman Hausherr >Assignee: Andreas Lehmkühler > Labels: regression > Attachments: BCZSFNQAB62TUBURWG6B3ZOZCG5IH46P.pdf, > PDFBOX-3934-KTUUMJNQ7NYGJJMEDGSY5OEU76G6JX2V.pdf > > > The first page (with "iéseg") was in 2.0.4 but is no longer there since 2.0.5. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Commented] (PDFBOX-3934) Page missing
[ https://issues.apache.org/jira/browse/PDFBOX-3934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16182968#comment-16182968 ] Andreas Lehmkühler commented on PDFBOX-3934: I've fixed the regression. The brute force search now includes compressed objects when rebuilding the trailer information. [~tilman], [~talli...@mitre.org] Please run your tests to see if it works. > Page missing > > > Key: PDFBOX-3934 > URL: https://issues.apache.org/jira/browse/PDFBOX-3934 > Project: PDFBox > Issue Type: Bug > Components: Parsing >Affects Versions: 2.0.5, 2.0.6, 2.0.7 >Reporter: Tilman Hausherr >Assignee: Andreas Lehmkühler > Labels: regression > Attachments: BCZSFNQAB62TUBURWG6B3ZOZCG5IH46P.pdf, > PDFBOX-3934-KTUUMJNQ7NYGJJMEDGSY5OEU76G6JX2V.pdf > > > The first page (with "iéseg") was in 2.0.4 but is no longer there since 2.0.5. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Commented] (PDFBOX-3934) Page missing
[ https://issues.apache.org/jira/browse/PDFBOX-3934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16182963#comment-16182963 ] ASF subversion and git services commented on PDFBOX-3934: - Commit 1809890 from [~lehmi] in branch 'pdfbox/trunk' [ https://svn.apache.org/r1809890 ] PDFBOX-3934: include compressed objects in brute force search when rebuilding the trailer > Page missing > > > Key: PDFBOX-3934 > URL: https://issues.apache.org/jira/browse/PDFBOX-3934 > Project: PDFBox > Issue Type: Bug > Components: Parsing >Affects Versions: 2.0.5, 2.0.6, 2.0.7 >Reporter: Tilman Hausherr >Assignee: Andreas Lehmkühler > Labels: regression > Attachments: BCZSFNQAB62TUBURWG6B3ZOZCG5IH46P.pdf, > PDFBOX-3934-KTUUMJNQ7NYGJJMEDGSY5OEU76G6JX2V.pdf > > > The first page (with "iéseg") was in 2.0.4 but is no longer there since 2.0.5. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Commented] (PDFBOX-3934) Page missing
[ https://issues.apache.org/jira/browse/PDFBOX-3934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16182964#comment-16182964 ] ASF subversion and git services commented on PDFBOX-3934: - Commit 1809891 from [~lehmi] in branch 'pdfbox/branches/2.0' [ https://svn.apache.org/r1809891 ] PDFBOX-3934: include compressed objects in brute force search when rebuilding the trailer > Page missing > > > Key: PDFBOX-3934 > URL: https://issues.apache.org/jira/browse/PDFBOX-3934 > Project: PDFBox > Issue Type: Bug > Components: Parsing >Affects Versions: 2.0.5, 2.0.6, 2.0.7 >Reporter: Tilman Hausherr >Assignee: Andreas Lehmkühler > Labels: regression > Attachments: BCZSFNQAB62TUBURWG6B3ZOZCG5IH46P.pdf, > PDFBOX-3934-KTUUMJNQ7NYGJJMEDGSY5OEU76G6JX2V.pdf > > > The first page (with "iéseg") was in 2.0.4 but is no longer there since 2.0.5. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Commented] (PDFBOX-3934) Page missing
[ https://issues.apache.org/jira/browse/PDFBOX-3934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16180562#comment-16180562 ] Andreas Lehmkühler commented on PDFBOX-3934: Just an update, the root cause is that the rebuild-mechanism doesn't support the recreation of compressed objects. I've a working solution for the trunk and will port it to the 2.0 branch > Page missing > > > Key: PDFBOX-3934 > URL: https://issues.apache.org/jira/browse/PDFBOX-3934 > Project: PDFBox > Issue Type: Bug > Components: Parsing >Affects Versions: 2.0.5, 2.0.6, 2.0.7 >Reporter: Tilman Hausherr >Assignee: Andreas Lehmkühler > Labels: regression > Attachments: BCZSFNQAB62TUBURWG6B3ZOZCG5IH46P.pdf, > PDFBOX-3934-KTUUMJNQ7NYGJJMEDGSY5OEU76G6JX2V.pdf > > > The first page (with "iéseg") was in 2.0.4 but is no longer there since 2.0.5. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Commented] (PDFBOX-3934) Page missing
[ https://issues.apache.org/jira/browse/PDFBOX-3934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16178167#comment-16178167 ] Andreas Lehmkühler commented on PDFBOX-3934: The former brute force search doesn't work with truncated files as it omits any valid content after the last found startxref entry. Unfortunately the regression from PDFBOX-3318 is back. > Page missing > > > Key: PDFBOX-3934 > URL: https://issues.apache.org/jira/browse/PDFBOX-3934 > Project: PDFBox > Issue Type: Bug > Components: Parsing >Affects Versions: 2.0.5, 2.0.6, 2.0.7 >Reporter: Tilman Hausherr >Assignee: Andreas Lehmkühler > Labels: regression > Attachments: BCZSFNQAB62TUBURWG6B3ZOZCG5IH46P.pdf, > PDFBOX-3934-KTUUMJNQ7NYGJJMEDGSY5OEU76G6JX2V.pdf > > > The first page (with "iéseg") was in 2.0.4 but is no longer there since 2.0.5. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Commented] (PDFBOX-3934) Page missing
[ https://issues.apache.org/jira/browse/PDFBOX-3934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16178166#comment-16178166 ] Tilman Hausherr commented on PDFBOX-3934: - PDFBOX-3318 fails now with "root cannot be null". > Page missing > > > Key: PDFBOX-3934 > URL: https://issues.apache.org/jira/browse/PDFBOX-3934 > Project: PDFBox > Issue Type: Bug > Components: Parsing >Affects Versions: 2.0.5, 2.0.6, 2.0.7 >Reporter: Tilman Hausherr >Assignee: Andreas Lehmkühler > Labels: regression > Attachments: BCZSFNQAB62TUBURWG6B3ZOZCG5IH46P.pdf, > PDFBOX-3934-KTUUMJNQ7NYGJJMEDGSY5OEU76G6JX2V.pdf > > > The first page (with "iéseg") was in 2.0.4 but is no longer there since 2.0.5. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Commented] (PDFBOX-3934) Page missing
[ https://issues.apache.org/jira/browse/PDFBOX-3934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16178155#comment-16178155 ] ASF subversion and git services commented on PDFBOX-3934: - Commit 1809500 from [~lehmi] in branch 'pdfbox/branches/2.0' [ https://svn.apache.org/r1809500 ] PDFBOX-3934: removed brute force search for last startxref entry fall back to rebuildTrailer instead, improved garbage detection > Page missing > > > Key: PDFBOX-3934 > URL: https://issues.apache.org/jira/browse/PDFBOX-3934 > Project: PDFBox > Issue Type: Bug > Components: Parsing >Affects Versions: 2.0.5, 2.0.6, 2.0.7 >Reporter: Tilman Hausherr >Assignee: Andreas Lehmkühler > Labels: regression > Attachments: BCZSFNQAB62TUBURWG6B3ZOZCG5IH46P.pdf, > PDFBOX-3934-KTUUMJNQ7NYGJJMEDGSY5OEU76G6JX2V.pdf > > > The first page (with "iéseg") was in 2.0.4 but is no longer there since 2.0.5. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Commented] (PDFBOX-3934) Page missing
[ https://issues.apache.org/jira/browse/PDFBOX-3934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16178156#comment-16178156 ] ASF subversion and git services commented on PDFBOX-3934: - Commit 1809501 from [~lehmi] in branch 'pdfbox/trunk' [ https://svn.apache.org/r1809501 ] PDFBOX-3934: removed brute force search for last startxref entry fall back to rebuildTrailer instead, improved garbage detection > Page missing > > > Key: PDFBOX-3934 > URL: https://issues.apache.org/jira/browse/PDFBOX-3934 > Project: PDFBox > Issue Type: Bug > Components: Parsing >Affects Versions: 2.0.5, 2.0.6, 2.0.7 >Reporter: Tilman Hausherr >Assignee: Andreas Lehmkühler > Labels: regression > Attachments: BCZSFNQAB62TUBURWG6B3ZOZCG5IH46P.pdf, > PDFBOX-3934-KTUUMJNQ7NYGJJMEDGSY5OEU76G6JX2V.pdf > > > The first page (with "iéseg") was in 2.0.4 but is no longer there since 2.0.5. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org