[jira] [Commented] (PDFBOX-3934) Page missing

2017-09-28 Thread JIRA

[ 
https://issues.apache.org/jira/browse/PDFBOX-3934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16184812#comment-16184812
 ] 

Andreas Lehmkühler commented on PDFBOX-3934:


Thanks [~tilman] for the retest and good idea to add the test

> Page missing
> 
>
> Key: PDFBOX-3934
> URL: https://issues.apache.org/jira/browse/PDFBOX-3934
> Project: PDFBox
>  Issue Type: Bug
>  Components: Parsing
>Affects Versions: 2.0.5, 2.0.6, 2.0.7
>Reporter: Tilman Hausherr
>Assignee: Andreas Lehmkühler
>  Labels: regression
> Attachments: BCZSFNQAB62TUBURWG6B3ZOZCG5IH46P.pdf, 
> PDFBOX-3934-KTUUMJNQ7NYGJJMEDGSY5OEU76G6JX2V.pdf
>
>
> The first page (with "iéseg") was in 2.0.4 but is no longer there since 2.0.5.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-3934) Page missing

2017-09-28 Thread Tilman Hausherr (JIRA)

[ 
https://issues.apache.org/jira/browse/PDFBOX-3934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16184593#comment-16184593
 ] 

Tilman Hausherr commented on PDFBOX-3934:
-

All good now. I've put a test with remote loading of the genko file, this was 
part of PDFBox until years ago, the test had the name "testParsingTroublePDFs". 
The file was removed due to copyright reasons.

> Page missing
> 
>
> Key: PDFBOX-3934
> URL: https://issues.apache.org/jira/browse/PDFBOX-3934
> Project: PDFBox
>  Issue Type: Bug
>  Components: Parsing
>Affects Versions: 2.0.5, 2.0.6, 2.0.7
>Reporter: Tilman Hausherr
>Assignee: Andreas Lehmkühler
>  Labels: regression
> Attachments: BCZSFNQAB62TUBURWG6B3ZOZCG5IH46P.pdf, 
> PDFBOX-3934-KTUUMJNQ7NYGJJMEDGSY5OEU76G6JX2V.pdf
>
>
> The first page (with "iéseg") was in 2.0.4 but is no longer there since 2.0.5.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-3934) Page missing

2017-09-28 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/PDFBOX-3934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16184555#comment-16184555
 ] 

ASF subversion and git services commented on PDFBOX-3934:
-

Commit 1810021 from [~tilman] in branch 'pdfbox/branches/2.0'
[ https://svn.apache.org/r1810021 ]

PDFBOX-3934: add parse test of genko file because it's been susceptible to 
regression for years

> Page missing
> 
>
> Key: PDFBOX-3934
> URL: https://issues.apache.org/jira/browse/PDFBOX-3934
> Project: PDFBox
>  Issue Type: Bug
>  Components: Parsing
>Affects Versions: 2.0.5, 2.0.6, 2.0.7
>Reporter: Tilman Hausherr
>Assignee: Andreas Lehmkühler
>  Labels: regression
> Attachments: BCZSFNQAB62TUBURWG6B3ZOZCG5IH46P.pdf, 
> PDFBOX-3934-KTUUMJNQ7NYGJJMEDGSY5OEU76G6JX2V.pdf
>
>
> The first page (with "iéseg") was in 2.0.4 but is no longer there since 2.0.5.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-3934) Page missing

2017-09-28 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/PDFBOX-3934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16184554#comment-16184554
 ] 

ASF subversion and git services commented on PDFBOX-3934:
-

Commit 1810020 from [~tilman] in branch 'pdfbox/trunk'
[ https://svn.apache.org/r1810020 ]

PDFBOX-3934: add parse test of genko file because it's been susceptible to 
regression for years

> Page missing
> 
>
> Key: PDFBOX-3934
> URL: https://issues.apache.org/jira/browse/PDFBOX-3934
> Project: PDFBox
>  Issue Type: Bug
>  Components: Parsing
>Affects Versions: 2.0.5, 2.0.6, 2.0.7
>Reporter: Tilman Hausherr
>Assignee: Andreas Lehmkühler
>  Labels: regression
> Attachments: BCZSFNQAB62TUBURWG6B3ZOZCG5IH46P.pdf, 
> PDFBOX-3934-KTUUMJNQ7NYGJJMEDGSY5OEU76G6JX2V.pdf
>
>
> The first page (with "iéseg") was in 2.0.4 but is no longer there since 2.0.5.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-3934) Page missing

2017-09-28 Thread JIRA

[ 
https://issues.apache.org/jira/browse/PDFBOX-3934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16184547#comment-16184547
 ] 

Andreas Lehmkühler commented on PDFBOX-3934:


I guess I've found the missing piece

> Page missing
> 
>
> Key: PDFBOX-3934
> URL: https://issues.apache.org/jira/browse/PDFBOX-3934
> Project: PDFBox
>  Issue Type: Bug
>  Components: Parsing
>Affects Versions: 2.0.5, 2.0.6, 2.0.7
>Reporter: Tilman Hausherr
>Assignee: Andreas Lehmkühler
>  Labels: regression
> Attachments: BCZSFNQAB62TUBURWG6B3ZOZCG5IH46P.pdf, 
> PDFBOX-3934-KTUUMJNQ7NYGJJMEDGSY5OEU76G6JX2V.pdf
>
>
> The first page (with "iéseg") was in 2.0.4 but is no longer there since 2.0.5.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-3934) Page missing

2017-09-28 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/PDFBOX-3934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16184537#comment-16184537
 ] 

ASF subversion and git services commented on PDFBOX-3934:
-

Commit 1810019 from [~lehmi] in branch 'pdfbox/trunk'
[ https://svn.apache.org/r1810019 ]

PDFBOX-3934: skip trailing spaces

> Page missing
> 
>
> Key: PDFBOX-3934
> URL: https://issues.apache.org/jira/browse/PDFBOX-3934
> Project: PDFBox
>  Issue Type: Bug
>  Components: Parsing
>Affects Versions: 2.0.5, 2.0.6, 2.0.7
>Reporter: Tilman Hausherr
>Assignee: Andreas Lehmkühler
>  Labels: regression
> Attachments: BCZSFNQAB62TUBURWG6B3ZOZCG5IH46P.pdf, 
> PDFBOX-3934-KTUUMJNQ7NYGJJMEDGSY5OEU76G6JX2V.pdf
>
>
> The first page (with "iéseg") was in 2.0.4 but is no longer there since 2.0.5.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-3934) Page missing

2017-09-28 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/PDFBOX-3934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16184536#comment-16184536
 ] 

ASF subversion and git services commented on PDFBOX-3934:
-

Commit 1810018 from [~lehmi] in branch 'pdfbox/branches/2.0'
[ https://svn.apache.org/r1810018 ]

PDFBOX-3934: skip trailing spaces

> Page missing
> 
>
> Key: PDFBOX-3934
> URL: https://issues.apache.org/jira/browse/PDFBOX-3934
> Project: PDFBox
>  Issue Type: Bug
>  Components: Parsing
>Affects Versions: 2.0.5, 2.0.6, 2.0.7
>Reporter: Tilman Hausherr
>Assignee: Andreas Lehmkühler
>  Labels: regression
> Attachments: BCZSFNQAB62TUBURWG6B3ZOZCG5IH46P.pdf, 
> PDFBOX-3934-KTUUMJNQ7NYGJJMEDGSY5OEU76G6JX2V.pdf
>
>
> The first page (with "iéseg") was in 2.0.4 but is no longer there since 2.0.5.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-3934) Page missing

2017-09-27 Thread Tilman Hausherr (JIRA)

[ 
https://issues.apache.org/jira/browse/PDFBOX-3934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16182983#comment-16182983
 ] 

Tilman Hausherr commented on PDFBOX-3934:
-

File genko_oc_shiryo1.pdf (in PDFBOX-3788) fails with "Missing root object 
specification in trailer".

> Page missing
> 
>
> Key: PDFBOX-3934
> URL: https://issues.apache.org/jira/browse/PDFBOX-3934
> Project: PDFBox
>  Issue Type: Bug
>  Components: Parsing
>Affects Versions: 2.0.5, 2.0.6, 2.0.7
>Reporter: Tilman Hausherr
>Assignee: Andreas Lehmkühler
>  Labels: regression
> Attachments: BCZSFNQAB62TUBURWG6B3ZOZCG5IH46P.pdf, 
> PDFBOX-3934-KTUUMJNQ7NYGJJMEDGSY5OEU76G6JX2V.pdf
>
>
> The first page (with "iéseg") was in 2.0.4 but is no longer there since 2.0.5.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-3934) Page missing

2017-09-27 Thread JIRA

[ 
https://issues.apache.org/jira/browse/PDFBOX-3934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16182968#comment-16182968
 ] 

Andreas Lehmkühler commented on PDFBOX-3934:


I've fixed the regression. The brute force search now includes compressed 
objects when rebuilding the trailer information.

[~tilman], [~talli...@mitre.org] Please run your tests to see if it works.

> Page missing
> 
>
> Key: PDFBOX-3934
> URL: https://issues.apache.org/jira/browse/PDFBOX-3934
> Project: PDFBox
>  Issue Type: Bug
>  Components: Parsing
>Affects Versions: 2.0.5, 2.0.6, 2.0.7
>Reporter: Tilman Hausherr
>Assignee: Andreas Lehmkühler
>  Labels: regression
> Attachments: BCZSFNQAB62TUBURWG6B3ZOZCG5IH46P.pdf, 
> PDFBOX-3934-KTUUMJNQ7NYGJJMEDGSY5OEU76G6JX2V.pdf
>
>
> The first page (with "iéseg") was in 2.0.4 but is no longer there since 2.0.5.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-3934) Page missing

2017-09-27 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/PDFBOX-3934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16182963#comment-16182963
 ] 

ASF subversion and git services commented on PDFBOX-3934:
-

Commit 1809890 from [~lehmi] in branch 'pdfbox/trunk'
[ https://svn.apache.org/r1809890 ]

PDFBOX-3934: include compressed objects in brute force search when rebuilding 
the trailer

> Page missing
> 
>
> Key: PDFBOX-3934
> URL: https://issues.apache.org/jira/browse/PDFBOX-3934
> Project: PDFBox
>  Issue Type: Bug
>  Components: Parsing
>Affects Versions: 2.0.5, 2.0.6, 2.0.7
>Reporter: Tilman Hausherr
>Assignee: Andreas Lehmkühler
>  Labels: regression
> Attachments: BCZSFNQAB62TUBURWG6B3ZOZCG5IH46P.pdf, 
> PDFBOX-3934-KTUUMJNQ7NYGJJMEDGSY5OEU76G6JX2V.pdf
>
>
> The first page (with "iéseg") was in 2.0.4 but is no longer there since 2.0.5.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-3934) Page missing

2017-09-27 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/PDFBOX-3934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16182964#comment-16182964
 ] 

ASF subversion and git services commented on PDFBOX-3934:
-

Commit 1809891 from [~lehmi] in branch 'pdfbox/branches/2.0'
[ https://svn.apache.org/r1809891 ]

PDFBOX-3934: include compressed objects in brute force search when rebuilding 
the trailer

> Page missing
> 
>
> Key: PDFBOX-3934
> URL: https://issues.apache.org/jira/browse/PDFBOX-3934
> Project: PDFBox
>  Issue Type: Bug
>  Components: Parsing
>Affects Versions: 2.0.5, 2.0.6, 2.0.7
>Reporter: Tilman Hausherr
>Assignee: Andreas Lehmkühler
>  Labels: regression
> Attachments: BCZSFNQAB62TUBURWG6B3ZOZCG5IH46P.pdf, 
> PDFBOX-3934-KTUUMJNQ7NYGJJMEDGSY5OEU76G6JX2V.pdf
>
>
> The first page (with "iéseg") was in 2.0.4 but is no longer there since 2.0.5.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-3934) Page missing

2017-09-26 Thread JIRA

[ 
https://issues.apache.org/jira/browse/PDFBOX-3934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16180562#comment-16180562
 ] 

Andreas Lehmkühler commented on PDFBOX-3934:


Just an update, the root cause is that the rebuild-mechanism doesn't support 
the recreation of compressed objects. I've a working solution for the trunk and 
will port it to the 2.0 branch

> Page missing
> 
>
> Key: PDFBOX-3934
> URL: https://issues.apache.org/jira/browse/PDFBOX-3934
> Project: PDFBox
>  Issue Type: Bug
>  Components: Parsing
>Affects Versions: 2.0.5, 2.0.6, 2.0.7
>Reporter: Tilman Hausherr
>Assignee: Andreas Lehmkühler
>  Labels: regression
> Attachments: BCZSFNQAB62TUBURWG6B3ZOZCG5IH46P.pdf, 
> PDFBOX-3934-KTUUMJNQ7NYGJJMEDGSY5OEU76G6JX2V.pdf
>
>
> The first page (with "iéseg") was in 2.0.4 but is no longer there since 2.0.5.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-3934) Page missing

2017-09-24 Thread JIRA

[ 
https://issues.apache.org/jira/browse/PDFBOX-3934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16178167#comment-16178167
 ] 

Andreas Lehmkühler commented on PDFBOX-3934:


The former brute force search doesn't work with truncated files as it omits any 
valid content after the last found startxref entry. Unfortunately the 
regression from PDFBOX-3318 is back.

> Page missing
> 
>
> Key: PDFBOX-3934
> URL: https://issues.apache.org/jira/browse/PDFBOX-3934
> Project: PDFBox
>  Issue Type: Bug
>  Components: Parsing
>Affects Versions: 2.0.5, 2.0.6, 2.0.7
>Reporter: Tilman Hausherr
>Assignee: Andreas Lehmkühler
>  Labels: regression
> Attachments: BCZSFNQAB62TUBURWG6B3ZOZCG5IH46P.pdf, 
> PDFBOX-3934-KTUUMJNQ7NYGJJMEDGSY5OEU76G6JX2V.pdf
>
>
> The first page (with "iéseg") was in 2.0.4 but is no longer there since 2.0.5.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-3934) Page missing

2017-09-24 Thread Tilman Hausherr (JIRA)

[ 
https://issues.apache.org/jira/browse/PDFBOX-3934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16178166#comment-16178166
 ] 

Tilman Hausherr commented on PDFBOX-3934:
-

PDFBOX-3318 fails now with "root cannot be null".

> Page missing
> 
>
> Key: PDFBOX-3934
> URL: https://issues.apache.org/jira/browse/PDFBOX-3934
> Project: PDFBox
>  Issue Type: Bug
>  Components: Parsing
>Affects Versions: 2.0.5, 2.0.6, 2.0.7
>Reporter: Tilman Hausherr
>Assignee: Andreas Lehmkühler
>  Labels: regression
> Attachments: BCZSFNQAB62TUBURWG6B3ZOZCG5IH46P.pdf, 
> PDFBOX-3934-KTUUMJNQ7NYGJJMEDGSY5OEU76G6JX2V.pdf
>
>
> The first page (with "iéseg") was in 2.0.4 but is no longer there since 2.0.5.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-3934) Page missing

2017-09-24 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/PDFBOX-3934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16178155#comment-16178155
 ] 

ASF subversion and git services commented on PDFBOX-3934:
-

Commit 1809500 from [~lehmi] in branch 'pdfbox/branches/2.0'
[ https://svn.apache.org/r1809500 ]

PDFBOX-3934: removed brute force search for last startxref entry fall back to 
rebuildTrailer instead, improved garbage detection

> Page missing
> 
>
> Key: PDFBOX-3934
> URL: https://issues.apache.org/jira/browse/PDFBOX-3934
> Project: PDFBox
>  Issue Type: Bug
>  Components: Parsing
>Affects Versions: 2.0.5, 2.0.6, 2.0.7
>Reporter: Tilman Hausherr
>Assignee: Andreas Lehmkühler
>  Labels: regression
> Attachments: BCZSFNQAB62TUBURWG6B3ZOZCG5IH46P.pdf, 
> PDFBOX-3934-KTUUMJNQ7NYGJJMEDGSY5OEU76G6JX2V.pdf
>
>
> The first page (with "iéseg") was in 2.0.4 but is no longer there since 2.0.5.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-3934) Page missing

2017-09-24 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/PDFBOX-3934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16178156#comment-16178156
 ] 

ASF subversion and git services commented on PDFBOX-3934:
-

Commit 1809501 from [~lehmi] in branch 'pdfbox/trunk'
[ https://svn.apache.org/r1809501 ]

PDFBOX-3934: removed brute force search for last startxref entry fall back to 
rebuildTrailer instead, improved garbage detection

> Page missing
> 
>
> Key: PDFBOX-3934
> URL: https://issues.apache.org/jira/browse/PDFBOX-3934
> Project: PDFBox
>  Issue Type: Bug
>  Components: Parsing
>Affects Versions: 2.0.5, 2.0.6, 2.0.7
>Reporter: Tilman Hausherr
>Assignee: Andreas Lehmkühler
>  Labels: regression
> Attachments: BCZSFNQAB62TUBURWG6B3ZOZCG5IH46P.pdf, 
> PDFBOX-3934-KTUUMJNQ7NYGJJMEDGSY5OEU76G6JX2V.pdf
>
>
> The first page (with "iéseg") was in 2.0.4 but is no longer there since 2.0.5.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org