[jira] [Commented] (TIKA-3111) Upgrade to PDFBox 2.0.20

2020-06-16 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3111?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17137866#comment-17137866 ] Hudson commented on TIKA-3111: -- SUCCESS: Integrated in Jenkins build tika-branch-1x #341 (See

[jira] [Commented] (TIKA-3111) Upgrade to PDFBox 2.0.20

2020-06-14 Thread Jira
[ https://issues.apache.org/jira/browse/TIKA-3111?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17135405#comment-17135405 ] Andreas Lehmkühler commented on TIKA-3111: -- Thanks for the prompt feedback [~tilman] > Upgrade

[jira] [Commented] (TIKA-3111) Upgrade to PDFBox 2.0.20

2020-06-14 Thread Tilman Hausherr (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3111?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17135203#comment-17135203 ] Tilman Hausherr commented on TIKA-3111: --- Now it works > Upgrade to PDFBox 2.0.20 >

[jira] [Commented] (TIKA-3111) Upgrade to PDFBox 2.0.20

2020-06-14 Thread Jira
[ https://issues.apache.org/jira/browse/TIKA-3111?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17135145#comment-17135145 ] Andreas Lehmkühler commented on TIKA-3111: -- I've extended my patch and taken

[jira] [Commented] (TIKA-3111) Upgrade to PDFBox 2.0.20

2020-06-13 Thread Jira
[ https://issues.apache.org/jira/browse/TIKA-3111?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17134780#comment-17134780 ] Andreas Lehmkühler commented on TIKA-3111: -- Thanks for the fast feedback and the inconvenience.

[jira] [Commented] (TIKA-3111) Upgrade to PDFBox 2.0.20

2020-06-13 Thread Tilman Hausherr (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3111?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17134768#comment-17134768 ] Tilman Hausherr commented on TIKA-3111: --- I did (after reverting my change in Tika), and it doesn't

[jira] [Commented] (TIKA-3111) Upgrade to PDFBox 2.0.20

2020-06-13 Thread Jira
[ https://issues.apache.org/jira/browse/TIKA-3111?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17134727#comment-17134727 ] Andreas Lehmkühler commented on TIKA-3111: -- I guess I've reinstated binary compatibility, see the

[jira] [Commented] (TIKA-3111) Upgrade to PDFBox 2.0.20

2020-06-12 Thread Jira
[ https://issues.apache.org/jira/browse/TIKA-3111?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17134345#comment-17134345 ] Andreas Lehmkühler commented on TIKA-3111: -- [~tilman] Yes, you're right the contract is broken,

[jira] [Commented] (TIKA-3111) Upgrade to PDFBox 2.0.20

2020-06-12 Thread Tilman Hausherr (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3111?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17134303#comment-17134303 ] Tilman Hausherr commented on TIKA-3111: --- No, I got it to work with several changes in

[jira] [Commented] (TIKA-3111) Upgrade to PDFBox 2.0.20

2020-06-12 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3111?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17134289#comment-17134289 ] Tim Allison commented on TIKA-3111: --- Thank you! So, we should switch to PDFStreamEngine from

[jira] [Commented] (TIKA-3111) Upgrade to PDFBox 2.0.20

2020-06-12 Thread Tilman Hausherr (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3111?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17134275#comment-17134275 ] Tilman Hausherr commented on TIKA-3111: --- Got it. PDFStreamEngine calls the (new) 4 parameter

[jira] [Commented] (TIKA-3111) Upgrade to PDFBox 2.0.20

2020-06-12 Thread Tilman Hausherr (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3111?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17134264#comment-17134264 ] Tilman Hausherr commented on TIKA-3111: --- Ignore my comment, it isn't helpful here, I was just

[jira] [Commented] (TIKA-3111) Upgrade to PDFBox 2.0.20

2020-06-12 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3111?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17134100#comment-17134100 ] Tim Allison commented on TIKA-3111: --- Sorry, to clarify, we don’t get character counts for _any_ pages in

[jira] [Commented] (TIKA-3111) Upgrade to PDFBox 2.0.20

2020-06-12 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3111?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17134097#comment-17134097 ] Tim Allison commented on TIKA-3111: --- Not sure I follow. Text extraction seems to be the same (on a

[jira] [Commented] (TIKA-3111) Upgrade to PDFBox 2.0.20

2020-06-12 Thread Tilman Hausherr (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3111?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17133978#comment-17133978 ] Tilman Hausherr commented on TIKA-3111: --- tail of debug log for 2.0.19: {quote} Warning

[jira] [Commented] (TIKA-3111) Upgrade to PDFBox 2.0.20

2020-06-11 Thread Hudson (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3111?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17133743#comment-17133743 ] Hudson commented on TIKA-3111: -- SUCCESS: Integrated in Jenkins build Tika-trunk #1821 (See

[jira] [Commented] (TIKA-3111) Upgrade to PDFBox 2.0.20

2020-06-11 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3111?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17133677#comment-17133677 ] Tim Allison commented on TIKA-3111: --- I made the upgrade in master, but came across a weird failure: