PDFBox 2.0.29 release?

2023-05-23 Thread Andreas Lehmkuehler
Hi, I tend to release 2.0.29 soon due to the regression which was solved with PDFBOX-5606. WDYT? Andreas - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail:

[ANNOUNCE] Apache PDFBox 2.0.28 released

2023-04-13 Thread Andreas Lehmkuehler
The Apache PDFBox community is pleased to announce the release of Apache PDFBox version 2.0.28. The release is available for download at: https://pdfbox.apache.org/download.html See the full release notes below for details about this release. Release Notes -- Apache PDFBox -- Version 2.0.28

[RESULT][VOTE] Release Apache PDFBox 2.0.28

2023-04-13 Thread Andreas Lehmkuehler
Am 10.04.23 um 12:15 schrieb Andreas Lehmkuehler: Please vote on releasing this package as Apache PDFBox 2.0.28. +1 Tilman Hausherr +1 Maruan Sahyoun +1 Tim Allison +1 Andreas Lehmkühler Thanks for your support and help!! I'm going to push the release out. Andreas

Re: Apache PDFBox Board Report April 2023 due

2023-04-11 Thread Andreas Lehmkuehler
Hi, thanks for your reviews, I've submitted the report as proposed Andreas Am 10.04.23 um 17:30 schrieb Andreas Lehmkuehler: Hi, find attached a quick draft of the board report we're expected to submit this month. It's based upon the report wizard template which can be found at [1] Any

Apache PDFBox Board Report April 2023 due

2023-04-10 Thread Andreas Lehmkuehler
Hi, find attached a quick draft of the board report we're expected to submit this month. It's based upon the report wizard template which can be found at [1] Any comments or additions are appreciated ... ## Description: The mission of PDFBox is the creation and maintenance of software related

Re: Fwd: 2.0.28 release?

2023-04-10 Thread Andreas Lehmkuehler
as well. I seems as if the test run doesn't reveals any reason to stop the release. WDYT? Andreas On Mon, Apr 10, 2023 at 7:02 AM Andreas Lehmkuehler wrote: Sounds like one of those expected issues. I guess PDFBox now swallows the former exception and is able to process the pdf in question

Re: Fwd: 2.0.28 release?

2023-04-10 Thread Andreas Lehmkuehler
211) at org.apache.tika.parser.pdf.PDFParserTest.testSkipBadPage(PDFParserTest.java:1044) On Mon, Apr 10, 2023 at 6:41 AM Tim Allison wrote: Y. Will start process now. Thank you! On Mon, Apr 10, 2023 at 6:20 AM Andreas Lehmkuehler wrote: Hi, I've finished the release process and provi

Re: Fwd: 2.0.28 release?

2023-04-10 Thread Andreas Lehmkuehler
exceptions than before but you knows Thanks in advance Andreas Am 10.04.23 um 11:42 schrieb Andreas Lehmkuehler: Am 10.04.23 um 04:32 schrieb Tilman Hausherr: On 09.04.2023 22:36, Andreas Lehmkuehler wrote: OK, so there is one more question left: do we need to re-run the tests before

[VOTE] Release Apache PDFBox 2.0.28

2023-04-10 Thread Andreas Lehmkuehler
Hi, a candidate for the PDFBox 2.0.28 release is available at: https://dist.apache.org/repos/dist/dev/pdfbox/2.0.28/ The release candidate is a zip archive of the sources in: https://svn.apache.org/repos/asf/pdfbox/tags/2.0.28/ The SHA-512 checksum of the archive is

Re: Fwd: 2.0.28 release?

2023-04-10 Thread Andreas Lehmkuehler
Am 10.04.23 um 04:32 schrieb Tilman Hausherr: On 09.04.2023 22:36, Andreas Lehmkuehler wrote: OK, so there is one more question left: do we need to re-run the tests before starting the release process? Yes I prefer to have another comparison, but it can be done in parallel. Good idea, I'm

Re: Fwd: 2.0.28 release?

2023-04-09 Thread Andreas Lehmkuehler
OK, so there is one more question left: do we need to re-run the tests before starting the release process? Andreas Am 09.04.23 um 20:56 schrieb Tilman Hausherr: On 09.04.2023 17:35, Andreas Lehmkuehler wrote: Hi, I've fixed the issue with 2 of the 3 pdfs. GHOSTSCRIPT-702891-0.pdf is left

Re: Fwd: 2.0.28 release?

2023-04-09 Thread Andreas Lehmkuehler
leave it alone, as it is malformed anmd doesn't contain any useful content. More important, it is one pdf out of hundreds of thoudsands, just a corner cases. WDYT? Andreas Am 05.04.23 um 08:10 schrieb Andreas Lehmkuehler: Am 04.04.23 um 07:40 schrieb Andreas Lehmkuehler: Am 03.04.23 um 19:50

Re: Fwd: 2.0.28 release?

2023-04-05 Thread Andreas Lehmkuehler
Am 04.04.23 um 07:40 schrieb Andreas Lehmkuehler: Am 03.04.23 um 19:50 schrieb Tim Allison: https://corpora.tika.apache.org/base/reports/pdfbox-2.0.27-v-2.0.28-20230403-reports.tgz Haven't had a chance to take a look yet. :( Thanks Tim! There are still 5 new exceptions listed. All of them

Re: Fwd: 2.0.28 release?

2023-04-03 Thread Andreas Lehmkuehler
Tilman --- Original-Nachricht --- Von: Tim Allison Betreff: Re: Fwd: 2.0.28 release? Datum: 03. April 2023, 12:47 An: dev@pdfbox.apache.org Y. I can kick that off now. Or should I wait? On Sat, Apr 1, 2023 at 2:06 PM Andreas Lehmkuehler mailto:andr...@lehmi.de> > wrote: @Tim <ma

Re: Fwd: 2.0.28 release?

2023-04-01 Thread Andreas Lehmkuehler
@Tim Is there any chance to re-run the tests? Andreas Am 01.04.23 um 17:08 schrieb Andreas Lehmkuehler: Am 01.04.23 um 17:05 schrieb Andreas Lehmkuehler: I've accidentally send this to Tim only :-| Weitergeleitete Nachricht Betreff: Re: 2.0.28 release? Datum: Fri, 31 Mar

Re: Fwd: 2.0.28 release?

2023-04-01 Thread Andreas Lehmkuehler
Am 01.04.23 um 17:05 schrieb Andreas Lehmkuehler: I've accidentally send this to Tim only :-| Weitergeleitete Nachricht Betreff: Re: 2.0.28 release? Datum: Fri, 31 Mar 2023 07:50:10 +0200 Von: Andreas Lehmkuehler An: Tim Allison Am 30.03.23 um 16:27 schrieb Tim Allison

Fwd: 2.0.28 release?

2023-04-01 Thread Andreas Lehmkuehler
I've accidentally send this to Tim only :-| Weitergeleitete Nachricht Betreff: Re: 2.0.28 release? Datum: Fri, 31 Mar 2023 07:50:10 +0200 Von: Andreas Lehmkuehler An: Tim Allison Am 30.03.23 um 16:27 schrieb Tim Allison: Reports are here: https://corpora.tika.apache.org

Re: 2.0.28 release?

2023-03-28 Thread Andreas Lehmkuehler
Tilman Hausherr wrote: +1 Tilman On 28.03.2023 08:46, Andreas Lehmkuehler wrote: Hi, how about cutting a 2.0.28 release next week on Monday? there is a bunch of solved tickets and the last release dates back 6 months Andreas

2.0.28 release?

2023-03-28 Thread Andreas Lehmkuehler
Hi, how about cutting a 2.0.28 release next week on Monday? there is a bunch of solved tickets and the last release dates back 6 months Andreas - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional

Re: Minimum Java version for PDFBox 3.x

2023-03-19 Thread Andreas Lehmkuehler
I've created https://issues.apache.org/jira/browse/PDFBOX-5576 to not forget about this ;-) Andreas Am 18.03.23 um 18:07 schrieb Maruan Sahyoun: Fine - so let‘s target that for 4x Am 18.03.2023 um 16:51 schrieb Andreas Lehmkuehler : Am 18.03.23 um 10:49 schrieb Maruan Sahyoun: I‘d second

Re: Minimum Java version for PDFBox 3.x

2023-03-18 Thread Andreas Lehmkuehler
Am 18.03.23 um 10:13 schrieb Tilman Hausherr: You may have a point with some of your arguments, but not this one: Public updates for Java 8 have stopped in march 2022, now one year ago My latest jdk8 is from January 17th of this year. (Amazon Corretto) About the difficulty to find

Re: Minimum Java version for PDFBox 3.x

2023-03-18 Thread Andreas Lehmkuehler
Am 18.03.23 um 10:49 schrieb Maruan Sahyoun: I‘d second a move to 11 for 3.x as for the lifetime of 3.x this will enable us to use newer funtions without another major release. I'd like to do so for the next major version 4.0.x. Hopefully it won't take us that much time to release that version

Re: Minimum Java version for PDFBox 3.x

2023-03-18 Thread Andreas Lehmkuehler
is missing functionality, and IMHO should be replaced by SLF4J 2 or log4j. But that’s another point (and yes, I’d volunteer to do the transition provided there’s a chance to get it in). Cheers, Axel Am 17.03.2023 um 20:06 schrieb Andreas Lehmkuehler : Am 17.03.23 um 10:09 schrieb axh: Hi, I am

Re: Minimum Java version for PDFBox 3.x

2023-03-17 Thread Andreas Lehmkuehler
Am 17.03.23 um 10:09 schrieb axh: Hi, I am developing a software that relies heavily on Apache PdfBox. It uses the current the current PDFBox 3.0.0 from trunk, with some patches. I wanted to know what your thoughts are about raising the minimum Java version for PDFBox 3.x to Java 11. I know

Re: PDFBOx 3.0.0-beta1 release

2023-01-18 Thread Andreas Lehmkuehler
Am 11.01.2023 um 08:24 schrieb Andreas Lehmkuehler : Hi, I'm planning to cut our first beta release of 3.0.0. Be aware that the api is supposed to be stable after the release. Are there any objections? Are there any tickets which should be solved before? Andreas

Re: Apache PDFBox Board Report January 2023 due

2023-01-12 Thread Andreas Lehmkuehler
Hi, thanks for the feedback. I've submitted the report as proposed. Andreas Am 11.01.23 um 20:09 schrieb Andreas Lehmkuehler: Hi, find attached a quick draft of the board report we're expected to submit this month. It's based upon the report wizard template which can be found at [1] Any

Apache PDFBox Board Report January 2023 due

2023-01-11 Thread Andreas Lehmkuehler
Hi, find attached a quick draft of the board report we're expected to submit this month. It's based upon the report wizard template which can be found at [1] Any comments or additions are appreciated ... ## Description: The mission of PDFBox is the creation and maintenance of software related

PDFBOx 3.0.0-beta1 release

2023-01-10 Thread Andreas Lehmkuehler
Hi, I'm planning to cut our first beta release of 3.0.0. Be aware that the api is supposed to be stable after the release. Are there any objections? Are there any tickets which should be solved before? Andreas - To

PDFBOX-5538: introduction of a new interface/functional interface to handle a stream cache

2022-11-01 Thread Andreas Lehmkuehler
Hi, the new on demand parser doesn't use the ScratchFileBuffer anymore and my idea was to overhaul/remove the usage of the ScratchFileBuffer for the creation of new COSStreams as well. My plan was to wait for the 4.0 release. A couple of days ago I realize it might be a good idea to introduce

Re: Fwd: Migrating away from Travis-CI

2022-10-28 Thread Andreas Lehmkuehler
Hi, I've removed all Travis-CUI builds, see PDFBOX-5535 Andreas Am 25.10.22 um 21:50 schrieb Andreas Lehmkuehler: Hi, what do you think? Should we convert our travis builds to github actions? Is there any benefit in having those additional builds? Or is it save to rely on our jenkins

Fwd: Migrating away from Travis-CI

2022-10-25 Thread Andreas Lehmkuehler
Hi, what do you think? Should we convert our travis builds to github actions? Is there any benefit in having those additional builds? Or is it save to rely on our jenkins builds only? Andreas Weitergeleitete Nachricht Betreff:Migrating away from Travis-CI Datum:

Re: Apache PDFBox Board Report October 2022 due

2022-10-11 Thread Andreas Lehmkuehler
Hi, thanks for the feedback. I've submitted the report as proposed Andreas Am 09.10.22 um 14:06 schrieb Andreas Lehmkuehler: Hi, find attached a quick draft of the board report we're expected to submit this month. It's based upon the report wizard template which can be found at [1] Any

[REPORT] PDFBox - October 2022

2022-10-11 Thread Andreas Lehmkuehler
## Description: The mission of PDFBox is the creation and maintenance of software related to a Java library for working with PDF documents ## Issues: There are no issues requiring board attention at this time. ## Membership Data: Apache PDFBox was founded 2009-10-21 (13 years ago) There are

Apache PDFBox Board Report October 2022 due

2022-10-09 Thread Andreas Lehmkuehler
Hi, find attached a quick draft of the board report we're expected to submit this month. It's based upon the report wizard template which can be found at [1] Any comments or additions are appreciated ... ## Description: The mission of PDFBox is the creation and maintenance of software related

[ANNOUNCE] Apache PDFBox 2.0.27 released

2022-09-29 Thread Andreas Lehmkuehler
The Apache PDFBox community is pleased to announce the release of Apache PDFBox version 2.0.27. The release is available for download at: https://pdfbox.apache.org/download.html See the full release notes below for details about this release. Release Notes -- Apache PDFBox -- Version 2.0.27

[RESULT][VOTE] Release Apache PDFBox 2.0.27

2022-09-29 Thread Andreas Lehmkuehler
Am 26.09.22 um 17:28 schrieb Andreas Lehmkuehler: Please vote on releasing this package as Apache PDFBox 2.0.27. +1 Tim Allison +1 Tilman Hausherr +1 Maruan Sahyoun +1 Timo Boehme +1 Daniel Persson (non-binding) +1 Andreas Lehmkühler Thanks for your support and help!! I'm

[VOTE] Release Apache PDFBox 2.0.27

2022-09-26 Thread Andreas Lehmkuehler
a candidate for the PDFBox 2.0.27 release is available at: https://dist.apache.org/repos/dist/dev/pdfbox/2.0.27/ The release candidate is a zip archive of the sources in: https://svn.apache.org/repos/asf/pdfbox/tags/2.0.27/ The SHA-512 checksum of the archive is

Re: jdk20 build

2022-09-24 Thread Andreas Lehmkuehler
+1 thanks Andreas Am 24.09.22 um 13:43 schrieb Tilman Hausherr: I've reconfigured the jdk18 build to be a jdk20 build due to the release of jdk19 Tilman - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For

Re: Release 2.0.27

2022-09-20 Thread Andreas Lehmkuehler
Am 20.09.22 um 20:23 schrieb Tim Allison: PS thanks for doing the test! Thank _you_ for the confirmation! Onwards! Thanks to both of you for running the test and checking the results. I'm planing to cut the release next Monday if no one objects. Andreas

Re: Release 2.0.27

2022-09-19 Thread Andreas Lehmkuehler
Thanks in advance, whoever will be faster ;-) We are not in a hurry, I'll wait for the results. Andreas Am 19.09.22 um 21:09 schrieb Tim Allison: I should have time tomorrow/Wednesday. Thank you! On Mon, Sep 19, 2022 at 2:30 PM Tilman Hausherr wrote: On 19.09.2022 08:22, Andreas

Re: Release 2.0.27

2022-09-19 Thread Andreas Lehmkuehler
Am 10.09.22 um 17:30 schrieb Andreas Lehmkuehler: Am 08.09.22 um 16:12 schrieb Eloisa Costa: Hello! We're having an issue converting some PDF pages to JPG and opened a ticket with you. The fix was made in a May version (2.0.27-Snapshot) that was not released yet. We would like to know if you

[ANNOUNCE] Apache PDFBox 1.8.17 released

2022-09-15 Thread Andreas Lehmkuehler
The Apache PDFBox community is pleased to announce the release of Apache PDFBox version 1.8.17. The release is available for download at: https://pdfbox.apache.org/download.html See the full release notes below for details about this release. Release Notes -- Apache PDFBox -- Version 1.8.17

[RESULT][VOTE] Release Apache PDFBox 1.8.17

2022-09-15 Thread Andreas Lehmkuehler
Am 12.09.22 um 18:50 schrieb Andreas Lehmkuehler: Please vote on releasing this package as Apache PDFBox 1.8.17. +1 Tilman Hausherr +1 Timo Boehme +1 Maruan Sahyoun +1 Andreas Lehmkühler Thanks for your support and help!! I'm going to push the release out. Andreas

Re: [VOTE] Release Apache PDFBox 1.8.17

2022-09-13 Thread Andreas Lehmkuehler
Andreas Lehmkuehler: a candidate for the PDFBox 1.8.17 release is available at: https://dist.apache.org/repos/dist/dev/pdfbox/1.8.17/ The release candidate is a zip archive of the sources in: https://svn.apache.org/repos/asf/pdfbox/tags/1.8.17/ The SHA-512 checksum of the archive

[VOTE] Release Apache PDFBox 1.8.17

2022-09-12 Thread Andreas Lehmkuehler
a candidate for the PDFBox 1.8.17 release is available at: https://dist.apache.org/repos/dist/dev/pdfbox/1.8.17/ The release candidate is a zip archive of the sources in: https://svn.apache.org/repos/asf/pdfbox/tags/1.8.17/ The SHA-512 checksum of the archive is

Re: Release 2.0.27

2022-09-10 Thread Andreas Lehmkuehler
Am 08.09.22 um 16:12 schrieb Eloisa Costa: Hello! We're having an issue converting some PDF pages to JPG and opened a ticket with you. The fix was made in a May version (2.0.27-Snapshot) that was not released yet. We would like to know if you have any release date for the new version, as we use

New PDFBox 1.8.17

2022-09-10 Thread Andreas Lehmkuehler
Hi, Tilman asked me to cut a 1.8.17 to support our friends from Apache TIKA [1] I'm going to do so next Monday in 2 days from now if nobody objects. Cheers Andreas [1] https://issues.apache.org/jira/browse/PDFBOX-5501 - To

Re: Replace methods using an InputStream from Loader.loadPDF

2022-08-01 Thread Andreas Lehmkuehler
Am 01.08.22 um 20:20 schrieb Tilman Hausherr: +1 but - the explanation below (when to use which class) should be in the javadoc - the removal should be in the migration guide It is already on my TODO list Andreas Tilman Am 31.07.2022 um 15:18 schrieb Andreas Lehmkuehler: Hi fellow devs

Replace methods using an InputStream from Loader.loadPDF

2022-07-31 Thread Andreas Lehmkuehler
Hi fellow devs, there was a discussion on JIRA [1] about the changed behaviour of the parser due to the removal of the ScratchFileBuffer when reading a pdf. Additionally there was the post "High memory usage with pdfbox 3" on users@pdfbox targeting the very same topic After explaining

Apache PDFBox Board Report July 2022 due

2022-07-11 Thread Andreas Lehmkuehler
Hi, find attached a quick draft of the board report we're expected to submit this month. It's based upon the report wizard template which can be found at [1] Any comments or additions are appreciated ... P.S.: I'm going to add some private comment about the issue we've discussed on private@

Re: wrong commit msg in jbig2

2022-07-11 Thread Andreas Lehmkuehler
Am 11.07.22 um 19:21 schrieb Tilman Hausherr: There's a wrong commit message and I can't use     git commit --amend -m "PDFBOX-4892: update rat"     git push --force-with-lease because I get "Rewinding refs/heads/master is forbidden" and "[remote rejected] master -> master (pre-receive hook

Re: New PDFBox 3.0.0 release

2022-06-18 Thread Andreas Lehmkuehler
Am 14.06.22 um 10:20 schrieb Emmeran Seehuber: Hi Andreas, Am 14.06.2022 um 08:19 schrieb Andreas Lehmkuehler : Hi, looks like it is time for another 3.0.0 release of PDFBox. Depending on the outcome of the next regression test I'd like to cut the next 3.0.0 release. Should we target

Re: text extraction regression tests for 3.x?

2022-06-16 Thread Andreas Lehmkuehler
g this now. Y. I'll kick off the tests tomorrow morning (ET). On Sat, Jun 11, 2022 at 8:09 AM Andreas Lehmkuehler wrote: I've fixed PDFBOX-5452 and found/fixed another one, see PDFBOX-5456 @Tim is there any chance to rerun the regression tests? Thanks in advance Andreas Am 07.06.22 um 08:

Re: text extraction regression tests for 3.x?

2022-06-16 Thread Andreas Lehmkuehler
PM Tim Allison wrote: Just seeing this now. Y. I'll kick off the tests tomorrow morning (ET). On Sat, Jun 11, 2022 at 8:09 AM Andreas Lehmkuehler wrote: I've fixed PDFBOX-5452 and found/fixed another one, see PDFBOX-5456 @Tim is there any chance to rerun the regression tests? Thanks

New PDFBox 3.0.0 release

2022-06-14 Thread Andreas Lehmkuehler
Hi, looks like it is time for another 3.0.0 release of PDFBox. Depending on the outcome of the next regression test I'd like to cut the next 3.0.0 release. Should we target another alpha or maybe the first beta? Or are is it time for a stable 3.0.0 PDFBox release already? WDYT? Do you have

Re: text extraction regression tests for 3.x?

2022-06-11 Thread Andreas Lehmkuehler
I've fixed PDFBOX-5452 and found/fixed another one, see PDFBOX-5456 @Tim is there any chance to rerun the regression tests? Thanks in advance Andreas Am 07.06.22 um 08:06 schrieb Andreas Lehmkuehler: I've found another regression, see PDFBOX-5452 Andreas Am 29.05.22 um 18:37 schrieb Andreas

Re: text extraction regression tests for 3.x?

2022-06-07 Thread Andreas Lehmkuehler
I've found another regression, see PDFBOX-5452 Andreas Am 29.05.22 um 18:37 schrieb Andreas Lehmkuehler: Thanks Tim, looks like there are some regressions, see PDFBOX-5444 and PDFBOX-5447. Maybe there are more to come Andreas Am 26.05.22 um 15:04 schrieb Tim Allison: Apologies

Re: FoxitDingbats

2022-06-01 Thread Andreas Lehmkuehler
Am 01.06.22 um 16:09 schrieb Tilman Hausherr: There's a Zapf Dingbats replacement FoxitDingbats with a nice license: https://github.com/mozilla/pdf.js/blob/master/external/standard_fonts/FoxitDingbats.pfb https://github.com/mozilla/pdf.js/blob/master/external/standard_fonts/LICENSE_FOXIT

Re: text extraction regression tests for 3.x?

2022-05-29 Thread Andreas Lehmkuehler
/base/reports/reports_pdfbox_3x_20220512.tgz Happy to rerun with a more recent version of trunk. Cheers, Tim On Sun, May 8, 2022 at 1:21 PM Andreas Lehmkuehler wrote: Am 06.05.22 um 14:30 schrieb Tim Allison: All, Let me know when makes sense to run the text extraction regression

Re: [ANNOUNCE] Apache PDFBox 3.0.0-alpha2 released

2022-05-08 Thread Andreas Lehmkuehler
. Everything seems work fine. Best regards Emmeran Am 05.05.2022 um 19:46 schrieb Andreas Lehmkuehler : The Apache PDFBox community is pleased to announce the release of the third alpha release for Apache PDFBox 3.0.0. It is available for download at: https://pdfbox.apache.org/download.html

Re: text extraction regression tests for 3.x?

2022-05-08 Thread Andreas Lehmkuehler
Am 06.05.22 um 14:30 schrieb Tim Allison: All, Let me know when makes sense to run the text extraction regression Yes, it'd be useful to have some update results. How about comparing 2.0.26 vs 3.0.0-alpha3 and maybe 3.0.0-alpha2 vs. 3.0.0-alpha3? tests for 3.x. I regret I haven't been

Re: [ANNOUNCE] Apache PDFBox 3.0.0-alpha2 released

2022-05-05 Thread Andreas Lehmkuehler
Sorry for the confusion in the subject. It is the new alpha3 release. Andreas Am 05.05.22 um 19:46 schrieb Andreas Lehmkuehler: The Apache PDFBox community is pleased to announce the release of the third alpha release for Apache PDFBox 3.0.0. It is available for download at: https

[ANNOUNCE] Apache PDFBox 3.0.0-alpha2 released

2022-05-05 Thread Andreas Lehmkuehler
The Apache PDFBox community is pleased to announce the release of the third alpha release for Apache PDFBox 3.0.0. It is available for download at: https://pdfbox.apache.org/download.html The Apache PDFBox library is an open source Java tool for working with PDF documents. This is the third

[RESULT][VOTE] Release Apache PDFBox 3.0.0-alpha3

2022-05-05 Thread Andreas Lehmkuehler
Am 02.05.22 um 19:27 schrieb Andreas Lehmkuehler: Please vote on releasing this package as Apache PDFBox 3.0.0-alpha3. +1 Tilman Hausherr +1 Maruan Sahyoun +1 Andreas Lehmkühler Thanks for your support and help!! I'm going to push the release out. Andreas

Re: Jenkins build became unstable: PDFBox » PDFBox-trunk #1287

2022-05-05 Thread Andreas Lehmkuehler
I somehow missed that issue when running the tests localy :-( I'll have a look or revert the changes Andreas Am 05.05.22 um 09:12 schrieb Apache Jenkins Server: See

[VOTE] Release Apache PDFBox 3.0.0-alpha3

2022-05-02 Thread Andreas Lehmkuehler
Hi, a candidate for the PDFBox 3.0.0-alpha3 release is available at: https://dist.apache.org/repos/dist/dev/pdfbox/3.0.0-alpha3/ The release candidate is a zip archive of the sources in: http://svn.apache.org/repos/asf/pdfbox/tags/3.0.0-alpha3/ The SHA-512 checksum of the archive is

Re: New PDFBox 3.0.0 alpha release

2022-05-01 Thread Andreas Lehmkuehler
Am 25.04.22 um 07:53 schrieb Andreas Lehmkuehler: Hi, I'm planning to cut another alpha release for PDFBox 3.0.0 in a week from now on next Monday. I'm going to cut the next alpha release tomorrow, approx. about 24 hours from now. Andreas Any objections? Andreas

New PDFBox 3.0.0 alpha release

2022-04-24 Thread Andreas Lehmkuehler
Hi, I'm planning to cut another alpha release for PDFBox 3.0.0 in a week from now on next Monday. Any objections? Andreas - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail:

Sonar build and failing tests

2022-04-24 Thread Andreas Lehmkuehler
Hi, does anyone know what happens with the sonar build? 60 of the tests are failing throwing an NPE. All test using org.apache.pdfbox.rendering.PDFRenderer.PDFRenderer(PDDocument) are affected. It looks like the NPE is thrown when creating an instance of PDFRenderer. Is this maybe related

[ANNOUNCE] Apache PDFBox 2.0.26 released

2022-04-21 Thread Andreas Lehmkuehler
The Apache PDFBox community is pleased to announce the release of Apache PDFBox version 2.0.26. The release is available for download at: https://pdfbox.apache.org/download.html See the full release notes below for details about this release. Release Notes -- Apache PDFBox -- Version 2.0.26

[RESULT][VOTE] Release Apache PDFBox 2.0.26

2022-04-21 Thread Andreas Lehmkuehler
Am 18.04.22 um 13:14 schrieb Andreas Lehmkuehler: Please vote on releasing this package as Apache PDFBox 2.0.26. +1 Tilman Hausherr +1 Maruan Sahyoun +1 Tim Allison +1 Andreas Lehmkühler Thanks for your support and help!! I'm going to push the release out. Andreas

[VOTE] Release Apache PDFBox 2.0.26

2022-04-18 Thread Andreas Lehmkuehler
Hi, a candidate for the PDFBox 2.0.26 release is available at: https://dist.apache.org/repos/dist/dev/pdfbox/2.0.26/ The release candidate is a zip archive of the sources in: http://svn.apache.org/repos/asf/pdfbox/tags/2.0.26/ The SHA-512 checksum of the archive is

Re: 2.0.26 release

2022-04-17 Thread Andreas Lehmkuehler
Am 17.04.22 um 20:25 schrieb Tilman Hausherr: new regression tests results at https://home.snafu.de/tilman/tmp/reports_pdfbox_2.0.25_vs_2.0.26.tar.xz IMHO we're fine now! Thanks for the fast re-test! I'm going to cut the 2.0.26 release tomorrow Andreas Tilman

Re: 2.0.26 release

2022-04-14 Thread Andreas Lehmkuehler
need investigation. commoncrawl3/7L/7LRS5U6CAFMN2P6JPTZVNBUW6XOFYH4M govdocs1/365/365260.pdf commoncrawl3/HO/HOAZTST4E26NPA7HL72WCIVMNRQ3E4M5 govdocs1/150/150282.pdf Tilman Am 12.04.2022 um 08:09 schrieb Andreas Lehmkuehler: Thanks Tim! Looks like there are 5 new exceptions left. I'm going

Re: 2.0.26 release

2022-04-12 Thread Andreas Lehmkuehler
://corpora.tika.apache.org/base/reports/tika-2.4-20220410.tgz Haven't had a chance to review. Hot off the vm. On Sun, Apr 10, 2022 at 9:58 AM Tim Allison wrote: Will try to kick off today…first thing Monday morning (EDT) at the latest. On Sun, Apr 10, 2022 at 9:05 AM Andreas Lehmkuehler

Re: 2.0.26 release

2022-04-10 Thread Andreas Lehmkuehler
Am 09.04.22 um 19:00 schrieb Tilman Hausherr: testFlattenPDFBOX2469Filled also fails in 2.0 (it is disabled by default). I've fixed all new tickets. PDFBOX-5413 fixes the issue with the disabled flatten test. @Tim Is there any chance to re-run the tests? Andreas

Apache PDFBox Board Report April 2022 due

2022-04-10 Thread Andreas Lehmkuehler
Hi, find attached a quick draft of the board report we're expected to submit this month. It's based upon the report wizard template which can be found at [1] Any comments or additions are appreciated ... ## Description: The mission of PDFBox is the creation and maintenance of software related

Re: 2.0.26 release

2022-04-07 Thread Andreas Lehmkuehler
. On Thu, Apr 7, 2022 at 9:07 AM Andreas Lehmkühler wrote: Yes, please Thanks in advance Andreas 07.04.2022 11:44:38 Tim Allison : Sounds great! Should I rerun the regression tests today? On Thu, Apr 7, 2022 at 1:41 AM Andreas Lehmkuehler wrote: Hi, sorry for the delay. I'm planning

2.0.26 release

2022-04-06 Thread Andreas Lehmkuehler
Hi, sorry for the delay. I'm planning to cut the 2.0.26 release next Saturday, the day after tomorrow, if nobody objects. Andreas P.S.: I'm targeting a new 3.0.0 alpha release once the 2.0.26 release is out - To

Re: 2.0.26 release? WAS: JBIG2 3.0.4 release?

2022-03-23 Thread Andreas Lehmkuehler
on a few files, but this was run on ~800k PDFs. There are a couple of cases where a file is now being detected as rfc822 instead of PDF.  We have to fix that on the Tika side. On Mon, Mar 21, 2022 at 12:53 PM Andreas Lehmkuehler wrote: Am 21.03.22 um 12:21 schrieb Tim Allison: I'm happy to run

Re: 2.0.26 release? WAS: JBIG2 3.0.4 release?

2022-03-21 Thread Andreas Lehmkuehler
Am 21.03.22 um 12:21 schrieb Tim Allison: I'm happy to run the tests today if that would be of any interest. Yes, please. TIA Andreas On Sun, Mar 20, 2022 at 5:01 PM Andreas Lehmkuehler wrote: Am 13.03.22 um 14:20 schrieb Tim Allison: From Tika's perspective, there's no rush. We're

Re: 2.0.26 release? WAS: JBIG2 3.0.4 release?

2022-03-20 Thread Andreas Lehmkuehler
aren't related to text extraction. Those which are related should decrease the number of exceptions and increase the accuracy. WDYT? Thank you, all! Cheers, Tim On Sat, Mar 12, 2022 at 5:29 AM Andreas Lehmkuehler wrote: Am 11.03.22 um 08:30 schrieb Tilman Hausherr: Am

Re: 2.0.26 release? WAS: JBIG2 3.0.4 release?

2022-03-13 Thread Andreas Lehmkuehler
um 19:05 schrieb Andreas Lehmkuehler: Am 09.03.22 um 17:07 schrieb Tim Allison: All, I've been out of the office for a bit and haven't caught up yet. Apologies if I've missed the discussion. Are there plans for a 2.0.26 release?  We're probably a few weeks out How about cutting the release

Re: COSBase, avoid to have the same hashCode for different objects holding the same value

2022-03-12 Thread Andreas Lehmkuehler
n Mar 5, 2022, at 10:30 AM, Andreas Lehmkuehler mailto:andr...@lehmi.de>> wrote: Hi, I'm not sure if we dicussed that topic in the past or if I simply mixed it up with a discussion about "equals" and "=" However, PDFBOX-5286 shows the we have an issue with objects which

Re: 2.0.26 release? WAS: JBIG2 3.0.4 release?

2022-03-12 Thread Andreas Lehmkuehler
Am 11.03.22 um 08:30 schrieb Tilman Hausherr: Am 11.03.2022 um 08:19 schrieb Andreas Lehmkuehler: Am 10.03.22 um 20:16 schrieb Tilman Hausherr: I'd agree but that might mean PDFBOX-5384 wouldn't be fixed. It's there for quite some time and it seems to be a seldom corner case. IMHO it can wait

Re: Suspected bug in and proposed fix for ToUnicodeWriter.writeTo

2022-03-12 Thread Andreas Lehmkuehler
Hi, Am 11.03.22 um 21:49 schrieb Ryan Jackson: Dear Apache Devs: I believe that I have identified a bug in the creation of the (begin/end)bfrange operator used when embedding fonts with the PDCIDFontType2Embedder class. The bug exists (as best I can tell) in both the main trunk and in the 2.0

Re: 2.0.26 release? WAS: JBIG2 3.0.4 release?

2022-03-10 Thread Andreas Lehmkuehler
schrieb Andreas Lehmkuehler: Am 09.03.22 um 17:07 schrieb Tim Allison: All, I've been out of the office for a bit and haven't caught up yet. Apologies if I've missed the discussion. Are there plans for a 2.0.26 release?  We're probably a few weeks out How about cutting the release next Monday

Re: 2.0.26 release? WAS: JBIG2 3.0.4 release?

2022-03-10 Thread Andreas Lehmkuehler
our next 1.x and 2.x releases on Tika, and it would be great to incorporate 2.0.26. No problem at all if 2.0.26 is slated for later. Thank you! Cheers, Tim On Fri, Mar 4, 2022 at 10:46 PM Tilman Hausherr wrote: Am 24.02.2022 um 07:41 schrieb Andreas Lehmkuehler: Am 22.02.22 um 07:49

COSBase, avoid to have the same hashCode for different objects holding the same value

2022-03-05 Thread Andreas Lehmkuehler
Hi, I'm not sure if we dicussed that topic in the past or if I simply mixed it up with a discussion about "equals" and "=" However, PDFBOX-5286 shows the we have an issue with objects which aren't the same but are treated as the same because of the same hash. This is true for all simple

[ANNOUNCE] Apache PDFBox JBIG2 ImageIO plugin 3.0.4 released

2022-03-01 Thread Andreas Lehmkuehler
The Apache PDFBox community is pleased to announce the release of Apache PDFBox JBIG2 ImageIO plugin version 3.0.4. The release is available for download at: https://pdfbox.apache.org/download.cgi See the full release notes below for details about this release. Release Notes -- Apache JBIG2

[RESULT][VOTE] Release Apache PDFBox JBIG2 ImageIO 3.0.4

2022-03-01 Thread Andreas Lehmkuehler
Am 26.02.22 um 16:38 schrieb Andreas Lehmkuehler: Please vote on releasing this package as Apache PDFBox JBIG2 ImageIO 3.0.4. +1 Tilman Hausherr +1 Maruan Sahyoun +1 Andreas Lehmkühler Thanks for your support and help!! I'm going to push the release out. Andreas

Re: [VOTE] Release Apache PDFBox JBIG2 ImageIO 3.0.4

2022-02-28 Thread Andreas Lehmkuehler
Just a friendly reminder. Is there anyone who can spare some cycles to check the release? There are round about 20 hours left Thanks in advance Andreas Am 26.02.22 um 16:38 schrieb Andreas Lehmkuehler: Hi, a candidate for the Apache PDFBox JBIG2 ImageIO 3.0.4 release is available

[VOTE] Release Apache PDFBox JBIG2 ImageIO 3.0.4

2022-02-26 Thread Andreas Lehmkuehler
Hi, a candidate for the Apache PDFBox JBIG2 ImageIO 3.0.4 release is available at: https://dist.apache.org/repos/dist/dev/pdfbox/jbig2-imageio/3.0.4/ The release candidate is a zip archive of the sources in: https://github.com/apache/pdfbox-jbig2/tree/3.0.4/ The SHA-512 checksum of

Re: JBIG2 3.0.4 release?

2022-02-23 Thread Andreas Lehmkuehler
Am 22.02.22 um 07:49 schrieb Andreas Lehmkuehler: Hi, I'm planning to cut a new JBIG2 release next week. There aren't that much changes but I think the fixes are worth to be released. [1] I'm going to cut the release next weekend, if nobody objects. Once it is done we should think about

JBIG2 3.0.4 release?

2022-02-21 Thread Andreas Lehmkuehler
Hi, I'm planning to cut a new JBIG2 release next week. There aren't that much changes but I think the fixes are worth to be released. [1] WDYT? Andreas [1]

Re: Apache PDFBox Board Report January 2022 due

2022-01-11 Thread Andreas Lehmkuehler
Hi, thanks for the feedback. I've submitted the report as proposed. Andreas Am 09.01.22 um 14:19 schrieb Andreas Lehmkuehler: Hi, find attached a quick draft of the board report we're expected to submit this month. It's based upon the report wizard template which can be found at [1] Any

[REPORT] PDFBox - January 2022

2022-01-11 Thread Andreas Lehmkuehler
## Description: The mission of PDFBox is the creation and maintenance of software related to Java library for working with PDF documents ## Issues: There are no issues requiring board attention at this time. ## Membership Data: Apache PDFBox was founded 2009-10-21 (12 years ago) There are

Apache PDFBox Board Report January 2022 due

2022-01-09 Thread Andreas Lehmkuehler
Hi, find attached a quick draft of the board report we're expected to submit this month. It's based upon the report wizard template which can be found at [1] Any comments or additions are appreciated ... ## Description: The mission of PDFBox is the creation and maintenance of software

2.0.25 javadocs

2021-12-20 Thread Andreas Lehmkuehler
Hi, I've missed to create and upload the javadocs for 2.0.25 to maven central. :-( I've put a reminder on my checklist. Please double check if javadocs are present next time we prepare a new release. Thanks in advance! Andreas

Re: [VOTE] Release Apache PDFBox 2.0.25

2021-12-16 Thread Andreas Lehmkuehler
e, it is a corner case and already fixed :-) Tilman Tilman Am 13.12.2021 um 20:02 schrieb Andreas Lehmkuehler: Hi, a candidate for the PDFBox 2.0.25 release is available at:     https://dist.apache.org/repos/dist/dev/pdfbox/2.0.25/ The release candidate is a zip archive of the sources in:   

  1   2   3   4   5   6   7   8   9   10   >