Re: [RESULT][VOTE] Release Apache Tika 1.28.4 Candidate #1

2022-06-16 Thread Tim Allison
I updated the site, but the cdn hasn't picked up the release yet. I'll wait for that to happen before announcing. On Thu, Jun 16, 2022 at 2:46 PM Tim Allison wrote: > The vote has passed with 4 PMC +1s and no -1s. > > +1 > Tim Allison > Konstantin Gribov > Tilman Hausherr > Oleg Tikhonov > > I'

[RESULT][VOTE] Release Apache Tika 1.28.4 Candidate #1

2022-06-16 Thread Tim Allison
The vote has passed with 4 PMC +1s and no -1s. +1 Tim Allison Konstantin Gribov Tilman Hausherr Oleg Tikhonov I'll make the release, update the website and make the announcement shortly, probably in coordination with the 2.4.1 release (if the vote passes) tomorrow. Thank you, all! Best,

[jira] [Comment Edited] (TIKA-3479) UniversalCharsetDetector in 2.x is misidentifying windows-1250 as ISO-8859-1

2022-06-16 Thread Ostico (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3479?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17555198#comment-17555198 ] Ostico edited comment on TIKA-3479 at 6/16/22 5:19 PM: --- Maybe, a bet

[jira] [Comment Edited] (TIKA-3479) UniversalCharsetDetector in 2.x is misidentifying windows-1250 as ISO-8859-1

2022-06-16 Thread Ostico (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3479?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17555198#comment-17555198 ] Ostico edited comment on TIKA-3479 at 6/16/22 5:19 PM: --- Maybe, a bet

[jira] [Comment Edited] (TIKA-3479) UniversalCharsetDetector in 2.x is misidentifying windows-1250 as ISO-8859-1

2022-06-16 Thread Ostico (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3479?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17555198#comment-17555198 ] Ostico edited comment on TIKA-3479 at 6/16/22 5:14 PM: --- Maybe, a bet

[jira] [Comment Edited] (TIKA-3479) UniversalCharsetDetector in 2.x is misidentifying windows-1250 as ISO-8859-1

2022-06-16 Thread Ostico (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3479?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17555198#comment-17555198 ] Ostico edited comment on TIKA-3479 at 6/16/22 5:12 PM: --- Maybe, a bet

[jira] [Comment Edited] (TIKA-3479) UniversalCharsetDetector in 2.x is misidentifying windows-1250 as ISO-8859-1

2022-06-16 Thread Ostico (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3479?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17555198#comment-17555198 ] Ostico edited comment on TIKA-3479 at 6/16/22 5:03 PM: --- Maybe, a bet

[jira] [Comment Edited] (TIKA-3479) UniversalCharsetDetector in 2.x is misidentifying windows-1250 as ISO-8859-1

2022-06-16 Thread Ostico (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3479?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17555198#comment-17555198 ] Ostico edited comment on TIKA-3479 at 6/16/22 5:03 PM: --- Maybe, a bet

[jira] [Comment Edited] (TIKA-3479) UniversalCharsetDetector in 2.x is misidentifying windows-1250 as ISO-8859-1

2022-06-16 Thread Ostico (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3479?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17555198#comment-17555198 ] Ostico edited comment on TIKA-3479 at 6/16/22 5:00 PM: --- Maybe, a bet

[jira] [Comment Edited] (TIKA-3479) UniversalCharsetDetector in 2.x is misidentifying windows-1250 as ISO-8859-1

2022-06-16 Thread Ostico (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3479?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17555198#comment-17555198 ] Ostico edited comment on TIKA-3479 at 6/16/22 4:56 PM: --- Maybe, a bet

[jira] [Commented] (TIKA-3479) UniversalCharsetDetector in 2.x is misidentifying windows-1250 as ISO-8859-1

2022-06-16 Thread Ostico (Jira)
[ https://issues.apache.org/jira/browse/TIKA-3479?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17555198#comment-17555198 ] Ostico commented on TIKA-3479: -- Maybe, a better implementation could be to exclude characters

Re: [VOTE] Release Apache Tika 1.28.4 Candidate #1

2022-06-16 Thread Oleg Tikhonov
Hey, [x] +1 Release this package as Apache Tika 1.28.4 Java 8, ubuntu 20, basic stuff. Thanks, Oleg On Thu, Jun 16, 2022, 17:42 Konstantin Gribov wrote: > Built successfully on ArchLinux, OpenJDK 11 & 17 (Temurin-11.0.15+10 & > 17.0.3+7) w/ Tesseract 5.1.0, Leptonica 1.82. > The issue with the

Re: [VOTE] Release Apache Tika 2.4.1 Candidate #1

2022-06-16 Thread Konstantin Gribov
Built successfully on ArchLinux, OpenJDK 11 & 17 (Temurin-11.0.15+10 & 17.0.3+7) w/ Tesseract 5.1.0, Leptonica 1.82. The issue with the tesseract multipage test is still the same, it extracts "Page?2" instead of "Page 2" on my laptop. GPG signatures and SHA512 hashes are fine. [x] +1 Release this

Re: [VOTE] Release Apache Tika 1.28.4 Candidate #1

2022-06-16 Thread Konstantin Gribov
Built successfully on ArchLinux, OpenJDK 11 & 17 (Temurin-11.0.15+10 & 17.0.3+7) w/ Tesseract 5.1.0, Leptonica 1.82. The issue with the tesseract multipage test is still the same, it extracts "Page?2" instead of "Page 2" on my laptop. GPG signatures and SHA512 hashes are fine. [x] +1 Release this