Re: ForkParser issues with 2.3.0

2022-04-26 Thread Nick Burch
On Tue, 26 Apr 2022, Stephen H wrote: On 26/04/2022 12:22, Nick Burch wrote: Are you able to write a short junit unit test case which shows this issue? We have a bunch of small test OOXML and ODF files that could be used I've done this - if I create an issue in Jira with it would that best?

Re: ForkParser issues with 2.3.0

2022-04-26 Thread Stephen H
On 26/04/2022 12:22, Nick Burch wrote: Are you able to write a short junit unit test case which shows this issue? We have a bunch of small test OOXML and ODF files that could be used I've done this - if I create an issue in Jira with it would that best? There isn't currently an ODS file in

[VOTE] Release Apache Tika 1.28.2 Candidate #1

2022-04-26 Thread Tim Allison
A candidate for the Tika 1.28.2 release is available at: https://dist.apache.org/repos/dist/dev/tika/1.28.2 The release candidate is a zip archive of the sources in: https://github.com/apache/tika/tree/tika-1.28.2-rc1/ The SHA-512 checksum of the archive is

Re: ForkParser issues with 2.3.0

2022-04-26 Thread Nick Burch
On Tue, 26 Apr 2022, Stephen H wrote: Second, there seems to be some work missing in the handling of metadata from certain parsers when using ForkParser. For example, for OpenDocument ODP and ODS files and Microsoft Open XML formats, while the document text is returned there is no metadata in