Re: Tika incorrectly detecting Canon raw image file .cr3 as video/quicktime

2023-03-22 Thread Nick Burch
On Wed, 22 Mar 2023, Tim Allison wrote: Thank you, Richard, for raising this. In looking at these file formats, it looks like crw is based on ciff, cr2 is based on tiff and cr3 is based on quicktime. Always fun when the core of a format (or at least the container) swaps between versions! F

Re: Tika incorrectly detecting Canon raw image file .cr3 as video/quicktime

2023-03-22 Thread Richard Toolan
Thanks Tim for the explanation, I’ll register for an ASF Jira account and attach the CR3 file there, I’ll also try to source some CRW files if we have any. All the best, Richard From: Tim Allison Date: Wednesday, 22 March 2023 at 14:27 To: user@tika.apache.org Subject: Re: Tika incorrectly d

Re: Tika incorrectly detecting Canon raw image file .cr3 as video/quicktime

2023-03-22 Thread Tim Allison
Let's move discussion here: https://issues.apache.org/jira/browse/TIKA-3991 @Richard, if you'd like access to our JIRA, see: https://selfserve.apache.org/jira-account.html On Wed, Mar 22, 2023 at 10:22 AM Tim Allison wrote: > > Thank you, Richard, for raising this. In looking at these file > fo

Re: Tika incorrectly detecting Canon raw image file .cr3 as video/quicktime

2023-03-22 Thread Tim Allison
Thank you, Richard, for raising this. In looking at these file formats, it looks like crw is based on ciff, cr2 is based on tiff and cr3 is based on quicktime. For some file formats we do, application/x-this-app; version=1.0, application/x-thisapp; version=2.0. For others, we create separate mai

Tika incorrectly detecting Canon raw image file .cr3 as video/quicktime

2023-03-22 Thread Richard Toolan
Hello, We’ve noticed that Tika is incorrectly detecting the file .cr3 as video/quicktime, other raw files are detected as image/tiff (including the .cr3’s predecessor the .cr2). I’ve uploaded a sample file here https://dropfiles.org/j8CS4Snr (that was taken from this review https://www.photogr