Re: Unexpected behavior when inspecting mp4 files with different ISO

2024-04-29 Thread Mauler, David
would you like me to send them to you? * Dave From: Tim Allison Sent: Monday, April 29, 2024 10:42 AM To: user@tika.apache.org Cc: Muruganandam, Srinivasan Subject: Re: Unexpected behavior when inspecting mp4 files with different ISO CAUTION: This email

Re: Unexpected behavior when inspecting mp4 files with different ISO

2024-04-29 Thread Tim Allison
I had forgotten about this, too: https://github.com/apache/tika/blob/777543d0ac2051bc2dce7b719a22c94019919ffb/tika-parsers/tika-parsers-standard/tika-parsers-standard-modules/tika-parser-audiovideo-module/src/main/java/org/apache/tika/parser/mp4/MP4Parser.java#L253 We try to update the media type

Re: Unexpected behavior when inspecting mp4 files with different ISO

2024-04-29 Thread Tim Allison
I agree with Nick. You can better understand the magic based algorithms we're using for detection by searching for mp4 and quicktime in this file: https://github.com/apache/tika/blob/main/tika-core/src/main/resources/org/apache/tika/mime/tika-mimetypes.xml A middle ground is to have the MP4 parse

Re: Unexpected behavior when inspecting mp4 files with different ISO

2024-04-27 Thread Nick Burch
On Fri, 26 Apr 2024, Mauler, David wrote: I'm in the process of troubleshooting an issue with certain mp4 video files and tika. After a bunch of digging, it appears to be related to whatever ISO is set for the mp4 file. An mp4 with an ISO of 14496-12:2003 will be detected as video/quicktime but

Unexpected behavior when inspecting mp4 files with different ISO

2024-04-26 Thread Mauler, David
Hello, I'm in the process of troubleshooting an issue with certain mp4 video files and tika. After a bunch of digging, it appears to be related to whatever ISO is set for the mp4 file. An mp4 with an ISO of 14496-12:2003 will be detected as video/quicktime but an mp4 with an ISO of 14496-14 is