2017-10-19 17:22 GMT+02:00 Aoife Fitzmaurice <[email protected]>:
>
>> Does it make sense to use the signature filter on two files with very 
>> different length?
>
> The purpose for which I am using the script is to compare two files for 
> similar content, the shorter of which I believe is a partial dupe of the 
> longer.
>
> Andrew Weaver has written an interesting blog on the subject: 
> https://ndsr.americanarchive.org/2017/04/20/adventures-in-perceptual-hashing/

(Given that kind people provide current binaries, the link to the
extremely complicated build page does not seem useful.)

> Up until now I've been generating and comparing checksums, but the perceptual 
> hashing script would definitely enhance our archival workflow.

Allow me to repeat my question (the blog is a little long and as said
above its links do not make it more credible):
Does it - in general - make sense to compare two files of (very)
different length with perceptual hashing?
You definitely have found a bug but I still find it interesting if the
issue you reported happens for a real-world
use-case or only for pathological cases.

In general, OOM can happen with FFmpeg and real-world use cases so
while we try to avoid huge
memory allocations they may not be avoidable (this is not true in your
case though afaict).

Carl Eugen
_______________________________________________
ffmpeg-user mailing list
[email protected]
http://ffmpeg.org/mailman/listinfo/ffmpeg-user

To unsubscribe, visit link above, or email
[email protected] with subject "unsubscribe".

Reply via email to