audiomuze;574641 Wrote: 
> 
> Would it not be possible to limit the md5 hash to say 1000 bytes or
> something similarly small if you read forward from the midpoint of the
> audio portion of the file, regardless of file format?
> 
Yes possibly, Andy plans to try if that works better so we will know as
soon as he have implemented a new version of the Audio::Scan module that
supports this which people can try.

A possible issue is that we are talking about compressed data which
means that the compression algorithm might cause problems. I don't have
any detailed knowledge about this but I suspect the real data might be
stored in the beginning of the file and the later part of the file
might just be instructions at which points to insert the different data
sections when uncompressing. If you know about compression algorithms,
you know that most of them try to store a common data section once and
have a list of all occurrences of that section in the uncompressed
file. Of course, the list of pointers might be as good as a real data
section from a checksum perspective. 

The 0.2 version combines the MD5 checksum with the number of compressed
audio bytes in the file and this made it a lot better than the previous
approach which only used the MD5 checksum. In the result "Duplicates"
shows the files that have both the same checksum and the same number of
compressed audio bytes. The "Incorrect duplicates" is the list of files
that have the same checksum but not the same number of compressed audio
bytes.

Since the intention is to use this later on to connect manually entered
metadata/statistics to individual music files it really needs to be as
close to 100% as possible.


-- 
erland

Erland Isaksson ('My homepage' (http://erland.isaksson.info))
(Developer of 'many plugins/applets'
(http://wiki.slimdevices.com/index.php/User:Erland). If my answer
helped you and you like to encourage future presence on this forum
and/or third party plugin/applet development, 'donations are always
appreciated' (http://erland.isaksson.info/donate))
------------------------------------------------------------------------
erland's Profile: http://forums.slimdevices.com/member.php?userid=3124
View this thread: http://forums.slimdevices.com/showthread.php?t=81679

_______________________________________________
beta mailing list
[email protected]
http://lists.slimdevices.com/mailman/listinfo/beta

Reply via email to