On 02/13/2012 12:57 PM, Henri Salo wrote:
> On Mon, Feb 13, 2012 at 05:04:34AM -0500, Michael Richards wrote:
>> Do the sigmakers just waste their time sifting through tons of
>> duplicate submissions?
>
> I sure hope not. I am more than happy to help creating faster "process" for
> this if ClamAV guys can tell what they need or at least old system should be
> documented somehow. Why not create this as a open-source :) If I am correct
> the duplicates mostly come from big av-check sites. They send reports with
> old signatures and/or when they send the file it is not in fact known, but it
> is known when ClamAV guys starts to add the signature.
The duplicate submissions are not bit-to-bit identical.
Bit-to-bit identical submissions are thrown away/merged automatically early in
the process, and they don't get reported to clamav-virusdb@.
Same with files that are already detected by ClamAV.
The duplicates ("Same as") mean that ClamAV detects them _now_ with the same
virusname, but at the time
of the submission they were not detected at all.
It is easy to see why this could happen:
- if it is a file infector then we get a unique submission for each file it
infected. It is still the same malware,
and if a signature gets added to detect one particular instance of the
infection then the other infected files
should get detected as well
- if it is a polymorphic virus then each instance is unique, and depending on
how good the signature is
it may detect many instances of the malware with the same virus name
- the signature might be generic, so it detects more than one malware under
the same name
- ... etc.
Best regards,
--Edwin
_______________________________________________
Help us build a comprehensive ClamAV guide: visit http://wiki.clamav.net
http://www.clamav.net/support/ml