Sorry to go off-topic here, but I'm in the process of trying to figure out a
better way of detecting duplicates in my corpus, which right now uses a
combination of "formail -D" (ie: message-id header) and a sha1sum of the
message.

Unduplicating the current messages is pretty simple (find duplicates and
delete them), but moving forward is more difficult.  Ideally, I'd like to have
something like "formail -D", but I'd like for it to take an input key (say
sha1sum output) instead of just looking at the Message-Id header.

Does anyone know of something like this, or do I need to go coding a little?

Thanks. :)

-- 
Randomly Generated Tagline:
Illiterate?  Write to us for a free brochure.

Attachment: pgpOQe5VGcnNV.pgp
Description: PGP signature

Reply via email to