Sorry to go off-topic here, but I'm in the process of trying to figure out a better way of detecting duplicates in my corpus, which right now uses a combination of "formail -D" (ie: message-id header) and a sha1sum of the message.
Unduplicating the current messages is pretty simple (find duplicates and delete them), but moving forward is more difficult. Ideally, I'd like to have something like "formail -D", but I'd like for it to take an input key (say sha1sum output) instead of just looking at the Message-Id header. Does anyone know of something like this, or do I need to go coding a little? Thanks. :) -- Randomly Generated Tagline: Illiterate? Write to us for a free brochure.
pgpOQe5VGcnNV.pgp
Description: PGP signature