> >>I am using it as a mailing list. However, over the last half year, it has > >>gotten pretty big, and has some duplicates in it. For reporting sake, I > >>would like to delete the duplicate records based on email addresses. If you > >>sign up three times, I only want to keep your first record in there. > > > >The simplest course of action is to use a hash, and use the email address > >as the deciding factor of uniqueness. > > I meant to convert the email address to lowercase... >
Wonder where you got that idea? ;-) Looks like that question has been quite well covered. Under Unix, my simple version can be reduced to: cat list.current | perl -ne 'print if not $seen{split'|'}++' > list.new Jonathan Paton ===== ---------------BEGIN GEEKCODE BLOCK-----------v3.12 GCS/E d+ s+: a20 C++(+++)>$ UHL++>+++ P+++ L++>++++ E- W++(-) N+ o? K- w--- !O M-- !V PS-- PE++ Y++ PGP t@ 5-- X-- R- tv- b DI+ D- G++ e h! !r--->++ !y--- ----------------END GEEKCODE BLOCK----------------- JAPH: print`perldoc perlembed`=~/(Ju.*)/,"\n" __________________________________________________ Do You Yahoo!? Everything you'll ever need on one web page from News and Sport to Email and Music Charts http://uk.my.yahoo.com -- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]