Crazy nonsensical white-space within words

Morgan Bishop Mon, 06 Jun 2011 18:03:14 -0700

I'm new to all of this and I'm not sure if training with sa-learn ishaving any effect as this SPAM still scores the same and bayes thinksit's probably less than 1% SPAM (BAYES_00). I'm run a small vanitydomain for friends and family so there isn't exactly a ton of traininggoing on, but I'm sure I'm doing it right as most Bayes is 95-99% forlegitimate SPAM, and 0-5% for HAM. I only training on mail I'vepersonally made sure is HAM and SPAM, and in fact, these e-mails are theonly 1% probability I get for legitimate SPAM.

I've attached an example below. There is an HTML component as well, butother than markup it is idential. My thinking is there should be someway to write a rule checking words against a dictionary, but it soundslike an expensive filter process-wise. This poor user gets about 10 ofthese mails a day.


---------BODY----------------
http://groups.yahoo.com/group/ayazpahlmu/message/Chat/220686/

Ulti mate ly Ab ou t Per ce nt Of Ind ivi dual Re turn s Qual ifie d For e fu nd s Last Ye ar Tot alin g Abou t Bil lion Th e Re fu nds Aver ged Ab out The Sa me Am ou nt.

Crazy nonsensical white-space within words

Reply via email to