On 6/7/2011 10:34 AM, Bowie Bailey wrote:
On 6/6/2011 9:02 PM, Morgan Bishop wrote:
I'm new to all of this and I'm not sure if training with sa-learn is
having any effect as this SPAM still scores the same and bayes thinks
it's probably less than 1% SPAM (BAYES_00).  I'm run a small vanity
domain for friends and family so there isn't exactly a ton of training
going on, but I'm sure I'm doing it right as most Bayes is 95-99% for
legitimate SPAM, and 0-5% for HAM.  I only training on mail I've
personally made sure is HAM and SPAM, and in fact, these e-mails are
the only 1% probability I get for legitimate SPAM.

I've attached an example below.  There is an HTML component as well,
but other than markup it is idential.  My thinking is there should be
some way to write a rule checking words against a dictionary, but it
sounds like an expensive filter process-wise.  This poor user gets
about 10 of these mails a day.
<spam sample removed>

Please upload the full email including all headers (and your X-Spam
headers, if possible) to pastebin.com and send us the link.  We will be
able to give you much better suggestions if we can see the full headers,
HTML, etc.


Here's a complete mail for the new user having problems. http://pastebin.com/YrKGGmvD Here's the identical mail forwarded to my long lived account. http://pastebin.com/JDArUj85

As you can see, the new account does not have a Bayes score while my account does.

I set up a new mail account for my father just for Hotmail to forward to. The only hit this SPAM is getting is for the new account is the generic freemail hit since it's forwarded from hotmail. In Bayes status (X-Spam-Bayes) there is an empty summary (_TOKENSUMMARY_). I assume this is because Bayes is not running since there are less than 200 SPAM/HAM messages for new account kreg...@morg.org?

However, In my case Bayes runs and sees enough spammy tokens for the Bayes Spam probability to be 99%, which is better than before where it was reported as HAM (BAYES_00). Even still, the message is not scored high enough to be marked as SPAM and I'm worried that increasing the BAYES_99 score will mess up a fairly good spam configuration - most of the accounts on my domain see very little false positives and a lot of times a BAYES_99 score will bump a message just under the SPAM threshold for first time HAM keeping it from becoming a false-positive. Thus, even if I wait for the SPAM and HAM to trickle in to this new account for Bayes to start running on kreg...@morg.org, as things are I do not think it will work without extra tweaking somewhere.

Hope that makes sense:)

Reply via email to