Re: How to disable autolearn for FuzzyOcr?

2006-10-18 Thread Justin Mason
John Thompson writes: On 2006-10-16, Marc Perkel [EMAIL PROTECTED] wrote: What need to be done with messages that are spam is to only learn the headers and not the body of the message. What needs to be done is some detection of deliberate bayes poisoning and removal of the poison

Re: How to disable autolearn for FuzzyOcr?

2006-10-17 Thread Frank Bures
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 On Mon, 16 Oct 2006 15:16:19 -0400 (EDT), Daniel T. Staal wrote: On Mon, October 16, 2006 3:07 pm, Marc Perkel said: What need to be done with messages that are spam is to only learn the headers and not the body of the message. What needs to be

Re: How to disable autolearn for FuzzyOcr?

2006-10-17 Thread John Thompson
On 2006-10-16, Marc Perkel [EMAIL PROTECTED] wrote: What need to be done with messages that are spam is to only learn the headers and not the body of the message. What needs to be done is some detection of deliberate bayes poisoning and removal of the poison before larning. Does Bayes

How to disable autolearn for FuzzyOcr?

2006-10-16 Thread Frank Bures
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 My apologies if this question has already been discussed here. I have a feeling it was but I could not find anything in archives. Question: Is there a way to disable autolearn if the spam triggers FUZZY_OCR? These spams usually contain lots of

R: How to disable autolearn for FuzzyOcr?

2006-10-16 Thread Giampaolo Tomassoni
My apologies if this question has already been discussed here. I have a feeling it was but I could not find anything in archives. Question: Is there a way to disable autolearn if the spam triggers FUZZY_OCR? These spams usually contain lots of legitimately looking text and I worry about

RE: How to disable autolearn for FuzzyOcr?

2006-10-16 Thread Chandler, Jay
-Original Message- From: Giampaolo Tomassoni [mailto:[EMAIL PROTECTED] Sent: Monday, October 16, 2006 5:26 AM To: users@spamassassin.apache.org Subject: R: How to disable autolearn for FuzzyOcr? My apologies if this question has already been discussed here. I have a feeling

RE: How to disable autolearn for FuzzyOcr?

2006-10-16 Thread Frank Bures
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 On Mon, 16 Oct 2006 08:46:17 -0700, Chandler, Jay wrote: -Original Message- From: Giampaolo Tomassoni [mailto:[EMAIL PROTECTED] Sent: Monday, October 16, 2006 5:26 AM To: users@spamassassin.apache.org Subject: R: How to disable autolearn

Re: How to disable autolearn for FuzzyOcr?

2006-10-16 Thread D . J .
I think what the original poster was asking was how to make thegibberish bodies not get Bayes scanned, so as to not pollute the database with text that isn't spammy.Exactly my point.Slightly off topic here, but I have a dumb question. If you get a message with obvious bayes poison, what *should*

Re: How to disable autolearn for FuzzyOcr?

2006-10-16 Thread Jim Maul
D.J. wrote: I think what the original poster was asking was how to make the gibberish bodies not get Bayes scanned, so as to not pollute the database with text that isn't spammy. Exactly my point. Slightly off topic here, but I have a dumb question. If you get a message

RE: How to disable autolearn for FuzzyOcr?

2006-10-16 Thread John D. Hardin
On Mon, 16 Oct 2006, Frank Bures wrote: On Mon, 16 Oct 2006 08:46:17 -0700, Chandler, Jay wrote: I think what the original poster was asking was how to make the gibberish bodies not get Bayes scanned, so as to not pollute the database with text that isn't spammy. Exactly my point. Do an

Re: How to disable autolearn for FuzzyOcr?

2006-10-16 Thread D . J .
Slightly off topic here, but I have a dumb question.If you get a message with obvious bayes poison, what *should* you do?Do you remove the poison and classify, or do you just not classify that message?I train it just like you would any other message - especially since manyget autolearned.The

Re: How to disable autolearn for FuzzyOcr?

2006-10-16 Thread Marc Perkel
What need to be done with messages that are spam is to only learn the headers and not the body of the message. What needs to be done is some detection of deliberate bayes poisoning and removal of the poison before larning.

Re: How to disable autolearn for FuzzyOcr?

2006-10-16 Thread Marc Perkel
John D. Hardin wrote: On Mon, 16 Oct 2006, Frank Bures wrote: On Mon, 16 Oct 2006 08:46:17 -0700, Chandler, Jay wrote: I think what the original poster was asking was how to make the gibberish bodies not get Bayes scanned, so as to not pollute the database with

Re: How to disable autolearn for FuzzyOcr?

2006-10-16 Thread Daniel T. Staal
On Mon, October 16, 2006 3:07 pm, Marc Perkel said: What need to be done with messages that are spam is to only learn the headers and not the body of the message. What needs to be done is some detection of deliberate bayes poisoning and removal of the poison before larning. In all honesty:

Re: How to disable autolearn for FuzzyOcr?

2006-10-16 Thread Marc Perkel
Daniel T. Staal wrote: On Mon, October 16, 2006 3:07 pm, Marc Perkel said: What need to be done with messages that are spam is to only learn the headers and not the body of the message. What needs to be done is some detection of deliberate bayes poisoning and removal of the