At 03:16 PM 5/27/2004, Mabry Tyson wrote:
>
This illustrates one of my complaints of most spam reporting (including for Razor). The basic idea people have is "I should report spam".
THAT ISN'T ENOUGH. You have to report non-spam as well. If all that is reported is spam, then there is nothing to compare it against. You bias the system to recognize everything as spam and your false positives (non-spam classified as spam) soar (where 0.5% is too high).

Dude, razor is NOT a bayes subsystem.

Do not try to apply bayes concepts to razor, as it doesn't work in anything REMOTELY resembling the bayes model.

Unless the message is already in the razor database, reporting a nonspam message to razor-revoke is pointless and a waste of bandwidth.

Razor is a database of hashes of known spam messages. Period.

Razor is NOT a tokenizer. Razor is NOT a learning system. You don't "train" razor. You can't teach it to recognize nonspam by feeding it random nonspam messages. Razor recognizes only the exact message you report. If that message never appears again in the world, reporting it is a waste.







-------------------------------------------------------
This SF.Net email is sponsored by: Oracle 10g
Get certified on the hottest thing ever to hit the market... Oracle 10g. Take an Oracle 10g class now, and we'll give you the exam FREE.
http://ads.osdn.com/?ad_id=3149&alloc_id=8166&op=click
_______________________________________________
Razor-users mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/razor-users

Reply via email to