Hello Gerald,

Thursday, April 7, 2005, 6:58:55 PM, you wrote:

GVLI> My question is, should I set up BAYES at all?

Yes. User-specific if you can do it, domain-level or site-wide
otherwise.

GVLI> I'm fairly certain domain level BAYES would be a bad thing with
GVLI> our demographic. We have people with family, friends, or
GVLI> business partners in APNIC countries, we have customers who
GVLI> frequent spam havens (online porn gatherers), we have ultra
GVLI> religious customers, and we have middle of the road customers.

The email host for my family domain does the same, and runs site-wide
bayes.

GVLI> I'm afraid domain wide bayes would show up as many FPs for the
GVLI> first two groups or many FNs for the last two -- or the database
GVLI> would just stay hosed up with customers shoving conflicting spam
GVLI> and ham into the learning folders.

It balances out.  Granny puts the porn into her spam box, and Ginger
puts a graphic discussion of last night's wet dream into her ham box.
Over time bayes learns which mails everyone thinks is spam, which
mails everyone thinks is ham, and which mails are undeterminable.

GVLI> I'm not sure how resource efficient per-user BAYES would be. Will it kill
GVLI> the machine as the user base grows or the spam volume increases?

per-user Bayes lookups aren't bad -- don't worry about them. The
question revolves around per-user Bayes database storage (do you have
enough disk space), and how you manage the sa-learn process.

Bob Menschel



Reply via email to