on Fri Apr 27 2007, skip-AT-pobox.com wrote:

>     Dave> Good to know.  I suppose there's no reason not to do this with a
>     Dave> cron job, if you're that confident in it.
>
> Well, I do kill my fetchmail process so sb_bnfilter isn't trying to read the
> database while tte is trying to write it.

IIUC, tte builds a fresh DB each time.  I simply run it on a new file
and the copy it over the old file.

>     Dave> OK... but what will happen if the real ratio of ham to spam is
>     Dave> more like 412:379 and I pass a simple ratio of 3:2?  
>
> All the RATIO tells it is how many spams and hams to score in one shot.  3:2
> means (if I recall correctly) that it will pick the next three spams and
> next two hams to score.  It will then check their scores.  Any which are
> correctly scored won't be visited in the next round.  I believe those that
> are scored incorrectly will be used to update the training database at that
> point.

Yeah, that's what I thought.

>     Dave> I guess I'm saying that the ratio argument is good for training
>     Dave> some specific ratio of hams and spams... but does anyone really
>     Dave> want to train a specific ratio?  What's the use case? If you've
>     Dave> supplied the ratio argument to make it easy for people to train
>     Dave> everything in an unbalanced set, it's not a very good way of
>     Dave> getting there.
>
> Maybe, but it works for me.

Sounds like you don't really want to discuss this point?  Sorry, I
don't mean to press it.  Just tell me whether you understand my
argument and would be willing to accept a patch for an --unbalanced
argument that counts the messages and traces a nice quantized line
through the whole set.  

>     Dave> Unfortunately, I want to keep my email address and my server, so
>     Dave> unless Google is going to make their spam blocking technology
>     Dave> public it means SB is going to have to take on the whole job.
>
> I still use [EMAIL PROTECTED] as my visible identity.  You could have
> [EMAIL PROTECTED] forward to Gmail and then use POP3 to pick up your
> mail from their server.  Then use your normal email client and use your
> boost-consulting address.  You would lose the IMAP capability, 

Not good for me.  I really need what IMAP offers, AFAICT.

-- 
Dave Abrahams
Boost Consulting
http://www.boost-consulting.com

Don't Miss BoostCon 2007! ==> http://www.boostcon.com
_______________________________________________
[email protected]
http://mail.python.org/mailman/listinfo/spambayes
Check the FAQ before asking: http://spambayes.sf.net/faq.html

Reply via email to