On Sat, Mar 06, 2004 at 09:11:41AM -0800, Tom Emerson <[EMAIL PROTECTED]> wrote:
Content-Description: signed data
> On Friday 27 February 2004 2:57 am, Matthew Hunter wrote:
> > FWIW, here's how I learn currently (learning from all read
> > messages, except known spam, and mailing lists dealing with
> > spam):
> >
> > find /home/matthew/Maildir/ -type d -name "cur" | grep -v -i spam | xargs
> > --max-args=1 -t sa-learn --no-rebuild --ham
> 
> I had to do the first couple of parts by hand to see what is happening, but 
> it 
> sort of makes sense: you're eliminating any mail directories that happen to 
> have the word "spam" as part of the NAME of the directory (folder).  At 
> first, I thought grep would scan the FILES in each directory passed to it, 
> rather than "the list of filenames" itself [really gotta learn piped 
> precedence :) ]
> 
> Unfortunately, this doesn't take into consideration "uncaught spam" -- i.e., 
> stuff that really is spam that is in the "wrong" folder.  In my case, I have 
> a bit of "history" built up, and although for the most part I've been pretty 
> diligent in removing the garbage, I wouldn't be 100% certain to use this 
> shotgun approach (but that's just me -- this will probably be fine for 
> others)

Actually, it does exactly that.  The "find" command returns only 
directories named "cur" within the tree, which (in a maildir 
folder tree) means "only directories with read mail".  New mail, 
before an email client sees it, is stored in a directory called 
"new".  

To make sure you don't learn missed spam, just make sure you move 
any spam you see into the spam folder the first time you see it; 
it will never appear in the cur directory if you move it directly 
to the spam folder as soon as you see it.

Note that the spam learning works the same way -- you won't learn 
from spam until you've checked the folder for false positives and 
then left that folder.  It won't learn anything you haven't 
at least glanced at.

It's been working very well.  I have maybe one or two spams a 
month that slip through.  Of course, I also use other measures.

-- 
Matthew Hunter ([EMAIL PROTECTED])
Public Key: http://matthew.infodancer.org/public_key.txt
Homepage: http://matthew.infodancer.org/index.jsp
Politics: http://www.triggerfinger.org/index.jsp

Reply via email to