On Tue, 2009-04-21 at 01:21 -0400, Gene Heskett wrote:
> You need to have it learn at least 200 messages of both 'ham' and 'spam' 
> before it has enough data to switch to working mode.  So sort them into 
> separate directories, and have it learn both a clean inbox as ham, and an all 
> spam directory.  [...]

Very true. There's one important word there, that /might/ bite one's
butt, though. *clean*.

With most backends, mail storage formats -- be it local or IMAP --
"moving spam out of the Inbox" isn't sufficient to have it *clean*. The
source message that's just "moved" out often still physically remains in
the folder, unnoticed, until one expunges (compact in TB lingo).

Without that step, training the Inbox as ham might learn all those
pesky, sneaky spam as ham too, which the user believes has been moved...

Just a side-note. :)


-- 
char *t="\10pse\0r\0dtu...@ghno\x4e\xc8\x79\xf4\xab\x51\x8a\x10\xf4\xf4\xc4";
main(){ char h,m=h=*t++,*x=t+2*h,c,i,l=*x,s=0; for (i=0;i<l;i++){ i%8? c<<=1:
(c=*++x); c&128 && (s+=h); if (!(h>>=1)||!t[s+h]){ putchar(t[s]);h=m;s=0; }}}

Reply via email to