On Tue, 13 Jul 2004 11:30:51 -0400 (EDT) Richard Welty <[EMAIL PROTECTED]> wrote:
RW> 0) a setup wizard might be a good idea; it could encourage people to RW> use dspam in the mythical "right" way (e.g., don't send spam to RW> the trash, but to a quarantine folder for review before disposal, RW> that sort of thing.) I've at least documented this in the manual but, of course, such wizard would be nice. I've even thought of showing one when the spam filters options dialog is opened for the first time but probably won't have time to implement it. RW> 1) if configure doesn't see sqlite, even if --with-dspam is specified, RW> it builds without giving any indication that dspam was skipped, Well, it does say "Cannot find sqlite header - support for DSPAM disabled." but I guess a line in the summary for DSPAM could be a good idea. Added. RW> and the menu entries in Message|Spam still show up and when used, don't RW> generate error messages. This is by design. DSPAM is just one of spam filters and there could be more of them (currently there are 2: DSPAM and my old home brewn one). You'd get an error only if there are no filters at all. RW> 2) it would be nice if there were a way to corpus train from RW> Message|Spam; the existing entries are for error training which RW> isn't quite the same thing. No, it isn't, but is it really common to train it like this? It doesn't cost much to add these menu commands but I thought that it was wiser to put them in a separate dialog because they're needed so rarely (basically I've used them only once). RW> 3) could we have error training on the right-click menu for RW> messages? Good idea, done! RW> 4) it would be good to add a button for clearing statistics on RW> the Edit|Spam Filters menu for dspam, as after you're satisfied RW> with the training, you usually are well advised to clear stats, RW> as errors in training tend to skew totals for a Very Long Time RW> after dspam moves into production mode. Yes, adding "clear" button to the statistics would be nice but this is really low priority to be honest... And I don't know how to do it neither to be honest (but then I didn't even look). RW> 5) it'd be nice if the displayed stats computed percentages RW> for false positives, false negatives, etc. I was quite confused by dspam statistics so I just gave the same output as dspam_stats. If you feel like improving it, just hack the relevant code in DspamFilter.cpp and send me the patch, I'd eb happy to apply it. RW> 6) there also should be a mechanism in Edit|Spam Filters RW> for purging the database. Yes, this will have to be done before release probably as my database access has become much faster since I purged it. RW> perhaps there should be some automation, e.g. mahogany periodically RW> checks for the last purge date and if it's been a while, it runs one. RW> this could be an option, e.g. AutoPurge. Unfortunately there is no provision in M for doing stuff periodically. This is an often requested feature (for folder archival, for example) and it will be implemented sooner or later -- but now now. RW> also, purge after training RW> completion is important, as the early databases get quite large and RW> once in production, they can be shrunk a lot. It didn't shrink at all for me. But it has stopped to grow which is already nice (as it was at 55Mb). RW> 7) corpus training from a mbx file with 1000+ messages RW> takes a while (at least, on my 750mhz linux laptop.) some RW> sort of progress bar would be good, but this will likely RW> require an API enhancement in libdspam. i can take this RW> to jonz. I'm almost surely using it incorrectly because I create and destroy DSPAM_CTX for each message. Apparently I can reuse the same one for all the messages. Also, iterating over messages is much slower than it could have been (although I suspect it doesn't play big role here). In any case, progress dialog should definitely be added... Thanks for your feedback! VZ ------------------------------------------------------- This SF.Net email sponsored by Black Hat Briefings & Training. Attend Black Hat Briefings & Training, Las Vegas July 24-29 - digital self defense, top technical experts, no vendor pitches, unmatched networking opportunities. Visit www.blackhat.com _______________________________________________ Mahogany-Developers mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/mahogany-developers