Hi all, now that it's Sunday I'm finally getting around to setting up the mass check scripts. Thanks for setting up the account, by the way. :)
I've got three questions: 1. My work machine is a laptop that does not run continuously. What do I do if it happens to be sleeping at 9 a.m. UTC? Skip the mass check for that day, or just run it at the earliest point possible? 2. Do I understand the code correctly when I assume that I can just leave report_safe messages as they are? I.e. there's no need to remove the report_safe encapsulation before putting the messages in the spam corpus? 3. I am having trouble using corpus files in mbox format. I just started with a handful of messages to try things out, namely 108 ham messages and 288 spam messages. If I put the messages into maildir folders, the log files have 114 lines for ham (seeing that there are 6 header lines, that seems to be all right) and 291 lines for spam (so I assume there's a few duplicates left). However, if I put the same messages into two mbox files (and change the config file correspondingly), the files have 13 lines for ham and 291 lines for spam. Is there anything special I have to do to use mbox? Thanks in advance! Regards Marc
