|
On 8/12/2012 2:06 PM, Marc Andre Selig
wrote:
I would say go ahead and run it. The worst that happens if you submit it too late is it isn't used.Hi all, now that it's Sunday I'm finally getting around to setting up the mass check scripts. Thanks for setting up the account, by the way. :)I've got three questions: 1. My work machine is a laptop that does not run continuously. What do I do if it happens to be sleeping at 9 a.m. UTC? Skip the mass check for that day, or just run it at the earliest point possible? That's my understanding.2. Do I understand the code correctly when I assume that I can just leave report_safe messages as they are? I.e. there's no need to remove the report_safe encapsulation before putting the messages in the spam corpus? I believe mbox format is broken from another ticket, sorry. Too many fronts being battled right now! we will get mbox working.3. I am having trouble using corpus files in mbox format. I just started with a handful of messages to try things out, namely 108 ham messages and 288 spam messages. If I put the messages into maildir folders, the log files have 114 lines for ham (seeing that there are 6 header lines, that seems to be all right) and 291 lines for spam (so I assume there's a few duplicates left). However, if I put the same messages into two mbox files (and change the config file correspondingly), the files have 13 lines for ham and 291 lines for spam. Is there anything special I have to do to use mbox? Thanks in advance! -- Kevin A. McGrail
President
Peregrine Computer Consultants Corporation
703-359-9700 x50 / 800-823-8402 (Toll-Free)
![]() |
- Timing; report_safe messages; mbox files Marc Andre Selig
- Re: Timing; report_safe messages; mbox files Kevin A. McGrail

