On Wednesday September 16 2009 22:03:17 Justin Mason wrote: > Who is running a mass-check that's still in progress? (fwiw, I am ;) > It'll be at least 5 users (with myself and John), but that's not a > great population of training data.
I spent a couple of afternoons cleaning up my corpus or 60.000 messages (of which 39000 is ham, checked and rechecked). I have already uploaded my results, although I will probably do another iteration of hand-weeding based on nightly ruleqa results - it will be there by the end of the day. Mark