On 7/18/2014 10:07 AM, Kevin Golding wrote:
Fair enough, although I would note that if, for example, I split my main corpus into subsets (which is tempting at times) then some messages would show in both the obsolete masscheck and the current ones.

Also if we're really using 2012 dated log files should I stop worrying about keeping everything within the past 6-7 months?
We are NOT using those files to my knowledge. I moved all the old logs to /home/updatesd/svn/new-rule-score-gen/corpus/corpus-logs-pre-2014-07-18/ and will delete if all looks good at 10PM~ EDT with the do-stable-update-with-scores job

One thing I will say is I've gradually moved my (working) masscheck later and later in the day. I know originally it said 9am UTC but I have to run mine after 10am UTC for some odd reason - perhaps that could explain some of the problems?

I think the key point is do you see the logs you are submitting listed as used? I didn't see your logs as being used with the correct revision. Looking at http://rsync.spamassassin.org/, I don't see your logs at all which means you haven't uploaded in quite a long time. Can you check your rsync output easily to see if you are uploading ok?

Otherwise, I think the last files we have from you are from 2012!

Yikes! I will check but I showed up as expected in the list you posted:

Checking corpus/usable-corpus-set1/spam-net-kpg-core.log for SVN 1609892...
# SVN revision: 1609892

And this was the output I got this morning:

rsync -qPcvz ham-kpg-core.log spam-kpg-core.log [email protected]::corpus/
This rsync lacks old-style --compress due to its external zlib. Try -zz.
Continuing without compression.

I've been seeing myself on the ruleqa site and I am in the rsync listing for today too, with my huge 32MB uncompressed file!
Ahh, you had uploaded some logs in 2012 under kgolding so that's what I was looking for. Sorry for the false alarm.

Regards,
KAM

Reply via email to