07.08.2012 18:58, Axb kirjoitti: > > Jari, are you seeing your corpus? > > > > Anyone else seeing missing corpora? > > Is this possibly a problem where corpora are not being included? > > I'm watching my masscheck logs closely - all there.
My typical log here (even increaset the rsync verbosity to be sure) Removing duplicates from HAM SPAM ... done. Removing unwanted HAM mail from corpus 0 Removing Maildir/.Confirmed-HAM/cur/1344318666.M953201P4886V0000000000000806I0000000000B6331C_0.whirlwind,S=14993:2,S ... done Removing unwanted SPAM mail from corpus Syncing nightly_mass_check + ./mass-check --hamlog=ham-jarif.log --spamlog=spam-jarif.log -j 4 --progress --reuse ham:dir:/home/jarif/Maildir/.Confirmed-HAM spam:dir:/home/jarif/Maildir/.Confirmed-SPAM status: starting scan stage now: 2012-08-07 12.01.56 status: completed scan stage, 13485 messages now: 2012-08-07 12.01.58 status: starting run stage now: 2012-08-07 12.01.58 status: 10% ham: 1185 spam: 164 date: 2012-05-09 now: 2012-08-07 12.04.17 status: 20% ham: 2373 spam: 325 date: 2011-04-22 now: 2012-08-07 12.08.12 status: 30% ham: 3560 spam: 487 date: 2011-06-17 now: 2012-08-07 12.12.36 status: 40% ham: 4748 spam: 648 date: 2011-09-05 now: 2012-08-07 12.16.53 status: 50% ham: 5937 spam: 808 date: 2011-11-01 now: 2012-08-07 12.21.11 status: 60% ham: 7125 spam: 969 date: 2012-01-02 now: 2012-08-07 12.25.46 status: 70% ham: 8313 spam: 1130 date: 2012-03-06 now: 2012-08-07 12.30.11 status: 80% ham: 9501 spam: 1291 date: 2012-05-04 now: 2012-08-07 12.34.39 status: 90% ham: 10689 spam: 1452 date: 2012-07-24 now: 2012-08-07 12.39.01 status: completed run stage now: 2012-08-07 12.43.01 + LOGLIST=' ham-jarif.log spam-jarif.log' + set +x rsync -Pcvz ham-jarif.log spam-jarif.log [email protected]::corpus/ This is the SpamAssassin Corpus rsync machine. Modules that are available: corpus nightly mass-check result upload area. It is password protected. If you would like a password, please send a request to [email protected] and request a "nightly" username and password. submit Score generation mass-check result upload area. It is password protected. If you would like a password, please send a request to [email protected] and request a "score generation" username and password. Generally these are only granted after a mass-check announcement has been made on the spamassassin developer mailing list. anoncorpus mass-check result download area, available via anonymous access. ham-jarif.log 32768 0% 0.00kB/s 0:00:00 2658417 15% 2.19MB/s 0:00:06 5541500 32% 2.44MB/s 0:00:04 7794435 45% 2.30MB/s 0:00:03 9597411 56% 2.16MB/s 0:00:03 11696723 68% 2.08MB/s 0:00:02 13594276 79% 1.85MB/s 0:00:01 15267861 89% 1.75MB/s 0:00:00 17038423 100% 1.96MB/s 0:00:08 (xfer#1, to-check=1/2) spam-jarif.log 8280 0% 8.09kB/s 0:05:38 11592 0% 11.32kB/s 0:04:01 2746479 100% 1.73MB/s 0:00:01 (xfer#2, to-check=0/2) sent 1021990 bytes received 34822 bytes 72883.59 bytes/sec total size is 19784902 speedup is 18.72
signature.asc
Description: OpenPGP digital signature
