On Mon, 2 Jul 2012 12:01:32 -0700 (PDT) John Hardin wrote: > On Mon, 2 Jul 2012, Jari Fredriksson wrote: > > > On 2.7.2012 19:23, [email protected] wrote: > >> On 07/02, Jari Fredriksson wrote: > >>> I follow the wiki page. I have now implemented the following > >> > >> It seems you are interpreting the wiki as a flawless authority, > >> when it would probably be more appropriate to consider it a crufty > >> guideline that one of us should get around to updating. > >> > >> http://wiki.apache.org/spamassassin/CorpusCleaning > >> > >> Which part of that page made you feel you should strip out > >> facebook? > >> > > > > http://wiki.apache.org/spamassassin/HandClassifiedCorpora?highlight=%28facebook%29 > > That says to not include any _spams_ received via those channels, not > to discard them _in toto_. > It actually says:
DO NOT include such mail in either ham or spam folder. Just delete it. Why? We don't want to count these as spam, causing false marks against highly safe whitelist rules like USER_IN_DEF_DKIM_WL. They do not count as ham either, because spam URL's or spam text would throw off the statistics if they show up in the ham folder. Simply delete them
