On Mon, 2 Jul 2012, RW wrote:
On Mon, 2 Jul 2012 12:01:32 -0700 (PDT)
John Hardin wrote:
On Mon, 2 Jul 2012, Jari Fredriksson wrote:
On 2.7.2012 19:23, [email protected] wrote:
On 07/02, Jari Fredriksson wrote:
I follow the wiki page. I have now implemented the following
It seems you are interpreting the wiki as a flawless authority,
when it would probably be more appropriate to consider it a crufty
guideline that one of us should get around to updating.
http://wiki.apache.org/spamassassin/CorpusCleaning
Which part of that page made you feel you should strip out
facebook?
http://wiki.apache.org/spamassassin/HandClassifiedCorpora?highlight=%28facebook%29
That says to not include any _spams_ received via those channels, not
to discard them _in toto_.
It actually says:
DO NOT include such mail in either ham or spam folder. Just delete it.
Why? We don't want to count these as spam, causing false marks against
highly safe whitelist rules like USER_IN_DEF_DKIM_WL. They do not count
as ham either, because spam URL's or spam text would throw off the
statistics if they show up in the ham folder. Simply delete them
Also, by "discard them in toto" I was referring to the _channels_, not the
individual messages.
--
John Hardin KA7OHZ http://www.impsec.org/~jhardin/
[email protected] FALaholic #11174 pgpk -a [email protected]
key: 0xB8732E79 -- 2D8C 34F4 6411 F507 136C AF76 D822 E6E6 B873 2E79
-----------------------------------------------------------------------
Riff: Torg, you traded our magic beans for a _cow_?
Torg: It's a _magic_ cow! It's full of steaks!
Riff: Whoa! -- Sluggy 04/28/2002
-----------------------------------------------------------------------
2 days until the 236th anniversary of the Declaration of Independence