http://bugzilla.spamassassin.org/show_bug.cgi?id=2129





------- Additional Comments From [EMAIL PROTECTED]  2004-03-14 14:07 -------
Subject: Re:  Bayes tweaks to test 

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1


>I like Comment 10, making invisible words metadata.  Perhaps the same
>could be done for low-contrast, tiny-font, and other near-invisible
>words.

I was considering this -- but I decided against it based on current
use of "invisible text".  Nowadays it's predominantly

  - random words from a dictionary
  - random strings of letters and numbers
  - "travesty" output from Project Gutenberg texts

learning these tokens with an "I*" prefix will be actively bad for
1 and 2, seeing as they'll bloat the db and possibly cause good
tokens to be expired due to space pressure.  in the 3 case, it
might be marginally useful, but I'm not convinced.

IMO, it's better to just ignore them, for bayes at least.  (but 
if someone codes it up and checks it in, I'll test it ;)

- --j.
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.4 (GNU/Linux)
Comment: Exmh CVS

iD8DBQFAVNeHQTcbUG5Y7woRAuxWAJ92b56XemLh27QbmYwGMbkIhVbYhwCgk51Z
+wMEYt/F/RuA4LQztakGI+o=
=q0dY
-----END PGP SIGNATURE-----





------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.

Reply via email to