https://bz.apache.org/SpamAssassin/show_bug.cgi?id=7141

--- Comment #3 from Mark Martinec <[email protected]> ---
I'm folding the above in. I think this follows the spirit of the original
code (takes 7 characters), except that back in 2002 single-byte encodings
(extended ASCII like Latin etc.) were much more prevalent than today's UTF-8,
(even without normalize_charset turned on).


Bug 7141: Bayes truncates ('skip') long tokens on bytes,
should it count characters instead?
  Sending lib/Mail/SpamAssassin/Plugin/Bayes.pm
Committed revision 1661633.



Re-assessing usefulness of skips (and/or their length) might still be
a good thing to do. If someone has a will to do so is most welcome.

-- 
You are receiving this mail because:
You are the assignee for the bug.

Reply via email to