https://bz.apache.org/SpamAssassin/show_bug.cgi?id=7141
--- Comment #3 from Mark Martinec <[email protected]> --- I'm folding the above in. I think this follows the spirit of the original code (takes 7 characters), except that back in 2002 single-byte encodings (extended ASCII like Latin etc.) were much more prevalent than today's UTF-8, (even without normalize_charset turned on). Bug 7141: Bayes truncates ('skip') long tokens on bytes, should it count characters instead? Sending lib/Mail/SpamAssassin/Plugin/Bayes.pm Committed revision 1661633. Re-assessing usefulness of skips (and/or their length) might still be a good thing to do. If someone has a will to do so is most welcome. -- You are receiving this mail because: You are the assignee for the bug.
