https://issues.apache.org/SpamAssassin/show_bug.cgi?id=7133

--- Comment #13 from Mark Martinec <[email protected]> ---
> normalize_charset ( 0 | 1) (default: 0)
>   Whether to detect character sets and normalize message content to
>   Unicode. Requires the Encode::Detect module, HTML::Parser version
>   3.46 or later, and Perl 5.8.5 or later.

I need to update that text a bit. The Encode::Detect is no longer a requirement
(just optional bonus), and the result is in UTF-8 bytes, not Unicode
characters.


> > It's been a year since the last release so let's worry less about surprise
> > and what is more effective?
> 
> I say enable it (1) by default.

That would be my advise too.

Some rules would need adjusting though (like the drugs / Viagra spellings,
which imply Latin encoding) to remain effective. Or just abandoned and
replaced with something new.

-- 
You are receiving this mail because:
You are the assignee for the bug.

Reply via email to