https://issues.apache.org/SpamAssassin/show_bug.cgi?id=6674

Adam Katz <[email protected]> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |[email protected]

--- Comment #9 from Adam Katz <[email protected]> 2011-10-13 21:15:04 UTC ---
(In reply to comment #5)
> 1) I can start maintaining these rules as CustomRuleset, I see that "Polish
> Language Ruleset" is empty and Status is "?". I just need to know what shall I
> do.

Segregating rulesets by language is generally a bad idea because it limits
visibility (FPs get minimized and ignored) and it becomes impossible to
maintain.  There is nothing wrong with this approach if not a part of the main
project, like say as an sa-update channel.

> or maybe SA could detect a language (which is not trivial) and load
> appropriate customruleset.
> 
> I see: Greek, German are active. Romanian is marked as active, but it is
> empty.

Language detection with TextCat is awful.  It's better than nothing, but it is
frequently wrong.

> 2) I'm thinking that maybe SA rules could be packaged with country-specific
> customrulesets and the postmaster would decide which rulesets are used,
> something like:
> 
> preload_rulesets pl de gr
> in local.cf

I believe language-specific rulesets are already possible in SA via locale
support (though note you can currently only have one locale).  Though I've
never tried it, you can conceivably write rules like this:

  lang pl body PL_FOO /\btawerna\b/i

PL_FOO would then only be run if the system locale is Polish.  This is
currently only used for "describe" lines.

However, I'd rather see this implemented as channels.

If we wanted to get more specific, I'd say the channels should be vetted
through mass-check (as my channels are), so that rules good enough to be
mainstream can be automatically promoted.  It should be noted that the current
ruleqa system with its current corpora is not at all set up to properly
evaluate rule efficacy for Polish language mail and would do an awful job.

-- 
Configure bugmail: 
https://issues.apache.org/SpamAssassin/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.

Reply via email to