Re: UTF8 character in [] doesn't match

2018-12-24 Thread John Hardin
On Sun, 23 Dec 2018, Matus UHLAR - fantomas wrote: While it doesn't directly answer your question about normalize-charset, this might work a little better: ifplugin Mail::SpamAssassin::Plugin::ReplaceTags body LOCAL_JANO /\bjno?\b/i replace_rules LOCAL_JANO endif body

Re: UTF8 character in [] doesn't match

2018-12-24 Thread Henrik K
On Mon, Dec 24, 2018 at 06:48:51PM +, RW wrote: > On Mon, 24 Dec 2018 10:16:58 +0200 > Henrik K wrote: > > > On Sun, Dec 23, 2018 at 11:11:39PM +, RW wrote: > > > On Sun, 23 Dec 2018 20:04:28 +0100 > > > Matus UHLAR - fantomas wrote: > > > > > > > Hello, > > > > > > > > I have tried

Re: UTF8 character in [] doesn't match

2018-12-24 Thread RW
On Mon, 24 Dec 2018 10:16:58 +0200 Henrik K wrote: > On Sun, Dec 23, 2018 at 11:11:39PM +, RW wrote: > > On Sun, 23 Dec 2018 20:04:28 +0100 > > Matus UHLAR - fantomas wrote: > > > > > Hello, > > > > > > I have tried to create rule that will match names "ján" and > > > "jano" (john and

Re: Is the SA Bayes implementation mathematically sound?

2018-12-24 Thread Rick Macdougall
On 2018-12-24 12:39 p.m., Ian Zimmerman wrote: On 2018-12-23 17:02, Rick Macdougall wrote: I'm just going to jump in here and mention that I train my bayes in SA and in Thunderbird email client. Thunderbird catches 99%+ and SA catches under 60% with the same training data. Have you also

Re: Is the SA Bayes implementation mathematically sound?

2018-12-24 Thread Ian Zimmerman
On 2018-12-23 17:02, Rick Macdougall wrote: > I'm just going to jump in here and mention that I train my bayes in SA > and in Thunderbird email client. > > Thunderbird catches 99%+ and SA catches under 60% with the same > training data. Have you also compared the rates of False Positives? --

Re: UTF8 character in [] doesn't match

2018-12-24 Thread Matus UHLAR - fantomas
> On Sun, 23 Dec 2018 20:04:28 +0100 > Matus UHLAR - fantomas wrote: > > I have tried to create rule that will match names "ján" and > > "jano" (john and johnny in slovak languages). > > > > I have created rule: > > > > body LOCAL_JANO /\bJ[aá]no\b/i > > > > however, it does not match. >

Re: UTF8 character in [] doesn't match

2018-12-24 Thread Pedro David Marco
On Monday, December 24, 2018, 9:49:11 AM GMT+1, Henrik K wrote: >... so for general file portability this would be even better: > >(?:[a\xe1]|\xc3\xa1) I fully agree with Henrik, but would add a small detail... in some cases i have found problems using BODY to locate special chars  (most

Re: UTF8 character in [] doesn't match

2018-12-24 Thread Henrik K
On Mon, Dec 24, 2018 at 10:18:31AM +0200, Henrik K wrote: > On Mon, Dec 24, 2018 at 10:16:58AM +0200, Henrik K wrote: > > On Sun, Dec 23, 2018 at 11:11:39PM +, RW wrote: > > > On Sun, 23 Dec 2018 20:04:28 +0100 > > > Matus UHLAR - fantomas wrote: > > > > > > > Hello, > > > > > > > > I have

Re: UTF8 character in [] doesn't match

2018-12-24 Thread Henrik K
On Mon, Dec 24, 2018 at 10:16:58AM +0200, Henrik K wrote: > On Sun, Dec 23, 2018 at 11:11:39PM +, RW wrote: > > On Sun, 23 Dec 2018 20:04:28 +0100 > > Matus UHLAR - fantomas wrote: > > > > > Hello, > > > > > > I have tried to create rule that will match names "ján" and > > > "jano" (john and

Re: UTF8 character in [] doesn't match

2018-12-24 Thread Henrik K
On Sun, Dec 23, 2018 at 11:11:39PM +, RW wrote: > On Sun, 23 Dec 2018 20:04:28 +0100 > Matus UHLAR - fantomas wrote: > > > Hello, > > > > I have tried to create rule that will match names "ján" and > > "jano" (john and johnny in slovak languages). > > > > I have created rule: > > > > body