Re: Bayes Stopword

2023-12-28 Thread giovanni
Config line produces a syntax error for me: config: failed to parse line in /etc/mail/spamassassin/local.cf (line 1): bayes_stopword_th Could you share the word list in utf8 ? I tried adding "บาท" to https://raw.githubusercontent.com/stopwords-iso/stopwords-th/master/stopwords-th.txt and it

Re: missing something in new SA config

2023-12-28 Thread AJ Weber
what is in the /etc/mail/spamassassin/.razor/razor-agent.conf ? debuglevel = 3 identity   = identity ignorelist = 0 listfile_catalogue = servers.catalogue.lst listfile_discovery = servers.discovery.lst listfile_nomination    =

Re: Bayes Stopword

2023-12-28 Thread Jimmy
bayes_stopword_th https://pastebin.pl/view/0838138d Sample mail https://pastebin.pl/view/e5a2c5b8 Jimmy On Thu, Dec 28, 2023 at 10:59 PM wrote: > Could you share a config line and a sample you are using ? > Giovanni > > On 12/28/23 16:26, Jimmy wrote: > > Yes, I have done that, and I am

Re: Bayes Stopword

2023-12-28 Thread giovanni
Could you share a config line and a sample you are using ? Giovanni On 12/28/23 16:26, Jimmy wrote: Yes, I have done that, and I am also editing Plugin/Bayes.pm to investigate why it is not being skipped. I suspect that if words are not separated by spaces, longer words may not match those

Re: Bayes Stopword

2023-12-28 Thread Jimmy
Yes, I have done that, and I am also editing Plugin/Bayes.pm to investigate why it is not being skipped. I suspect that if words are not separated by spaces, longer words may not match those patterns. Jimmy On Thu, Dec 28, 2023 at 10:13 PM wrote: > "spamassassin -D bayes" will tell you, you

Re: Bayes Stopword

2023-12-28 Thread giovanni
"spamassassin -D bayes" will tell you, you should see a line like: bayes: skipped token 'from' because it's in stopword list for language 'en' Giovanni On 12/28/23 15:45, Jimmy wrote: The pattern has successfully passed the test script, but it needs to check whether Bayes learning will

Re: Bayes Stopword

2023-12-28 Thread Jimmy
The pattern has successfully passed the test script, but it needs to check whether Bayes learning will identify and possibly exclude the word from matching this pattern. Thank you. On Thu, Dec 28, 2023 at 9:22 PM wrote: > On 12/28/23 12:59, Jimmy wrote: > > Hi, > > > > I'm seeking assistance

Re: Bayes Stopword

2023-12-28 Thread giovanni
On 12/28/23 12:59, Jimmy wrote: Hi, I'm seeking assistance in incorporating a stopword for Asian languages in Unicode. Although I possess comprehensive word lists, my attempts to generate a regex pattern and test it have been unsuccessful; the pattern fails to match or skips tokens in the

Bayes Stopword

2023-12-28 Thread Jimmy
Hi, I'm seeking assistance in incorporating a stopword for Asian languages in Unicode. Although I possess comprehensive word lists, my attempts to generate a regex pattern and test it have been unsuccessful; the pattern fails to match or skips tokens in the newly added stopword list. I created