Re: are there any alternatives to textcat?

2010-07-20 Thread Matus UHLAR - fantomas
On 14.07.10 12:32, Jason Haar wrote: For some weird reason I seem to get a lot of Chinese spam - and even with TextCat enabled, SA is unable to recognise it as Chinese (ie I want to score on X-Spam-Languages:). I've Googled around and it looks like TextCat ceased development some time ago, so

Re: are there any alternatives to textcat?

2010-07-20 Thread Jason Haar
On 07/20/2010 11:36 PM, Matus UHLAR - fantomas wrote: did you set up ok_languages? Yup - in general it does work - it's just that textcat doesn't seem to be able to figure out Chinese from a 5 paragraph email containing nothing but Chinese and about 5 words of English. I had a similar problem

Re: are there any alternatives to textcat?

2010-07-14 Thread Henrik K
On Tue, Jul 13, 2010 at 07:35:36PM -0500, Chris Owen wrote: On Jul 13, 2010, at 7:32 PM, Jason Haar wrote: For some weird reason I seem to get a lot of Chinese spam - and even with TextCat enabled, SA is unable to recognise it as Chinese (ie I want to score on X-Spam-Languages:). I've

RE: are there any alternatives to textcat?

2010-07-14 Thread Giampaolo Tomassoni
It's more of the implementation that needs an update than TextCat algorithm itself. Charset/case awareness: https://issues.apache.org/SpamAssassin/show_bug.cgi?id=6229 Better database: https://issues.apache.org/SpamAssassin/show_bug.cgi?id=4152 Etc.. feel free to chime in.. There is

Re: are there any alternatives to textcat?

2010-07-14 Thread Benny Pedersen
On ons 14 jul 2010 02:32:36 CEST, Jason Haar wrote The idea behind TextCat seems sound, but the only alternative I've found is Google Translator - but sending your emails to it may not be an option ;-) relaycountry maybe ? or if one make a aspell/ispell plugin -- xpoint

Re: are there any alternatives to textcat?

2010-07-13 Thread Chris Owen
On Jul 13, 2010, at 7:32 PM, Jason Haar wrote: For some weird reason I seem to get a lot of Chinese spam - and even with TextCat enabled, SA is unable to recognise it as Chinese (ie I want to score on X-Spam-Languages:). I've Googled around and it looks like TextCat ceased development some