On Thu, 19 Nov 2009 11:12:33 +0100 Alexander Prinsier <aphe...@mailhaven.com> wrote:
> On 11/19/2009 01:45 AM, Stevan Bajić wrote: > >>>> There's also IBM's ITU (open source library) if you need something > >>>> heavier. > > > Alexander. What would you say about adding ICU and that character handling > > into DSPAM? You seem to be capable to do it. Would be a nice thing to do. I > > would not mind if you would take that task :) > > I think the performance penalty would be too big. I mean, I just say all > asian email is spam. That's so much more efficient *and* accurate than > trying to use ICU or something like it. > You! This might be true for you but it's not about you or me. It's about the users using DSPAM. And I don't think that we can say that no Asian user is using DSPAM as their Spam filter. > I don't think there are many asian people using dspam either... So this > is a feature nobody is waiting for I think. > Ahh... I would not count on that. Let me quickly check my bookmarks. I remember having bookmarked a bunch of links to a mailserver how-to in Asian that uses DSPAM. Give me a sec.... Yes. This here: http://www.extmail.org/ Their forum is full of articles about how to use DSPAM. For example: http://www.extmail.org/forum/archiver/tid-10393.html And btw there are users using Asian language and DSPAM: http://www.mail-archive.com/dspam-us...@lists.nuclearelephant.com/msg00191.html And regarding "...many Asian people using DSPAM...": No wonder they are not using DSPAM. It's broken for them. Terribly broken. Why should/would they use something that broken? > But sure, in theory we can support those languages by using ICU, and > using that to either tell us the word boundary or to tell us there are none. > IMHO it would be the right thing to do. DSPAM is about statistics and if I apply that on this topic then I can say that +/- 20% of all possible DSPAM users on this planet are using Chinese as their primary language. That is a significant number! We should not ignore them. Just because we have another culture and another language and are using letters instead of symbols does not mean that we are better then they are. A DSPAM user is a DSPAM user. They are our friends. The community part in "DSPAM community project" has a value/meaning. Or not? > Alexander > Steve ------------------------------------------------------------------------------ Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day trial. Simplify your report design, integration and deployment - and focus on what you do best, core application coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july _______________________________________________ Dspam-devel mailing list Dspam-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspam-devel