Amedee> The word lengths in Dutch are somewhere between those of English Amedee> and German. Is this a "configurable"?
Not trivially, but it's not too hard either. Look toward the bottom of spambayes/tokenizer.py where there are a couple comparisons of n to 3. I can't quote you the correct chapter and verse because I'm using a version of tokenizer.py modified in just that region and SourceForge appears to be on-the-blink at the moment. It should be fairly easy to understand. Skip _______________________________________________ SpamBayes@python.org http://mail.python.org/mailman/listinfo/spambayes Check the FAQ before asking: http://spambayes.sf.net/faq.html