Amedee> The word lengths in Dutch are somewhere between those of English
Amedee> and German. Is this a "configurable"?
Not trivially, but it's not too hard either. Look toward the bottom of
spambayes/tokenizer.py where there are a couple comparisons of n to 3. I
can't quote you the correct chapter and verse because I'm using a version of
tokenizer.py modified in just that region and SourceForge appears to be
on-the-blink at the moment. It should be fairly easy to understand.
Skip
_______________________________________________
[email protected]
http://mail.python.org/mailman/listinfo/spambayes
Check the FAQ before asking: http://spambayes.sf.net/faq.html