Amedee> The word lengths in Dutch are somewhere between those of English
    Amedee> and German.  Is this a "configurable"?

Not trivially, but it's not too hard either.  Look toward the bottom of
spambayes/tokenizer.py where there are a couple comparisons of n to 3.  I
can't quote you the correct chapter and verse because I'm using a version of
tokenizer.py modified in just that region and SourceForge appears to be
on-the-blink at the moment.  It should be fairly easy to understand.

Skip
_______________________________________________
SpamBayes@python.org
http://mail.python.org/mailman/listinfo/spambayes
Check the FAQ before asking: http://spambayes.sf.net/faq.html

Reply via email to