On Thu, May 25, 2006 21:38, [EMAIL PROTECTED] said: > > Amedee> The word lengths in Dutch are somewhere between those of > English > Amedee> and German. Is this a "configurable"? > > Not trivially, but it's not too hard either. Look toward the bottom of > spambayes/tokenizer.py where there are a couple comparisons of n to 3. I > can't quote you the correct chapter and verse because I'm using a version > of > tokenizer.py modified in just that region and SourceForge appears to be > on-the-blink at the moment. It should be fairly easy to understand. > > Skip > > OK, I'll unleash my vi-fu and give it a try.
-- Disclaimer: By sending an email to ANY of my addresses you are agreeing that: 1. I am by definition, "the intended recipient" 2. All information in the email is mine to do with as I see fit and make such financial profit, political mileage, or good joke as it lends itself to. In particular, I may quote it on usenet. 3. I may take the contents as representing the views of your company. 4. This overrides any disclaimer or statement of confidentiality that may be included on your message. _______________________________________________ [email protected] http://mail.python.org/mailman/listinfo/spambayes Check the FAQ before asking: http://spambayes.sf.net/faq.html
