Hey all, I'm currently doing a UC Berkeley research project. We would like to understand what interactions the tokenizer has with the different modules. Is there any documentation available that describes the different modules? We are interested in what the email representation is after email is tokenized and going into the learner and classifier. In addition, we would like to isolate the tokenizer. Any help would be appreciated. Thanks in advance for your response.
Kai Xia _______________________________________________ spambayes-dev mailing list spambayes-dev@python.org http://mail.python.org/mailman/listinfo/spambayes-dev