A very good point! (Although augmented and log-average tf both do some kind of normalisation of the tf distribution before IDF weighting.)
_______________________________________________ scikit-learn mailing list scikit-learn@python.org https://mail.python.org/mailman/listinfo/scikit-learn