I made an observation similar to what was pointed out in this mailing list post here: http://comments.gmane.org/gmane.comp.apache.mahout.user/17819; that TF-IDF vectors do not seem to persist when generating them with normalization enabled.
According to Gokhan Capan: "It seems to have tf-idf vectors later, you need to create tf vectors (DictionaryVectorizer.createTermFrequencyVectors) with logNormalize option set to false, and normPower option set to -1.0f." Is there some reason for this? It would seem useful if they persisted. Can someone explain the reasoning behind them not? I figure there's a perfectly good reason, I just can't seem to figure out what it is.
