I made an observation similar to what was pointed out in this mailing
list post here:
http://comments.gmane.org/gmane.comp.apache.mahout.user/17819; that
TF-IDF vectors do not seem to persist when generating them with
normalization enabled.

According to Gokhan Capan:

"It seems to have tf-idf vectors later, you need to create tf vectors
(DictionaryVectorizer.createTermFrequencyVectors) with logNormalize option
set to false, and normPower option set to -1.0f."

Is there some reason for this? It would seem useful if they persisted.
Can someone explain the reasoning behind them not? I figure there's a
perfectly good reason, I just can't seem to figure out what it is.

Reply via email to