Re: [Scikit-learn-general] IDF formula

2012-07-04 Thread Olivier Grisel
2012/7/4 Frédérique Passot : > Hi, > > I am trying to use TFIDF weighting to extract significant keywords from > a corpus of texts (and later to compute cosine similarity between texts). > > For testing purposes, I am not doing any stopword filtering prior to > vectorizing my data. I am consistentl

[Scikit-learn-general] IDF formula

2012-07-03 Thread Frédérique Passot
Hi, I am trying to use TFIDF weighting to extract significant keywords from a corpus of texts (and later to compute cosine similarity between texts). For testing purposes, I am not doing any stopword filtering prior to vectorizing my data. I am consistently getting unexpected results, with the