Re: [Scikit-learn-general] Using TFxIDF with HashingVectorizer

2014-10-09 Thread Lars Buitinck
2014-09-09 3:36 GMT+02:00 Apu Mishra : > Lars Buitinck writes: > >> The way to combine HV and >> Tfidf is >> >> hashing = HashingVectorizer(non_negative=True, norm=None) >> tfidf = TfidfTransformer() >> hashing_tfidf = Pipeline([("hashing", hashing), ("tidf", tfidf)]) >> > > I notice your use of t

Re: [Scikit-learn-general] Using TFxIDF with HashingVectorizer

2014-09-08 Thread Apu Mishra
Lars Buitinck writes: > The way to combine HV and > Tfidf is > > hashing = HashingVectorizer(non_negative=True, norm=None) > tfidf = TfidfTransformer() > hashing_tfidf = Pipeline([("hashing", hashing), ("tidf", tfidf)]) > I notice your use of the non_negative option in HashingVectorizer(), whe

Re: [Scikit-learn-general] Using TFxIDF with HashingVectorizer

2013-10-03 Thread Lars Buitinck
2013/10/2 Minkoo : > I have a question on using HashingVectorizer with TFxIDF. Currently, I'm > trying to build a model to predict classes for large set of documents. > > On the other hand TfIdfVectorizer does not support processing documents in > batch. It needs to load the entire feature vector i

Re: [Scikit-learn-general] Using TFxIDF with HashingVectorizer

2013-10-03 Thread Olivier Grisel
You can use a Pipeline to combine a TfidfTransformer with the HashingVectorizer. -- Olivier -- October Webinars: Code for Performance Free Intel webinars can help you accelerate application performance. Explore tips for

[Scikit-learn-general] Using TFxIDF with HashingVectorizer

2013-10-03 Thread Minkoo
Hi scikit learn. I have a question on using HashingVectorizer with TFxIDF. Currently, I'm trying to build a model to predict classes for large set of documents. I'm using HashingVectorizer as my data is large. Because I can feed some batch of documents to HashingVectorizer, it's perfect fit for m