Am 14.09.2012 14:53, schrieb Andreas Müller:
Hi Philipp.
Hey Andreas!
First, you should ensure that the features all have approximately the same
scale.
For example they should all be between zero and one - if the LDA features
are much smaller than the other ones, then they will probably not
I'd be interested in the outcome.
Let us know when you get it to work :)
- Ursprüngliche Mail -
Von: Philipp Singer kill...@gmail.com
An: scikit-learn-general@lists.sourceforge.net
Gesendet: Freitag, 14. September 2012 14:00:48
Betreff: Re: [Scikit-learn-general] Combining TFIDF and LDA
2012/9/14 Andreas Müller amuel...@ais.uni-bonn.de:
Hi Philipp.
First, you should ensure that the features all have approximately the same
scale.
For example they should all be between zero and one - if the LDA features
are much smaller than the other ones, then they will probably not be
Have you done tests using some other classifiers such as gradient boosting
which has a kind of internal feature selection?
On Fri, Sep 14, 2012 at 5:36 PM, Andreas Müller amuel...@ais.uni-bonn.dewrote:
I'd be interested in the outcome.
Let us know when you get it to work :)
-
Am 14.09.2012 15:10, schrieb amir rahimi:
Have you done tests using some other classifiers such as gradient
boosting which has a kind of internal feature selection?
Actually not, but I wanted to try that out, if the runtime allows it.
On Fri, Sep 14, 2012 at 5:36 PM, Andreas Müller
Okay, so I did a fast chi2 check and it seems like some LDA features
have high p-values, so they should be helpful at least.
Am 14.09.2012 15:06, schrieb Andreas Müller:
I'd be interested in the outcome.
Let us know when you get it to work :)
- Ursprüngliche Mail -
Von: Philipp
Hey!
Am 14.09.2012 15:10, schrieb Peter Prettenhofer:
I totally agree - I had such an issue in my research as well
(combining word presence features with SVD embeddings).
I followed Blitzer et. al 2006 and normalized** both feature groups
separately - e.g. you could normalize word presence
2012/9/14 Philipp Singer kill...@gmail.com:
Hey!
Am 14.09.2012 15:10, schrieb Peter Prettenhofer:
I totally agree - I had such an issue in my research as well
(combining word presence features with SVD embeddings).
I followed Blitzer et. al 2006 and normalized** both feature groups
Hi folks,
you may have already seen this, but in case you haven't, I'm thrilled
to share that the Python Software Foundation has just created its
newest and highest distinction, the Distinguished Service Award, and
has chosen John as its first recipient:
Hello,
The video of the scikit-learn tutorial from Scipy2012 has (finally) been
posted:
http://www.youtube.com/watch?v=33L_EXLtJPEfeature=plcp
The tutorial material can be found at this site:
http://astroml.github.com/sklearn_tutorial/
Enjoy!
Jake
10 matches
Mail list logo