Am 30.11.2012 16:58, schrieb Philipp Singer:
> Hey again!
>
> Today is my posting day, hope you don't bother, but I just stumbled upon
> a further problem.
>
> I currently use a grid search strtaifiedkfold approach that works on
> textual data. So I use a pipeline that does tfidf vectorization as well.
> The thing now is, that I want to append additional features that are not
> textual to the feature data.
This kind of (but not really) sounds like a job for FeatureUnion:
http://scikit-learn.sourceforge.net/dev/modules/pipeline.html#featureunion-combining-feature-extractors

Feature union applies to different transformers to the same input data.
But you already start with two kinds of data, right?
I guess you could make your data be a list of tuples (text, non-test).
Then you would still need a transformer that projects to the components, 
though.
This might not be ideal.

Though I have no better idea.

Cheers,
Andy


------------------------------------------------------------------------------
Keep yourself connected to Go Parallel: 
TUNE You got it built. Now make it sing. Tune shows you how.
http://goparallel.sourceforge.net
_______________________________________________
Scikit-learn-general mailing list
Scikit-learn-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general

Reply via email to