Am 30.11.2012 16:58, schrieb Philipp Singer: > Hey again! > > Today is my posting day, hope you don't bother, but I just stumbled upon > a further problem. > > I currently use a grid search strtaifiedkfold approach that works on > textual data. So I use a pipeline that does tfidf vectorization as well. > The thing now is, that I want to append additional features that are not > textual to the feature data. This kind of (but not really) sounds like a job for FeatureUnion: http://scikit-learn.sourceforge.net/dev/modules/pipeline.html#featureunion-combining-feature-extractors
Feature union applies to different transformers to the same input data. But you already start with two kinds of data, right? I guess you could make your data be a list of tuples (text, non-test). Then you would still need a transformer that projects to the components, though. This might not be ideal. Though I have no better idea. Cheers, Andy ------------------------------------------------------------------------------ Keep yourself connected to Go Parallel: TUNE You got it built. Now make it sing. Tune shows you how. http://goparallel.sourceforge.net _______________________________________________ Scikit-learn-general mailing list Scikit-learn-general@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/scikit-learn-general