Am 30.11.2012 17:31, schrieb Andreas Mueller:
> Am 30.11.2012 16:58, schrieb Philipp Singer:
>> Hey again!
>>
>> Today is my posting day, hope you don't bother, but I just stumbled upon
>> a further problem.
>>
>> I currently use a grid search strtaifiedkfold approach that works on
>> textual data. So I use a pipeline that does tfidf vectorization as well.
>> The thing now is, that I want to append additional features that are not
>> textual to the feature data.
> This kind of (but not really) sounds like a job for FeatureUnion:
> http://scikit-learn.sourceforge.net/dev/modules/pipeline.html#featureunion-combining-feature-extractors
>
> Feature union applies to different transformers to the same input data.
> But you already start with two kinds of data, right?
Yep exactly. One with textual data and the other with other kind of 
features.
> I guess you could make your data be a list of tuples (text, non-test).
> Then you would still need a transformer that projects to the components,
> though.
> This might not be ideal.
I thought about building a custom transformer, that I can pass to the 
pipeline that somehow appends the features to train and test data. But 
the problem is, that I don't know exactly which data is used for the 
splits (i.e., with sample). How would you do it with a list of tuples?
>
> Though I have no better idea.
>
> Cheers,
> Andy
Thanks, Philipp
>
>
> ------------------------------------------------------------------------------
> Keep yourself connected to Go Parallel:
> TUNE You got it built. Now make it sing. Tune shows you how.
> http://goparallel.sourceforge.net
> _______________________________________________
> Scikit-learn-general mailing list
> Scikit-learn-general@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/scikit-learn-general


------------------------------------------------------------------------------
Keep yourself connected to Go Parallel: 
TUNE You got it built. Now make it sing. Tune shows you how.
http://goparallel.sourceforge.net
_______________________________________________
Scikit-learn-general mailing list
Scikit-learn-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general

Reply via email to