Re: [Scikit-learn-general] Inspection of Classifications

Andreas Mueller Mon, 24 Aug 2015 14:52:25 -0700

I think these are really easy to write for a single use-case, and hardto be generally useful.Why do you think pipelines make it hard? You know you can extract theestimators from the steps, right?


def feature_importances_pipeline(pipe):
    extractor = pipe.steps[0][1]
    linear_model = pipe.steps[1][1]
    return dict(zip(extractor.get_feature_names(), linear_model.coef_))



On 08/20/2015 02:47 AM, Christoph Sawade wrote:

Hey there!
In the last time, I used the linear models and the feature extractionpipeline a lot. During the feature engineering process, I run againand again in the situation, in which I wanted to inspect theclassifications and understand, which features had the highestinfluence; this is important especially if one uses different featuresources. In my case, I found it very helpful to have an inspection forthe model (independent of the instance) and one for the prediction ofa single example that gives some insights about the weights. E.g., aninspection function for a binary linear classifier could return a listof (feature name, weight)-pairs sorted by weight:
mdl = LogisticRegression()
# training, tuning,...
inspect(mdl)
/[(<sourcename_featname>, <weight>),...]/

predict_and_inspect(mdl, example)
/{/
/    'probability': <prob_of_positive_class>,/
/    'inspection':[(<sourcename_featname>, <model_weight*feature_weight>),...],
    'label': <ground_truth_label>,
    'prediction': <predicted_label>,
/
/        'example_id': <id>/
/}/
The pipeline framework encapsulates the whole feature encoding, which is 
typically very convenient. However, it is very hard to map the learned weights 
back to the actual feature names.
Is there any easy way to do that or are there already initiatives tobuild something like this? If not, do you think that this makes sensein general? I know that this is tough to generalize over all possiblemodel classes (linear vs. kernel machines, number of classes, ...),but I think it is worth to try it, since it is necessary to iterate ona predictive model in practice.
Cheers, Christoph


------------------------------------------------------------------------------


_______________________________________________
Scikit-learn-general mailing list
Scikit-learn-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general

------------------------------------------------------------------------------

_______________________________________________
Scikit-learn-general mailing list
Scikit-learn-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general

Re: [Scikit-learn-general] Inspection of Classifications

Reply via email to