On 04/25/2015 09:18 AM, Luca Puggini wrote:
I think it depends by the role of sample weight during the construction of the forest. If I set sample_weight = 2 for one of my samples is this equivalent to duplicate the row in the data?

During fitting, yes, during evaluation currently not.

On Fri, Apr 24, 2015 at 10:25 PM, Andreas Mueller <t3k...@gmail.com <mailto:t3k...@gmail.com>> wrote:

    The roc_auc will not take sample_weights into account if using
    cross_val_score.
    Thinking about it, I'm not sure if this a bug or a feature.
    Not sure if that was discussed before, I opened an issue:
    https://github.com/scikit-learn/scikit-learn/issues/4632


    On 04/24/2015 12:29 PM, Luca Puggini wrote:
    Dear all,

    I am quiet new to  {0,1} classification problems.
    I have an unbalanced dataset and and I am using a
    RandomForestMethod on it.

    To evaluate the performances of my estimator I am using the
    cross_val_score function with the roc_auc metric.

    My understanding is that to deal with unbalanced problem I can
    pass the argument sample_weight to the random forest estimator.

    I do not understand if I should pass the sample_weight parameters
    also in this case or if this will bias the result obtained with
    roc_auc

    Is there any common way to do that? Have you any advice?

    Thanks a lot!


    
------------------------------------------------------------------------------
    One dashboard for servers and applications across Physical-Virtual-Cloud
    Widest out-of-the-box monitoring support with 50+ applications
    Performance metrics, stats and reports that give you Actionable Insights
    Deep dive visibility with transaction tracing using APM Insight.
    http://ad.doubleclick.net/ddm/clk/290420510;117567292;y


    _______________________________________________
    Scikit-learn-general mailing list
    Scikit-learn-general@lists.sourceforge.net  
<mailto:Scikit-learn-general@lists.sourceforge.net>
    https://lists.sourceforge.net/lists/listinfo/scikit-learn-general


    
------------------------------------------------------------------------------
    One dashboard for servers and applications across
    Physical-Virtual-Cloud
    Widest out-of-the-box monitoring support with 50+ applications
    Performance metrics, stats and reports that give you Actionable
    Insights
    Deep dive visibility with transaction tracing using APM Insight.
    http://ad.doubleclick.net/ddm/clk/290420510;117567292;y
    _______________________________________________
    Scikit-learn-general mailing list
    Scikit-learn-general@lists.sourceforge.net
    <mailto:Scikit-learn-general@lists.sourceforge.net>
    https://lists.sourceforge.net/lists/listinfo/scikit-learn-general




------------------------------------------------------------------------------
One dashboard for servers and applications across Physical-Virtual-Cloud
Widest out-of-the-box monitoring support with 50+ applications
Performance metrics, stats and reports that give you Actionable Insights
Deep dive visibility with transaction tracing using APM Insight.
http://ad.doubleclick.net/ddm/clk/290420510;117567292;y


_______________________________________________
Scikit-learn-general mailing list
Scikit-learn-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general

------------------------------------------------------------------------------
One dashboard for servers and applications across Physical-Virtual-Cloud 
Widest out-of-the-box monitoring support with 50+ applications
Performance metrics, stats and reports that give you Actionable Insights
Deep dive visibility with transaction tracing using APM Insight.
http://ad.doubleclick.net/ddm/clk/290420510;117567292;y
_______________________________________________
Scikit-learn-general mailing list
Scikit-learn-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general

Reply via email to