Hi Alex,

I am forwarding you this question as I am not sure your are following
the mailing list.

2012/7/25 Kasper Thofte <[email protected]>:
> Hi
>
> I am using the DPGMM for clustering short sequences of integers.
>
> In my application, I need the datapoint that is in some sense closest to the
> cluster mean, for each cluster.
>
> Conforming to the interface of scikit-learn, I opted to use the
> predict_proba(X), where X is the data, then selecting for each component,
> the datum with highest probability.
>
> However, it seems that predict_proba (and apparently also eval(X)) returns
> the arrays of probabilities in decreasing order instead of corresponding to
> the order of the components? Is this really the order of the components?
>
> I am a little confused by this. Can someone clear this issue up?
>
> Kasper

-- 
Olivier
http://twitter.com/ogrisel - http://github.com/ogrisel

------------------------------------------------------------------------------
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
_______________________________________________
Scikit-learn-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general

Reply via email to