Hi,
Is there a way (already existing) , whereby we can get the variance about
the mean/cluster center (chosen by clustering algorithm) of all the samples
belonging to that cluster.
If not, is the following approach optimal?
For each cluster label:
For all ids belonging to this cluster label:
Find variance from Sequence[ids] about mean/cluster center

Also, what is the best way to deal with outliers while performing
clustering. I saw that DBSCAN would inherently do that.
------------------------------------------------------------------------------
Everyone hates slow websites. So do we.
Make your web apps faster with AppDynamics
Download AppDynamics Lite for free today:
http://p.sf.net/sfu/appdyn_d2d_feb
_______________________________________________
Scikit-learn-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general

Reply via email to