Hi,

Birch does not remember which samples are fed to it in it and it each keeps
track of only the linear sum, squared sum and number of samples. (I think),
so you cannot do this directly.

You can either
1. Reduce the threshold so much to get a large number of subcluster
centers. (Note these are reduced instances of the original data, not the
original data itself) or.
2. Using the distances from of the original data to the subcluster centers,
compute the subclusters yourself. This is done at predict time in Birch.


HTH
------------------------------------------------------------------------------
_______________________________________________
Scikit-learn-general mailing list
Scikit-learn-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general

Reply via email to