Space: Apache Mahout (https://cwiki.apache.org/confluence/display/MAHOUT) Page: Top Down Clustering (https://cwiki.apache.org/confluence/display/MAHOUT/Top+Down+Clustering)
Edited by Paritosh Ranjan: --------------------------------------------------------------------- Top Down Clustering Top Down clustering is a type of Hierarchical Clustering. It tries to find bigger clusters first and then does fine grained clustering on these clusters. Hence the name Top Down. Any clustering algorithm can be used to perform the Top Level Clustering and the Bottom Level Clustering. So, all clustering algorithms available in Mahout, other than the MinHash Clustering algorithm ( which is a "Bottom Up" Clustering Algorithm ), are suitable to be used for Top Down Clustering, on both Top Level and Bottom Level. To perform Top Down Clustering, the top level clustering output needs to post processed, in order to identify all top level clusters, and, to group vectors in their respective clusters. So, that, the bottom level clustering can execute on each of them. Change your notification preferences: https://cwiki.apache.org/confluence/users/viewnotifications.action
