We probably should have ClusterDumper still handle Dirichlet jobs, so that users don't need to deal w/ more than one interface.
On Jan 26, 2010, at 11:25 PM, Jeff Eastman wrote: > Hi Jerry, > > DirichletClusters are not similar enough to ClusterBase to make that > workable, so you are correct that the utility won't dump them. Writing a dump > utility that can is a great idea, though it does tend to be rather Model > specific. Maybe Models should have some printable representation a-la > asFormatString(). > > Look at the code in > > /MahoutTrunk/utils/src/test/java/org/apache/mahout/clustering/dirichlet/TestL1ModelClustering.java > /MahoutTrunk/examples/src/main/java/org/apache/mahout/clustering/dirichlet/DisplayOutputState.java > > for ideas on how you might be able to dump out your DirichletClusters and > their Models. > > I've actually considered making ClusterBase into a Model and generalizing > DirichletCluster to be the root of all clusters. I think the distance > measures used by canopy and k-means could be cast as Model pdfs but the whole > idea is still only half-baked. > > Jeff > > Jerry Ye wrote: >> I'm trying to view the output of my experiment using Dirichlet Process >> Clustering. When attempting to use the ClusterDumper utility on the output >> directory, an exception is thrown. Upon looking closer, DirichletCluster >> does not extend ClusterBase. The error is below. >> >> Is there some other way that I can view the cluster labels? >> >> Thanks! >> >> - jerry >> >> -bash-3.1$ java -cp >> mahout-core-0.3-SNAPSHOT.jar:mahout-utils-0.3-SNAPSHOT.jar:$( echo >> dependency/*.jar . | sed 's/ /:/g') >> org.apache.mahout.utils.clustering.ClusterDumper -s mahoutout/state-0 >> Input Path: /homes/jerryye/mahout/mahoutout/state-0/part-0 >> Exception in thread "main" java.lang.ClassCastException: >> org.apache.mahout.clustering.dirichlet.DirichletCluster cannot be cast to >> org.apache.mahout.clustering.ClusterBase >> at >> org.apache.mahout.utils.clustering.ClusterDumper.printClusters(ClusterDumper.java:119) >> at >> org.apache.mahout.utils.clustering.ClusterDumper.main(ClusterDumper.java:251) >> >> > -------------------------- Grant Ingersoll http://www.lucidimagination.com/ Search the Lucene ecosystem using Solr/Lucene: http://www.lucidimagination.com/search
