What we really need is a nice utility to take clustered output and maybe label all of the vectors in the training set (and new vectors, if it's either a generative model or one which allows "folding in") with some labels in a Vector wrapper class, and maybe some sort of statistics generating utility, which prints out general data about the clustering (number of points per cluster, how wide they are, what the centroids are or other stuff like that).
This is really something true of all of the clustering classes / jobs, not just Dirichlet. -jake On Tue, Apr 6, 2010 at 10:30 AM, Ted Dunning <ted.dunn...@gmail.com> wrote: > This isn't far from true. I was just thinking something along the same > lines, but phrased a bit differently. > > My thought was that if the concept and output is sooo different, will users > be able to use it even if the dumper is made to work well? > > > On Tue, Apr 6, 2010 at 10:27 AM, Jake Mannix <jake.man...@gmail.com> > wrote: > > > > > Without this final step, this seems very much like an unfinished > feature, > > to the point of being unusable. >