On Tue, Aug 17, 2010 at 3:35 PM, Jeff Eastman <j...@windwardsolutions.com>wrote:

>  Ok, so if a new ClusterClassifier could read a set of Clusters from
> clusters-i storage into memory and do what is now being done in each
> driver/clusterer then this would facilitate the integration?


Absolutely.


> It's just a different slice of the current clusteredPoints output; a vector
> of probabilities for the given input vector and Clusters. Sounds like this
> would not need a command line interface at all, just a Path reference to the
> clusters to be read-in at initialization time.
>

Yes.


> Dirichlet models already implement pdf(Vector) methods which support
> classification from their persistent state. The other types of Cluster
> cannot since their persistent state does not include the DistanceMeasure
> they used. Some further refactoring of Cluster, AbstractCluster and Model
> along the lines I discussed earlier might make all this come together. I
> think the current set of Dirichlet models needs to be cleaned up anyway;
> AbstractClusters look way too much like AsymmetricSampledNormalDistributions
> to ignore the redundancy. I will noodle on this some more and see where it
> takes me.
>

Sounds great.

Reply via email to