Kmeans example with space delimited data
----------------------------------------

                 Key: MAHOUT-551
                 URL: https://issues.apache.org/jira/browse/MAHOUT-551
             Project: Mahout
          Issue Type: Improvement
          Components: Utils
    Affects Versions: 0.4
            Reporter: Djellel Eddine Difallah
            Priority: Minor


The provided example for Kmeans clustering using the synthetic control data 
asks for t1 and t2 measures because it runs the Canopy Driver to determine the 
initial clusters. Kmeans originally requires a K variable to generate random 
centers from the input data. I propose to add another example in the package 
which will serve for any space delimited numerical input to cluster with Kmeans 
in its original form and not using Canopy. The modification is quite simple and 
is mostly based on the synthetic control Job.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to