Kmeans example with space delimited data
----------------------------------------
Key: MAHOUT-551
URL: https://issues.apache.org/jira/browse/MAHOUT-551
Project: Mahout
Issue Type: Improvement
Components: Utils
Affects Versions: 0.4
Reporter: Djellel Eddine Difallah
Priority: Minor
The provided example for Kmeans clustering using the synthetic control data
asks for t1 and t2 measures because it runs the Canopy Driver to determine the
initial clusters. Kmeans originally requires a K variable to generate random
centers from the input data. I propose to add another example in the package
which will serve for any space delimited numerical input to cluster with Kmeans
in its original form and not using Canopy. The modification is quite simple and
is mostly based on the synthetic control Job.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.