I do not have a good way of determining what the distribution of each variable is, and I believe it will change as I get more data from different sources. This means in my mind that there is not currently a good clustering algorithm to use in the mahout framework.
Thanks for the help! -- View this message in context: http://n3.nabble.com/Clustering-approach-tp716743p718711.html Sent from the Mahout User List mailing list archive at Nabble.com.