Hi, I am not aware of tool that directly takes csv files. However you could convert csv to arff files by just adding header in required format and then use
"mahout arff.vector" to convert arff to sequence file which could be used in canopy/kmeans. Thanks, Rajesh On Wed, Jun 12, 2013 at 3:41 PM, Neetha <[email protected]> wrote: > Hi, > > > I am using 1m movielens. > > I need to run the K-means clustering using mahout and hadoop. Actually, > 1st step in the clustering is to convert into a sequence file, then into > vector format and then apply the clustering algorithm. My doubt is, Is > there any need to convert the movielens rating.csv file into a sequence > file. If needed what are the commands for applying clustering technique > using mahout and the hadoop. > > Thanking you, > Neetha Suan Thampi >
