I have a file of vectors I formulated in csv format, and I want to use mahout to perform k-means clustering on the vectors in this file.
However, it seems mahout expects the input data to be formatted in a SequenceFile format, and I'm not sure if there's a way to easily do this (are there existing tools?)
