Vectorize the movielens 100K dataset for Mahout k-means clustering

Carlos Seminario Sun, 30 Jun 2013 17:31:32 -0700

Hi: I want to vectorize the movielens 100K dataset as a
RandomAccessSparseVector and use it to run Mahout k-means clustering. Has
anyone done this before? If not, any ideas on a how this can be done? (BTW,
movielens dataset contains ~100K records/lines with this format: userid,
itemid, rating, unix time.)


Thanks .. Carlos

Vectorize the movielens 100K dataset for Mahout k-means clustering

Reply via email to