I have all the data required for KMeans in a dataset<Row> in memory Standard approach to load this data from a file is spark.read().format("libsvm").load(filename)
where the file has data in the format 0 1:0.0 2:0.0 3:0.0 How do i this from an in-memory dataset already present. Any suggestions ? -Tony