[ 
https://issues.apache.org/jira/browse/SPARK-1874?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14002535#comment-14002535
 ] 

Xiangrui Meng commented on SPARK-1874:
--------------------------------------

There are three files under `data/`: `kmeans_data.txt`, `lr_data.txt`, and 
`pagerank_data.txt`, while more files under `mllib/data`. It feels more natural 
to me to keep the sample data under `mllib/data`. Anyway, I will create sample 
data first.

> Clean up MLlib sample data
> --------------------------
>
>                 Key: SPARK-1874
>                 URL: https://issues.apache.org/jira/browse/SPARK-1874
>             Project: Spark
>          Issue Type: Bug
>          Components: MLlib
>            Reporter: Matei Zaharia
>             Fix For: 1.0.0
>
>
> - Replace logistic regression example data with linear to make 
> mllib.LinearRegression example easier to run
> - Move files from mllib/data into data/mllib to make them easier to find
> - Add a simple MovieLens data file



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to