[
https://issues.apache.org/jira/browse/SPARK-1874?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14002535#comment-14002535
]
Xiangrui Meng commented on SPARK-1874:
--------------------------------------
There are three files under `data/`: `kmeans_data.txt`, `lr_data.txt`, and
`pagerank_data.txt`, while more files under `mllib/data`. It feels more natural
to me to keep the sample data under `mllib/data`. Anyway, I will create sample
data first.
> Clean up MLlib sample data
> --------------------------
>
> Key: SPARK-1874
> URL: https://issues.apache.org/jira/browse/SPARK-1874
> Project: Spark
> Issue Type: Bug
> Components: MLlib
> Reporter: Matei Zaharia
> Fix For: 1.0.0
>
>
> - Replace logistic regression example data with linear to make
> mllib.LinearRegression example easier to run
> - Move files from mllib/data into data/mllib to make them easier to find
> - Add a simple MovieLens data file
--
This message was sent by Atlassian JIRA
(v6.2#6252)