Re: Dataset announcement

2015-04-15 Thread Simon Edelhaus
to assess and push further the > > scalability of Spark and MLlib. > > > > Cheers, > > Olivier > > > > > > > > -- > > View this message in context: > http://apache-spark-user-list.1001560.n3.nabble.com/Dataset-announcement-tp22507.html &

Re: Dataset announcement

2015-04-15 Thread Krishna Sankar
them taking millions of values. > - 4B rows > > Hopefully this dataset will be useful to assess and push further the > scalability of Spark and MLlib. > > Cheers, > Olivier > > > > -- > View this message in context: > http://apache-spark-user-list.1001560.n3.nabble.

Re: Dataset announcement

2015-04-15 Thread Matei Zaharia
> > Cheers, > Olivier > > > > -- > View this message in context: > http://apache-spark-user-list.1001560.n3.nabble.com/Dataset-announcement-tp22507.html > Sent from the Apache Spark User List mailing list archive at Nabble.com. > > -

Dataset announcement

2015-04-15 Thread Olivier Chapelle
Hopefully this dataset will be useful to assess and push further the scalability of Spark and MLlib. Cheers, Olivier -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/Dataset-announcement-tp22507.html Sent from the Apache Spark User List mailing list archive at