Re: why is spark + scala code so slow, compared to python?

2014-12-12 Thread rzykov
Try this https://github.com/RetailRocket/SparkMultiTool <https://github.com/RetailRocket/SparkMultiTool> This loader solved slow reading of a big data set of small files in hdfs. -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/why-is-spark-scala-c

Re: why is spark + scala code so slow, compared to python?

2014-12-11 Thread Andy Wagner
>>>> operations with breeze/blas, etc. i saw some improvements, but it's >>>> still a >>>> lot slower than my python code. >>>> >>>> why is that? >>>> >>>> how do you improve your spark

Re: why is spark + scala code so slow, compared to python?

2014-12-11 Thread Sean Owen
how do you improve your spark + scala performance today? >>> >>> or is spark + scala just not the right tool for small to medium datasets? >>> >>> when would you use spark + scala vs. python? >>> >>> thanks! >>> >>> >

Re: why is spark + scala code so slow, compared to python?

2014-12-11 Thread Duy Huynh
>>> why is that? >>> >>> how do you improve your spark + scala performance today? >>> >>> or is spark + scala just not the right tool for small to medium datasets? >>> >>> when would you

Re: why is spark + scala code so slow, compared to python?

2014-12-11 Thread Duy Huynh
gt;> or is spark + scala just not the right tool for small to medium datasets? >> >> when would you use spark + scala vs. python? >> >> thanks! >> >> >> >> -- >> View this message in con

Re: why is spark + scala code so slow, compared to python?

2014-12-11 Thread Natu Lauchande
asets? > > when would you use spark + scala vs. python? > > thanks! > > > > -- > View this message in context: > http://apache-spark-user-list.1001560.n3.nabble.com/why-is-spark-scala-code-so-slow-compared-

why is spark + scala code so slow, compared to python?

2014-12-11 Thread ll
r spark + scala performance today? or is spark + scala just not the right tool for small to medium datasets? when would you use spark + scala vs. python? thanks! -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/why-is-spark-scala-code-so-slow-compared-to-pyth