|
Hi Christopher
Data is in hdfs (hadoop serialized and compressed files, no parsing), the hdfs nodes are also spark nodes. There is a bit of shuffling at the beginning and then at each iteration. Good idea, I'll share it asap, it's still a bit dirty, as I've rewritten everything last sunday, and I'm also trying different alternate ways on the matrix computations, so not ready for a PR yet (i.e. I'm trying netlib-java which has a mkl binding to see if it helps). I'll clean it up and send it to the list. Thanks Guillaume --
|
Re: DoubleMatrix vs Array[Array[Double]] : Question about debugging serialization performance issues
- DoubleMatrix vs Array[Array[Double]] : Question about d... Guillaume Pitel
- Re: DoubleMatrix vs Array[Array[Double]] : Questio... Matei Zaharia
- Re: DoubleMatrix vs Array[Array[Double]] : Que... Christopher Nguyen
- Re: DoubleMatrix vs Array[Array[Double]] :... Guillaume Pitel
- Re: DoubleMatrix vs Array[Array[Double]] : Que... Guillaume Pitel

