Github user KyleLi1985 commented on the issue: https://github.com/apache/spark/pull/22893 I form the final test case for sparse case and dense case on realistic data to test new commit [SparkMLlibTest.txt](https://github.com/apache/spark/files/2561442/SparkMLlibTest.txt) **For Dense case**, we use data from http://archive.ics.uci.edu/ml/datasets/EEG+Steady-State+Visual+Evoked+Potential+Signals and use all the dense data file from this realistic data **Dense case Test Result time cost (milliseconds)** Before Enhance 211878 210845 215375 After Enhance 140827 149282 130691 **For Sparse case**, we use data from http://archive.ics.uci.edu/ml/datasets/Condition+monitoring+of+hydraulic+systems and extract all the sparse data file (PS1, PS2, PS3, PS4, PS5, PS6) from this realistic data **Sparse case Test Result time cost (milliseconds)** Before Enhance 108080 103582 103586 After Enhance 107652 107145 104768
--- --------------------------------------------------------------------- To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org