zhengruifeng edited a comment on issue #25829: [SPARK-29144][ML] Binarizer handle sparse vectors incorrectly with negative threshold URL: https://github.com/apache/spark/pull/25829#issuecomment-532655598 Just notice that existing ML algs deal with sparse dataset in a different way from scikit-learn: scikit-learn refuse to break the data sparsity, and will throw an exception; while ML will convert sparse vector to dense one. I will follow ML’s way and update this PR tomorrow
---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] With regards, Apache Git Services --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
