Egor Pakhomov created SPARK-3878:
------------------------------------
Summary: Benchmarks and common tests for mllib algorithm
Key: SPARK-3878
URL: https://issues.apache.org/jira/browse/SPARK-3878
Project: Spark
Issue Type: New Feature
Components: MLlib
Reporter: Egor Pakhomov
There is no common practice among MLlib for testing algorithms: every model
generates it's own random test data. There is no easy extractable test cases
applible to another algorithm. There is no benchmarks for comparing algorithms.
After implementing new algorithm it's very hard to understand how it should be
tested.
Lack of serialization testing: MLlib algorithms don't contain tests which test
that model work after serialization.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]