GitHub user BryanCutler opened a pull request:
https://github.com/apache/spark/pull/23056
[SPARK-26034][PYTHON][TESTS] Break large mllib/tests.py file into smaller
files
## What changes were proposed in this pull request?
This PR breaks down the large mllib/tests.py file that contains all Python
MLlib unit tests into several smaller test files to be easier to read and
maintain.
The tests are broken down as follows:
```
pyspark
âââ __init__.py
...
âââ mllib
â âââ __init__.py
...
â âââ tests
â â âââ __init__.py
â â âââ test_algorithms.py
â â âââ test_feature.py
â â âââ test_linalg.py
â â âââ test_stat.py
â â âââ test_streaming_algorithms.py
â â âââ test_util.py
...
âââ testing
...
â âââ mllibutils.py
...
```
## How was this patch tested?
Ran tests manually by module to ensure test count was the same, and ran
`python/run-tests --modules=pyspark-mllib` to verify all passing with Python
2.7 and Python 3.6. Also installed scipy to include optional tests in
test_linalg.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/BryanCutler/spark
python-test-breakup-mllib-SPARK-26034
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/23056.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #23056
----
commit 2759521df7f2dffc9ddb9379e0b1dac6721da366
Author: Bryan Cutler <cutlerb@...>
Date: 2018-11-16T03:01:22Z
separated mllib tests
----
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]