GitHub user BryanCutler opened a pull request:

    https://github.com/apache/spark/pull/23056

    [SPARK-26034][PYTHON][TESTS] Break large mllib/tests.py file into smaller 
files

    ## What changes were proposed in this pull request?
    
    This PR breaks down the large mllib/tests.py file that contains all Python 
MLlib unit tests into several smaller test files to be easier to read and 
maintain.
    
    The tests are broken down as follows:
    ```
    pyspark
    ├── __init__.py
    ...
    ├── mllib
    │   ├── __init__.py
    ...
    │   ├── tests
    │   │   ├── __init__.py
    │   │   ├── test_algorithms.py
    │   │   ├── test_feature.py
    │   │   ├── test_linalg.py
    │   │   ├── test_stat.py
    │   │   ├── test_streaming_algorithms.py
    │   │   └── test_util.py
    ...
    ├── testing
    ...
    │   ├── mllibutils.py
    ...
    ```
    
    ## How was this patch tested?
    
    Ran tests manually by module to ensure test count was the same, and ran 
`python/run-tests --modules=pyspark-mllib` to verify all passing with Python 
2.7 and Python 3.6. Also installed scipy to include optional tests in 
test_linalg.

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/BryanCutler/spark 
python-test-breakup-mllib-SPARK-26034

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/23056.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #23056
    
----
commit 2759521df7f2dffc9ddb9379e0b1dac6721da366
Author: Bryan Cutler <cutlerb@...>
Date:   2018-11-16T03:01:22Z

    separated mllib tests

----


---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to