GitHub user WeichenXu123 opened a pull request:

    https://github.com/apache/spark/pull/19110

    [SPARK-21027][ML][PYTHON] Added tunable parallelism to one vs. rest in both 
Scala mllib and Pyspark

    ## What changes were proposed in this pull request?
    
    Added tunable parallelism to the pyspark implementation of one vs. rest 
classification. Added a parallelism parameter to the Scala implementation of 
one vs. rest along with functionality for using the parameter to tune the level 
of parallelism.
    
    I take this PR #18281 over because the original author is busy but we need 
merge this PR soon.
    
    ## How was this patch tested?
    
    Test suite added.

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/WeichenXu123/spark spark-21027

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/19110.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #19110
    
----
commit b69f201bc51f8de87adc3869d4843e3df6750972
Author: Ajay Saini <[email protected]>
Date:   2017-06-12T21:46:28Z

    Added tunable parallelism to the pyspark implementation of one vs. rest 
classification. Added a parallelism parameter to the scala implementation of 
one vs. rest for python persistence but have not yet used it to tune the scala 
parallelism implementation.

commit e750d3ee8eaf75ed8732f5cf1a1904acb1666793
Author: Ajay Saini <[email protected]>
Date:   2017-06-12T22:35:20Z

    Fixed python style.

commit 81d458be99cf4f195b761eaa9bcb48ea086cdf61
Author: Ajay Saini <[email protected]>
Date:   2017-06-13T00:10:23Z

    Added functionality for tuning parellelism in the Scala implementation of 
the one vs. rest algorithm.

commit 213337882a40c63c3a3ef5741c17a6eebd63df0b
Author: Ajay Saini <[email protected]>
Date:   2017-06-13T20:28:49Z

    Fixed code according to comments. Added both annotations and unit tests for 
testing that parallelism doesn't affect the output.

commit c59b1d897c24d88753f478243ac8428598108da3
Author: Ajay Saini <[email protected]>
Date:   2017-06-22T23:41:15Z

    Modified parallel one vs rest to use futures.

commit 5f635a2e3746bb6f1c503ebf48a485c8217f555e
Author: Ajay Saini <[email protected]>
Date:   2017-06-23T00:00:48Z

    Put the parallelism parameter as well as the function for getting an 
executor service with a given level of parallelism in a separat trait that 
OneVsRest inherits from.

commit 4431ffcd38843798db907842799fd81a30c7a5be
Author: Ajay Saini <[email protected]>
Date:   2017-06-23T18:54:32Z

    Responded to pull request comments.

commit a841b3ea3c674b790948425c884661063f5e3520
Author: Ajay Saini <[email protected]>
Date:   2017-07-06T23:35:15Z

    Made changes based on pull request comments.

commit a95a8af2073b29aac751ae58489b737a3d7a39ae
Author: Ajay Saini <[email protected]>
Date:   2017-07-14T23:49:00Z

    Fixed based on pull request comments

commit d45bc233751e63f15d242da702fdad6102f8bd65
Author: Ajay Saini <[email protected]>
Date:   2017-07-18T18:25:50Z

    Fixed based on comments

commit 30ac62d476e72fd2f70ef2f4a74a735ea5d98509
Author: Ajay Saini <[email protected]>
Date:   2017-07-19T20:24:48Z

    Reverting merge and adding change that would fix merge conflict (making 
OneVsRest and OneVsRest model JavaMLReadable and JavaMLWritable)

commit cc634d2034db3144ff54fd9012abc15c57b1a226
Author: Ajay Saini <[email protected]>
Date:   2017-07-19T21:03:43Z

    Merge branch 'master' into spark-21027

commit ce14172711b51a4321ed02a3cf8450a54374d4f5
Author: Ajay Saini <[email protected]>
Date:   2017-07-20T00:32:08Z

    Style fix with docstring

commit 1c9de16d2919205542712b57de8bf4d866e17d95
Author: Ajay Saini <[email protected]>
Date:   2017-07-27T01:56:31Z

    Fixed based on comments.

commit 9f3440412515df1630072c01ef642ee5e3723eb0
Author: Ajay Saini <[email protected]>
Date:   2017-07-27T02:04:43Z

    Fixed style issue.

commit 585a3f8ea21359f11cd5a19ba195df88e091d9e0
Author: Ajay Saini <[email protected]>
Date:   2017-08-12T00:22:56Z

    Fixed merge conflict

commit f65381afe20f5458ff7d706c6d803355c1b89f00
Author: Ajay Saini <[email protected]>
Date:   2017-08-23T17:51:01Z

    Fixed remaining part of merge conflict.

commit 2a335fec1a4527e4970e88c10555d7d3f0375c35
Author: Ajay Saini <[email protected]>
Date:   2017-08-23T18:34:48Z

    Fixed style problem

commit 049f371738eddd99d8261aaef02a5b9030843174
Author: WeichenXu <[email protected]>
Date:   2017-09-02T15:43:55Z

    Merge branch 'master' into spark-21027

commit ddc2ff4f775358336914e65e228c74af8056e641
Author: WeichenXu <[email protected]>
Date:   2017-09-03T03:13:50Z

    address review feedback issues

commit 24f4499702dd6efd80867e7d51a30fc08f2a51cc
Author: WeichenXu <[email protected]>
Date:   2017-09-03T04:26:01Z

    update migration guide

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to