GitHub user Yunni opened a pull request:
https://github.com/apache/spark/pull/16715
[Spark-18080][ML] Python API & Examples for Locality Sensitive Hashing
## What changes were proposed in this pull request?
This pull request includes python API and examples for LSH. The API changes
was based on @yanboliang 's PR #15768 and resolved conflicts and API changes on
the Scala API. The examples are consistent with Scala examples of MinHashLSH
and BucketedRandomProjectionLSH.
## How was this patch tested?
API and examples are tested using spark-submit:
bin/spark-submit examples/src/main/python/ml/min_hash_lsh.py
bin/spark-submit
examples/src/main/python/ml/bucketed_random_projection_lsh.py
User guide changes are generated and manually inspected:
`SKIP_API=1 jekyll build`
Please review http://spark.apache.org/contributing.html before opening a
pull request.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/Yunni/spark spark-18080
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/16715.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #16715
----
commit 85d22c37d3fe0b907f2eaf892729d087f9efb76c
Author: Yanbo Liang <[email protected]>
Date: 2016-11-04T14:22:23Z
Locality Sensitive Hashing (LSH) Python API.
commit cdeca1cdd8ed61274137c3012ba49ff57d459190
Author: Yanbo Liang <[email protected]>
Date: 2016-11-04T14:44:52Z
Fix typos.
commit 66d308bb6d5d254057b0de9217f87391f269aaed
Author: Yun Ni <[email protected]>
Date: 2017-01-25T21:11:57Z
Merge branch 'spark-18080' of https://github.com/yanboliang/spark into
spark-18080
commit d62a2d0d6cdd1e4cb0626bacfe389274db42a11c
Author: Yun Ni <[email protected]>
Date: 2017-01-26T00:59:15Z
Merge branch 'master' of https://github.com/apache/spark into spark-18080
commit dafc4d120c0606ccd2be892fb2618a1df676ccd3
Author: Yun Ni <[email protected]>
Date: 2017-01-26T01:23:53Z
Changes to fix LSH Python API
commit ac1f4f7190192a3ee6fd8a311a0036e1546e4592
Author: Yunni <[email protected]>
Date: 2017-01-26T05:08:47Z
Merge branch 'spark-18080' of https://github.com/Yunni/spark into
spark-18080
commit 3a21f2666c907d6d520771b4343af7d877d689bb
Author: Yunni <[email protected]>
Date: 2017-01-26T07:20:12Z
Fix examples and class definition
commit 65dab3ec32f423936f2cb310bbfbc312ece8ac54
Author: Yun Ni <[email protected]>
Date: 2017-01-26T20:19:22Z
Add python examples and updated the user guide
----
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]