GitHub user Yunni opened a pull request: https://github.com/apache/spark/pull/16715
[Spark-18080][ML] Python API & Examples for Locality Sensitive Hashing ## What changes were proposed in this pull request? This pull request includes python API and examples for LSH. The API changes was based on @yanboliang 's PR #15768 and resolved conflicts and API changes on the Scala API. The examples are consistent with Scala examples of MinHashLSH and BucketedRandomProjectionLSH. ## How was this patch tested? API and examples are tested using spark-submit: bin/spark-submit examples/src/main/python/ml/min_hash_lsh.py bin/spark-submit examples/src/main/python/ml/bucketed_random_projection_lsh.py User guide changes are generated and manually inspected: `SKIP_API=1 jekyll build` Please review http://spark.apache.org/contributing.html before opening a pull request. You can merge this pull request into a Git repository by running: $ git pull https://github.com/Yunni/spark spark-18080 Alternatively you can review and apply these changes as the patch at: https://github.com/apache/spark/pull/16715.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #16715 ---- commit 85d22c37d3fe0b907f2eaf892729d087f9efb76c Author: Yanbo Liang <yblia...@gmail.com> Date: 2016-11-04T14:22:23Z Locality Sensitive Hashing (LSH) Python API. commit cdeca1cdd8ed61274137c3012ba49ff57d459190 Author: Yanbo Liang <yblia...@gmail.com> Date: 2016-11-04T14:44:52Z Fix typos. commit 66d308bb6d5d254057b0de9217f87391f269aaed Author: Yun Ni <y...@uber.com> Date: 2017-01-25T21:11:57Z Merge branch 'spark-18080' of https://github.com/yanboliang/spark into spark-18080 commit d62a2d0d6cdd1e4cb0626bacfe389274db42a11c Author: Yun Ni <y...@uber.com> Date: 2017-01-26T00:59:15Z Merge branch 'master' of https://github.com/apache/spark into spark-18080 commit dafc4d120c0606ccd2be892fb2618a1df676ccd3 Author: Yun Ni <y...@uber.com> Date: 2017-01-26T01:23:53Z Changes to fix LSH Python API commit ac1f4f7190192a3ee6fd8a311a0036e1546e4592 Author: Yunni <euler57...@gmail.com> Date: 2017-01-26T05:08:47Z Merge branch 'spark-18080' of https://github.com/Yunni/spark into spark-18080 commit 3a21f2666c907d6d520771b4343af7d877d689bb Author: Yunni <euler57...@gmail.com> Date: 2017-01-26T07:20:12Z Fix examples and class definition commit 65dab3ec32f423936f2cb310bbfbc312ece8ac54 Author: Yun Ni <y...@uber.com> Date: 2017-01-26T20:19:22Z Add python examples and updated the user guide ---- --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. --- --------------------------------------------------------------------- To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org