[GitHub] spark pull request #16715: [Spark-18080][ML] Python API & Examples for Local...

2017-02-09 Thread sethah
Github user sethah commented on a diff in the pull request: https://github.com/apache/spark/pull/16715#discussion_r100426821 --- Diff: examples/src/main/java/org/apache/spark/examples/ml/JavaMinHashLSHExample.java --- @@ -44,25 +45,67 @@ public static void main(String[] args) {

[GitHub] spark pull request #16715: [Spark-18080][ML] Python API & Examples for Local...

2017-02-09 Thread sethah
Github user sethah commented on a diff in the pull request: https://github.com/apache/spark/pull/16715#discussion_r100424220 --- Diff: examples/src/main/python/ml/bucketed_random_projection_lsh_example.py --- @@ -0,0 +1,86 @@ +# +# Licensed to the Apache Software

[GitHub] spark pull request #16715: [Spark-18080][ML] Python API & Examples for Local...

2017-02-09 Thread sethah
Github user sethah commented on a diff in the pull request: https://github.com/apache/spark/pull/16715#discussion_r100422395 --- Diff: examples/src/main/python/ml/bucketed_random_projection_lsh_example.py --- @@ -0,0 +1,86 @@ +# +# Licensed to the Apache Software

[GitHub] spark pull request #16715: [Spark-18080][ML] Python API & Examples for Local...

2017-02-09 Thread sethah
Github user sethah commented on a diff in the pull request: https://github.com/apache/spark/pull/16715#discussion_r100427531 --- Diff: python/pyspark/ml/feature.py --- @@ -120,6 +122,196 @@ def getThreshold(self): return self.getOrDefault(self.threshold)

[GitHub] spark pull request #16715: [Spark-18080][ML] Python API & Examples for Local...

2017-02-09 Thread sethah
Github user sethah commented on a diff in the pull request: https://github.com/apache/spark/pull/16715#discussion_r100427045 --- Diff: examples/src/main/python/ml/min_hash_lsh_example.py --- @@ -0,0 +1,85 @@ +# +# Licensed to the Apache Software Foundation (ASF) under one

[GitHub] spark pull request #16715: [Spark-18080][ML] Python API & Examples for Local...

2017-02-09 Thread sethah
Github user sethah commented on a diff in the pull request: https://github.com/apache/spark/pull/16715#discussion_r100421903 --- Diff: examples/src/main/python/ml/bucketed_random_projection_lsh_example.py --- @@ -0,0 +1,86 @@ +# +# Licensed to the Apache Software

[GitHub] spark pull request #16715: [Spark-18080][ML] Python API & Examples for Local...

2017-02-09 Thread sethah
Github user sethah commented on a diff in the pull request: https://github.com/apache/spark/pull/16715#discussion_r100426756 --- Diff: examples/src/main/java/org/apache/spark/examples/ml/JavaMinHashLSHExample.java --- @@ -44,25 +45,67 @@ public static void main(String[] args) {

[GitHub] spark pull request #16715: [Spark-18080][ML] Python API & Examples for Local...

2017-02-09 Thread sethah
Github user sethah commented on a diff in the pull request: https://github.com/apache/spark/pull/16715#discussion_r100426237 --- Diff: examples/src/main/java/org/apache/spark/examples/ml/JavaBucketedRandomProjectionLSHExample.java --- @@ -71,25 +71,32 @@ public static void

[GitHub] spark pull request #16715: [Spark-18080][ML] Python API & Examples for Local...

2017-02-09 Thread sethah
Github user sethah commented on a diff in the pull request: https://github.com/apache/spark/pull/16715#discussion_r100427633 --- Diff: python/pyspark/ml/feature.py --- @@ -755,6 +947,101 @@ def maxAbs(self): @inherit_doc +class MinHashLSH(JavaEstimator,

[GitHub] spark pull request #16715: [Spark-18080][ML] Python API & Examples for Local...

2017-02-09 Thread sethah
Github user sethah commented on a diff in the pull request: https://github.com/apache/spark/pull/16715#discussion_r100426683 --- Diff: examples/src/main/java/org/apache/spark/examples/ml/JavaMinHashLSHExample.java --- @@ -44,25 +45,67 @@ public static void main(String[] args) {

[GitHub] spark pull request #16715: [Spark-18080][ML] Python API & Examples for Local...

2017-02-09 Thread sethah
Github user sethah commented on a diff in the pull request: https://github.com/apache/spark/pull/16715#discussion_r100428492 --- Diff: examples/src/main/java/org/apache/spark/examples/ml/JavaMinHashLSHExample.java --- @@ -44,25 +45,67 @@ public static void main(String[] args) {

[GitHub] spark pull request #16715: [Spark-18080][ML] Python API & Examples for Local...

2017-02-09 Thread sethah
Github user sethah commented on a diff in the pull request: https://github.com/apache/spark/pull/16715#discussion_r100420489 --- Diff: examples/src/main/python/ml/bucketed_random_projection_lsh_example.py --- @@ -0,0 +1,86 @@ +# +# Licensed to the Apache Software

[GitHub] spark pull request #16715: [Spark-18080][ML] Python API & Examples for Local...

2017-02-09 Thread sethah
Github user sethah commented on a diff in the pull request: https://github.com/apache/spark/pull/16715#discussion_r100426020 --- Diff: examples/src/main/java/org/apache/spark/examples/ml/JavaBucketedRandomProjectionLSHExample.java --- @@ -71,25 +71,32 @@ public static void

[GitHub] spark pull request #16715: [Spark-18080][ML] Python API & Examples for Local...

2017-02-08 Thread Yunni
Github user Yunni commented on a diff in the pull request: https://github.com/apache/spark/pull/16715#discussion_r100199037 --- Diff: python/pyspark/ml/feature.py --- @@ -120,6 +122,200 @@ def getThreshold(self): return self.getOrDefault(self.threshold)

[GitHub] spark pull request #16715: [Spark-18080][ML] Python API & Examples for Local...

2017-02-08 Thread Yunni
Github user Yunni commented on a diff in the pull request: https://github.com/apache/spark/pull/16715#discussion_r100198559 --- Diff: python/pyspark/ml/feature.py --- @@ -120,6 +122,200 @@ def getThreshold(self): return self.getOrDefault(self.threshold)

[GitHub] spark pull request #16715: [Spark-18080][ML] Python API & Examples for Local...

2017-02-08 Thread Yunni
Github user Yunni commented on a diff in the pull request: https://github.com/apache/spark/pull/16715#discussion_r100192059 --- Diff: python/pyspark/ml/feature.py --- @@ -120,6 +122,200 @@ def getThreshold(self): return self.getOrDefault(self.threshold)

[GitHub] spark pull request #16715: [Spark-18080][ML] Python API & Examples for Local...

2017-02-08 Thread Yunni
Github user Yunni commented on a diff in the pull request: https://github.com/apache/spark/pull/16715#discussion_r100193058 --- Diff: python/pyspark/ml/feature.py --- @@ -755,6 +951,102 @@ def maxAbs(self): @inherit_doc +class MinHashLSH(JavaEstimator,

[GitHub] spark pull request #16715: [Spark-18080][ML] Python API & Examples for Local...

2017-02-08 Thread Yunni
Github user Yunni commented on a diff in the pull request: https://github.com/apache/spark/pull/16715#discussion_r100193020 --- Diff: python/pyspark/ml/feature.py --- @@ -755,6 +951,102 @@ def maxAbs(self): @inherit_doc +class MinHashLSH(JavaEstimator,

[GitHub] spark pull request #16715: [Spark-18080][ML] Python API & Examples for Local...

2017-02-08 Thread Yunni
Github user Yunni commented on a diff in the pull request: https://github.com/apache/spark/pull/16715#discussion_r100193043 --- Diff: python/pyspark/ml/feature.py --- @@ -755,6 +951,102 @@ def maxAbs(self): @inherit_doc +class MinHashLSH(JavaEstimator,

[GitHub] spark pull request #16715: [Spark-18080][ML] Python API & Examples for Local...

2017-02-08 Thread Yunni
Github user Yunni commented on a diff in the pull request: https://github.com/apache/spark/pull/16715#discussion_r100192985 --- Diff: examples/src/main/python/ml/bucketed_random_projection_lsh.py --- @@ -0,0 +1,76 @@ +# +# Licensed to the Apache Software Foundation (ASF)

[GitHub] spark pull request #16715: [Spark-18080][ML] Python API & Examples for Local...

2017-02-08 Thread Yunni
Github user Yunni commented on a diff in the pull request: https://github.com/apache/spark/pull/16715#discussion_r100192933 --- Diff: examples/src/main/python/ml/bucketed_random_projection_lsh.py --- @@ -0,0 +1,76 @@ +# +# Licensed to the Apache Software Foundation (ASF)

[GitHub] spark pull request #16715: [Spark-18080][ML] Python API & Examples for Local...

2017-02-08 Thread Yunni
Github user Yunni commented on a diff in the pull request: https://github.com/apache/spark/pull/16715#discussion_r100192881 --- Diff: examples/src/main/python/ml/min_hash_lsh.py --- @@ -0,0 +1,75 @@ +# +# Licensed to the Apache Software Foundation (ASF) under one or more

[GitHub] spark pull request #16715: [Spark-18080][ML] Python API & Examples for Local...

2017-02-08 Thread Yunni
Github user Yunni commented on a diff in the pull request: https://github.com/apache/spark/pull/16715#discussion_r100192685 --- Diff: examples/src/main/python/ml/min_hash_lsh.py --- @@ -0,0 +1,75 @@ +# +# Licensed to the Apache Software Foundation (ASF) under one or more

[GitHub] spark pull request #16715: [Spark-18080][ML] Python API & Examples for Local...

2017-02-08 Thread Yunni
Github user Yunni commented on a diff in the pull request: https://github.com/apache/spark/pull/16715#discussion_r100192402 --- Diff: python/pyspark/ml/feature.py --- @@ -120,6 +122,200 @@ def getThreshold(self): return self.getOrDefault(self.threshold)

[GitHub] spark pull request #16715: [Spark-18080][ML] Python API & Examples for Local...

2017-02-08 Thread Yunni
Github user Yunni commented on a diff in the pull request: https://github.com/apache/spark/pull/16715#discussion_r100192347 --- Diff: python/pyspark/ml/feature.py --- @@ -120,6 +122,200 @@ def getThreshold(self): return self.getOrDefault(self.threshold)

[GitHub] spark pull request #16715: [Spark-18080][ML] Python API & Examples for Local...

2017-02-08 Thread Yunni
Github user Yunni commented on a diff in the pull request: https://github.com/apache/spark/pull/16715#discussion_r100192333 --- Diff: python/pyspark/ml/feature.py --- @@ -120,6 +122,200 @@ def getThreshold(self): return self.getOrDefault(self.threshold)

[GitHub] spark pull request #16715: [Spark-18080][ML] Python API & Examples for Local...

2017-02-08 Thread Yunni
Github user Yunni commented on a diff in the pull request: https://github.com/apache/spark/pull/16715#discussion_r100192298 --- Diff: python/pyspark/ml/feature.py --- @@ -120,6 +122,200 @@ def getThreshold(self): return self.getOrDefault(self.threshold)

[GitHub] spark pull request #16715: [Spark-18080][ML] Python API & Examples for Local...

2017-02-08 Thread Yunni
Github user Yunni commented on a diff in the pull request: https://github.com/apache/spark/pull/16715#discussion_r100192314 --- Diff: python/pyspark/ml/feature.py --- @@ -120,6 +122,200 @@ def getThreshold(self): return self.getOrDefault(self.threshold)

[GitHub] spark pull request #16715: [Spark-18080][ML] Python API & Examples for Local...

2017-02-08 Thread Yunni
Github user Yunni commented on a diff in the pull request: https://github.com/apache/spark/pull/16715#discussion_r100192074 --- Diff: python/pyspark/ml/feature.py --- @@ -120,6 +122,200 @@ def getThreshold(self): return self.getOrDefault(self.threshold)

[GitHub] spark pull request #16715: [Spark-18080][ML] Python API & Examples for Local...

2017-02-08 Thread Yunni
Github user Yunni commented on a diff in the pull request: https://github.com/apache/spark/pull/16715#discussion_r100192026 --- Diff: python/pyspark/ml/feature.py --- @@ -120,6 +122,200 @@ def getThreshold(self): return self.getOrDefault(self.threshold)

[GitHub] spark pull request #16715: [Spark-18080][ML] Python API & Examples for Local...

2017-02-07 Thread sethah
Github user sethah commented on a diff in the pull request: https://github.com/apache/spark/pull/16715#discussion_r99940031 --- Diff: examples/src/main/python/ml/bucketed_random_projection_lsh.py --- @@ -0,0 +1,76 @@ +# +# Licensed to the Apache Software Foundation (ASF)

[GitHub] spark pull request #16715: [Spark-18080][ML] Python API & Examples for Local...

2017-02-07 Thread sethah
Github user sethah commented on a diff in the pull request: https://github.com/apache/spark/pull/16715#discussion_r99943935 --- Diff: examples/src/main/python/ml/min_hash_lsh.py --- @@ -0,0 +1,75 @@ +# +# Licensed to the Apache Software Foundation (ASF) under one or more

[GitHub] spark pull request #16715: [Spark-18080][ML] Python API & Examples for Local...

2017-02-07 Thread sethah
Github user sethah commented on a diff in the pull request: https://github.com/apache/spark/pull/16715#discussion_r99940201 --- Diff: examples/src/main/python/ml/bucketed_random_projection_lsh.py --- @@ -0,0 +1,76 @@ +# +# Licensed to the Apache Software Foundation (ASF)

[GitHub] spark pull request #16715: [Spark-18080][ML] Python API & Examples for Local...

2017-02-07 Thread sethah
Github user sethah commented on a diff in the pull request: https://github.com/apache/spark/pull/16715#discussion_r99943986 --- Diff: examples/src/main/python/ml/min_hash_lsh.py --- @@ -0,0 +1,75 @@ +# +# Licensed to the Apache Software Foundation (ASF) under one or more

[GitHub] spark pull request #16715: [Spark-18080][ML] Python API & Examples for Local...

2017-02-07 Thread sethah
Github user sethah commented on a diff in the pull request: https://github.com/apache/spark/pull/16715#discussion_r99912784 --- Diff: python/pyspark/ml/feature.py --- @@ -120,6 +122,200 @@ def getThreshold(self): return self.getOrDefault(self.threshold)

[GitHub] spark pull request #16715: [Spark-18080][ML] Python API & Examples for Local...

2017-02-07 Thread sethah
Github user sethah commented on a diff in the pull request: https://github.com/apache/spark/pull/16715#discussion_r99923657 --- Diff: python/pyspark/ml/feature.py --- @@ -120,6 +122,200 @@ def getThreshold(self): return self.getOrDefault(self.threshold)

[GitHub] spark pull request #16715: [Spark-18080][ML] Python API & Examples for Local...

2017-02-07 Thread sethah
Github user sethah commented on a diff in the pull request: https://github.com/apache/spark/pull/16715#discussion_r99901401 --- Diff: python/pyspark/ml/feature.py --- @@ -120,6 +122,200 @@ def getThreshold(self): return self.getOrDefault(self.threshold)

[GitHub] spark pull request #16715: [Spark-18080][ML] Python API & Examples for Local...

2017-02-07 Thread sethah
Github user sethah commented on a diff in the pull request: https://github.com/apache/spark/pull/16715#discussion_r99929663 --- Diff: python/pyspark/ml/feature.py --- @@ -120,6 +122,200 @@ def getThreshold(self): return self.getOrDefault(self.threshold)

[GitHub] spark pull request #16715: [Spark-18080][ML] Python API & Examples for Local...

2017-02-07 Thread sethah
Github user sethah commented on a diff in the pull request: https://github.com/apache/spark/pull/16715#discussion_r99909930 --- Diff: python/pyspark/ml/feature.py --- @@ -120,6 +122,200 @@ def getThreshold(self): return self.getOrDefault(self.threshold)

[GitHub] spark pull request #16715: [Spark-18080][ML] Python API & Examples for Local...

2017-02-07 Thread sethah
Github user sethah commented on a diff in the pull request: https://github.com/apache/spark/pull/16715#discussion_r99871800 --- Diff: python/pyspark/ml/feature.py --- @@ -755,6 +951,102 @@ def maxAbs(self): @inherit_doc +class MinHashLSH(JavaEstimator,

[GitHub] spark pull request #16715: [Spark-18080][ML] Python API & Examples for Local...

2017-02-07 Thread sethah
Github user sethah commented on a diff in the pull request: https://github.com/apache/spark/pull/16715#discussion_r99871507 --- Diff: python/pyspark/ml/feature.py --- @@ -120,6 +122,200 @@ def getThreshold(self): return self.getOrDefault(self.threshold)

[GitHub] spark pull request #16715: [Spark-18080][ML] Python API & Examples for Local...

2017-02-07 Thread sethah
Github user sethah commented on a diff in the pull request: https://github.com/apache/spark/pull/16715#discussion_r99928739 --- Diff: python/pyspark/ml/feature.py --- @@ -755,6 +951,102 @@ def maxAbs(self): @inherit_doc +class MinHashLSH(JavaEstimator,

[GitHub] spark pull request #16715: [Spark-18080][ML] Python API & Examples for Local...

2017-02-07 Thread sethah
Github user sethah commented on a diff in the pull request: https://github.com/apache/spark/pull/16715#discussion_r99929977 --- Diff: python/pyspark/ml/feature.py --- @@ -120,6 +122,200 @@ def getThreshold(self): return self.getOrDefault(self.threshold)

[GitHub] spark pull request #16715: [Spark-18080][ML] Python API & Examples for Local...

2017-02-07 Thread sethah
Github user sethah commented on a diff in the pull request: https://github.com/apache/spark/pull/16715#discussion_r99870133 --- Diff: python/pyspark/ml/feature.py --- @@ -120,6 +122,200 @@ def getThreshold(self): return self.getOrDefault(self.threshold)

[GitHub] spark pull request #16715: [Spark-18080][ML] Python API & Examples for Local...

2017-02-07 Thread sethah
Github user sethah commented on a diff in the pull request: https://github.com/apache/spark/pull/16715#discussion_r99870243 --- Diff: python/pyspark/ml/feature.py --- @@ -120,6 +122,200 @@ def getThreshold(self): return self.getOrDefault(self.threshold)

[GitHub] spark pull request #16715: [Spark-18080][ML] Python API & Examples for Local...

2017-02-07 Thread sethah
Github user sethah commented on a diff in the pull request: https://github.com/apache/spark/pull/16715#discussion_r99872069 --- Diff: python/pyspark/ml/feature.py --- @@ -755,6 +951,102 @@ def maxAbs(self): @inherit_doc +class MinHashLSH(JavaEstimator,

[GitHub] spark pull request #16715: [Spark-18080][ML] Python API & Examples for Local...

2017-02-07 Thread sethah
Github user sethah commented on a diff in the pull request: https://github.com/apache/spark/pull/16715#discussion_r99870914 --- Diff: python/pyspark/ml/feature.py --- @@ -120,6 +122,200 @@ def getThreshold(self): return self.getOrDefault(self.threshold)

[GitHub] spark pull request #16715: [Spark-18080][ML] Python API & Examples for Local...

2017-01-26 Thread Yunni
GitHub user Yunni opened a pull request: https://github.com/apache/spark/pull/16715 [Spark-18080][ML] Python API & Examples for Locality Sensitive Hashing ## What changes were proposed in this pull request? This pull request includes python API and examples for LSH. The API