Github user sethah commented on a diff in the pull request:
https://github.com/apache/spark/pull/16715#discussion_r100426821
--- Diff:
examples/src/main/java/org/apache/spark/examples/ml/JavaMinHashLSHExample.java
---
@@ -44,25 +45,67 @@ public static void main(String[] args) {
Github user sethah commented on a diff in the pull request:
https://github.com/apache/spark/pull/16715#discussion_r100424220
--- Diff:
examples/src/main/python/ml/bucketed_random_projection_lsh_example.py ---
@@ -0,0 +1,86 @@
+#
+# Licensed to the Apache Software
Github user sethah commented on a diff in the pull request:
https://github.com/apache/spark/pull/16715#discussion_r100422395
--- Diff:
examples/src/main/python/ml/bucketed_random_projection_lsh_example.py ---
@@ -0,0 +1,86 @@
+#
+# Licensed to the Apache Software
Github user sethah commented on a diff in the pull request:
https://github.com/apache/spark/pull/16715#discussion_r100427531
--- Diff: python/pyspark/ml/feature.py ---
@@ -120,6 +122,196 @@ def getThreshold(self):
return self.getOrDefault(self.threshold)
Github user sethah commented on a diff in the pull request:
https://github.com/apache/spark/pull/16715#discussion_r100427045
--- Diff: examples/src/main/python/ml/min_hash_lsh_example.py ---
@@ -0,0 +1,85 @@
+#
+# Licensed to the Apache Software Foundation (ASF) under one
Github user sethah commented on a diff in the pull request:
https://github.com/apache/spark/pull/16715#discussion_r100421903
--- Diff:
examples/src/main/python/ml/bucketed_random_projection_lsh_example.py ---
@@ -0,0 +1,86 @@
+#
+# Licensed to the Apache Software
Github user sethah commented on a diff in the pull request:
https://github.com/apache/spark/pull/16715#discussion_r100426756
--- Diff:
examples/src/main/java/org/apache/spark/examples/ml/JavaMinHashLSHExample.java
---
@@ -44,25 +45,67 @@ public static void main(String[] args) {
Github user sethah commented on a diff in the pull request:
https://github.com/apache/spark/pull/16715#discussion_r100426237
--- Diff:
examples/src/main/java/org/apache/spark/examples/ml/JavaBucketedRandomProjectionLSHExample.java
---
@@ -71,25 +71,32 @@ public static void
Github user sethah commented on a diff in the pull request:
https://github.com/apache/spark/pull/16715#discussion_r100427633
--- Diff: python/pyspark/ml/feature.py ---
@@ -755,6 +947,101 @@ def maxAbs(self):
@inherit_doc
+class MinHashLSH(JavaEstimator,
Github user sethah commented on a diff in the pull request:
https://github.com/apache/spark/pull/16715#discussion_r100426683
--- Diff:
examples/src/main/java/org/apache/spark/examples/ml/JavaMinHashLSHExample.java
---
@@ -44,25 +45,67 @@ public static void main(String[] args) {
Github user sethah commented on a diff in the pull request:
https://github.com/apache/spark/pull/16715#discussion_r100428492
--- Diff:
examples/src/main/java/org/apache/spark/examples/ml/JavaMinHashLSHExample.java
---
@@ -44,25 +45,67 @@ public static void main(String[] args) {
Github user sethah commented on a diff in the pull request:
https://github.com/apache/spark/pull/16715#discussion_r100420489
--- Diff:
examples/src/main/python/ml/bucketed_random_projection_lsh_example.py ---
@@ -0,0 +1,86 @@
+#
+# Licensed to the Apache Software
Github user sethah commented on a diff in the pull request:
https://github.com/apache/spark/pull/16715#discussion_r100426020
--- Diff:
examples/src/main/java/org/apache/spark/examples/ml/JavaBucketedRandomProjectionLSHExample.java
---
@@ -71,25 +71,32 @@ public static void
Github user Yunni commented on a diff in the pull request:
https://github.com/apache/spark/pull/16715#discussion_r100199037
--- Diff: python/pyspark/ml/feature.py ---
@@ -120,6 +122,200 @@ def getThreshold(self):
return self.getOrDefault(self.threshold)
Github user Yunni commented on a diff in the pull request:
https://github.com/apache/spark/pull/16715#discussion_r100198559
--- Diff: python/pyspark/ml/feature.py ---
@@ -120,6 +122,200 @@ def getThreshold(self):
return self.getOrDefault(self.threshold)
Github user Yunni commented on a diff in the pull request:
https://github.com/apache/spark/pull/16715#discussion_r100192059
--- Diff: python/pyspark/ml/feature.py ---
@@ -120,6 +122,200 @@ def getThreshold(self):
return self.getOrDefault(self.threshold)
Github user Yunni commented on a diff in the pull request:
https://github.com/apache/spark/pull/16715#discussion_r100193058
--- Diff: python/pyspark/ml/feature.py ---
@@ -755,6 +951,102 @@ def maxAbs(self):
@inherit_doc
+class MinHashLSH(JavaEstimator,
Github user Yunni commented on a diff in the pull request:
https://github.com/apache/spark/pull/16715#discussion_r100193020
--- Diff: python/pyspark/ml/feature.py ---
@@ -755,6 +951,102 @@ def maxAbs(self):
@inherit_doc
+class MinHashLSH(JavaEstimator,
Github user Yunni commented on a diff in the pull request:
https://github.com/apache/spark/pull/16715#discussion_r100193043
--- Diff: python/pyspark/ml/feature.py ---
@@ -755,6 +951,102 @@ def maxAbs(self):
@inherit_doc
+class MinHashLSH(JavaEstimator,
Github user Yunni commented on a diff in the pull request:
https://github.com/apache/spark/pull/16715#discussion_r100192985
--- Diff: examples/src/main/python/ml/bucketed_random_projection_lsh.py ---
@@ -0,0 +1,76 @@
+#
+# Licensed to the Apache Software Foundation (ASF)
Github user Yunni commented on a diff in the pull request:
https://github.com/apache/spark/pull/16715#discussion_r100192933
--- Diff: examples/src/main/python/ml/bucketed_random_projection_lsh.py ---
@@ -0,0 +1,76 @@
+#
+# Licensed to the Apache Software Foundation (ASF)
Github user Yunni commented on a diff in the pull request:
https://github.com/apache/spark/pull/16715#discussion_r100192881
--- Diff: examples/src/main/python/ml/min_hash_lsh.py ---
@@ -0,0 +1,75 @@
+#
+# Licensed to the Apache Software Foundation (ASF) under one or more
Github user Yunni commented on a diff in the pull request:
https://github.com/apache/spark/pull/16715#discussion_r100192685
--- Diff: examples/src/main/python/ml/min_hash_lsh.py ---
@@ -0,0 +1,75 @@
+#
+# Licensed to the Apache Software Foundation (ASF) under one or more
Github user Yunni commented on a diff in the pull request:
https://github.com/apache/spark/pull/16715#discussion_r100192402
--- Diff: python/pyspark/ml/feature.py ---
@@ -120,6 +122,200 @@ def getThreshold(self):
return self.getOrDefault(self.threshold)
Github user Yunni commented on a diff in the pull request:
https://github.com/apache/spark/pull/16715#discussion_r100192347
--- Diff: python/pyspark/ml/feature.py ---
@@ -120,6 +122,200 @@ def getThreshold(self):
return self.getOrDefault(self.threshold)
Github user Yunni commented on a diff in the pull request:
https://github.com/apache/spark/pull/16715#discussion_r100192333
--- Diff: python/pyspark/ml/feature.py ---
@@ -120,6 +122,200 @@ def getThreshold(self):
return self.getOrDefault(self.threshold)
Github user Yunni commented on a diff in the pull request:
https://github.com/apache/spark/pull/16715#discussion_r100192298
--- Diff: python/pyspark/ml/feature.py ---
@@ -120,6 +122,200 @@ def getThreshold(self):
return self.getOrDefault(self.threshold)
Github user Yunni commented on a diff in the pull request:
https://github.com/apache/spark/pull/16715#discussion_r100192314
--- Diff: python/pyspark/ml/feature.py ---
@@ -120,6 +122,200 @@ def getThreshold(self):
return self.getOrDefault(self.threshold)
Github user Yunni commented on a diff in the pull request:
https://github.com/apache/spark/pull/16715#discussion_r100192074
--- Diff: python/pyspark/ml/feature.py ---
@@ -120,6 +122,200 @@ def getThreshold(self):
return self.getOrDefault(self.threshold)
Github user Yunni commented on a diff in the pull request:
https://github.com/apache/spark/pull/16715#discussion_r100192026
--- Diff: python/pyspark/ml/feature.py ---
@@ -120,6 +122,200 @@ def getThreshold(self):
return self.getOrDefault(self.threshold)
Github user sethah commented on a diff in the pull request:
https://github.com/apache/spark/pull/16715#discussion_r99940031
--- Diff: examples/src/main/python/ml/bucketed_random_projection_lsh.py ---
@@ -0,0 +1,76 @@
+#
+# Licensed to the Apache Software Foundation (ASF)
Github user sethah commented on a diff in the pull request:
https://github.com/apache/spark/pull/16715#discussion_r99943935
--- Diff: examples/src/main/python/ml/min_hash_lsh.py ---
@@ -0,0 +1,75 @@
+#
+# Licensed to the Apache Software Foundation (ASF) under one or more
Github user sethah commented on a diff in the pull request:
https://github.com/apache/spark/pull/16715#discussion_r99940201
--- Diff: examples/src/main/python/ml/bucketed_random_projection_lsh.py ---
@@ -0,0 +1,76 @@
+#
+# Licensed to the Apache Software Foundation (ASF)
Github user sethah commented on a diff in the pull request:
https://github.com/apache/spark/pull/16715#discussion_r99943986
--- Diff: examples/src/main/python/ml/min_hash_lsh.py ---
@@ -0,0 +1,75 @@
+#
+# Licensed to the Apache Software Foundation (ASF) under one or more
Github user sethah commented on a diff in the pull request:
https://github.com/apache/spark/pull/16715#discussion_r99912784
--- Diff: python/pyspark/ml/feature.py ---
@@ -120,6 +122,200 @@ def getThreshold(self):
return self.getOrDefault(self.threshold)
Github user sethah commented on a diff in the pull request:
https://github.com/apache/spark/pull/16715#discussion_r99923657
--- Diff: python/pyspark/ml/feature.py ---
@@ -120,6 +122,200 @@ def getThreshold(self):
return self.getOrDefault(self.threshold)
Github user sethah commented on a diff in the pull request:
https://github.com/apache/spark/pull/16715#discussion_r99901401
--- Diff: python/pyspark/ml/feature.py ---
@@ -120,6 +122,200 @@ def getThreshold(self):
return self.getOrDefault(self.threshold)
Github user sethah commented on a diff in the pull request:
https://github.com/apache/spark/pull/16715#discussion_r99929663
--- Diff: python/pyspark/ml/feature.py ---
@@ -120,6 +122,200 @@ def getThreshold(self):
return self.getOrDefault(self.threshold)
Github user sethah commented on a diff in the pull request:
https://github.com/apache/spark/pull/16715#discussion_r99909930
--- Diff: python/pyspark/ml/feature.py ---
@@ -120,6 +122,200 @@ def getThreshold(self):
return self.getOrDefault(self.threshold)
Github user sethah commented on a diff in the pull request:
https://github.com/apache/spark/pull/16715#discussion_r99871800
--- Diff: python/pyspark/ml/feature.py ---
@@ -755,6 +951,102 @@ def maxAbs(self):
@inherit_doc
+class MinHashLSH(JavaEstimator,
Github user sethah commented on a diff in the pull request:
https://github.com/apache/spark/pull/16715#discussion_r99871507
--- Diff: python/pyspark/ml/feature.py ---
@@ -120,6 +122,200 @@ def getThreshold(self):
return self.getOrDefault(self.threshold)
Github user sethah commented on a diff in the pull request:
https://github.com/apache/spark/pull/16715#discussion_r99928739
--- Diff: python/pyspark/ml/feature.py ---
@@ -755,6 +951,102 @@ def maxAbs(self):
@inherit_doc
+class MinHashLSH(JavaEstimator,
Github user sethah commented on a diff in the pull request:
https://github.com/apache/spark/pull/16715#discussion_r99929977
--- Diff: python/pyspark/ml/feature.py ---
@@ -120,6 +122,200 @@ def getThreshold(self):
return self.getOrDefault(self.threshold)
Github user sethah commented on a diff in the pull request:
https://github.com/apache/spark/pull/16715#discussion_r99870133
--- Diff: python/pyspark/ml/feature.py ---
@@ -120,6 +122,200 @@ def getThreshold(self):
return self.getOrDefault(self.threshold)
Github user sethah commented on a diff in the pull request:
https://github.com/apache/spark/pull/16715#discussion_r99870243
--- Diff: python/pyspark/ml/feature.py ---
@@ -120,6 +122,200 @@ def getThreshold(self):
return self.getOrDefault(self.threshold)
Github user sethah commented on a diff in the pull request:
https://github.com/apache/spark/pull/16715#discussion_r99872069
--- Diff: python/pyspark/ml/feature.py ---
@@ -755,6 +951,102 @@ def maxAbs(self):
@inherit_doc
+class MinHashLSH(JavaEstimator,
Github user sethah commented on a diff in the pull request:
https://github.com/apache/spark/pull/16715#discussion_r99870914
--- Diff: python/pyspark/ml/feature.py ---
@@ -120,6 +122,200 @@ def getThreshold(self):
return self.getOrDefault(self.threshold)
GitHub user Yunni opened a pull request:
https://github.com/apache/spark/pull/16715
[Spark-18080][ML] Python API & Examples for Locality Sensitive Hashing
## What changes were proposed in this pull request?
This pull request includes python API and examples for LSH. The API
48 matches
Mail list logo