[GitHub] spark pull request #15777: [SPARK-18282][ML][PYSPARK] Add python clustering ...

2016-11-21 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/15777 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[GitHub] spark pull request #15777: [SPARK-18282][ML][PYSPARK] Add python clustering ...

2016-11-20 Thread sethah
Github user sethah commented on a diff in the pull request: https://github.com/apache/spark/pull/15777#discussion_r88833295 --- Diff: mllib/src/main/scala/org/apache/spark/ml/clustering/BisectingKMeans.scala --- @@ -95,8 +95,7 @@ class BisectingKMeansModel private[ml] (

[GitHub] spark pull request #15777: [SPARK-18282][ML][PYSPARK] Add python clustering ...

2016-11-19 Thread yanboliang
Github user yanboliang commented on a diff in the pull request: https://github.com/apache/spark/pull/15777#discussion_r88778015 --- Diff: mllib/src/main/scala/org/apache/spark/ml/clustering/BisectingKMeans.scala --- @@ -95,8 +95,7 @@ class BisectingKMeansModel private[ml] (

[GitHub] spark pull request #15777: [SPARK-18282][ML][PYSPARK] Add python clustering ...

2016-11-17 Thread sethah
Github user sethah commented on a diff in the pull request: https://github.com/apache/spark/pull/15777#discussion_r88495818 --- Diff: python/pyspark/ml/tests.py --- @@ -1097,6 +1097,44 @@ def test_logistic_regression_summary(self): sameSummary = model.evaluate(df)

[GitHub] spark pull request #15777: [SPARK-18282][ML][PYSPARK] Add python clustering ...

2016-11-17 Thread yanboliang
Github user yanboliang commented on a diff in the pull request: https://github.com/apache/spark/pull/15777#discussion_r88485754 --- Diff: python/pyspark/ml/tests.py --- @@ -1097,6 +1097,44 @@ def test_logistic_regression_summary(self): sameSummary = model.evaluate(df)

[GitHub] spark pull request #15777: [SPARK-18282][ML][PYSPARK] Add python clustering ...

2016-11-17 Thread sethah
Github user sethah commented on a diff in the pull request: https://github.com/apache/spark/pull/15777#discussion_r88483092 --- Diff: python/pyspark/ml/tests.py --- @@ -1097,6 +1097,44 @@ def test_logistic_regression_summary(self): sameSummary = model.evaluate(df)

[GitHub] spark pull request #15777: [SPARK-18282][ML][PYSPARK] Add python clustering ...

2016-11-16 Thread sethah
Github user sethah commented on a diff in the pull request: https://github.com/apache/spark/pull/15777#discussion_r88269525 --- Diff: python/pyspark/ml/clustering.py --- @@ -346,6 +453,27 @@ def computeCost(self, dataset): """ return

[GitHub] spark pull request #15777: [SPARK-18282][ML][PYSPARK] Add python clustering ...

2016-11-16 Thread yanboliang
Github user yanboliang commented on a diff in the pull request: https://github.com/apache/spark/pull/15777#discussion_r88252426 --- Diff: mllib/src/main/scala/org/apache/spark/ml/clustering/BisectingKMeans.scala --- @@ -132,7 +132,7 @@ class BisectingKMeansModel private[ml] (

[GitHub] spark pull request #15777: [SPARK-18282][ML][PYSPARK] Add python clustering ...

2016-11-16 Thread yanboliang
Github user yanboliang commented on a diff in the pull request: https://github.com/apache/spark/pull/15777#discussion_r88253073 --- Diff: python/pyspark/ml/clustering.py --- @@ -17,16 +17,75 @@ from pyspark import since, keyword_only from pyspark.ml.util import *

[GitHub] spark pull request #15777: [SPARK-18282][ML][PYSPARK] Add python clustering ...

2016-11-16 Thread yanboliang
Github user yanboliang commented on a diff in the pull request: https://github.com/apache/spark/pull/15777#discussion_r88256003 --- Diff: python/pyspark/ml/clustering.py --- @@ -346,6 +453,27 @@ def computeCost(self, dataset): """ return

[GitHub] spark pull request #15777: [SPARK-18282][ML][PYSPARK] Add python clustering ...

2016-11-16 Thread yanboliang
Github user yanboliang commented on a diff in the pull request: https://github.com/apache/spark/pull/15777#discussion_r88255016 --- Diff: python/pyspark/ml/classification.py --- @@ -309,13 +309,16 @@ def interceptVector(self): @since("2.0.0") def summary(self):

[GitHub] spark pull request #15777: [SPARK-18282][ML][PYSPARK] Add python clustering ...

2016-11-16 Thread yanboliang
Github user yanboliang commented on a diff in the pull request: https://github.com/apache/spark/pull/15777#discussion_r88261887 --- Diff: python/pyspark/ml/tests.py --- @@ -1097,6 +1097,44 @@ def test_logistic_regression_summary(self): sameSummary = model.evaluate(df)

[GitHub] spark pull request #15777: [SPARK-18282][ML][PYSPARK] Add python clustering ...

2016-11-15 Thread zhengruifeng
Github user zhengruifeng commented on a diff in the pull request: https://github.com/apache/spark/pull/15777#discussion_r87966225 --- Diff: python/pyspark/ml/clustering.py --- @@ -17,16 +17,75 @@ from pyspark import since, keyword_only from pyspark.ml.util import *

[GitHub] spark pull request #15777: [SPARK-18282][ML][PYSPARK] Add python clustering ...

2016-11-14 Thread sethah
Github user sethah commented on a diff in the pull request: https://github.com/apache/spark/pull/15777#discussion_r87841411 --- Diff: mllib/src/main/scala/org/apache/spark/ml/clustering/BisectingKMeans.scala --- @@ -132,7 +132,7 @@ class BisectingKMeansModel private[ml] (

[GitHub] spark pull request #15777: [SPARK-18282][ML][PYSPARK] Add python clustering ...

2016-11-05 Thread holdenk
Github user holdenk commented on a diff in the pull request: https://github.com/apache/spark/pull/15777#discussion_r86676702 --- Diff: python/pyspark/ml/tests.py --- @@ -1097,6 +1097,42 @@ def test_logistic_regression_summary(self): sameSummary = model.evaluate(df)

[GitHub] spark pull request #15777: [SPARK-18282][ML][PYSPARK] Add python clustering ...

2016-11-05 Thread holdenk
Github user holdenk commented on a diff in the pull request: https://github.com/apache/spark/pull/15777#discussion_r86676679 --- Diff: python/pyspark/ml/classification.py --- @@ -309,13 +309,16 @@ def interceptVector(self): @since("2.0.0") def summary(self):

[GitHub] spark pull request #15777: [SPARK-18282][ML][PYSPARK] Add python clustering ...

2016-11-04 Thread sethah
Github user sethah commented on a diff in the pull request: https://github.com/apache/spark/pull/15777#discussion_r86653312 --- Diff: python/pyspark/ml/classification.py --- @@ -309,13 +309,16 @@ def interceptVector(self): @since("2.0.0") def summary(self):

[GitHub] spark pull request #15777: [SPARK-18282][ML][PYSPARK] Add python clustering ...

2016-11-04 Thread sethah
Github user sethah commented on a diff in the pull request: https://github.com/apache/spark/pull/15777#discussion_r86653456 --- Diff: python/pyspark/ml/tests.py --- @@ -1097,6 +1097,42 @@ def test_logistic_regression_summary(self): sameSummary = model.evaluate(df)

[GitHub] spark pull request #15777: [SPARK-18282][ML][PYSPARK] Add python clustering ...

2016-11-04 Thread sethah
GitHub user sethah opened a pull request: https://github.com/apache/spark/pull/15777 [SPARK-18282][ML][PYSPARK] Add python clustering summaries for GMM and BKM ## What changes were proposed in this pull request? Add model summary APIs for `GaussianMixtureModel` and