[GitHub] spark pull request: [SPARK-14464] [MLLIB] Better support for logis...

2016-04-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12761#issuecomment-215592560 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your projec

[GitHub] spark pull request: [SPARK-14972] Improve performance of JSON sche...

2016-04-28 Thread JoshRosen
Github user JoshRosen commented on the pull request: https://github.com/apache/spark/pull/12750#issuecomment-215592580 Maybe, but my hunch is that it's going to be slower and won't save much code. --- If your project is set up for it, you can reply to this email and have your reply a

[GitHub] spark pull request: [SPARK-14987] [SQL] inline hive-service (cli) ...

2016-04-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12764#issuecomment-215592206 **[Test build #2912 has started](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/2912/consoleFull)** for PR 12764 at commit [`a07440b`](https://g

[GitHub] spark pull request: [SPARK-14989][BUILD] Upgrade Jackson from 2.5....

2016-04-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12766#issuecomment-215592028 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your projec

[GitHub] spark pull request: [SPARK-14989][BUILD] Upgrade Jackson from 2.5....

2016-04-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12766#issuecomment-215592032 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/

[GitHub] spark pull request: [SPARK-14989][BUILD] Upgrade Jackson from 2.5....

2016-04-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12766#issuecomment-215591871 **[Test build #57282 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/57282/consoleFull)** for PR 12766 at commit [`264f76f`](https://g

[GitHub] spark pull request: [SPARK-14991][SQL] Remove HiveNativeCommand

2016-04-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12769#issuecomment-215591840 **[Test build #57284 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/57284/consoleFull)** for PR 12769 at commit [`b0eed5e`](https://gi

[GitHub] spark pull request: [SPARK-14991][SQL] Remove HiveNativeCommand

2016-04-28 Thread rxin
Github user rxin commented on the pull request: https://github.com/apache/spark/pull/12769#issuecomment-215591665 Note that HiveCompatibilitySuite will fail because I haven't removed the unsupported commands yet. --- If your project is set up for it, you can reply to this email and

[GitHub] spark pull request: [SPARK-14991][SQL] Remove HiveNativeCommand

2016-04-28 Thread rxin
GitHub user rxin opened a pull request: https://github.com/apache/spark/pull/12769 [SPARK-14991][SQL] Remove HiveNativeCommand ## What changes were proposed in this pull request? This patch removes HiveNativeCommand, so we can continue to remove the dependency on Hive.

[GitHub] spark pull request: [SPARK-14862][ML] Updated Classifiers to not r...

2016-04-28 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/12663 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is ena

[GitHub] spark pull request: [SPARK-14862][ML] Updated Classifiers to not r...

2016-04-28 Thread jkbradley
Github user jkbradley commented on the pull request: https://github.com/apache/spark/pull/12663#issuecomment-215590524 Thanks @MLnick @sethah @thunterdb for reviewing! Merging with master --- If your project is set up for it, you can reply to this email and have your reply appear

[GitHub] spark pull request: [SPARK-14613][ML] Add @Since into the matrix a...

2016-04-28 Thread dbtsai
Github user dbtsai commented on the pull request: https://github.com/apache/spark/pull/12416#issuecomment-215587478 Thanks. Merged into master. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-14732][ML] spark.ml GaussianMixture sho...

2016-04-28 Thread dbtsai
Github user dbtsai commented on the pull request: https://github.com/apache/spark/pull/12593#issuecomment-215588256 @jkbradley The `@Since` annotation was merged https://github.com/apache/spark/pull/12416 Could you submit a followup PR? Thanks. --- If your project is set up for it,

[GitHub] spark pull request: [SPARK-14613][ML] Add @Since into the matrix a...

2016-04-28 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/12416 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is ena

[GitHub] spark pull request: [SPARK-14613][ML] Add @Since into the matrix a...

2016-04-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12416#issuecomment-215587557 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/

[GitHub] spark pull request: [SPARK-14613][ML] Add @Since into the matrix a...

2016-04-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12416#issuecomment-215587555 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your projec

[GitHub] spark pull request: [SPARK-14613][ML] Add @Since into the matrix a...

2016-04-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12416#issuecomment-215587391 **[Test build #57276 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/57276/consoleFull)** for PR 12416 at commit [`8b57dc0`](https://g

[GitHub] spark pull request: [SPARK-14987] [SQL] inline hive-service (cli) ...

2016-04-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12764#issuecomment-215585506 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your projec

[GitHub] spark pull request: [SPARK-14987] [SQL] inline hive-service (cli) ...

2016-04-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12764#issuecomment-215585508 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/

[GitHub] spark pull request: [SPARK-14987] [SQL] inline hive-service (cli) ...

2016-04-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12764#issuecomment-215585359 **[Test build #57279 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/57279/consoleFull)** for PR 12764 at commit [`a07440b`](https://g

[GitHub] spark pull request: [SPARK-14972] Improve performance of JSON sche...

2016-04-28 Thread NathanHowell
Github user NathanHowell commented on the pull request: https://github.com/apache/spark/pull/12750#issuecomment-215585269 Would Guava's `Iterables.mergeSorted[T]` help out here? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as w

[GitHub] spark pull request: [SPARK-14988][PYTHON] SparkSession catalog and...

2016-04-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12765#issuecomment-215584504 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/

[GitHub] spark pull request: [SPARK-14988][PYTHON] SparkSession catalog and...

2016-04-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12765#issuecomment-215584501 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your projec

[GitHub] spark pull request: [SPARK-14988][PYTHON] SparkSession catalog and...

2016-04-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12765#issuecomment-215584321 **[Test build #57280 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/57280/consoleFull)** for PR 12765 at commit [`d32ee8c`](https://g

[GitHub] spark pull request: [SPARK-14988][PYTHON] SparkSession catalog and...

2016-04-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12765#issuecomment-215583865 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your projec

[GitHub] spark pull request: [SPARK-14988][PYTHON] SparkSession catalog and...

2016-04-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12765#issuecomment-215583867 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/

[GitHub] spark pull request: [SPARK-14988][PYTHON] SparkSession catalog and...

2016-04-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12765#issuecomment-215583680 **[Test build #57278 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/57278/consoleFull)** for PR 12765 at commit [`a263f74`](https://g

[GitHub] spark pull request: [SPARK-14613][ML] Add @Since into the matrix a...

2016-04-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12416#issuecomment-215583165 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your projec

[GitHub] spark pull request: [SPARK-14613][ML] Add @Since into the matrix a...

2016-04-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12416#issuecomment-215583167 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/

[GitHub] spark pull request: [SPARK-14613][ML] Add @Since into the matrix a...

2016-04-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12416#issuecomment-215582963 **[Test build #57273 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/57273/consoleFull)** for PR 12416 at commit [`f81616e`](https://g

[GitHub] spark pull request: [SPARK-11940][PYSPARK][ML] Python API for ml.c...

2016-04-28 Thread jkbradley
Github user jkbradley commented on the pull request: https://github.com/apache/spark/pull/12723#issuecomment-215581769 Ready now? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark pull request: [SPARK-14555] Second cut of Python API for Str...

2016-04-28 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/12673 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is ena

[GitHub] spark pull request: [SPARK-14716][SQL] Added support for partition...

2016-04-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12409#issuecomment-215581159 **[Test build #2911 has started](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/2911/consoleFull)** for PR 12409 at commit [`baf837f`](https://g

[GitHub] spark pull request: [SPARK-14716][SQL] Added support for partition...

2016-04-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12409#issuecomment-215581123 **[Test build #2910 has started](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/2910/consoleFull)** for PR 12409 at commit [`baf837f`](https://g

[GitHub] spark pull request: [SPARK-14555] Second cut of Python API for Str...

2016-04-28 Thread tdas
Github user tdas commented on the pull request: https://github.com/apache/spark/pull/12673#issuecomment-215579990 Merging this. Thank you very much @brkyvz --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request: [SPARK-14555] Second cut of Python API for Str...

2016-04-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12673#issuecomment-215578529 **[Test build #2909 has finished](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/2909/consoleFull)** for PR 12673 at commit [`37da1e1`](https://

[GitHub] spark pull request: [SPARK-14555] Second cut of Python API for Str...

2016-04-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12673#issuecomment-215578315 **[Test build #2908 has finished](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/2908/consoleFull)** for PR 12673 at commit [`37da1e1`](https://

[GitHub] spark pull request: [SPARK-14990][SQL] nvl, coalesce, array with p...

2016-04-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12768#issuecomment-215577290 Can one of the admins verify this patch? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your p

[GitHub] spark pull request: [SPARK-14990][SQL] nvl, coalesce, array with p...

2016-04-28 Thread dosoft
GitHub user dosoft opened a pull request: https://github.com/apache/spark/pull/12768 [SPARK-14990][SQL] nvl, coalesce, array with parameter of type 'array' ## What changes were proposed in this pull request? (Please fill in changes proposed in this fix) ## How

[GitHub] spark pull request: [SPARK-11940][PYSPARK][ML] Python API for ml.c...

2016-04-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12723#issuecomment-215576476 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/

[GitHub] spark pull request: [SPARK-11940][PYSPARK][ML] Python API for ml.c...

2016-04-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12723#issuecomment-215576473 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your projec

[GitHub] spark pull request: [SPARK-11940][PYSPARK][ML] Python API for ml.c...

2016-04-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12723#issuecomment-215576378 **[Test build #57281 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/57281/consoleFull)** for PR 12723 at commit [`f37c1c1`](https://g

[GitHub] spark pull request: [SPARK-14837][SQL][STREAMING] Added support in...

2016-04-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12616#issuecomment-215576137 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your projec

[GitHub] spark pull request: [SPARK-14837][SQL][STREAMING] Added support in...

2016-04-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12616#issuecomment-215576141 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/

[GitHub] spark pull request: [SPARK-14837][SQL][STREAMING] Added support in...

2016-04-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12616#issuecomment-215575885 **[Test build #57275 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/57275/consoleFull)** for PR 12616 at commit [`b011b5e`](https://g

[GitHub] spark pull request: [SPARK-14464] [MLLIB] Better support for logis...

2016-04-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12761#issuecomment-215575102 **[Test build #57283 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/57283/consoleFull)** for PR 12761 at commit [`aa7585a`](https://gi

[GitHub] spark pull request: [SPARK-14802] [SQL] [WIP] Disable Passing to H...

2016-04-28 Thread gatorsmile
Github user gatorsmile commented on the pull request: https://github.com/apache/spark/pull/12692#issuecomment-215574915 Another option is to use a different API to drop multiple partitions by a single command. ```JAVA public List dropPartitions(String dbName, String tblNa

[GitHub] spark pull request: [SPARK-14555] Second cut of Python API for Str...

2016-04-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12673#issuecomment-215574717 **[Test build #2909 has started](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/2909/consoleFull)** for PR 12673 at commit [`37da1e1`](https://g

[GitHub] spark pull request: [SPARK-14850][ML] convert primitive array from...

2016-04-28 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/12640#discussion_r61509114 --- Diff: sql/catalyst/src/main/java/org/apache/spark/sql/catalyst/expressions/UnsafeArrayData.java --- @@ -336,4 +336,78 @@ public UnsafeArrayData copy() {

[GitHub] spark pull request: [SPARK-14850][ML] convert primitive array from...

2016-04-28 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/12640#discussion_r61509122 --- Diff: sql/catalyst/src/main/java/org/apache/spark/sql/catalyst/expressions/UnsafeArrayData.java --- @@ -336,4 +336,62 @@ public UnsafeArrayData copy() {

[GitHub] spark pull request: [SPARK-14555] Second cut of Python API for Str...

2016-04-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12673#issuecomment-215574347 **[Test build #2908 has started](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/2908/consoleFull)** for PR 12673 at commit [`37da1e1`](https://g

[GitHub] spark pull request: [SPARK-14464] [MLLIB] Better support for logis...

2016-04-28 Thread dbtsai
Github user dbtsai commented on the pull request: https://github.com/apache/spark/pull/12761#issuecomment-215573982 Jenkins, retest this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-14464] [MLLIB] Better support for logis...

2016-04-28 Thread dbtsai
Github user dbtsai commented on the pull request: https://github.com/apache/spark/pull/12761#issuecomment-215573910 Jenkins, test this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have t

[GitHub] spark pull request: [SPARK-14464] [MLLIB] Better support for logis...

2016-04-28 Thread dbtsai
Github user dbtsai commented on the pull request: https://github.com/apache/spark/pull/12761#issuecomment-215573882 test this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this featu

[GitHub] spark pull request: [SPARK-14464] [MLLIB] Better support for logis...

2016-04-28 Thread dbtsai
Github user dbtsai commented on the pull request: https://github.com/apache/spark/pull/12761#issuecomment-215573656 Jenkins, please test it again. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not h

[GitHub] spark pull request: [SPARK-14978][PySpark] PySpark TrainValidation...

2016-04-28 Thread vectorijk
Github user vectorijk commented on a diff in the pull request: https://github.com/apache/spark/pull/12767#discussion_r61508511 --- Diff: python/pyspark/ml/tests.py --- @@ -586,10 +589,13 @@ def test_fit_maximize_metric(self): tvsModel = tvs.fit(dataset) bes

[GitHub] spark pull request: [SPARK-14978][PySpark] PySpark TrainValidation...

2016-04-28 Thread vectorijk
Github user vectorijk commented on a diff in the pull request: https://github.com/apache/spark/pull/12767#discussion_r61508260 --- Diff: python/pyspark/ml/tests.py --- @@ -616,6 +622,7 @@ def test_save_load(self): tvsModel.save(tvsModelPath) loadedModel = T

[GitHub] spark pull request: [SPARK-14464] [MLLIB] Better support for logis...

2016-04-28 Thread daniel-siegmann-aol
Github user daniel-siegmann-aol commented on the pull request: https://github.com/apache/spark/pull/12761#issuecomment-215572538 Thanks. I'm pretty confident this does what it's suppose to do, my main concern is to make sure performance doesn't degrade for anything else. The t

[GitHub] spark pull request: [SPARK-14830][SQL] Add RemoveRepetitionFromGro...

2016-04-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12590#issuecomment-215572360 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/

[GitHub] spark pull request: [SPARK-14830][SQL] Add RemoveRepetitionFromGro...

2016-04-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12590#issuecomment-215572352 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your projec

[GitHub] spark pull request: [SPARK-14830][SQL] Add RemoveRepetitionFromGro...

2016-04-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12590#issuecomment-215572100 **[Test build #57274 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/57274/consoleFull)** for PR 12590 at commit [`e3bdc16`](https://g

[GitHub] spark pull request: [SPARK-14978][PySpark] PySpark TrainValidation...

2016-04-28 Thread vectorijk
Github user vectorijk commented on a diff in the pull request: https://github.com/apache/spark/pull/12767#discussion_r61507716 --- Diff: python/pyspark/ml/tuning.py --- @@ -613,7 +615,9 @@ def copy(self, extra=None): """ if extra is None: e

[GitHub] spark pull request: [SPARK-14858][SQL] Enable subquery pushdown

2016-04-28 Thread davies
Github user davies commented on a diff in the pull request: https://github.com/apache/spark/pull/12720#discussion_r61507610 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/Optimizer.scala --- @@ -644,9 +643,10 @@ object InferFiltersFromConstraints ext

[GitHub] spark pull request: [SPARK-14555] Second cut of Python API for Str...

2016-04-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12673#issuecomment-215571604 **[Test build #2907 has finished](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/2907/consoleFull)** for PR 12673 at commit [`37da1e1`](https://

[GitHub] spark pull request: [SPARK-14555] Second cut of Python API for Str...

2016-04-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12673#issuecomment-215570990 **[Test build #2906 has finished](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/2906/consoleFull)** for PR 12673 at commit [`37da1e1`](https://

[GitHub] spark pull request: [SPARK-14464] [MLLIB] Better support for logis...

2016-04-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12761#issuecomment-215570785 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your projec

[GitHub] spark pull request: [SPARK-14464] [MLLIB] Better support for logis...

2016-04-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12761#issuecomment-215570788 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/

[GitHub] spark pull request: [SPARK-14989][BUILD] Upgrade Jackson from 2.5....

2016-04-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12766#issuecomment-215570639 **[Test build #57282 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/57282/consoleFull)** for PR 12766 at commit [`264f76f`](https://gi

[GitHub] spark pull request: [SPARK-14464] [MLLIB] Better support for logis...

2016-04-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12761#issuecomment-215570576 **[Test build #57272 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/57272/consoleFull)** for PR 12761 at commit [`aa7585a`](https://g

[GitHub] spark pull request: [SPARK-14978][PySpark] PySpark TrainValidation...

2016-04-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12767#issuecomment-215570555 Can one of the admins verify this patch? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your p

[GitHub] spark pull request: [SPARK-14978][PySpark] PySpark TrainValidation...

2016-04-28 Thread taku-k
GitHub user taku-k opened a pull request: https://github.com/apache/spark/pull/12767 [SPARK-14978][PySpark] PySpark TrainValidationSplitModel should support validationMetrics ## What changes were proposed in this pull request? This pull request includes supporting validatio

[GitHub] spark pull request: [SPARK-14555] Second cut of Python API for Str...

2016-04-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12673#issuecomment-215569984 **[Test build #2905 has finished](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/2905/consoleFull)** for PR 12673 at commit [`37da1e1`](https://

[GitHub] spark pull request: [SPARK-14972] Improve performance of JSON sche...

2016-04-28 Thread JoshRosen
Github user JoshRosen commented on the pull request: https://github.com/apache/spark/pull/12750#issuecomment-215569915 /cc @NathanHowell, FYI. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-14989] Upgrade Jackson from 2.5.3 to 2....

2016-04-28 Thread JoshRosen
GitHub user JoshRosen opened a pull request: https://github.com/apache/spark/pull/12766 [SPARK-14989] Upgrade Jackson from 2.5.3 to 2.7.3 This patch upgrades Jackson from 2.5.3 to 2.7.3. I'd like to upgrade now in order to take advantage of new performance improvements and features,

[GitHub] spark pull request: [SPARK-14858][SQL] Enable subquery pushdown

2016-04-28 Thread davies
Github user davies commented on a diff in the pull request: https://github.com/apache/spark/pull/12720#discussion_r61505828 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/Analyzer.scala --- @@ -442,7 +443,7 @@ class Analyzer( */ object

[GitHub] spark pull request: [SPARK-14850][ML] convert primitive array from...

2016-04-28 Thread mengxr
Github user mengxr commented on a diff in the pull request: https://github.com/apache/spark/pull/12640#discussion_r61505845 --- Diff: mllib/src/test/scala/org/apache/spark/mllib/linalg/UDTSerializationBenchmark.scala --- @@ -0,0 +1,70 @@ +/* + * Licensed to the Apache Soft

[GitHub] spark pull request: [SPARK-11940][PYSPARK][ML] Python API for ml.c...

2016-04-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12723#issuecomment-215566571 **[Test build #57281 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/57281/consoleFull)** for PR 12723 at commit [`f37c1c1`](https://gi

[GitHub] spark pull request: [SPARK-14464] [MLLIB] Better support for logis...

2016-04-28 Thread dbtsai
Github user dbtsai commented on the pull request: https://github.com/apache/spark/pull/12761#issuecomment-215564655 Seems to be very promising. Since 2.0 window will be closed soon, it's unlikely to get into 2.0. Let's target 2.1 --- If your project is set up for it, you can reply t

[GitHub] spark pull request: [SPARK-12810][PySpark] PySpark CrossValidatorM...

2016-04-28 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/12464 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is ena

[GitHub] spark pull request: [SPARK-14939][SQL] Improve EliminateSorts opti...

2016-04-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12719#issuecomment-215564542 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/

[GitHub] spark pull request: [SPARK-11940][PYSPARK][ML] Python API for ml.c...

2016-04-28 Thread jkbradley
Github user jkbradley commented on the pull request: https://github.com/apache/spark/pull/12723#issuecomment-215566123 test this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this fe

[GitHub] spark pull request: [SPARK-14555] Second cut of Python API for Str...

2016-04-28 Thread tdas
Github user tdas commented on the pull request: https://github.com/apache/spark/pull/12673#issuecomment-215565677 LGTM. I am testing this for flakiness after which I will merge this soon. The lack ctx.streams is causing flakiness and blocking other PRs. --- If your project is set up

[GitHub] spark pull request: [SPARK-12810][PySpark] PySpark CrossValidatorM...

2016-04-28 Thread jkbradley
Github user jkbradley commented on the pull request: https://github.com/apache/spark/pull/12464#issuecomment-215565488 LGTM Merging with master Thanks! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your proje

[GitHub] spark pull request: [SPARK-14555] Second cut of Python API for Str...

2016-04-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12673#issuecomment-215565423 **[Test build #2907 has started](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/2907/consoleFull)** for PR 12673 at commit [`37da1e1`](https://g

[GitHub] spark pull request: [SPARK-12810][PySpark] PySpark CrossValidatorM...

2016-04-28 Thread jkbradley
Github user jkbradley commented on a diff in the pull request: https://github.com/apache/spark/pull/12464#discussion_r61504179 --- Diff: python/pyspark/ml/tests.py --- @@ -461,6 +461,31 @@ def _fit(self, dataset): class CrossValidatorTests(PySparkTestCase): +

[GitHub] spark pull request: [SPARK-14555] Second cut of Python API for Str...

2016-04-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12673#issuecomment-215565371 **[Test build #2906 has started](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/2906/consoleFull)** for PR 12673 at commit [`37da1e1`](https://g

[GitHub] spark pull request: [SPARK-14555] Second cut of Python API for Str...

2016-04-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12673#issuecomment-215565327 **[Test build #2905 has started](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/2905/consoleFull)** for PR 12673 at commit [`37da1e1`](https://g

[GitHub] spark pull request: [SPARK-14987] [SQL] inline hive-service (cli) ...

2016-04-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12764#issuecomment-215565289 **[Test build #57279 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/57279/consoleFull)** for PR 12764 at commit [`a07440b`](https://gi

[GitHub] spark pull request: [SPARK-14988][PYTHON] SparkSession catalog and...

2016-04-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12765#issuecomment-215565286 **[Test build #57280 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/57280/consoleFull)** for PR 12765 at commit [`d32ee8c`](https://gi

[GitHub] spark pull request: [SPARK-14939][SQL] Improve EliminateSorts opti...

2016-04-28 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/12719#issuecomment-215564535 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your projec

[GitHub] spark pull request: [SPARK-14939][SQL] Improve EliminateSorts opti...

2016-04-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12719#issuecomment-215564274 **[Test build #57271 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/57271/consoleFull)** for PR 12719 at commit [`b7deb89`](https://g

[GitHub] spark pull request: [SPARK-14464] [MLLIB] Better support for logis...

2016-04-28 Thread daniel-siegmann-aol
Github user daniel-siegmann-aol commented on the pull request: https://github.com/apache/spark/pull/12761#issuecomment-215564208 Note those results above are the results from my production data. For comparison, I'm told by one of our data scientists the training can be done locally wi

[GitHub] spark pull request: [SPARK-14988][PYTHON] SparkSession catalog and...

2016-04-28 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/12765#issuecomment-215564024 **[Test build #57278 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/57278/consoleFull)** for PR 12765 at commit [`a263f74`](https://gi

[GitHub] spark pull request: [SPARK-14882] [DOCS] Clarify that Spark can be...

2016-04-28 Thread srowen
Github user srowen commented on a diff in the pull request: https://github.com/apache/spark/pull/12757#discussion_r61503201 --- Diff: docs/programming-guide.md --- @@ -24,7 +24,8 @@ along with if you launch Spark's interactive shell -- either `bin/spark-shell` f

[GitHub] spark pull request: [SPARK-14988][Python] Python SparkSession cata...

2016-04-28 Thread andrewor14
GitHub user andrewor14 opened a pull request: https://github.com/apache/spark/pull/12765 [SPARK-14988][Python] Python SparkSession catalog and conf API ## What changes were proposed in this pull request? The `catalog` and `conf` APIs were exposed in `SparkSession` in #12713

[GitHub] spark pull request: [SPARK-14891][ML] Add schema validation for AL...

2016-04-28 Thread BenFradet
Github user BenFradet commented on the pull request: https://github.com/apache/spark/pull/12762#issuecomment-215563782 LGTM except for a few minors. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not

[GitHub] spark pull request: [SPARK-14837][SQL][STREAMING] Added support in...

2016-04-28 Thread tdas
Github user tdas commented on the pull request: https://github.com/apache/spark/pull/12616#issuecomment-215562885 @marmbrus Please take a look once again. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project do

[GitHub] spark pull request: [SPARK-14802] [SQL] [WIP] Disable Passing to H...

2016-04-28 Thread gatorsmile
Github user gatorsmile commented on the pull request: https://github.com/apache/spark/pull/12692#issuecomment-215562689 Found a serious issue in the existing `Drop Partition`. Now, we are able to drop multiple partitions using a single call. However, this could break atomicity. I thin

[GitHub] spark pull request: [SPARK-14891][ML] Add schema validation for AL...

2016-04-28 Thread BenFradet
Github user BenFradet commented on a diff in the pull request: https://github.com/apache/spark/pull/12762#discussion_r61502301 --- Diff: mllib/src/test/scala/org/apache/spark/ml/util/MLTestingUtils.scala --- @@ -58,6 +58,30 @@ object MLTestingUtils extends SparkFunSuite {

[GitHub] spark pull request: [SPARK-14464] [MLLIB] Better support for logis...

2016-04-28 Thread daniel-siegmann-aol
Github user daniel-siegmann-aol commented on the pull request: https://github.com/apache/spark/pull/12761#issuecomment-215561740 I'll give the results of my own training flow too. Testing was done on EMR 4.4.0 with Spark 1.6.0. The cluster was configured with six r3.8xlarge nodes: one

<    1   2   3   4   5   6   7   8   9   >