[GitHub] spark pull request #15149: [SPARK-17057] [ML] ProbabilisticClassifierModels'...

2016-09-20 Thread srowen
Github user srowen commented on a diff in the pull request: https://github.com/apache/spark/pull/15149#discussion_r79631445 --- Diff: mllib/src/main/scala/org/apache/spark/ml/classification/ProbabilisticClassifier.scala --- @@ -200,26 +200,9 @@ abstract class

[GitHub] spark issue #15160: [SPARK-17609][SQL] SessionCatalog.tableExists should not...

2016-09-20 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15160 **[Test build #65654 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/65654/consoleFull)** for PR 15160 at commit

[GitHub] spark pull request #12819: [SPARK-14077][ML] Refactor NaiveBayes to support ...

2016-09-20 Thread sethah
Github user sethah commented on a diff in the pull request: https://github.com/apache/spark/pull/12819#discussion_r79632172 --- Diff: mllib/src/test/scala/org/apache/spark/ml/classification/NaiveBayesSuite.scala --- @@ -150,6 +150,75 @@ class NaiveBayesSuite extends SparkFunSuite

[GitHub] spark issue #15149: [SPARK-17057] [ML] ProbabilisticClassifierModels' thresh...

2016-09-20 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15149 **[Test build #65660 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/65660/consoleFull)** for PR 15149 at commit

[GitHub] spark issue #15150: [SPARK-17595] [MLLib] Use a bounded priority queue to fi...

2016-09-20 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15150 **[Test build #65661 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/65661/consoleFull)** for PR 15150 at commit

[GitHub] spark issue #15005: [SPARK-17421] [DOCS] Documenting the current treatment o...

2016-09-20 Thread frreiss
Github user frreiss commented on the issue: https://github.com/apache/spark/pull/15005 I've about narrowed down the options that work for OpenJDK 7 and 8 on Mac and Linux. Working on IBM Java on Linux. I can have an update in by EOD today. BTW, one thing that's been slowing me

[GitHub] spark issue #15150: [SPARK-17595] [MLLib] Use a bounded priority queue to fi...

2016-09-20 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15150 **[Test build #65661 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/65661/consoleFull)** for PR 15150 at commit

[GitHub] spark issue #15150: [SPARK-17595] [MLLib] Use a bounded priority queue to fi...

2016-09-20 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15150 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/65661/ Test PASSed. ---

[GitHub] spark issue #15150: [SPARK-17595] [MLLib] Use a bounded priority queue to fi...

2016-09-20 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15150 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #14634: [SPARK-17051][SQL] we should use hadoopConf in InsertInt...

2016-09-20 Thread yhuai
Github user yhuai commented on the issue: https://github.com/apache/spark/pull/14634 LGTM. Merging to master and branch 2.0. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #13152: [SPARK-15353] [CORE] Making peer selection for block rep...

2016-09-20 Thread shubhamchopra
Github user shubhamchopra commented on the issue: https://github.com/apache/spark/pull/13152 Rebased to master to resolve merge conflicts --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this

[GitHub] spark issue #13513: [SPARK-15698][SQL][Streaming] Add the ability to remove ...

2016-09-20 Thread zsxwing
Github user zsxwing commented on the issue: https://github.com/apache/spark/pull/13513 LGTM. Thanks! Merging to master and 2.0. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #15161: [SPARK-17611][yarn][test] Make shuffle service test real...

2016-09-20 Thread vanzin
Github user vanzin commented on the issue: https://github.com/apache/spark/pull/15161 @tgravescs --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if

[GitHub] spark issue #14808: [SPARK-17156][ML][EXAMPLE] Add multiclass logistic regre...

2016-09-20 Thread sethah
Github user sethah commented on the issue: https://github.com/apache/spark/pull/14808 I think we should add an example for multiclass classification, to demonstrate that it works in 2.1+. We also need to update the documentation that says we only support binary. --- If your

[GitHub] spark issue #14897: [SPARK-17338][SQL] add global temp view

2016-09-20 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14897 **[Test build #65662 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/65662/consoleFull)** for PR 14897 at commit

[GitHub] spark issue #15140: [SPARK-17585][PySpark][Core] PySpark SparkContext.addFil...

2016-09-20 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15140 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/65652/ Test PASSed. ---

[GitHub] spark issue #15148: [SPARK-5992][ML] Locality Sensitive Hashing

2016-09-20 Thread Yunni
Github user Yunni commented on the issue: https://github.com/apache/spark/pull/15148 Hi @sethah, Thanks for the comments. - I agree. I have moved `lsh` package to be under `feature` - In "Similarity search in high dimensions via hashing", there is an algorithm in the

[GitHub] spark issue #15140: [SPARK-17585][PySpark][Core] PySpark SparkContext.addFil...

2016-09-20 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15140 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #14426: [SPARK-16475][SQL] Broadcast Hint for SQL Queries

2016-09-20 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14426 **[Test build #65663 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/65663/consoleFull)** for PR 14426 at commit

[GitHub] spark issue #14116: [SPARK-16452][SQL] Support basic INFORMATION_SCHEMA

2016-09-20 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14116 **[Test build #65664 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/65664/consoleFull)** for PR 14116 at commit

[GitHub] spark issue #15149: [SPARK-17057] [ML] ProbabilisticClassifierModels' thresh...

2016-09-20 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15149 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/65659/ Test PASSed. ---

[GitHub] spark issue #15149: [SPARK-17057] [ML] ProbabilisticClassifierModels' thresh...

2016-09-20 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15149 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #14638: [SPARK-11374][SQL] Support `skip.header.line.count` opti...

2016-09-20 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14638 **[Test build #65666 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/65666/consoleFull)** for PR 14638 at commit

[GitHub] spark issue #14623: [SPARK-17044][SQL] Make test files for window functions ...

2016-09-20 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14623 **[Test build #65667 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/65667/consoleFull)** for PR 14623 at commit

[GitHub] spark pull request #13680: [SPARK-15962][SQL] Introduce implementation with ...

2016-09-20 Thread cloud-fan
Github user cloud-fan commented on a diff in the pull request: https://github.com/apache/spark/pull/13680#discussion_r79653637 --- Diff: sql/catalyst/src/main/java/org/apache/spark/sql/catalyst/expressions/UnsafeArrayData.java --- @@ -43,10 +43,11 @@ * * In the

[GitHub] spark issue #15149: [SPARK-17057] [ML] ProbabilisticClassifierModels' thresh...

2016-09-20 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15149 **[Test build #65660 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/65660/consoleFull)** for PR 15149 at commit

[GitHub] spark issue #13680: [SPARK-15962][SQL] Introduce implementation with a dense...

2016-09-20 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/13680 **[Test build #65657 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/65657/consoleFull)** for PR 13680 at commit

[GitHub] spark pull request #14467: [SPARK-16861][PYSPARK][CORE] Refactor PySpark acc...

2016-09-20 Thread holdenk
Github user holdenk commented on a diff in the pull request: https://github.com/apache/spark/pull/14467#discussion_r79661108 --- Diff: core/src/main/scala/org/apache/spark/api/python/PythonRDD.scala --- @@ -866,11 +866,14 @@ class BytesToString extends

[GitHub] spark issue #13822: [SPARK-16115][SQL] Improve error message on non-existent...

2016-09-20 Thread cloud-fan
Github user cloud-fan commented on the issue: https://github.com/apache/spark/pull/13822 does the problem still exist? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark issue #14818: [SPARK-17157][SPARKR][WIP]: Add multiclass logistic regr...

2016-09-20 Thread wangmiao1981
Github user wangmiao1981 commented on the issue: https://github.com/apache/spark/pull/14818 Thanks! I will update the PR accordingly. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this

[GitHub] spark issue #15149: [SPARK-17057] [ML] ProbabilisticClassifierModels' thresh...

2016-09-20 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15149 **[Test build #65659 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/65659/consoleFull)** for PR 15149 at commit

[GitHub] spark pull request #15149: [SPARK-17057] [ML] ProbabilisticClassifierModels'...

2016-09-20 Thread srowen
Github user srowen commented on a diff in the pull request: https://github.com/apache/spark/pull/15149#discussion_r79635467 --- Diff: mllib/src/main/scala/org/apache/spark/ml/classification/ProbabilisticClassifier.scala --- @@ -200,26 +200,9 @@ abstract class

[GitHub] spark issue #15152: [SPARK-17365][Core] Remove/Kill multiple executors toget...

2016-09-20 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15152 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/65655/ Test FAILed. ---

[GitHub] spark pull request #15148: [SPARK-5992][ML] Locality Sensitive Hashing

2016-09-20 Thread Yunni
Github user Yunni commented on a diff in the pull request: https://github.com/apache/spark/pull/15148#discussion_r79639164 --- Diff: mllib/src/main/scala/org/apache/spark/ml/lsh/LSH.scala --- @@ -0,0 +1,270 @@ +/* + * Licensed to the Apache Software Foundation (ASF) under

[GitHub] spark issue #15152: [SPARK-17365][Core] Remove/Kill multiple executors toget...

2016-09-20 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15152 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #15149: [SPARK-17057] [ML] ProbabilisticClassifierModels' thresh...

2016-09-20 Thread sethah
Github user sethah commented on the issue: https://github.com/apache/spark/pull/15149 +1 for not changing the sum requirement. I agree that we need to restrict them to sum to something non-zero and all positive. Thanks for the clarification. --- If your project is set up for it,

[GitHub] spark issue #15140: [SPARK-17585][PySpark][Core] PySpark SparkContext.addFil...

2016-09-20 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15140 **[Test build #65652 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/65652/consoleFull)** for PR 15140 at commit

[GitHub] spark issue #14897: [SPARK-17338][SQL][WIP] add global temp view

2016-09-20 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14897 **[Test build #65668 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/65668/consoleFull)** for PR 14897 at commit

[GitHub] spark pull request #15067: [SPARK-17513] [STREAMING] [SQL] Make StreamExecut...

2016-09-20 Thread frreiss
Github user frreiss closed the pull request at: https://github.com/apache/spark/pull/15067 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[GitHub] spark issue #15149: [SPARK-17057] [ML] ProbabilisticClassifierModels' thresh...

2016-09-20 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15149 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/65660/ Test PASSed. ---

[GitHub] spark issue #15149: [SPARK-17057] [ML] ProbabilisticClassifierModels' thresh...

2016-09-20 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15149 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #15157: Revert "[SPARK-17549][SQL] Only collect table size stat ...

2016-09-20 Thread yhuai
Github user yhuai commented on the issue: https://github.com/apache/spark/pull/15157 I will merge this PR to master and branch 2.0 once it passes jenkins. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark issue #14597: [SPARK-17017][MLLIB][ML] add a chiSquare Selector based ...

2016-09-20 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14597 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/65658/ Test PASSed. ---

[GitHub] spark issue #14597: [SPARK-17017][MLLIB][ML] add a chiSquare Selector based ...

2016-09-20 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14597 **[Test build #65658 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/65658/consoleFull)** for PR 14597 at commit

[GitHub] spark pull request #15149: [SPARK-17057] [ML] ProbabilisticClassifierModels'...

2016-09-20 Thread sethah
Github user sethah commented on a diff in the pull request: https://github.com/apache/spark/pull/15149#discussion_r79635015 --- Diff: mllib/src/main/scala/org/apache/spark/ml/classification/ProbabilisticClassifier.scala --- @@ -200,26 +200,9 @@ abstract class

[GitHub] spark pull request #15071: [SPARK-17517][SQL]Improve generated Code for Broa...

2016-09-20 Thread yaooqinn
Github user yaooqinn commented on a diff in the pull request: https://github.com/apache/spark/pull/15071#discussion_r79637407 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/execution/benchmark/BroadcastHashJoinBenchmark.scala --- @@ -0,0 +1,84 @@ +/* + * Licensed

[GitHub] spark pull request #15150: [SPARK-17595] [MLLib] Use a bounded priority queu...

2016-09-20 Thread willb
Github user willb commented on a diff in the pull request: https://github.com/apache/spark/pull/15150#discussion_r79637489 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/feature/Word2Vec.scala --- @@ -580,10 +581,14 @@ class Word2VecModel private[spark] ( ind +=

[GitHub] spark issue #15149: [SPARK-17057] [ML] ProbabilisticClassifierModels' thresh...

2016-09-20 Thread sethah
Github user sethah commented on the issue: https://github.com/apache/spark/pull/15149 Requiring these thresholds to sum <= 1 seems entirely arbitrary. I don't know why thresholds that sum to `0.347` are any more valid than thresholds that sum to `347`. If these are not meant to

[GitHub] spark issue #15157: Revert "[SPARK-17549][SQL] Only collect table size stat ...

2016-09-20 Thread vanzin
Github user vanzin commented on the issue: https://github.com/apache/spark/pull/15157 If it's not too much trouble, keeping the workaround in CodeGenerator.scala would be nice. Otherwise LGTM. --- If your project is set up for it, you can reply to this email and have your reply

[GitHub] spark issue #14116: [SPARK-16452][SQL] Support basic INFORMATION_SCHEMA

2016-09-20 Thread dongjoon-hyun
Github user dongjoon-hyun commented on the issue: https://github.com/apache/spark/pull/14116 Rebased to resolve conflicts. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #14897: [SPARK-17338][SQL][WIP] add global temp view

2016-09-20 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14897 **[Test build #65665 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/65665/consoleFull)** for PR 14897 at commit

[GitHub] spark pull request #14634: [SPARK-17051][SQL] we should use hadoopConf in In...

2016-09-20 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/14634 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[GitHub] spark issue #13680: [SPARK-15962][SQL] Introduce implementation with a dense...

2016-09-20 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/13680 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #13680: [SPARK-15962][SQL] Introduce implementation with a dense...

2016-09-20 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/13680 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/65657/ Test PASSed. ---

[GitHub] spark issue #14597: [SPARK-17017][MLLIB][ML] add a chiSquare Selector based ...

2016-09-20 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14597 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #14623: [SPARK-17044][SQL] Make test files for window functions ...

2016-09-20 Thread dongjoon-hyun
Github user dongjoon-hyun commented on the issue: https://github.com/apache/spark/pull/14623 Regenerate the answer set. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark pull request #13680: [SPARK-15962][SQL] Introduce implementation with ...

2016-09-20 Thread cloud-fan
Github user cloud-fan commented on a diff in the pull request: https://github.com/apache/spark/pull/13680#discussion_r79654687 --- Diff: sql/catalyst/src/main/java/org/apache/spark/sql/catalyst/expressions/codegen/UnsafeArrayWriter.java --- @@ -33,134 +37,213 @@ // The

[GitHub] spark issue #15157: Revert "[SPARK-17549][SQL] Only collect table size stat ...

2016-09-20 Thread yhuai
Github user yhuai commented on the issue: https://github.com/apache/spark/pull/15157 Done. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the

[GitHub] spark pull request #13680: [SPARK-15962][SQL] Introduce implementation with ...

2016-09-20 Thread cloud-fan
Github user cloud-fan commented on a diff in the pull request: https://github.com/apache/spark/pull/13680#discussion_r79654723 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/ScalaReflection.scala --- @@ -690,6 +690,7 @@ object ScalaReflection extends

[GitHub] spark pull request #15158: [SPARK-17603] [SQL] Utilize Hive-generated Statis...

2016-09-20 Thread cloud-fan
Github user cloud-fan commented on a diff in the pull request: https://github.com/apache/spark/pull/15158#discussion_r79656655 --- Diff: sql/hive/src/main/scala/org/apache/spark/sql/hive/MetastoreRelation.scala --- @@ -109,39 +109,59 @@ private[hive] case class MetastoreRelation(

[GitHub] spark issue #14897: [SPARK-17338][SQL] add global temp view

2016-09-20 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14897 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #14897: [SPARK-17338][SQL] add global temp view

2016-09-20 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14897 **[Test build #65662 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/65662/consoleFull)** for PR 14897 at commit

[GitHub] spark issue #14959: [SPARK-17387][PYSPARK] Creating SparkContext() from pyth...

2016-09-20 Thread vanzin
Github user vanzin commented on the issue: https://github.com/apache/spark/pull/14959 Are these tests flaky or is the failure related to this change? (Other PRs seem to be passing, so probably the latter?) --- If your project is set up for it, you can reply to this email and have

[GitHub] spark issue #14897: [SPARK-17338][SQL] add global temp view

2016-09-20 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14897 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/65662/ Test FAILed. ---

[GitHub] spark issue #14897: [SPARK-17338][SQL][WIP] add global temp view

2016-09-20 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14897 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/65665/ Test FAILed. ---

[GitHub] spark issue #14897: [SPARK-17338][SQL][WIP] add global temp view

2016-09-20 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14897 **[Test build #65665 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/65665/consoleFull)** for PR 14897 at commit

[GitHub] spark issue #14897: [SPARK-17338][SQL][WIP] add global temp view

2016-09-20 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14897 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #15157: Revert "[SPARK-17549][SQL] Only collect table size stat ...

2016-09-20 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15157 **[Test build #65669 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/65669/consoleFull)** for PR 15157 at commit

[GitHub] spark issue #15102: [SPARK-17346][SQL] Add Kafka source for Structured Strea...

2016-09-20 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15102 **[Test build #65636 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/65636/consoleFull)** for PR 15102 at commit

[GitHub] spark issue #15102: [SPARK-17346][SQL] Add Kafka source for Structured Strea...

2016-09-20 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15102 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/65636/ Test FAILed. ---

[GitHub] spark issue #14959: [SPARK-17387][PYSPARK] Creating SparkContext() from pyth...

2016-09-20 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14959 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #14959: [SPARK-17387][PYSPARK] Creating SparkContext() from pyth...

2016-09-20 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14959 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/65639/ Test FAILed. ---

[GitHub] spark issue #14959: [SPARK-17387][PYSPARK] Creating SparkContext() from pyth...

2016-09-20 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14959 **[Test build #65639 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/65639/consoleFull)** for PR 14959 at commit

[GitHub] spark issue #15146: [SPARK-17590][SQL] Analyze CTE definitions at once and a...

2016-09-20 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15146 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #15146: [SPARK-17590][SQL] Analyze CTE definitions at once and a...

2016-09-20 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15146 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/65630/ Test PASSed. ---

[GitHub] spark issue #15146: [SPARK-17590][SQL] Analyze CTE definitions at once and a...

2016-09-20 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15146 **[Test build #65630 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/65630/consoleFull)** for PR 15146 at commit

[GitHub] spark issue #15135: [pyspark][group]pyspark GroupedData can't apply agg func...

2016-09-20 Thread citoubest
Github user citoubest commented on the issue: https://github.com/apache/spark/pull/15135 OK, because pandas dataframe support the added approach to agg, so I suppose maybe spark dataframe should support, but it not. So I have tried to add this patch. If you think this patch is not

[GitHub] spark issue #15054: [SPARK-17502] [SQL] Fix Multiple Bugs in DDL Statements ...

2016-09-20 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15054 **[Test build #65638 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/65638/consoleFull)** for PR 15054 at commit

[GitHub] spark issue #14959: [SPARK-17387][PYSPARK] Creating SparkContext() from pyth...

2016-09-20 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14959 **[Test build #65639 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/65639/consoleFull)** for PR 14959 at commit

[GitHub] spark issue #15102: [SPARK-17346][SQL] Add Kafka source for Structured Strea...

2016-09-20 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15102 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #15135: [pyspark][group]pyspark GroupedData can't apply agg func...

2016-09-20 Thread rxin
Github user rxin commented on the issue: https://github.com/apache/spark/pull/15135 Pandas doesn't support this, does it? ``` >>> pd.read_csv('test.csv').groupby('a').agg('sum', 'avg') Traceback (most recent call last): File "", line 1, in File

[GitHub] spark issue #15102: [SPARK-17346][SQL] Add Kafka source for Structured Strea...

2016-09-20 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15102 **[Test build #65640 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/65640/consoleFull)** for PR 15102 at commit

[GitHub] spark pull request #15090: [SPARK-17073] [SQL] generate column-level statist...

2016-09-20 Thread gatorsmile
Github user gatorsmile commented on a diff in the pull request: https://github.com/apache/spark/pull/15090#discussion_r79544029 --- Diff: sql/hive/src/test/scala/org/apache/spark/sql/hive/StatisticsColumnSuite.scala --- @@ -0,0 +1,343 @@ +/* + * Licensed to the Apache

[GitHub] spark issue #15090: [SPARK-17073] [SQL] generate column-level statistics

2016-09-20 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15090 **[Test build #65633 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/65633/consoleFull)** for PR 15090 at commit

[GitHub] spark issue #15158: [SPARK-17603] [SQL] Utilize Hive-generated Statistics Fo...

2016-09-20 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15158 **[Test build #65634 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/65634/consoleFull)** for PR 15158 at commit

[GitHub] spark pull request #15090: [SPARK-17073] [SQL] generate column-level statist...

2016-09-20 Thread gatorsmile
Github user gatorsmile commented on a diff in the pull request: https://github.com/apache/spark/pull/15090#discussion_r79543613 --- Diff: sql/hive/src/test/scala/org/apache/spark/sql/hive/StatisticsColumnSuite.scala --- @@ -0,0 +1,343 @@ +/* + * Licensed to the Apache

[GitHub] spark issue #15090: [SPARK-17073] [SQL] generate column-level statistics

2016-09-20 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15090 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/65633/ Test PASSed. ---

[GitHub] spark issue #15158: [SPARK-17603] [SQL] Utilize Hive-generated Statistics Fo...

2016-09-20 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15158 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/65634/ Test PASSed. ---

[GitHub] spark issue #15158: [SPARK-17603] [SQL] Utilize Hive-generated Statistics Fo...

2016-09-20 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15158 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #15090: [SPARK-17073] [SQL] generate column-level statistics

2016-09-20 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15090 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark pull request #15159: [SPARK-17605][SPARK_SUBMIT] Add option spark.useP...

2016-09-20 Thread zjffdu
GitHub user zjffdu opened a pull request: https://github.com/apache/spark/pull/15159 [SPARK-17605][SPARK_SUBMIT] Add option spark.usePython and spark.useR for applications that use both pyspark and sparkr ## What changes were proposed in this pull request? It is mostly for

[GitHub] spark issue #12819: [SPARK-14077][ML] Refactor NaiveBayes to support weighte...

2016-09-20 Thread yanboliang
Github user yanboliang commented on the issue: https://github.com/apache/spark/pull/12819 @sethah It's a good questions, and we have some concerns: * The Dataset based implementation does not bring much performance improvement from the test result(even worse in some case). *

[GitHub] spark issue #15000: [SPARK-17437] Add uiWebUrl to JavaSparkContext and pyspa...

2016-09-20 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15000 **[Test build #3280 has started](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/3280/consoleFull)** for PR 15000 at commit

[GitHub] spark issue #15150: [SPARK-17595] [MLLib] Use a bounded priority queue to fi...

2016-09-20 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15150 **[Test build #3281 has started](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/3281/consoleFull)** for PR 15150 at commit

[GitHub] spark pull request #15090: [SPARK-17073] [SQL] generate column-level statist...

2016-09-20 Thread gatorsmile
Github user gatorsmile commented on a diff in the pull request: https://github.com/apache/spark/pull/15090#discussion_r79546259 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/command/AnalyzeColumnCommand.scala --- @@ -0,0 +1,168 @@ +/* + * Licensed to

[GitHub] spark pull request #15090: [SPARK-17073] [SQL] generate column-level statist...

2016-09-20 Thread gatorsmile
Github user gatorsmile commented on a diff in the pull request: https://github.com/apache/spark/pull/15090#discussion_r79550718 --- Diff: sql/hive/src/test/scala/org/apache/spark/sql/hive/StatisticsTest.scala --- @@ -0,0 +1,139 @@ +/* + * Licensed to the Apache Software

[GitHub] spark issue #13705: [SPARK-15472][SQL] Add support for writing in `csv` form...

2016-09-20 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/13705 **[Test build #65635 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/65635/consoleFull)** for PR 13705 at commit

[GitHub] spark issue #15054: [SPARK-17502] [SQL] Fix Multiple Bugs in DDL Statements ...

2016-09-20 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15054 **[Test build #65642 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/65642/consoleFull)** for PR 15054 at commit

[GitHub] spark issue #15054: [SPARK-17502] [SQL] Fix Multiple Bugs in DDL Statements ...

2016-09-20 Thread cloud-fan
Github user cloud-fan commented on the issue: https://github.com/apache/spark/pull/15054 retest this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes

[GitHub] spark issue #15159: [SPARK-17605][SPARK_SUBMIT] Add option spark.usePython a...

2016-09-20 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15159 **[Test build #65644 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/65644/consoleFull)** for PR 15159 at commit

  1   2   3   4   5   6   >