[GitHub] spark pull request #13394: [SPARK-15490][R][DOC] SparkR 2.0 QA: New R APIs a...

2016-06-07 Thread jkbradley
Github user jkbradley commented on a diff in the pull request: https://github.com/apache/spark/pull/13394#discussion_r66177097 --- Diff: R/pkg/R/mllib.R --- @@ -197,11 +197,10 @@ print.summary.GeneralizedLinearRegressionModel <- function(x, ...) { invisible(x) }

[GitHub] spark issue #12938: [SPARK-15162][SPARK-15164][PySpark][DOCS][ML] update som...

2016-06-07 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/12938 **[Test build #60139 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/60139/consoleFull)** for PR 12938 at commit [`7b634b6`](https://github.com/apache/spark/commit/

[GitHub] spark issue #12938: [SPARK-15162][SPARK-15164][PySpark][DOCS][ML] update som...

2016-06-07 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/12938 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/60139/ Test FAILed. ---

[GitHub] spark issue #12938: [SPARK-15162][SPARK-15164][PySpark][DOCS][ML] update som...

2016-06-07 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/12938 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature e

[GitHub] spark issue #13526: [SPARK-15780][SQL] Support mapValues on KeyValueGroupedD...

2016-06-07 Thread cloud-fan
Github user cloud-fan commented on the issue: https://github.com/apache/spark/pull/13526 A possible approach maybe just keep the function given by `mapValues`, and apply it before calling the function given by `mapGroups`. By doing this, we at least won't make the performance worse, a

[GitHub] spark issue #12824: [SPARK-15046] When running hive-thriftserver with yarn o...

2016-06-07 Thread vanzin
Github user vanzin commented on the issue: https://github.com/apache/spark/pull/12824 @tgravescs the problem is this code in Client.scala: sparkConf.set(TOKEN_RENEWAL_INTERVAL, renewalInterval) That will write the value to the config with the `ms` suffix. I think i

[GitHub] spark issue #13439: [SPARK-15701][SQL] Modify ColumnVector to reduce memory ...

2016-06-07 Thread viirya
Github user viirya commented on the issue: https://github.com/apache/spark/pull/13439 @rxin I've updated this to more simple approach that doesn't introduce new classes. The main change is to check if the current vector is constant or not and do suitable data access. Please take a loo

[GitHub] spark issue #13543: [SPARK-15806] [Documentation] update doc for SPARK_MASTE...

2016-06-07 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/13543 **[Test build #60142 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/60142/consoleFull)** for PR 13543 at commit [`6f29181`](https://github.com/apache/spark/commit/

[GitHub] spark issue #13543: [SPARK-15806] [Documentation] update doc for SPARK_MASTE...

2016-06-07 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/13543 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/60142/ Test FAILed. ---

[GitHub] spark issue #13534: [SPARK-15789][SQL] Allow reserved keywords in most place...

2016-06-07 Thread cloud-fan
Github user cloud-fan commented on the issue: https://github.com/apache/spark/pull/13534 LGTM, merging to master and 2.0, thanks! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #13543: [SPARK-15806] [Documentation] update doc for SPARK_MASTE...

2016-06-07 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/13543 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature e

[GitHub] spark issue #13543: [SPARK-15806] [Documentation] update doc for SPARK_MASTE...

2016-06-07 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/13543 **[Test build #60142 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/60142/consoleFull)** for PR 13543 at commit [`6f29181`](https://github.com/apache/spark/commit/6

[GitHub] spark issue #13189: [SPARK-14670][SQL] allow updating driver side sql metric...

2016-06-07 Thread cloud-fan
Github user cloud-fan commented on the issue: https://github.com/apache/spark/pull/13189 `QueryExecution.hiveResultString` will call `SparkPlan.executeCollect` without setting an execution id. This method is only used in test, should we just stop reporting metrics for this case, or cr

[GitHub] spark pull request #13439: [SPARK-15701][SQL] Constant ColumnVector only nee...

2016-06-07 Thread viirya
Github user viirya commented on a diff in the pull request: https://github.com/apache/spark/pull/13439#discussion_r66174085 --- Diff: sql/core/src/main/java/org/apache/spark/sql/execution/vectorized/OnHeapColumnVector.java --- @@ -70,26 +71,106 @@ public long nullsNativeAddress()

[GitHub] spark pull request #13534: [SPARK-15789][SQL] Allow reserved keywords in mos...

2016-06-07 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/13534 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is ena

[GitHub] spark issue #13439: [SPARK-15701][SQL] Constant ColumnVector only needs to p...

2016-06-07 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/13439 **[Test build #60141 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/60141/consoleFull)** for PR 13439 at commit [`2226efc`](https://github.com/apache/spark/commit/2

[GitHub] spark issue #13439: [SPARK-15701][SQL] Constant ColumnVector only needs to p...

2016-06-07 Thread viirya
Github user viirya commented on the issue: https://github.com/apache/spark/pull/13439 The latest benchmark is run individually for each type of column vector. As stated in `ColumnarBatchBenchmark`, it is hard to reason about the JIT. If we put these 4 cases together to run benchmark,

[GitHub] spark issue #13439: [SPARK-15701][SQL] Constant ColumnVector only needs to p...

2016-06-07 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/13439 **[Test build #60140 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/60140/consoleFull)** for PR 13439 at commit [`07ef523`](https://github.com/apache/spark/commit/0

[GitHub] spark issue #13439: [SPARK-15701][SQL] Constant ColumnVector only needs to p...

2016-06-07 Thread viirya
Github user viirya commented on the issue: https://github.com/apache/spark/pull/13439 Benchmark again on new change: Environment: Java HotSpot(TM) 64-Bit Server VM 1.8.0_71-b15 on Linux 3.19.0-25-generic Intel(R) Core(TM) i7-5557U CPU @ 3.10GHz On

[GitHub] spark issue #13495: [SPARK-15751][MLLIB][PYSPARK] Add generateAssociationRul...

2016-06-07 Thread zjffdu
Github user zjffdu commented on the issue: https://github.com/apache/spark/pull/13495 \cc @yanboliang --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or i

[GitHub] spark issue #13530: [SPARK-14279][BUILD] Pick the spark version from pom

2016-06-07 Thread vanzin
Github user vanzin commented on the issue: https://github.com/apache/spark/pull/13530 @dhruve could you close the PR? The bot doesn't do it automatically for backports. thx --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well.

[GitHub] spark issue #9207: [SPARK-11171][SPARK-11237][SPARK-11241][ML] Try adding PM...

2016-06-07 Thread holdenk
Github user holdenk commented on the issue: https://github.com/apache/spark/pull/9207 So I guess I'm wondering what our plans for PMML look like - I'm happy to update this or go in the direction @MLnick suggested if thats what we want? --- If your project is set up for it, you can re

[GitHub] spark issue #12938: [SPARK-15162][SPARK-15164][PySpark][DOCS][ML] update som...

2016-06-07 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/12938 **[Test build #60139 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/60139/consoleFull)** for PR 12938 at commit [`7b634b6`](https://github.com/apache/spark/commit/7

[GitHub] spark pull request #13335: [SPARK-15580][SQL]Add ContinuousQueryInfo to make...

2016-06-07 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/13335 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is ena

[GitHub] spark pull request #13534: [SPARK-15789][SQL] Allow reserved keywords in mos...

2016-06-07 Thread yhuai
Github user yhuai commented on a diff in the pull request: https://github.com/apache/spark/pull/13534#discussion_r66171754 --- Diff: sql/catalyst/src/test/scala/org/apache/spark/sql/catalyst/parser/DataTypeParserSuite.scala --- @@ -133,4 +133,8 @@ class CatalystQlDataTypeParserSui

[GitHub] spark issue #13065: [SPARK-15214][SQL] Code-generation for Generate

2016-06-07 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/13065 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/60137/ Test PASSed. ---

[GitHub] spark issue #13065: [SPARK-15214][SQL] Code-generation for Generate

2016-06-07 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/13065 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature e

[GitHub] spark issue #13335: [SPARK-15580][SQL]Add ContinuousQueryInfo to make Contin...

2016-06-07 Thread tdas
Github user tdas commented on the issue: https://github.com/apache/spark/pull/13335 Merging this to master and 2.0. Thanks @zsxwing --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this featu

[GitHub] spark issue #13065: [SPARK-15214][SQL] Code-generation for Generate

2016-06-07 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/13065 **[Test build #60137 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/60137/consoleFull)** for PR 13065 at commit [`60da24e`](https://github.com/apache/spark/commit/

[GitHub] spark pull request #12258: [SPARK-14485][CORE] ignore task finished for exec...

2016-06-07 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/12258 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is ena

[GitHub] spark issue #12258: [SPARK-14485][CORE] ignore task finished for executor lo...

2016-06-07 Thread vanzin
Github user vanzin commented on the issue: https://github.com/apache/spark/pull/12258 @zhonghaihua let me know your JIRA username and I'll assingn the bug to you. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your pr

[GitHub] spark pull request #13534: [SPARK-15789][SQL] Allow reserved keywords in mos...

2016-06-07 Thread cloud-fan
Github user cloud-fan commented on a diff in the pull request: https://github.com/apache/spark/pull/13534#discussion_r66170917 --- Diff: sql/catalyst/src/test/scala/org/apache/spark/sql/catalyst/parser/DataTypeParserSuite.scala --- @@ -133,4 +133,8 @@ class CatalystQlDataTypeParse

[GitHub] spark issue #12258: [SPARK-14485][CORE] ignore task finished for executor lo...

2016-06-07 Thread vanzin
Github user vanzin commented on the issue: https://github.com/apache/spark/pull/12258 LGTM, merging to master / 2.0. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and w

[GitHub] spark issue #12258: [SPARK-14485][CORE] ignore task finished for executor lo...

2016-06-07 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/12258 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature e

[GitHub] spark issue #12258: [SPARK-14485][CORE] ignore task finished for executor lo...

2016-06-07 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/12258 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/60136/ Test PASSed. ---

[GitHub] spark issue #12258: [SPARK-14485][CORE] ignore task finished for executor lo...

2016-06-07 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/12258 **[Test build #60136 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/60136/consoleFull)** for PR 12258 at commit [`b719166`](https://github.com/apache/spark/commit/

[GitHub] spark issue #13543: [SPARK-15806] [Documentation] update doc for SPARK_MASTE...

2016-06-07 Thread bomeng
Github user bomeng commented on the issue: https://github.com/apache/spark/pull/13543 Yes. I can add a warning if SPARK_MASTER_IP is set. Ideally we should use SPARK_MASTER_HOST in all places to avoid confusion. --- If your project is set up for it, you can reply to this email and ha

[GitHub] spark issue #13522: [SPARK-14321][SQL] Reduce date format cost and string-to...

2016-06-07 Thread cloud-fan
Github user cloud-fan commented on the issue: https://github.com/apache/spark/pull/13522 Some more ideas: I think we should create a subclass of `UnixTime` to handle the `left.dataType == StringType && right.foldable` case. In optimizer, we can replace it with null literal if the `for

[GitHub] spark pull request #13548: [DO NOT MERGE] lots of blacklist testing

2016-06-07 Thread squito
Github user squito closed the pull request at: https://github.com/apache/spark/pull/13548 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is ena

[GitHub] spark issue #13548: [DO NOT MERGE] lots of blacklist testing

2016-06-07 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/13548 **[Test build #60138 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/60138/consoleFull)** for PR 13548 at commit [`5bc48f2`](https://github.com/apache/spark/commit/5

[GitHub] spark pull request #12731: [SPARK-13590] [ML] [Doc] Document spark.ml LiR, L...

2016-06-07 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/12731 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is ena

[GitHub] spark issue #13526: [SPARK-15780][SQL] Support mapValues on KeyValueGroupedD...

2016-06-07 Thread koertkuipers
Github user koertkuipers commented on the issue: https://github.com/apache/spark/pull/13526 ``` scala> val x = Seq(("a", 1), ("b", 2)).toDS x: org.apache.spark.sql.Dataset[(String, Int)] = [_1: string, _2: int] scala> x.groupByKey(_._1).mapValues(_._2).reduceGroups(_ +

[GitHub] spark pull request #13548: [DO NOT MERGE] lots of blacklist testing

2016-06-07 Thread squito
GitHub user squito opened a pull request: https://github.com/apache/spark/pull/13548 [DO NOT MERGE] lots of blacklist testing making jenkins run the scheduler tests a lot You can merge this pull request into a Git repository by running: $ git pull https://github.com/squito/spar

[GitHub] spark pull request #13414: [SPARK-15674][SQL] Deprecates "CREATE TEMPORARY T...

2016-06-07 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/13414 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is ena

[GitHub] spark issue #12731: [SPARK-13590] [ML] [Doc] Document spark.ml LiR, LoR and ...

2016-06-07 Thread mengxr
Github user mengxr commented on the issue: https://github.com/apache/spark/pull/12731 LGTM --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the featur

[GitHub] spark issue #13414: [SPARK-15674][SQL] Deprecates "CREATE TEMPORARY TABLE US...

2016-06-07 Thread hvanhovell
Github user hvanhovell commented on the issue: https://github.com/apache/spark/pull/13414 Thanks! Merging to master/2.0 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled an

[GitHub] spark issue #13460: [SPARK-15615] [SQL] Support Json input from Dataset[Stri...

2016-06-07 Thread pjfanning
Github user pjfanning commented on the issue: https://github.com/apache/spark/pull/13460 @HyukjinKwon all the JsonSuite tests pass for me on my laptop - would it be feasible to get this reviewed again? --- If your project is set up for it, you can reply to this email and have your re

[GitHub] spark pull request #13534: [SPARK-15789][SQL] Allow reserved keywords in mos...

2016-06-07 Thread hvanhovell
Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/13534#discussion_r66161676 --- Diff: sql/catalyst/src/main/antlr4/org/apache/spark/sql/catalyst/parser/SqlBase.g4 --- @@ -594,16 +594,13 @@ qualifiedName : identifier ('.'

[GitHub] spark pull request #13534: [SPARK-15789][SQL] Allow reserved keywords in mos...

2016-06-07 Thread cloud-fan
Github user cloud-fan commented on a diff in the pull request: https://github.com/apache/spark/pull/13534#discussion_r66160129 --- Diff: sql/catalyst/src/main/antlr4/org/apache/spark/sql/catalyst/parser/SqlBase.g4 --- @@ -594,16 +594,13 @@ qualifiedName : identifier ('.' i

[GitHub] spark issue #13543: [SPARK-15806] [Documentation] update doc for SPARK_MASTE...

2016-06-07 Thread srowen
Github user srowen commented on the issue: https://github.com/apache/spark/pull/13543 The thing is, `SPARK_MASTER_HOST` is not used anywhere in the code but this file. It's not documented. I would imagine it is the one that's deprecated, but, have a look at https://issues.apache.org/j

[GitHub] spark issue #13534: [SPARK-15789][SQL] Allow reserved keywords in most place...

2016-06-07 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/13534 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/60135/ Test PASSed. ---

[GitHub] spark issue #13065: [SPARK-15214][SQL] Code-generation for Generate

2016-06-07 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/13065 **[Test build #60137 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/60137/consoleFull)** for PR 13065 at commit [`60da24e`](https://github.com/apache/spark/commit/6

[GitHub] spark issue #13534: [SPARK-15789][SQL] Allow reserved keywords in most place...

2016-06-07 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/13534 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature e

[GitHub] spark issue #13534: [SPARK-15789][SQL] Allow reserved keywords in most place...

2016-06-07 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/13534 **[Test build #60135 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/60135/consoleFull)** for PR 13534 at commit [`2c18705`](https://github.com/apache/spark/commit/

[GitHub] spark pull request #13545: [SPARK-15807][SQL] Support varargs for distinct/d...

2016-06-07 Thread dongjoon-hyun
Github user dongjoon-hyun commented on a diff in the pull request: https://github.com/apache/spark/pull/13545#discussion_r66156310 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/Dataset.scala --- @@ -2262,6 +2275,19 @@ class Dataset[T] private[sql]( def distinct(): Da

[GitHub] spark issue #12675: [SPARK-14894][PySpark] Add result summary api to Gaussia...

2016-06-07 Thread GayathriMurali
Github user GayathriMurali commented on the issue: https://github.com/apache/spark/pull/12675 @jkbradley @holdenk Can you please help review? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have t

[GitHub] spark issue #12258: [SPARK-14485][CORE] ignore task finished for executor lo...

2016-06-07 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/12258 **[Test build #60136 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/60136/consoleFull)** for PR 12258 at commit [`b719166`](https://github.com/apache/spark/commit/b

[GitHub] spark issue #12258: [SPARK-14485][CORE] ignore task finished for executor lo...

2016-06-07 Thread vanzin
Github user vanzin commented on the issue: https://github.com/apache/spark/pull/12258 flaky test? retest this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and w

[GitHub] spark issue #13530: [SPARK-14279][BUILD] Pick the spark version from pom

2016-06-07 Thread vanzin
Github user vanzin commented on the issue: https://github.com/apache/spark/pull/13530 Merging to 2.0. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if

[GitHub] spark issue #13543: [SPARK-15806] [Documentation] update doc for SPARK_MASTE...

2016-06-07 Thread bomeng
Github user bomeng commented on the issue: https://github.com/apache/spark/pull/13543 Here is the link: [MasterArguments.scala](https://github.com/bomeng/spark/blob/master/core/src/main/scala/org/apache/spark/deploy/master/MasterArguments.scala#L56-L59) --- If your project is set up

[GitHub] spark issue #13493: [SPARK-15750][MLLib][PYSPARK] Constructing FPGrowth fail...

2016-06-07 Thread MechCoder
Github user MechCoder commented on the issue: https://github.com/apache/spark/pull/13493 lgtm cc: @MLnick --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so,

[GitHub] spark pull request #13545: [SPARK-15807][SQL] Support varargs for distinct/d...

2016-06-07 Thread dongjoon-hyun
Github user dongjoon-hyun commented on a diff in the pull request: https://github.com/apache/spark/pull/13545#discussion_r66152341 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/Dataset.scala --- @@ -2262,6 +2275,19 @@ class Dataset[T] private[sql]( def distinct(): Da

[GitHub] spark issue #13546: [SPARK-15808] [SQL] File Format Checking When Appending ...

2016-06-07 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/13546 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature e

[GitHub] spark issue #13546: [SPARK-15808] [SQL] File Format Checking When Appending ...

2016-06-07 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/13546 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/60134/ Test PASSed. ---

[GitHub] spark issue #13546: [SPARK-15808] [SQL] File Format Checking When Appending ...

2016-06-07 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/13546 **[Test build #60134 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/60134/consoleFull)** for PR 13546 at commit [`9d9d263`](https://github.com/apache/spark/commit/

[GitHub] spark issue #13545: [SPARK-15807][SQL] Support varargs for distinct/dropDupl...

2016-06-07 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/13545 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature e

[GitHub] spark issue #13545: [SPARK-15807][SQL] Support varargs for distinct/dropDupl...

2016-06-07 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/13545 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/60133/ Test PASSed. ---

[GitHub] spark issue #13545: [SPARK-15807][SQL] Support varargs for distinct/dropDupl...

2016-06-07 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/13545 **[Test build #60133 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/60133/consoleFull)** for PR 13545 at commit [`33f446f`](https://github.com/apache/spark/commit/

[GitHub] spark issue #13300: [SPARK-15463][SQL] support creating dataframe out of Dat...

2016-06-07 Thread pjfanning
Github user pjfanning commented on the issue: https://github.com/apache/spark/pull/13300 @HyukjinKwon @rxin @falaki Would it be feasible to get this merged for Spark 2.0 release? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as

[GitHub] spark issue #13534: [SPARK-15789][SQL] Allow reserved keywords in most place...

2016-06-07 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/13534 **[Test build #60135 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/60135/consoleFull)** for PR 13534 at commit [`2c18705`](https://github.com/apache/spark/commit/2

[GitHub] spark issue #13543: [SPARK-15806] [Documentation] update doc for SPARK_MASTE...

2016-06-07 Thread srowen
Github user srowen commented on the issue: https://github.com/apache/spark/pull/13543 Pardon, where do you see that SPARK_MASTER_IP is deprecated? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not h

[GitHub] spark issue #13534: [SPARK-15789][SQL] Allow reserved keywords in most place...

2016-06-07 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/13534 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/60132/ Test PASSed. ---

[GitHub] spark issue #13534: [SPARK-15789][SQL] Allow reserved keywords in most place...

2016-06-07 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/13534 Build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark issue #13534: [SPARK-15789][SQL] Allow reserved keywords in most place...

2016-06-07 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/13534 **[Test build #60132 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/60132/consoleFull)** for PR 13534 at commit [`4ccd65f`](https://github.com/apache/spark/commit/

[GitHub] spark issue #13065: [SPARK-15214][SQL] Code-generation for Generate

2016-06-07 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/13065 Build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark issue #13065: [SPARK-15214][SQL] Code-generation for Generate

2016-06-07 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/13065 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/60131/ Test PASSed. ---

[GitHub] spark issue #13065: [SPARK-15214][SQL] Code-generation for Generate

2016-06-07 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/13065 **[Test build #60131 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/60131/consoleFull)** for PR 13065 at commit [`f86da0f`](https://github.com/apache/spark/commit/

[GitHub] spark issue #13547: Update KafkaWordCount.scala

2016-06-07 Thread srowen
Github user srowen commented on the issue: https://github.com/apache/spark/pull/13547 Please close this and read https://cwiki.apache.org/confluence/display/SPARK/Contributing+to+Spark --- If your project is set up for it, you can reply to this email and have your reply appear on Git

[GitHub] spark pull request #13545: [SPARK-15807][SQL] Support varargs for distinct/d...

2016-06-07 Thread rxin
Github user rxin commented on a diff in the pull request: https://github.com/apache/spark/pull/13545#discussion_r66135714 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/Dataset.scala --- @@ -2262,6 +2275,19 @@ class Dataset[T] private[sql]( def distinct(): Dataset[T]

[GitHub] spark pull request #13547: Update KafkaWordCount.scala

2016-06-07 Thread ShreyasFadnavis
GitHub user ShreyasFadnavis opened a pull request: https://github.com/apache/spark/pull/13547 Update KafkaWordCount.scala ## What changes were proposed in this pull request? (Please fill in changes proposed in this fix) ## How was this patch tested? (P

[GitHub] spark issue #13547: Update KafkaWordCount.scala

2016-06-07 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/13547 Can one of the admins verify this patch? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feat

[GitHub] spark issue #13530: [SPARK-14279][BUILD] Pick the spark version from pom

2016-06-07 Thread rxin
Github user rxin commented on the issue: https://github.com/apache/spark/pull/13530 sounds good go ahead --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or

[GitHub] spark issue #13544: [SPARK-15805][SQL][Documents] update sql programming gui...

2016-06-07 Thread rxin
Github user rxin commented on the issue: https://github.com/apache/spark/pull/13544 cc @liancheng --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if th

[GitHub] spark issue #13546: [SPARK-15808] [SQL] File Format Checking When Appending ...

2016-06-07 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/13546 **[Test build #60134 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/60134/consoleFull)** for PR 13546 at commit [`9d9d263`](https://github.com/apache/spark/commit/9

[GitHub] spark pull request #13546: [SPARK-15808] [SQL] File Format Checking When App...

2016-06-07 Thread gatorsmile
GitHub user gatorsmile opened a pull request: https://github.com/apache/spark/pull/13546 [SPARK-15808] [SQL] File Format Checking When Appending Data What changes were proposed in this pull request? **Issue:** Got wrong results or strange errors when append data to a table

[GitHub] spark issue #13540: [SPARK-15788][PYSPARK][ML] PySpark IDFModel missing "idf...

2016-06-07 Thread MLnick
Github user MLnick commented on the issue: https://github.com/apache/spark/pull/13540 We should make it a property On Tue, 7 Jun 2016 at 11:28, Manoj Kumar wrote: > LGTM as well. pending the nitpick by @BryanCutler > > > Not rel

[GitHub] spark issue #13540: [SPARK-15788][PYSPARK][ML] PySpark IDFModel missing "idf...

2016-06-07 Thread MLnick
Github user MLnick commented on the issue: https://github.com/apache/spark/pull/13540 Also, yes we should actually look at copying docs from Scala side where appropriate. We can add that to the QA JIRAs for Python doc. On Tue, 7 Jun 2016 at 11:28, Manoj Kumar wrote:

[GitHub] spark issue #13545: [SPARK-15807][SQL] Support varargs for distinct/dropDupl...

2016-06-07 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/13545 **[Test build #60133 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/60133/consoleFull)** for PR 13545 at commit [`33f446f`](https://github.com/apache/spark/commit/3

[GitHub] spark issue #13415: [SPARK-15676] [SQL] Disallow Column Names as Partition C...

2016-06-07 Thread gatorsmile
Github user gatorsmile commented on the issue: https://github.com/apache/spark/pull/13415 Thank you! @cloud-fan --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishe

[GitHub] spark pull request #13545: [SPARK-15807][SQL] Support varargs for distinct/d...

2016-06-07 Thread dongjoon-hyun
GitHub user dongjoon-hyun opened a pull request: https://github.com/apache/spark/pull/13545 [SPARK-15807][SQL] Support varargs for distinct/dropDuplicates in Dataset/DataFrame ## What changes were proposed in this pull request? This PR adds `varargs`-types `distinct/dropDuplicat

[GitHub] spark issue #13540: [SPARK-15788][PYSPARK][ML] PySpark IDFModel missing "idf...

2016-06-07 Thread MechCoder
Github user MechCoder commented on the issue: https://github.com/apache/spark/pull/13540 LGTM as well. pending the nitpick by @BryanCutler Not related, but it's been a while since I hacked on Spark or PySpark but at some point do we need better docs for PySpark? I couldn't fi

[GitHub] spark issue #13534: [SPARK-15789][SQL] Allow reserved keywords in most place...

2016-06-07 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/13534 **[Test build #60132 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/60132/consoleFull)** for PR 13534 at commit [`4ccd65f`](https://github.com/apache/spark/commit/4

[GitHub] spark issue #12258: [SPARK-14485][CORE] ignore task finished for executor lo...

2016-06-07 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/12258 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature e

[GitHub] spark issue #12258: [SPARK-14485][CORE] ignore task finished for executor lo...

2016-06-07 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/12258 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/60128/ Test FAILed. ---

[GitHub] spark issue #12258: [SPARK-14485][CORE] ignore task finished for executor lo...

2016-06-07 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/12258 **[Test build #60128 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/60128/consoleFull)** for PR 12258 at commit [`b719166`](https://github.com/apache/spark/commit/

[GitHub] spark issue #13540: [SPARK-15788][PYSPARK][ML] PySpark IDFModel missing "idf...

2016-06-07 Thread BryanCutler
Github user BryanCutler commented on the issue: https://github.com/apache/spark/pull/13540 Good catch! Not too big of deal, but should this be a property? Otherwise, LGTM cc @MLnick --- If your project is set up for it, you can reply to this email and have your reply appear on

[GitHub] spark issue #13065: [SPARK-15214][SQL] Code-generation for Generate [WIP]

2016-06-07 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/13065 **[Test build #60131 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/60131/consoleFull)** for PR 13065 at commit [`f86da0f`](https://github.com/apache/spark/commit/f

[GitHub] spark pull request #13502: [SPARK-15760][docs] Add documentation for package...

2016-06-07 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/13502 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is ena

[GitHub] spark issue #13502: [SPARK-15760][docs] Add documentation for package-relate...

2016-06-07 Thread vanzin
Github user vanzin commented on the issue: https://github.com/apache/spark/pull/13502 Merging to master / 2.0. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes

[GitHub] spark issue #10896: [SPARK-12978][SQL] Skip unnecessary final group-by when ...

2016-06-07 Thread maropu
Github user maropu commented on the issue: https://github.com/apache/spark/pull/10896 @hvanhovell yea, I'm checking we could implement the same logic in `EnsureRequirements#ensureDistributionAndOrdering`. As @yhuai suggested, the new approach will make a pr more bigger. So, wh

<    1   2   3   4   >