[GitHub] spark issue #15094: [SPARK-17534] [TESTS] Increase timeouts for DirectKafkaS...

2016-09-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15094 **[Test build #65367 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/65367/consoleFull)** for PR 15094 at commit

[GitHub] spark issue #13513: [SPARK-15698][SQL][Streaming] Add the ability to remove ...

2016-09-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/13513 **[Test build #65368 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/65368/consoleFull)** for PR 13513 at commit

[GitHub] spark issue #15077: [SPARK-17521]Error when I use sparkContext.makeRDD(Seq()...

2016-09-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15077 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #15077: [SPARK-17521]Error when I use sparkContext.makeRDD(Seq()...

2016-09-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15077 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/65362/ Test PASSed. ---

[GitHub] spark issue #15059: [SPARK-17506][SQL] Improve the check double values equal...

2016-09-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15059 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #15077: [SPARK-17521]Error when I use sparkContext.makeRDD(Seq()...

2016-09-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15077 **[Test build #65362 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/65362/consoleFull)** for PR 15077 at commit

[GitHub] spark issue #14981: [SPARK-17418] Remove Kinesis artifacts from Spark releas...

2016-09-14 Thread koeninger
Github user koeninger commented on the issue: https://github.com/apache/spark/pull/14981 Yeah, I don't know of an easier workaround. My understanding is also that the asf concern is tied to distribution, so not publishing to maven should be sufficient. On Sep 14,

[GitHub] spark issue #15059: [SPARK-17506][SQL] Improve the check double values equal...

2016-09-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15059 **[Test build #65364 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/65364/consoleFull)** for PR 15059 at commit

[GitHub] spark issue #15059: [SPARK-17506][SQL] Improve the check double values equal...

2016-09-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15059 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/65364/ Test PASSed. ---

[GitHub] spark issue #15059: [SPARK-17506][SQL] Improve the check double values equal...

2016-09-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15059 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #15060: [SPARK-17507][ML][MLLib] check weight vector size in ANN

2016-09-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15060 **[Test build #65370 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/65370/consoleFull)** for PR 15060 at commit

[GitHub] spark issue #14444: [SPARK-16839] [SQL] redundant aliases after cleanupAlias...

2016-09-14 Thread HyukjinKwon
Github user HyukjinKwon commented on the issue: https://github.com/apache/spark/pull/1 I just opened a pull request against your forked repo - https://github.com/eyalfa/spark/pull/1 :) --- If your project is set up for it, you can reply to this email and have your reply appear

[GitHub] spark issue #14961: [SPARK-17379] [BUILD] Upgrade netty-all to 4.0.41 final ...

2016-09-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14961 **[Test build #65361 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/65361/consoleFull)** for PR 14961 at commit

[GitHub] spark issue #14444: [SPARK-16839] [SQL] redundant aliases after cleanupAlias...

2016-09-14 Thread eyalfa
Github user eyalfa commented on the issue: https://github.com/apache/spark/pull/1 @HyukjinKwon thanks, looking at it as we speak :-) can I merge it, or is it still pending for sign-off by @shivaram? --- If your project is set up for it, you can reply to this email and

[GitHub] spark issue #15059: [SPARK-17506][SQL] Improve the check double values equal...

2016-09-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15059 **[Test build #65363 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/65363/consoleFull)** for PR 15059 at commit

[GitHub] spark issue #15059: [SPARK-17506][SQL] Improve the check double values equal...

2016-09-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15059 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/65363/ Test PASSed. ---

[GitHub] spark pull request #15090: [SPARK-17073] [SQL] generate column-level statist...

2016-09-14 Thread hvanhovell
Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/15090#discussion_r78732371 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/command/AnalyzeColumnCommand.scala --- @@ -0,0 +1,209 @@ +/* + * Licensed to

[GitHub] spark pull request #15093: [SPARK-17480][SQL][FOLLOWUP] Fix more instances w...

2016-09-14 Thread HyukjinKwon
GitHub user HyukjinKwon opened a pull request: https://github.com/apache/spark/pull/15093 [SPARK-17480][SQL][FOLLOWUP] Fix more instances which calls List.length/size which is O(n) ## What changes were proposed in this pull request? This PR fixes all the instances which

[GitHub] spark issue #15093: [SPARK-17480][SQL][FOLLOWUP] Fix more instances which ca...

2016-09-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15093 **[Test build #65366 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/65366/consoleFull)** for PR 15093 at commit

[GitHub] spark issue #15093: [SPARK-17480][SQL][FOLLOWUP] Fix more instances which ca...

2016-09-14 Thread HyukjinKwon
Github user HyukjinKwon commented on the issue: https://github.com/apache/spark/pull/15093 @srowen Could you please review this? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark pull request #15093: [SPARK-17480][SQL][FOLLOWUP] Fix more instances w...

2016-09-14 Thread srowen
Github user srowen commented on a diff in the pull request: https://github.com/apache/spark/pull/15093#discussion_r78746309 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/Analyzer.scala --- @@ -1659,7 +1659,8 @@ class Analyzer( // child of

[GitHub] spark pull request #15059: [SPARK-17506][SQL] Improve the check double value...

2016-09-14 Thread jiangxb1987
Github user jiangxb1987 commented on a diff in the pull request: https://github.com/apache/spark/pull/15059#discussion_r78725648 --- Diff: sql/catalyst/src/test/scala/org/apache/spark/sql/catalyst/expressions/ExpressionEvalHelper.scala --- @@ -289,13 +290,32 @@ trait

[GitHub] spark issue #15059: [SPARK-17506][SQL] Improve the check double values equal...

2016-09-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15059 **[Test build #65364 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/65364/consoleFull)** for PR 15059 at commit

[GitHub] spark pull request #15090: [SPARK-17073] [SQL] generate column-level statist...

2016-09-14 Thread hvanhovell
Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/15090#discussion_r78731826 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/command/AnalyzeColumnCommand.scala --- @@ -0,0 +1,209 @@ +/* + * Licensed to

[GitHub] spark issue #13513: [SPARK-15698][SQL][Streaming] Add the ability to remove ...

2016-09-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/13513 **[Test build #65365 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/65365/consoleFull)** for PR 13513 at commit

[GitHub] spark pull request #15094: [SPARK-17534] [TESTS] Increase timeouts for Direc...

2016-09-14 Thread a-roberts
GitHub user a-roberts opened a pull request: https://github.com/apache/spark/pull/15094 [SPARK-17534] [TESTS] Increase timeouts for DirectKafkaStreamSuite tests **## What changes were proposed in this pull request?** There are two tests in this suite that are particularly flaky

[GitHub] spark pull request #15093: [SPARK-17480][SQL][FOLLOWUP] Fix more instances w...

2016-09-14 Thread HyukjinKwon
Github user HyukjinKwon commented on a diff in the pull request: https://github.com/apache/spark/pull/15093#discussion_r78746718 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/linalg/distributed/RowMatrix.scala --- @@ -586,7 +586,7 @@ class RowMatrix @Since("1.0.0") (

[GitHub] spark issue #14971: [SPARK-17410] [SPARK-17284] Move Hive-generated Stats In...

2016-09-14 Thread cloud-fan
Github user cloud-fan commented on the issue: https://github.com/apache/spark/pull/14971 Will this change break existing behaviour? The `MetastoreRelation` can have table statistics if the hive table is already analyzed. BTW, I'd like to have this behaviour: 1.

[GitHub] spark issue #15093: [SPARK-17480][SQL][FOLLOWUP] Fix more instances which ca...

2016-09-14 Thread HyukjinKwon
Github user HyukjinKwon commented on the issue: https://github.com/apache/spark/pull/15093 @srowen Please let me check this out again tomorrow. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not

[GitHub] spark pull request #15090: [SPARK-17073] [SQL] generate column-level statist...

2016-09-14 Thread hvanhovell
Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/15090#discussion_r78731287 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/Statistics.scala --- @@ -32,19 +34,70 @@ package

[GitHub] spark pull request #15090: [SPARK-17073] [SQL] generate column-level statist...

2016-09-14 Thread hvanhovell
Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/15090#discussion_r78734109 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/command/AnalyzeTableCommand.scala --- @@ -129,3 +87,51 @@ case class

[GitHub] spark pull request #15090: [SPARK-17073] [SQL] generate column-level statist...

2016-09-14 Thread hvanhovell
Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/15090#discussion_r78740027 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/command/AnalyzeColumnCommand.scala --- @@ -0,0 +1,209 @@ +/* + * Licensed to

[GitHub] spark issue #13513: [SPARK-15698][SQL][Streaming] Add the ability to remove ...

2016-09-14 Thread jerryshao
Github user jerryshao commented on the issue: https://github.com/apache/spark/pull/13513 Thanks a lot @zsxwing and @frreiss for your comments. For the slow scan problem of compact batch. Originally I planned to to not merge the latest batch as I did before, also suggested

[GitHub] spark pull request #15060: [SPARK-17507][ML][MLLib] check weight vector size...

2016-09-14 Thread WeichenXu123
Github user WeichenXu123 commented on a diff in the pull request: https://github.com/apache/spark/pull/15060#discussion_r78749771 --- Diff: mllib/src/main/scala/org/apache/spark/ml/classification/MultilayerPerceptronClassifier.scala --- @@ -235,6 +235,7 @@ class

[GitHub] spark issue #15093: [SPARK-17480][SQL][FOLLOWUP] Fix more instances which ca...

2016-09-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15093 **[Test build #65369 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/65369/consoleFull)** for PR 15093 at commit

[GitHub] spark issue #14961: [SPARK-17379] [BUILD] Upgrade netty-all to 4.0.41 final ...

2016-09-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14961 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #14961: [SPARK-17379] [BUILD] Upgrade netty-all to 4.0.41 final ...

2016-09-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14961 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/65361/ Test PASSed. ---

[GitHub] spark pull request #15093: [SPARK-17480][SQL][FOLLOWUP] Fix more instances w...

2016-09-14 Thread srowen
Github user srowen commented on a diff in the pull request: https://github.com/apache/spark/pull/15093#discussion_r78746161 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/linalg/distributed/RowMatrix.scala --- @@ -586,7 +586,7 @@ class RowMatrix @Since("1.0.0") (

[GitHub] spark issue #14444: [SPARK-16839] [SQL] redundant aliases after cleanupAlias...

2016-09-14 Thread HyukjinKwon
Github user HyukjinKwon commented on the issue: https://github.com/apache/spark/pull/1 I think it'd be okay to merge it? If there are some more fixes to be done, I will post another pull request :). I am pretty much always online. --- If your project is set up for it, you can

[GitHub] spark pull request #15090: [SPARK-17073] [SQL] generate column-level statist...

2016-09-14 Thread hvanhovell
Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/15090#discussion_r78730873 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/command/AnalyzeTableCommand.scala --- @@ -129,3 +87,51 @@ case class

[GitHub] spark pull request #15090: [SPARK-17073] [SQL] generate column-level statist...

2016-09-14 Thread hvanhovell
Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/15090#discussion_r78731424 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/command/AnalyzeColumnCommand.scala --- @@ -0,0 +1,209 @@ +/* + * Licensed to

[GitHub] spark pull request #15093: [SPARK-17480][SQL][FOLLOWUP] Fix more instances w...

2016-09-14 Thread HyukjinKwon
Github user HyukjinKwon commented on a diff in the pull request: https://github.com/apache/spark/pull/15093#discussion_r78746774 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/Analyzer.scala --- @@ -1659,7 +1659,8 @@ class Analyzer( //

[GitHub] spark issue #13513: [SPARK-15698][SQL][Streaming] Add the ability to remove ...

2016-09-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/13513 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark pull request #15095: [SQL][minor] correct the comment of SortBasedAggr...

2016-09-14 Thread cloud-fan
GitHub user cloud-fan opened a pull request: https://github.com/apache/spark/pull/15095 [SQL][minor] correct the comment of SortBasedAggregationIterator.safeProj ## What changes were proposed in this pull request? This comment went stale long time ago, this PR fixes it

[GitHub] spark issue #15004: [SPARK-17440] [SPARK-17441] Fixed Multiple Bugs in ALTER...

2016-09-14 Thread cloud-fan
Github user cloud-fan commented on the issue: https://github.com/apache/spark/pull/15004 retest this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes

[GitHub] spark pull request #14926: [SPARK-17365][Core] Remove/Kill multiple executor...

2016-09-14 Thread tgravescs
Github user tgravescs commented on a diff in the pull request: https://github.com/apache/spark/pull/14926#discussion_r78772038 --- Diff: core/src/main/scala/org/apache/spark/SparkContext.scala --- @@ -36,18 +36,15 @@ import com.google.common.collect.MapMaker import

[GitHub] spark issue #13513: [SPARK-15698][SQL][Streaming] Add the ability to remove ...

2016-09-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/13513 **[Test build #65368 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/65368/consoleFull)** for PR 13513 at commit

[GitHub] spark pull request #14659: [SPARK-16757] Set up Spark caller context to HDFS

2016-09-14 Thread tgravescs
Github user tgravescs commented on a diff in the pull request: https://github.com/apache/spark/pull/14659#discussion_r78773807 --- Diff: core/src/main/scala/org/apache/spark/scheduler/Task.scala --- @@ -79,6 +82,13 @@ private[spark] abstract class Task[T]( metrics)

[GitHub] spark issue #15093: [SPARK-17480][SQL][FOLLOWUP] Fix more instances which ca...

2016-09-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15093 **[Test build #65369 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/65369/consoleFull)** for PR 15093 at commit

[GitHub] spark issue #14444: [SPARK-16839] [SQL] redundant aliases after cleanupAlias...

2016-09-14 Thread shivaram
Github user shivaram commented on the issue: https://github.com/apache/spark/pull/1 Yeah the R change looks fine. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark issue #14444: [SPARK-16839] [SQL] redundant aliases after cleanupAlias...

2016-09-14 Thread shivaram
Github user shivaram commented on the issue: https://github.com/apache/spark/pull/1 Jenkins, retest this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and

[GitHub] spark issue #15095: [SQL][minor] correct the comment of SortBasedAggregation...

2016-09-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15095 **[Test build #65372 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/65372/consoleFull)** for PR 15095 at commit

[GitHub] spark issue #14834: [SPARK-17163][ML] Unified LogisticRegression interface

2016-09-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14834 **[Test build #65374 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/65374/consoleFull)** for PR 14834 at commit

[GitHub] spark pull request #14659: [SPARK-16757] Set up Spark caller context to HDFS

2016-09-14 Thread tgravescs
Github user tgravescs commented on a diff in the pull request: https://github.com/apache/spark/pull/14659#discussion_r78768851 --- Diff: yarn/src/main/scala/org/apache/spark/deploy/yarn/ApplicationMaster.scala --- @@ -184,6 +184,9 @@ private[spark] class ApplicationMaster(

[GitHub] spark issue #15048: [SPARK-17409] [SQL] Do Not Optimize Query in CTAS More T...

2016-09-14 Thread cloud-fan
Github user cloud-fan commented on the issue: https://github.com/apache/spark/pull/15048 thanks, merging to master! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and

[GitHub] spark issue #15004: [SPARK-17440] [SPARK-17441] Fixed Multiple Bugs in ALTER...

2016-09-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15004 **[Test build #65375 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/65375/consoleFull)** for PR 15004 at commit

[GitHub] spark issue #15094: [SPARK-17534] [TESTS] Increase timeouts for DirectKafkaS...

2016-09-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15094 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #15051: [SPARK-17499][SparkR][ML][MLLib] make the default params...

2016-09-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15051 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/65373/ Test PASSed. ---

[GitHub] spark issue #15051: [SPARK-17499][ML][MLLib] make the default params in spar...

2016-09-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15051 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/65371/ Test FAILed. ---

[GitHub] spark issue #15051: [SPARK-17499][ML][MLLib] make the default params in spar...

2016-09-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15051 **[Test build #65371 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/65371/consoleFull)** for PR 15051 at commit

[GitHub] spark pull request #14659: [SPARK-16757] Set up Spark caller context to HDFS

2016-09-14 Thread tgravescs
Github user tgravescs commented on a diff in the pull request: https://github.com/apache/spark/pull/14659#discussion_r78758857 --- Diff: core/src/main/scala/org/apache/spark/scheduler/Task.scala --- @@ -54,7 +54,10 @@ private[spark] abstract class Task[T]( val partitionId:

[GitHub] spark pull request #14659: [SPARK-16757] Set up Spark caller context to HDFS

2016-09-14 Thread tgravescs
Github user tgravescs commented on a diff in the pull request: https://github.com/apache/spark/pull/14659#discussion_r78758789 --- Diff: core/src/main/scala/org/apache/spark/scheduler/ShuffleMapTask.scala --- @@ -51,8 +51,12 @@ private[spark] class ShuffleMapTask(

[GitHub] spark issue #15051: [SPARK-17499][ML][MLLib] make the default params in spar...

2016-09-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15051 **[Test build #65373 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/65373/consoleFull)** for PR 15051 at commit

[GitHub] spark issue #15060: [SPARK-17507][ML][MLLib] check weight vector size in ANN

2016-09-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15060 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/65370/ Test PASSed. ---

[GitHub] spark pull request #14659: [SPARK-16757] Set up Spark caller context to HDFS

2016-09-14 Thread tgravescs
Github user tgravescs commented on a diff in the pull request: https://github.com/apache/spark/pull/14659#discussion_r78769775 --- Diff: core/src/main/scala/org/apache/spark/scheduler/Task.scala --- @@ -79,6 +82,13 @@ private[spark] abstract class Task[T]( metrics)

[GitHub] spark issue #15094: [SPARK-17534] [TESTS] Increase timeouts for DirectKafkaS...

2016-09-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15094 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/65367/ Test PASSed. ---

[GitHub] spark issue #15082: [SPARK-17528][SQL] MutableProjection should not cache co...

2016-09-14 Thread cloud-fan
Github user cloud-fan commented on the issue: https://github.com/apache/spark/pull/15082 After thinking more about it, it seems not a problem and we don't need to fix it. Currently `MutableProjection` is used in 3 places: 1. hash based aggregate. It's fine because hash

[GitHub] spark issue #15060: [SPARK-17507][ML][MLLib] check weight vector size in ANN

2016-09-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15060 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark pull request #15054: [SPARK-17502] [SQL] Fix Multiple Bugs in DDL Stat...

2016-09-14 Thread cloud-fan
Github user cloud-fan commented on a diff in the pull request: https://github.com/apache/spark/pull/15054#discussion_r78766111 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/command/ddl.scala --- @@ -266,6 +267,12 @@ case class AlterTableUnsetPropertiesCommand(

[GitHub] spark pull request #15054: [SPARK-17502] [SQL] Fix Multiple Bugs in DDL Stat...

2016-09-14 Thread cloud-fan
Github user cloud-fan commented on a diff in the pull request: https://github.com/apache/spark/pull/15054#discussion_r78765976 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/command/AnalyzeTableCommand.scala --- @@ -91,6 +91,9 @@ case class

[GitHub] spark pull request #14659: [SPARK-16757] Set up Spark caller context to HDFS

2016-09-14 Thread tgravescs
Github user tgravescs commented on a diff in the pull request: https://github.com/apache/spark/pull/14659#discussion_r78767418 --- Diff: core/src/main/scala/org/apache/spark/util/Utils.scala --- @@ -2418,6 +2418,21 @@ private[spark] object Utils extends Logging {

[GitHub] spark issue #15051: [SPARK-17499][SparkR][ML][MLLib] make the default params...

2016-09-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15051 **[Test build #65373 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/65373/consoleFull)** for PR 15051 at commit

[GitHub] spark issue #15097: [SPARK-17540][SparkR][Spark Core] fix SparkR array serde...

2016-09-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15097 **[Test build #65376 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/65376/consoleFull)** for PR 15097 at commit

[GitHub] spark issue #15096: [SPARK-17537] [SQL] Reading parquet schema from driver d...

2016-09-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15096 Can one of the admins verify this patch? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this

[GitHub] spark issue #15093: [SPARK-17480][SQL][FOLLOWUP] Fix more instances which ca...

2016-09-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15093 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/65366/ Test PASSed. ---

[GitHub] spark issue #13513: [SPARK-15698][SQL][Streaming] Add the ability to remove ...

2016-09-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/13513 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/65368/ Test PASSed. ---

[GitHub] spark issue #13513: [SPARK-15698][SQL][Streaming] Add the ability to remove ...

2016-09-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/13513 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #15094: [SPARK-17534] [TESTS] Increase timeouts for DirectKafkaS...

2016-09-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15094 **[Test build #65367 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/65367/consoleFull)** for PR 15094 at commit

[GitHub] spark issue #15051: [SPARK-17499][SparkR][ML][MLLib] make the default params...

2016-09-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15051 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #13513: [SPARK-15698][SQL][Streaming] Add the ability to remove ...

2016-09-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/13513 **[Test build #65365 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/65365/consoleFull)** for PR 13513 at commit

[GitHub] spark issue #15051: [SPARK-17499][ML][MLLib] make the default params in spar...

2016-09-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15051 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark pull request #14659: [SPARK-16757] Set up Spark caller context to HDFS

2016-09-14 Thread tgravescs
Github user tgravescs commented on a diff in the pull request: https://github.com/apache/spark/pull/14659#discussion_r78758711 --- Diff: core/src/main/scala/org/apache/spark/scheduler/ResultTask.scala --- @@ -51,8 +51,12 @@ private[spark] class ResultTask[T, U]( locs:

[GitHub] spark issue #15080: [SPARK-17526][Web UI]: Display the executor log links wi...

2016-09-14 Thread zhzhan
Github user zhzhan commented on the issue: https://github.com/apache/spark/pull/15080 @srowen Thanks for reviewing this. Any suggestion to improve it are welcomed. It does bother us a lot without being able to locate the debug log quickly in production. --- If your project is set

[GitHub] spark pull request #15096: [SPARK-17537] [SQL] Reading parquet schema from d...

2016-09-14 Thread yangw1234
GitHub user yangw1234 opened a pull request: https://github.com/apache/spark/pull/15096 [SPARK-17537] [SQL] Reading parquet schema from driver directly when there is only one file to touch ## What changes were proposed in this pull request?

[GitHub] spark issue #15093: [SPARK-17480][SQL][FOLLOWUP] Fix more instances which ca...

2016-09-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15093 **[Test build #65366 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/65366/consoleFull)** for PR 15093 at commit

[GitHub] spark pull request #14926: [SPARK-17365][Core] Remove/Kill multiple executor...

2016-09-14 Thread tgravescs
Github user tgravescs commented on a diff in the pull request: https://github.com/apache/spark/pull/14926#discussion_r78772541 --- Diff: core/src/main/scala/org/apache/spark/SparkContext.scala --- @@ -2535,7 +2539,7 @@ object SparkContext extends Logging { private def

[GitHub] spark pull request #14834: [SPARK-17163][ML] Unified LogisticRegression inte...

2016-09-14 Thread sethah
Github user sethah commented on a diff in the pull request: https://github.com/apache/spark/pull/14834#discussion_r78774078 --- Diff: mllib/src/main/scala/org/apache/spark/ml/classification/LogisticRegression.scala --- @@ -676,39 +936,54 @@ object LogisticRegressionModel extends

[GitHub] spark issue #14980: [SPARK-17317][SparkR] Add SparkR vignette

2016-09-14 Thread junyangq
Github user junyangq commented on the issue: https://github.com/apache/spark/pull/14980 @shivaram Yeah sure :) --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes

[GitHub] spark pull request #15097: [SPARK-17540][SparkR][Spark Core] fix SparkR arra...

2016-09-14 Thread WeichenXu123
GitHub user WeichenXu123 opened a pull request: https://github.com/apache/spark/pull/15097 [SPARK-17540][SparkR][Spark Core] fix SparkR array serde type problem when length == 0 ## What changes were proposed in this pull request? fix SparkR array serde type problem when

[GitHub] spark issue #15095: [SQL][minor] correct the comment of SortBasedAggregation...

2016-09-14 Thread cloud-fan
Github user cloud-fan commented on the issue: https://github.com/apache/spark/pull/15095 cc @yhuai @davies @liancheng @clockfly --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #13513: [SPARK-15698][SQL][Streaming] Add the ability to remove ...

2016-09-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/13513 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/65365/ Test PASSed. ---

[GitHub] spark pull request #15076: [SPARK-17114][SQL] Fix aggregates grouped by lite...

2016-09-14 Thread cloud-fan
Github user cloud-fan commented on a diff in the pull request: https://github.com/apache/spark/pull/15076#discussion_r78764150 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/plans/logical/basicLogicalOperators.scala --- @@ -484,7 +494,10 @@ case class

[GitHub] spark issue #15093: [SPARK-17480][SQL][FOLLOWUP] Fix more instances which ca...

2016-09-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15093 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark pull request #15048: [SPARK-17409] [SQL] Do Not Optimize Query in CTAS...

2016-09-14 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/15048 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[GitHub] spark issue #15051: [SPARK-17499][ML][MLLib] make the default params in spar...

2016-09-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15051 **[Test build #65371 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/65371/consoleFull)** for PR 15051 at commit

[GitHub] spark issue #15060: [SPARK-17507][ML][MLLib] check weight vector size in ANN

2016-09-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15060 **[Test build #65370 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/65370/consoleFull)** for PR 15060 at commit

[GitHub] spark pull request #15082: [SPARK-17528][SQL] MutableProjection should not c...

2016-09-14 Thread cloud-fan
Github user cloud-fan closed the pull request at: https://github.com/apache/spark/pull/15082 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[GitHub] spark issue #15082: [SPARK-17528][SQL] MutableProjection should not cache co...

2016-09-14 Thread liancheng
Github user liancheng commented on the issue: https://github.com/apache/spark/pull/15082 My understanding of the main concern of closing this PR is that: 1. Although this issue can be potentially dangerous, the current code work fine without fixing this issue. 1. We still

[GitHub] spark pull request #14659: [SPARK-16757] Set up Spark caller context to HDFS

2016-09-14 Thread tgravescs
Github user tgravescs commented on a diff in the pull request: https://github.com/apache/spark/pull/14659#discussion_r78767924 --- Diff: core/src/main/scala/org/apache/spark/util/Utils.scala --- @@ -2418,6 +2418,21 @@ private[spark] object Utils extends Logging {

[GitHub] spark pull request #14659: [SPARK-16757] Set up Spark caller context to HDFS

2016-09-14 Thread tgravescs
Github user tgravescs commented on a diff in the pull request: https://github.com/apache/spark/pull/14659#discussion_r78767856 --- Diff: core/src/main/scala/org/apache/spark/util/Utils.scala --- @@ -2418,6 +2418,21 @@ private[spark] object Utils extends Logging {

  1   2   3   4   5   6   >