date:20170825

[GitHub] spark issue #19048: [SPARK-21834] Incorrect executor request in case of dyna...

2017-08-25 Thread vanzin

Github user vanzin commented on the issue: https://github.com/apache/spark/pull/19048 I think I'm starting to understand what you're getting at, but I still don't see why this has anything to do with the CGSB. What I understand from your comment is that the EAM may reduce its target

[GitHub] spark issue #19047: [SPARK-21798]: No config to replace deprecated SPARK_CLA...

2017-08-25 Thread tgravescs

Github user tgravescs commented on the issue: https://github.com/apache/spark/pull/19047 test this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so,

[GitHub] spark issue #19047: [SPARK-21798]: No config to replace deprecated SPARK_CLA...

2017-08-25 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19047 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #19047: [SPARK-21798]: No config to replace deprecated SPARK_CLA...

2017-08-25 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19047 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/81133/ Test FAILed. ---

[GitHub] spark issue #19047: [SPARK-21798]: No config to replace deprecated SPARK_CLA...

2017-08-25 Thread SparkQA

Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19047 **[Test build #81133 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/81133/testReport)** for PR 19047 at commit

[GitHub] spark issue #19055: [SPARK-21839][SQL] Support SQL config for ORC compressio...

2017-08-25 Thread SparkQA

Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19055 **[Test build #81139 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/81139/testReport)** for PR 19055 at commit

[GitHub] spark pull request #19055: [SPARK-21839][SQL] Support SQL config for ORC com...

2017-08-25 Thread dongjoon-hyun

GitHub user dongjoon-hyun opened a pull request: https://github.com/apache/spark/pull/19055 [SPARK-21839][SQL] Support SQL config for ORC compression ## What changes were proposed in this pull request? This PR aims to support `spark.sql.orc.compression.codec` like Parquet's

[GitHub] spark issue #19048: [SPARK-21834] Incorrect executor request in case of dyna...

2017-08-25 Thread sitalkedia

Github user sitalkedia commented on the issue: https://github.com/apache/spark/pull/19048 That's not really true. The EAM uses the `requestTotalExecutors` api to set the target for the scheduler. - 10 executors are running, each executor can run 4 tasks at max. - 20

[GitHub] spark pull request #18966: [SPARK-21751][SQL] CodeGeneraor.splitExpressions ...

2017-08-25 Thread gatorsmile

Github user gatorsmile commented on a diff in the pull request: https://github.com/apache/spark/pull/18966#discussion_r135332868 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/internal/SQLConf.scala --- @@ -582,6 +582,15 @@ object SQLConf { .intConf

[GitHub] spark issue #19048: [SPARK-21834] Incorrect executor request in case of dyna...

2017-08-25 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19048 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #19048: [SPARK-21834] Incorrect executor request in case of dyna...

2017-08-25 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19048 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/81132/ Test PASSed. ---

[GitHub] spark issue #19048: [SPARK-21834] Incorrect executor request in case of dyna...

2017-08-25 Thread SparkQA

Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19048 **[Test build #81132 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/81132/testReport)** for PR 19048 at commit

[GitHub] spark issue #19048: [SPARK-21834] Incorrect executor request in case of dyna...

2017-08-25 Thread vanzin

Github user vanzin commented on the issue: https://github.com/apache/spark/pull/19048 > This is when things get out of sync because now the scheduler will set the number of total executors needed from 4 to 1. Have you actually observed that behavior? The way I

[GitHub] spark issue #19024: [SPARK-21469][ML][EXAMPLES] Adding Examples for FeatureH...

2017-08-25 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19024 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/81136/ Test PASSed. ---

[GitHub] spark issue #19024: [SPARK-21469][ML][EXAMPLES] Adding Examples for FeatureH...

2017-08-25 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19024 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #19024: [SPARK-21469][ML][EXAMPLES] Adding Examples for FeatureH...

2017-08-25 Thread SparkQA

Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19024 **[Test build #81136 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/81136/testReport)** for PR 19024 at commit

[GitHub] spark issue #18193: [SPARK-15616] [SQL] CatalogRelation should fallback to H...

2017-08-25 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/18193 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #18193: [SPARK-15616] [SQL] CatalogRelation should fallback to H...

2017-08-25 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/18193 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/81134/ Test PASSed. ---

[GitHub] spark issue #18193: [SPARK-15616] [SQL] CatalogRelation should fallback to H...

2017-08-25 Thread SparkQA

Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/18193 **[Test build #81134 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/81134/testReport)** for PR 18193 at commit

[GitHub] spark pull request #18966: [SPARK-21751][SQL] CodeGeneraor.splitExpressions ...

2017-08-25 Thread gatorsmile

Github user gatorsmile commented on a diff in the pull request: https://github.com/apache/spark/pull/18966#discussion_r135327262 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/internal/SQLConf.scala --- @@ -582,6 +582,15 @@ object SQLConf { .intConf

[GitHub] spark issue #19048: [SPARK-21834] Incorrect executor request in case of dyna...

2017-08-25 Thread sitalkedia

Github user sitalkedia commented on the issue: https://github.com/apache/spark/pull/19048 Looking at the scheduler and the dynamic executor allocator code, this is what my understanding, correct me if I am wrong. Let's say the dynamic executor allocator is ramping down the

[GitHub] spark pull request #18966: [SPARK-21751][SQL] CodeGeneraor.splitExpressions ...

2017-08-25 Thread gatorsmile

Github user gatorsmile commented on a diff in the pull request: https://github.com/apache/spark/pull/18966#discussion_r135326229 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/internal/SQLConf.scala --- @@ -582,6 +582,15 @@ object SQLConf { .intConf

[GitHub] spark issue #18659: [SPARK-21404][PYSPARK][WIP] Simple Python Vectorized UDF...

2017-08-25 Thread SparkQA

Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/18659 **[Test build #81138 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/81138/testReport)** for PR 18659 at commit

[GitHub] spark pull request #18966: [SPARK-21751][SQL] CodeGeneraor.splitExpressions ...

2017-08-25 Thread gatorsmile

Github user gatorsmile commented on a diff in the pull request: https://github.com/apache/spark/pull/18966#discussion_r135324695 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/codegen/CodeGenerator.scala --- @@ -769,16 +769,21 @@ class

[GitHub] spark issue #19048: [SPARK-21834] Incorrect executor request in case of dyna...

2017-08-25 Thread vanzin

Github user vanzin commented on the issue: https://github.com/apache/spark/pull/19048 I'm not sure I understand why is this a problem. What is the undesired behavior that happens because of this? That's not explained either in the PR nor in the bug. The way I understand the

[GitHub] spark issue #18837: [Spark-20812][Mesos] Add secrets support to the dispatch...

2017-08-25 Thread SparkQA

Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/18837 **[Test build #3904 has finished](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/3904/testReport)** for PR 18837 at commit

[GitHub] spark issue #19054: [SPARK-18067] Avoid shuffling child if join keys are sup...

2017-08-25 Thread tejasapatil

Github user tejasapatil commented on the issue: https://github.com/apache/spark/pull/19054 cc @hvanhovell @cloud-fan for review --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #19053: [SPARK-21837][SQL][TESTS] UserDefinedTypeSuite Local UDT...

2017-08-25 Thread SparkQA

Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19053 **[Test build #81137 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/81137/testReport)** for PR 19053 at commit

[GitHub] spark issue #19054: [SPARK-18067] Avoid shuffling child if join keys are sup...

2017-08-25 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19054 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/81131/ Test PASSed. ---

[GitHub] spark issue #19054: [SPARK-18067] Avoid shuffling child if join keys are sup...

2017-08-25 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19054 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark pull request #18581: [SPARK-21289][SQL][ML] Supports custom line separ...

2017-08-25 Thread HyukjinKwon

Github user HyukjinKwon commented on a diff in the pull request: https://github.com/apache/spark/pull/18581#discussion_r135316064 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/HadoopFileLinesReader.scala --- @@ -32,7 +32,9 @@ import

[GitHub] spark issue #19054: [SPARK-18067] Avoid shuffling child if join keys are sup...

2017-08-25 Thread SparkQA

Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19054 **[Test build #81131 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/81131/testReport)** for PR 19054 at commit

[GitHub] spark issue #18941: [SPARK-21715][WebUI] History Server should not respond h...

2017-08-25 Thread vanzin

Github user vanzin commented on the issue: https://github.com/apache/spark/pull/18941 The PR title does not match what the PR summary says. The title is about one change, the summary is about a different change, and the code seems to handle both. It's all pretty confusing.

[GitHub] spark issue #19053: [SPARK-21837][SQL][TESTS] UserDefinedTypeSuite Local UDT...

2017-08-25 Thread SparkQA

Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19053 **[Test build #3903 has finished](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/3903/testReport)** for PR 19053 at commit

[GitHub] spark issue #19024: [SPARK-21469][ML][EXAMPLES] Adding Examples for FeatureH...

2017-08-25 Thread SparkQA

Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19024 **[Test build #81136 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/81136/testReport)** for PR 19024 at commit

[GitHub] spark issue #19053: [SPARK-21837][SQL][TESTS] UserDefinedTypeSuite Local UDT...

2017-08-25 Thread gatorsmile

Github user gatorsmile commented on the issue: https://github.com/apache/spark/pull/19053 LGTM except one comment. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and

[GitHub] spark pull request #19053: [SPARK-21837][SQL][TESTS] UserDefinedTypeSuite Lo...

2017-08-25 Thread gatorsmile

Github user gatorsmile commented on a diff in the pull request: https://github.com/apache/spark/pull/19053#discussion_r135313526 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/UserDefinedTypeSuite.scala --- @@ -203,12 +203,14 @@ class UserDefinedTypeSuite extends QueryTest

[GitHub] spark pull request #19049: [WEB-UI]Add the 'master' column to identify the t...

2017-08-25 Thread ajbozarth

Github user ajbozarth commented on a diff in the pull request: https://github.com/apache/spark/pull/19049#discussion_r135313226 --- Diff: core/src/main/resources/org/apache/spark/ui/static/historypage.js --- @@ -136,6 +136,16 @@ $(document).ready(function() {

[GitHub] spark issue #19049: [WEB-UI]Add the 'master' column to identify the type of ...

2017-08-25 Thread ajbozarth

Github user ajbozarth commented on the issue: https://github.com/apache/spark/pull/19049 I'll try to clarify @srowen issue for you @guoxiaolongzte For most use-cases each Spark cluster has it's own history server and also uses one type of resource manager. Therefore for most

[GitHub] spark issue #19013: [SPARK-21728][core] Allow SparkSubmit to use Logging.

2017-08-25 Thread SparkQA

Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19013 **[Test build #81135 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/81135/testReport)** for PR 19013 at commit

[GitHub] spark pull request #19008: [SPARK-21756][SQL]Add JSON option to allow unquot...

2017-08-25 Thread asfgit

Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/19008 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[GitHub] spark issue #19008: [SPARK-21756][SQL]Add JSON option to allow unquoted cont...

2017-08-25 Thread gatorsmile

Github user gatorsmile commented on the issue: https://github.com/apache/spark/pull/19008 Thanks! Merging to master. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and

[GitHub] spark issue #19008: [SPARK-21756][SQL]Add JSON option to allow unquoted cont...

2017-08-25 Thread gatorsmile

Github user gatorsmile commented on the issue: https://github.com/apache/spark/pull/19008 LGTM --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the

[GitHub] spark pull request #19024: [SPARK-21469][ML][EXAMPLES] Adding Examples for F...

2017-08-25 Thread BryanCutler

Github user BryanCutler commented on a diff in the pull request: https://github.com/apache/spark/pull/19024#discussion_r135307673 --- Diff: docs/ml-features.md --- @@ -211,6 +211,89 @@ for more details on the API. +## FeatureHasher + +Feature hashing

[GitHub] spark pull request #19024: [SPARK-21469][ML][EXAMPLES] Adding Examples for F...

2017-08-25 Thread BryanCutler

Github user BryanCutler commented on a diff in the pull request: https://github.com/apache/spark/pull/19024#discussion_r135307551 --- Diff: docs/ml-features.md --- @@ -53,9 +53,9 @@ are calculated based on the mapped indices. This approach avoids the need to com term-to-index

[GitHub] spark pull request #19012: [SPARK-17742][core] Fail launcher app handle if c...

2017-08-25 Thread asfgit

Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/19012 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[GitHub] spark pull request #19050: [SPARK-21835][SQL][WIP] RewritePredicateSubquery ...

2017-08-25 Thread dilipbiswal

Github user dilipbiswal commented on a diff in the pull request: https://github.com/apache/spark/pull/19050#discussion_r135307335 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/subquery.scala --- @@ -98,6 +99,11 @@ object RewritePredicateSubquery

[GitHub] spark issue #19012: [SPARK-17742][core] Fail launcher app handle if child pr...

2017-08-25 Thread vanzin

Github user vanzin commented on the issue: https://github.com/apache/spark/pull/19012 Merging to master. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so,

[GitHub] spark pull request #18962: [SPARK-21714][CORE][YARN] Avoiding re-uploading r...

2017-08-25 Thread asfgit

Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/18962 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[GitHub] spark issue #18962: [SPARK-21714][CORE][YARN] Avoiding re-uploading remote r...

2017-08-25 Thread vanzin

Github user vanzin commented on the issue: https://github.com/apache/spark/pull/18962 @jerryshao there are conflicts in 2.2, will need a separate PR. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does

[GitHub] spark pull request #18581: [SPARK-21289][SQL][ML] Supports custom line separ...

2017-08-25 Thread gatorsmile

Github user gatorsmile commented on a diff in the pull request: https://github.com/apache/spark/pull/18581#discussion_r135305729 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/HadoopFileLinesReader.scala --- @@ -32,7 +32,9 @@ import

[GitHub] spark issue #18962: [SPARK-21714][CORE][YARN] Avoiding re-uploading remote r...

2017-08-25 Thread vanzin

Github user vanzin commented on the issue: https://github.com/apache/spark/pull/18962 Merging to master, will also try 2.2. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #18791: [SPARK-21571][Scheduler] Spark history server leaves inc...

2017-08-25 Thread ericvandenbergfb

Github user ericvandenbergfb commented on the issue: https://github.com/apache/spark/pull/18791 The default is off, so people can opt-in to more aggressive clean up. Is this okay to be merged? --- If your project is set up for it, you can reply to this email and have your

[GitHub] spark issue #18193: [SPARK-15616] [SQL] CatalogRelation should fallback to H...

2017-08-25 Thread lianhuiwang

Github user lianhuiwang commented on the issue: https://github.com/apache/spark/pull/18193 @cloud-fan I have address your comments. Thanks. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark issue #18193: [SPARK-15616] [SQL] CatalogRelation should fallback to H...

2017-08-25 Thread SparkQA

Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/18193 **[Test build #81134 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/81134/testReport)** for PR 18193 at commit

[GitHub] spark issue #19016: [SPARK-21805][SPARKR] Disable R vignettes code on Window...

2017-08-25 Thread shivaram

Github user shivaram commented on the issue: https://github.com/apache/spark/pull/19016 Thats great ! I will also run this by winbuilder later today. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does

[GitHub] spark issue #18991: [SPARK-21783][SQL] Turn on ORC filter push-down by defau...

2017-08-25 Thread dongjoon-hyun

Github user dongjoon-hyun commented on the issue: https://github.com/apache/spark/pull/18991 +1, I cannot agree anymore. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark issue #18991: [SPARK-21783][SQL] Turn on ORC filter push-down by defau...

2017-08-25 Thread gatorsmile

Github user gatorsmile commented on the issue: https://github.com/apache/spark/pull/18991 Yes. The commercial DBMS products have a very good/comprehensive test coverage. So far, it is missing in Apache Spark. Basically, we simply trust the underlying data sources, which are

[GitHub] spark issue #19047: [SPARK-21798]: No config to replace deprecated SPARK_CLA...

2017-08-25 Thread SparkQA

Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19047 **[Test build #81133 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/81133/testReport)** for PR 19047 at commit

[GitHub] spark issue #18991: [SPARK-21783][SQL] Turn on ORC filter push-down by defau...

2017-08-25 Thread dongjoon-hyun

Github user dongjoon-hyun commented on the issue: https://github.com/apache/spark/pull/18991 Wow. It's real commercial spec. Thank you! I understand. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does

[GitHub] spark issue #18991: [SPARK-21783][SQL] Turn on ORC filter push-down by defau...

2017-08-25 Thread gatorsmile

Github user gatorsmile commented on the issue: https://github.com/apache/spark/pull/18991 Since I saw you are also working on the enhancement of ORC reader/writer, we need to check all the limits (value ranges). I am not sure how good Apache ORC/Parquet did in their test case design.

[GitHub] spark issue #19047: [SPARK-21798]: No config to replace deprecated SPARK_CLA...

2017-08-25 Thread tgravescs

Github user tgravescs commented on the issue: https://github.com/apache/spark/pull/19047 ok to test --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if

[GitHub] spark issue #19053: [SPARK-21837][SQL][TESTS] UserDefinedTypeSuite local UDF...

2017-08-25 Thread srowen

Github user srowen commented on the issue: https://github.com/apache/spark/pull/19053 Oops, I meant UDT. Just referring to the tests name in the code. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does

[GitHub] spark pull request #19047: [SPARK-21798]: No config to replace deprecated SP...

2017-08-25 Thread pgandhi999

Github user pgandhi999 commented on a diff in the pull request: https://github.com/apache/spark/pull/19047#discussion_r135297693 --- Diff: launcher/src/main/java/org/apache/spark/launcher/AbstractCommandBuilder.java --- @@ -136,7 +136,8 @@ void addOptionString(List cmd, String

[GitHub] spark issue #18991: [SPARK-21783][SQL] Turn on ORC filter push-down by defau...

2017-08-25 Thread dongjoon-hyun

Github user dongjoon-hyun commented on the issue: https://github.com/apache/spark/pull/18991 Thank you for the comments and directions. Definitely, I'll try! Since we depends on Apache Spark 1.4.0, I think I can add raw level test case somewhere for evaluation purpose only. ---

[GitHub] spark issue #18991: [SPARK-21783][SQL] Turn on ORC filter push-down by defau...

2017-08-25 Thread gatorsmile

Github user gatorsmile commented on the issue: https://github.com/apache/spark/pull/18991 If ORC incorrectly filters out the extra rows, we might get incorrect results. In addition, we do not know whether the push down could get the performance gain. We saw the performance regression

[GitHub] spark issue #18991: [SPARK-21783][SQL] Turn on ORC filter push-down by defau...

2017-08-25 Thread dongjoon-hyun

Github user dongjoon-hyun commented on the issue: https://github.com/apache/spark/pull/18991 Hi, @gatorsmile . Could you review this ORC PPD default configuration? Our data source doesn't trust any data sources including Parquet/ORC. I think ORC PPD do no harm on Spark. --- If

[GitHub] spark pull request #19044: [SPARK-21832][TEST] Merge SQLBuilderTest into Exp...

2017-08-25 Thread asfgit

Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/19044 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[GitHub] spark issue #19044: [SPARK-21832][TEST] Merge SQLBuilderTest into Expression...

2017-08-25 Thread dongjoon-hyun

Github user dongjoon-hyun commented on the issue: https://github.com/apache/spark/pull/19044 Thank you for review and merging, @gatorsmile . --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark issue #19044: [SPARK-21832][TEST] Merge SQLBuilderTest into Expression...

2017-08-25 Thread gatorsmile

Github user gatorsmile commented on the issue: https://github.com/apache/spark/pull/19044 Thanks! Merging to master. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and

[GitHub] spark issue #19053: [SPARK-21837][SQL][TESTS] UserDefinedTypeSuite local UDF...

2017-08-25 Thread gatorsmile

Github user gatorsmile commented on the issue: https://github.com/apache/spark/pull/19053 UDF? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the

[GitHub] spark issue #19048: [SPARK-21834] Incorrect executor request in case of dyna...

2017-08-25 Thread SparkQA

Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19048 **[Test build #81132 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/81132/testReport)** for PR 19048 at commit

[GitHub] spark issue #19048: [SPARK-21834] Incorrect executor request in case of dyna...

2017-08-25 Thread sitalkedia

Github user sitalkedia commented on the issue: https://github.com/apache/spark/pull/19048 Jenkins retest this please. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark issue #17461: [SPARK-20082][ml] LDA incremental model learning

2017-08-25 Thread hhbyyh

Github user hhbyyh commented on the issue: https://github.com/apache/spark/pull/17461 Got it. Will make a pass today. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and

[GitHub] spark issue #15605: [WIP] [SPARK-18067] [SQL] SortMergeJoin adds shuffle if ...

2017-08-25 Thread tejasapatil

Github user tejasapatil commented on the issue: https://github.com/apache/spark/pull/15605 This is superseded by https://github.com/apache/spark/pull/19054 Closing --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your

[GitHub] spark pull request #15605: [WIP] [SPARK-18067] [SQL] SortMergeJoin adds shuf...

2017-08-25 Thread tejasapatil

Github user tejasapatil closed the pull request at: https://github.com/apache/spark/pull/15605 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature

[GitHub] spark issue #19054: [SPARK-18067] Avoid shuffling child if join keys are sup...

2017-08-25 Thread SparkQA

Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19054 **[Test build #81131 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/81131/testReport)** for PR 19054 at commit

[GitHub] spark pull request #19054: [SPARK-18067] Avoid shuffling child if join keys ...

2017-08-25 Thread tejasapatil

GitHub user tejasapatil opened a pull request: https://github.com/apache/spark/pull/19054 [SPARK-18067] Avoid shuffling child if join keys are superset of child's partitioning keys Jira : https://issues.apache.org/jira/browse/SPARK-18067 ## What problem is being addressed

[GitHub] spark pull request #19050: [SPARK-21835][SQL][WIP] RewritePredicateSubquery ...

2017-08-25 Thread viirya

Github user viirya commented on a diff in the pull request: https://github.com/apache/spark/pull/19050#discussion_r135283339 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/subquery.scala --- @@ -98,6 +99,11 @@ object RewritePredicateSubquery extends

[GitHub] spark pull request #19051: [MINOR][BUILD] Fix build warnings and Java lint e...

2017-08-25 Thread srowen

Github user srowen closed the pull request at: https://github.com/apache/spark/pull/19051 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[GitHub] spark issue #19051: [MINOR][BUILD] Fix build warnings and Java lint errors

2017-08-25 Thread srowen

Github user srowen commented on the issue: https://github.com/apache/spark/pull/19051 Merged to master --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or

[GitHub] spark issue #18837: [Spark-20812][Mesos] Add secrets support to the dispatch...

2017-08-25 Thread SparkQA

Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/18837 **[Test build #3904 has started](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/3904/testReport)** for PR 18837 at commit

[GitHub] spark issue #19053: [SPARK-21837][SQL][TESTS] UserDefinedTypeSuite local UDF...

2017-08-25 Thread SparkQA

Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19053 **[Test build #3903 has started](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/3903/testReport)** for PR 19053 at commit

[GitHub] spark issue #18730: [SPARK-21527][CORE] Use buffer limit in order to use JAV...

2017-08-25 Thread cloud-fan

Github user cloud-fan commented on the issue: https://github.com/apache/spark/pull/18730 thanks, merging to master! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and

[GitHub] spark pull request #18730: [SPARK-21527][CORE] Use buffer limit in order to ...

2017-08-25 Thread asfgit

Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/18730 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[GitHub] spark pull request #19050: [SPARK-21835][SQL][WIP] RewritePredicateSubquery ...

2017-08-25 Thread viirya

Github user viirya commented on a diff in the pull request: https://github.com/apache/spark/pull/19050#discussion_r135280321 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/Analyzer.scala --- @@ -2502,3 +2373,140 @@ object UpdateOuterReferences

[GitHub] spark pull request #19050: [SPARK-21835][SQL][WIP] RewritePredicateSubquery ...

2017-08-25 Thread hvanhovell

Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/19050#discussion_r135271400 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/subquery.scala --- @@ -98,6 +99,11 @@ object RewritePredicateSubquery

[GitHub] spark pull request #19050: [SPARK-21835][SQL][WIP] RewritePredicateSubquery ...

2017-08-25 Thread hvanhovell

Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/19050#discussion_r135270779 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/Analyzer.scala --- @@ -2502,3 +2373,140 @@ object UpdateOuterReferences

[GitHub] spark issue #18837: [Spark-20812][Mesos] Add secrets support to the dispatch...

2017-08-25 Thread ArtRand

Github user ArtRand commented on the issue: https://github.com/apache/spark/pull/18837 Hello @srowen, thanks for taking a look at this. You're correct in that this change does not require users to have a Mesos 1.3+ cluster, we do not change or omit any required records in the proto

[GitHub] spark pull request #19024: [SPARK-21469][ML][EXAMPLES] Adding Examples for F...

2017-08-25 Thread MLnick

Github user MLnick commented on a diff in the pull request: https://github.com/apache/spark/pull/19024#discussion_r135261324 --- Diff: docs/ml-features.md --- @@ -211,6 +211,89 @@ for more details on the API. +## FeatureHasher + +Feature hashing

[GitHub] spark pull request #19024: [SPARK-21469][ML][EXAMPLES] Adding Examples for F...

2017-08-25 Thread MLnick

Github user MLnick commented on a diff in the pull request: https://github.com/apache/spark/pull/19024#discussion_r135261228 --- Diff: docs/ml-features.md --- @@ -53,9 +53,9 @@ are calculated based on the mapped indices. This approach avoids the need to com term-to-index map,

[GitHub] spark issue #18730: [SPARK-21527][CORE] Use buffer limit in order to use JAV...

2017-08-25 Thread caneGuy

Github user caneGuy commented on the issue: https://github.com/apache/spark/pull/18730 @cloud-fan Jekins done! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes

[GitHub] spark issue #19053: [SPARK-21837][SQL][TESTS] UserDefinedTypeSuite local UDF...

2017-08-25 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19053 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #19053: [SPARK-21837][SQL][TESTS] UserDefinedTypeSuite local UDF...

2017-08-25 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/19053 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/81130/ Test FAILed. ---

[GitHub] spark issue #19012: [SPARK-17742][core] Fail launcher app handle if child pr...

2017-08-25 Thread jerryshao

Github user jerryshao commented on the issue: https://github.com/apache/spark/pull/19012 LGTM, I tried locally. Looks like now the NPE is gone in yarn UT, thanks for the fix. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as

[GitHub] spark issue #19053: [SPARK-21837][SQL][TESTS] UserDefinedTypeSuite local UDF...

2017-08-25 Thread SparkQA

Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19053 **[Test build #81130 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/81130/testReport)** for PR 19053 at commit

[GitHub] spark issue #19051: [MINOR][BUILD] Fix build warnings and Java lint errors

2017-08-25 Thread SparkQA

Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/19051 **[Test build #3902 has finished](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/3902/testReport)** for PR 19051 at commit

[GitHub] spark issue #18730: [SPARK-21527][CORE] Use buffer limit in order to use JAV...

2017-08-25 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/18730 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #18730: [SPARK-21527][CORE] Use buffer limit in order to use JAV...

2017-08-25 Thread AmplabJenkins

Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/18730 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/81128/ Test PASSed. ---

[GitHub] spark issue #18730: [SPARK-21527][CORE] Use buffer limit in order to use JAV...

2017-08-25 Thread SparkQA

Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/18730 **[Test build #81128 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/81128/testReport)** for PR 18730 at commit

< 1 2 3 4 >

101 - 200 of 300 matches

Mail list logo