[GitHub] spark issue #14867: [SPARK-17296][SQL] Simplify parser join processing.

2016-09-02 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14867 **[Test build #64880 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64880/consoleFull)** for PR 14867 at commit

[GitHub] spark pull request #14517: [SPARK-16931][PYTHON] PySpark APIS for bucketBy a...

2016-09-02 Thread davies
Github user davies commented on a diff in the pull request: https://github.com/apache/spark/pull/14517#discussion_r77420246 --- Diff: python/pyspark/sql/readwriter.py --- @@ -747,16 +800,25 @@ def _test(): except py4j.protocol.Py4JError: spark =

[GitHub] spark issue #14866: [SPARK-17298][SQL] Require explicit CROSS join for carte...

2016-09-02 Thread davies
Github user davies commented on the issue: https://github.com/apache/spark/pull/14866 LGTM --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the

[GitHub] spark pull request #14866: [SPARK-17298][SQL] Require explicit CROSS join fo...

2016-09-02 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/14866 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[GitHub] spark issue #14866: [SPARK-17298][SQL] Require explicit CROSS join for carte...

2016-09-02 Thread hvanhovell
Github user hvanhovell commented on the issue: https://github.com/apache/spark/pull/14866 Merging to master. Thanks! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and

[GitHub] spark pull request #14690: [SPARK-16980][SQL] Load only catalog table partit...

2016-09-02 Thread ericl
Github user ericl commented on a diff in the pull request: https://github.com/apache/spark/pull/14690#discussion_r77419498 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/TableFileCatalog.scala --- @@ -0,0 +1,102 @@ +/* + * Licensed to the

[GitHub] spark pull request #14690: [SPARK-16980][SQL] Load only catalog table partit...

2016-09-02 Thread ericl
Github user ericl commented on a diff in the pull request: https://github.com/apache/spark/pull/14690#discussion_r77419464 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/FileSourceStrategy.scala --- @@ -79,8 +79,16 @@ object FileSourceStrategy

[GitHub] spark pull request #14690: [SPARK-16980][SQL] Load only catalog table partit...

2016-09-02 Thread ericl
Github user ericl commented on a diff in the pull request: https://github.com/apache/spark/pull/14690#discussion_r77419510 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/fileSourceInterfaces.scala --- @@ -346,11 +340,30 @@ trait FileCatalog {

[GitHub] spark pull request #14690: [SPARK-16980][SQL] Load only catalog table partit...

2016-09-02 Thread ericl
Github user ericl commented on a diff in the pull request: https://github.com/apache/spark/pull/14690#discussion_r77419448 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/DataSourceScanExec.scala --- @@ -184,7 +184,7 @@ case class FileSourceScanExec(

[GitHub] spark pull request #14690: [SPARK-16980][SQL] Load only catalog table partit...

2016-09-02 Thread ericl
Github user ericl commented on a diff in the pull request: https://github.com/apache/spark/pull/14690#discussion_r77419496 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/TableFileCatalog.scala --- @@ -0,0 +1,102 @@ +/* + * Licensed to the

[GitHub] spark pull request #14941: [SPARK-16334] Reusing same dictionary column for ...

2016-09-02 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/14941 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[GitHub] spark pull request #14690: [SPARK-16980][SQL] Load only catalog table partit...

2016-09-02 Thread ericl
Github user ericl commented on a diff in the pull request: https://github.com/apache/spark/pull/14690#discussion_r77419455 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/Dataset.scala --- @@ -2531,6 +2531,8 @@ class Dataset[T] private[sql]( */ def inputFiles:

[GitHub] spark issue #14797: [SPARK-17230] [SQL] Should not pass optimized query into...

2016-09-02 Thread davies
Github user davies commented on the issue: https://github.com/apache/spark/pull/14797 Merged this into master and 2.0 branch, thanks! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this

[GitHub] spark issue #14941: [SPARK-16334] Reusing same dictionary column for decodin...

2016-09-02 Thread davies
Github user davies commented on the issue: https://github.com/apache/spark/pull/14941 Merging this into master and 2.0 branch, thanks! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this

[GitHub] spark issue #14941: [SPARK-16334] Reusing same dictionary column for decodin...

2016-09-02 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14941 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/64870/ Test PASSed. ---

[GitHub] spark issue #14941: [SPARK-16334] Reusing same dictionary column for decodin...

2016-09-02 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14941 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark pull request #14797: [SPARK-17230] [SQL] Should not pass optimized que...

2016-09-02 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/14797 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[GitHub] spark issue #14941: [SPARK-16334] Reusing same dictionary column for decodin...

2016-09-02 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14941 **[Test build #64870 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64870/consoleFull)** for PR 14941 at commit

[GitHub] spark pull request #14867: [SPARK-17296][SQL] Simplify parser join processin...

2016-09-02 Thread srinathshankar
Github user srinathshankar commented on a diff in the pull request: https://github.com/apache/spark/pull/14867#discussion_r77418488 --- Diff: sql/catalyst/src/test/scala/org/apache/spark/sql/catalyst/parser/PlanParserSuite.scala --- @@ -360,10 +360,25 @@ class PlanParserSuite

[GitHub] spark issue #14941: [SPARK-16334] Reusing same dictionary column for decodin...

2016-09-02 Thread heroldus
Github user heroldus commented on the issue: https://github.com/apache/spark/pull/14941 @davies Fine, thx. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so,

[GitHub] spark issue #14638: [SPARK-11374][SQL] Support `skip.header.line.count` opti...

2016-09-02 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14638 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/64872/ Test PASSed. ---

[GitHub] spark issue #14638: [SPARK-11374][SQL] Support `skip.header.line.count` opti...

2016-09-02 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14638 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #14638: [SPARK-11374][SQL] Support `skip.header.line.count` opti...

2016-09-02 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14638 **[Test build #64872 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64872/consoleFull)** for PR 14638 at commit

[GitHub] spark pull request #14872: [SPARK-3162][MLlib][WIP] Add local tree training ...

2016-09-02 Thread smurching
Github user smurching closed the pull request at: https://github.com/apache/spark/pull/14872 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[GitHub] spark issue #14931: [SPARK-17370] Shuffle service files not invalidated when...

2016-09-02 Thread vanzin
Github user vanzin commented on the issue: https://github.com/apache/spark/pull/14931 The issue I see is how easy is it for the driver to know that? Adding a new flag to the `SlaveLost` class doesn't mean that you know how to set its value. I'm pretty sure, on the YARN side,

[GitHub] spark issue #14866: [SPARK-17298][SQL] Require explicit CROSS join for carte...

2016-09-02 Thread sameeragarwal
Github user sameeragarwal commented on the issue: https://github.com/apache/spark/pull/14866 LGTM --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if

[GitHub] spark issue #14942: [SparkR][Minor] Fix docs for sparkR.session and count

2016-09-02 Thread shivaram
Github user shivaram commented on the issue: https://github.com/apache/spark/pull/14942 cc @felixcheung --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so,

[GitHub] spark issue #14866: [SPARK-17298][SQL] Require explicit CROSS join for carte...

2016-09-02 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14866 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #14866: [SPARK-17298][SQL] Require explicit CROSS join for carte...

2016-09-02 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14866 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/64869/ Test PASSed. ---

[GitHub] spark issue #14866: [SPARK-17298][SQL] Require explicit CROSS join for carte...

2016-09-02 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14866 **[Test build #64869 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64869/consoleFull)** for PR 14866 at commit

[GitHub] spark issue #14872: [SPARK-3162][MLlib][WIP] Add local tree training for dec...

2016-09-02 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14872 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/64879/ Test FAILed. ---

[GitHub] spark issue #14872: [SPARK-3162][MLlib][WIP] Add local tree training for dec...

2016-09-02 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14872 **[Test build #64879 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64879/consoleFull)** for PR 14872 at commit

[GitHub] spark issue #14872: [SPARK-3162][MLlib][WIP] Add local tree training for dec...

2016-09-02 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14872 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #14797: [SPARK-17230] [SQL] Should not pass optimized query into...

2016-09-02 Thread srinathshankar
Github user srinathshankar commented on the issue: https://github.com/apache/spark/pull/14797 Looks fine. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so,

[GitHub] spark issue #14872: [SPARK-3162][MLlib][WIP] Add local tree training for dec...

2016-09-02 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14872 **[Test build #64879 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64879/consoleFull)** for PR 14872 at commit

[GitHub] spark pull request #14867: [SPARK-17296][SQL] Simplify parser join processin...

2016-09-02 Thread srinathshankar
Github user srinathshankar commented on a diff in the pull request: https://github.com/apache/spark/pull/14867#discussion_r77417316 --- Diff: sql/catalyst/src/test/scala/org/apache/spark/sql/catalyst/parser/PlanParserSuite.scala --- @@ -360,10 +360,25 @@ class PlanParserSuite

[GitHub] spark issue #14797: [SPARK-17230] [SQL] Should not pass optimized query into...

2016-09-02 Thread sameeragarwal
Github user sameeragarwal commented on the issue: https://github.com/apache/spark/pull/14797 LGTM --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if

[GitHub] spark issue #14866: [SPARK-17298][SQL] Require explicit CROSS join for carte...

2016-09-02 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14866 **[Test build #3245 has finished](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/3245/consoleFull)** for PR 14866 at commit

[GitHub] spark pull request #14887: [SPARK-17321][YARN] YARN shuffle service should u...

2016-09-02 Thread zhaoyunjiong
Github user zhaoyunjiong closed the pull request at: https://github.com/apache/spark/pull/14887 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature

[GitHub] spark pull request #14929: [SPARK-17374][SQL] Better error messages when par...

2016-09-02 Thread clockfly
Github user clockfly commented on a diff in the pull request: https://github.com/apache/spark/pull/14929#discussion_r77416301 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/json/JacksonParser.scala --- @@ -62,8 +68,39 @@ class JacksonParser(

[GitHub] spark issue #11119: [SPARK-10780][ML] Add an initial model to kmeans

2016-09-02 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/9 **[Test build #64878 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64878/consoleFull)** for PR 9 at commit

[GitHub] spark issue #11119: [SPARK-10780][ML] Add an initial model to kmeans

2016-09-02 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/9 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #11119: [SPARK-10780][ML] Add an initial model to kmeans

2016-09-02 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/9 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/64878/ Test FAILed. ---

[GitHub] spark issue #11119: [SPARK-10780][ML] Add an initial model to kmeans

2016-09-02 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/9 **[Test build #64878 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64878/consoleFull)** for PR 9 at commit

[GitHub] spark pull request #11119: [SPARK-10780][ML] Add an initial model to kmeans

2016-09-02 Thread yinxusen
Github user yinxusen commented on a diff in the pull request: https://github.com/apache/spark/pull/9#discussion_r77414688 --- Diff: mllib/src/test/scala/org/apache/spark/ml/clustering/KMeansSuite.scala --- @@ -139,16 +145,32 @@ class KMeansSuite extends SparkFunSuite with

[GitHub] spark issue #11119: [SPARK-10780][ML] Add an initial model to kmeans

2016-09-02 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/9 **[Test build #64877 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64877/consoleFull)** for PR 9 at commit

[GitHub] spark issue #14938: [SPARK-17335][SQL] Fix ArrayType and MapType CatalogStri...

2016-09-02 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14938 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/64867/ Test PASSed. ---

[GitHub] spark issue #14938: [SPARK-17335][SQL] Fix ArrayType and MapType CatalogStri...

2016-09-02 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14938 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #14938: [SPARK-17335][SQL] Fix ArrayType and MapType CatalogStri...

2016-09-02 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14938 **[Test build #64867 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64867/consoleFull)** for PR 14938 at commit

[GitHub] spark issue #14941: [SPARK-16334] Reusing same dictionary column for decodin...

2016-09-02 Thread davies
Github user davies commented on the issue: https://github.com/apache/spark/pull/14941 @heroldus decodeDictionaryIds() is only used when a batch across pages with different encoding (dictionary or plain), so it's not in the hot pass, I think the performance impact should be fine.

[GitHub] spark issue #14942: [SparkR][Minor] Fix docs for sparkR.session and count

2016-09-02 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14942 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #14942: [SparkR][Minor] Fix docs for sparkR.session and count

2016-09-02 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14942 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/64871/ Test PASSed. ---

[GitHub] spark issue #14942: [SparkR][Minor] Fix docs for sparkR.session and count

2016-09-02 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14942 **[Test build #64871 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64871/consoleFull)** for PR 14942 at commit

[GitHub] spark issue #14854: [SPARK-17283][Core] Cancel job in RDD.take() as soon as ...

2016-09-02 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14854 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #14854: [SPARK-17283][Core] Cancel job in RDD.take() as soon as ...

2016-09-02 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14854 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/64866/ Test PASSed. ---

[GitHub] spark issue #14854: [SPARK-17283][Core] Cancel job in RDD.take() as soon as ...

2016-09-02 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14854 **[Test build #64866 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64866/consoleFull)** for PR 14854 at commit

[GitHub] spark issue #14941: [SPARK-16334] Reusing same dictionary column for decodin...

2016-09-02 Thread heroldus
Github user heroldus commented on the issue: https://github.com/apache/spark/pull/14941 @sameeragarwal: Do you expect any performace impact of this commit? It's an additional `if (!column.isNullAt(i))` for every single value read. --- If your project is set up for it, you can reply

[GitHub] spark issue #14527: [SPARK-16938][SQL] `drop/dropDuplicate` should handle th...

2016-09-02 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14527 **[Test build #64874 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64874/consoleFull)** for PR 14527 at commit

[GitHub] spark issue #14426: [SPARK-16475][SQL] Broadcast Hint for SQL Queries

2016-09-02 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14426 **[Test build #64875 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64875/consoleFull)** for PR 14426 at commit

[GitHub] spark issue #14116: [SPARK-16452][SQL] Support basic INFORMATION_SCHEMA

2016-09-02 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14116 **[Test build #64876 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64876/consoleFull)** for PR 14116 at commit

[GitHub] spark issue #14623: [SPARK-17044][SQL] Make test files for window functions ...

2016-09-02 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14623 **[Test build #64873 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64873/consoleFull)** for PR 14623 at commit

[GitHub] spark issue #14638: [SPARK-11374][SQL] Support `skip.header.line.count` opti...

2016-09-02 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14638 **[Test build #64872 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64872/consoleFull)** for PR 14638 at commit

[GitHub] spark issue #14941: [SPARK-16334] Reusing same dictionary column for decodin...

2016-09-02 Thread davies
Github user davies commented on the issue: https://github.com/apache/spark/pull/14941 LGTM, pending jenkins. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes

[GitHub] spark issue #14942: [SparkR][Minor] Fix docs for sparkR.session and count

2016-09-02 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14942 **[Test build #64871 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64871/consoleFull)** for PR 14942 at commit

[GitHub] spark issue #14941: [SPARK-16334] Reusing same dictionary column for decodin...

2016-09-02 Thread sameeragarwal
Github user sameeragarwal commented on the issue: https://github.com/apache/spark/pull/14941 cc @davies --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so,

[GitHub] spark pull request #14942: [SparkR][Minor] Fix docs for sparkR.session and c...

2016-09-02 Thread junyangq
GitHub user junyangq opened a pull request: https://github.com/apache/spark/pull/14942 [SparkR][Minor] Fix docs for sparkR.session and count ## What changes were proposed in this pull request? This PR tries to add some more explanation to `sparkR.session`. It also modifies

[GitHub] spark issue #12436: [SPARK-14649][CORE] DagScheduler should not run duplicat...

2016-09-02 Thread davies
Github user davies commented on the issue: https://github.com/apache/spark/pull/12436 @sitalkedia Have a quick look at this one, the use case sounds good, we should improve the stability for long running tasks. Could you explain a bit more how the current patch works? (in the PR

[GitHub] spark issue #14941: [SPARK-16334] Reusing same dictionary column for decodin...

2016-09-02 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14941 **[Test build #64870 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64870/consoleFull)** for PR 14941 at commit

[GitHub] spark pull request #14941: [SPARK-16334] Reusing same dictionary column for ...

2016-09-02 Thread sameeragarwal
GitHub user sameeragarwal opened a pull request: https://github.com/apache/spark/pull/14941 [SPARK-16334] Reusing same dictionary column for decoding consecutive row groups shouldn't throw an error ## What changes were proposed in this pull request? This patch fixes a bug

[GitHub] spark issue #14882: [SPARK-17316][Core] Make CoarseGrainedSchedulerBackend.r...

2016-09-02 Thread zsxwing
Github user zsxwing commented on the issue: https://github.com/apache/spark/pull/14882 I just checkpicked this one into branch 1.6 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this

[GitHub] spark issue #14854: [SPARK-17283][Core] Cancel job in RDD.take() as soon as ...

2016-09-02 Thread JoshRosen
Github user JoshRosen commented on the issue: https://github.com/apache/spark/pull/14854 I'm actually going to close this now and will revisit later; the scheduling complexity may not be warranted now given benefits of simpler approaches. --- If your project is set up for it, you

[GitHub] spark pull request #14854: [SPARK-17283][Core] Cancel job in RDD.take() as s...

2016-09-02 Thread JoshRosen
Github user JoshRosen closed the pull request at: https://github.com/apache/spark/pull/14854 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[GitHub] spark issue #14938: [SPARK-17335][SQL] Fix ArrayType and MapType CatalogStri...

2016-09-02 Thread hvanhovell
Github user hvanhovell commented on the issue: https://github.com/apache/spark/pull/14938 I just compared the wide schema benchmark on master with this patch and there do not seem to be performance regressions. --- If your project is set up for it, you can reply to this email and

[GitHub] spark issue #14866: [SPARK-17298][SQL] Require explicit CROSS join for carte...

2016-09-02 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14866 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/64863/ Test PASSed. ---

[GitHub] spark issue #14866: [SPARK-17298][SQL] Require explicit CROSS join for carte...

2016-09-02 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14866 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #14866: [SPARK-17298][SQL] Require explicit CROSS join for carte...

2016-09-02 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14866 **[Test build #64863 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64863/consoleFull)** for PR 14866 at commit

[GitHub] spark issue #14866: [SPARK-17298][SQL] Require explicit CROSS join for carte...

2016-09-02 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14866 **[Test build #64869 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64869/consoleFull)** for PR 14866 at commit

[GitHub] spark issue #14866: [SPARK-17298][SQL] Require explicit CROSS join for carte...

2016-09-02 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14866 **[Test build #3245 has started](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/3245/consoleFull)** for PR 14866 at commit

[GitHub] spark issue #14881: [SPARK-17315][SparkR] Kolmogorov-Smirnov test SparkR wra...

2016-09-02 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14881 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/64865/ Test PASSed. ---

[GitHub] spark issue #14881: [SPARK-17315][SparkR] Kolmogorov-Smirnov test SparkR wra...

2016-09-02 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14881 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #14881: [SPARK-17315][SparkR] Kolmogorov-Smirnov test SparkR wra...

2016-09-02 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14881 **[Test build #64865 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64865/consoleFull)** for PR 14881 at commit

[GitHub] spark issue #14854: [SPARK-17283][Core] Cancel job in RDD.take() as soon as ...

2016-09-02 Thread JoshRosen
Github user JoshRosen commented on the issue: https://github.com/apache/spark/pull/14854 @davies brought up a reasonable point that we might be able to achieve similar benefits with less complexity by replacing the exponential ramp-up with something that's linearly proportional to

[GitHub] spark issue #14866: [SPARK-17298][SQL] Require explicit CROSS join for carte...

2016-09-02 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14866 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #14866: [SPARK-17298][SQL] Require explicit CROSS join for carte...

2016-09-02 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14866 **[Test build #64868 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64868/consoleFull)** for PR 14866 at commit

[GitHub] spark issue #14866: [SPARK-17298][SQL] Require explicit CROSS join for carte...

2016-09-02 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14866 **[Test build #64868 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64868/consoleFull)** for PR 14866 at commit

[GitHub] spark issue #14866: [SPARK-17298][SQL] Require explicit CROSS join for carte...

2016-09-02 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14866 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/64868/ Test FAILed. ---

[GitHub] spark issue #14940: [SPARK-17383][GRAPHX]LabelPropagation

2016-09-02 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14940 Can one of the admins verify this patch? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this

[GitHub] spark issue #14866: [SPARK-17298][SQL] Require explicit CROSS join for carte...

2016-09-02 Thread srinathshankar
Github user srinathshankar commented on the issue: https://github.com/apache/spark/pull/14866 I'll update the python and R APIs in a follow up. Right now in python and R a cross join is done if no join exprs/columns and join types are specified. It would be good to require explicit

[GitHub] spark pull request #14940: [SPARK-17383][GRAPHX]LabelPropagation

2016-09-02 Thread bookling
GitHub user bookling opened a pull request: https://github.com/apache/spark/pull/14940 [SPARK-17383][GRAPHX]LabelPropagation In the labelPropagation of graphx lib, node is initialized with a unique label and at every step each node adopts the label that most of its neighbors

[GitHub] spark issue #14931: [SPARK-17370] Shuffle service files not invalidated when...

2016-09-02 Thread ericl
Github user ericl commented on the issue: https://github.com/apache/spark/pull/14931 What if we added a flag to SlaveLost indicating if we think the entire host is lost? In many cases that should be true, if the event originated from worker loss or Mesos slave loss events. --- If

[GitHub] spark issue #14938: [SPARK-17335][SQL] Fix ArrayType and MapType CatalogStri...

2016-09-02 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14938 **[Test build #64867 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64867/consoleFull)** for PR 14938 at commit

[GitHub] spark issue #14931: [SPARK-17370] Shuffle service files not invalidated when...

2016-09-02 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14931 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/64862/ Test FAILed. ---

[GitHub] spark issue #14931: [SPARK-17370] Shuffle service files not invalidated when...

2016-09-02 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14931 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #14931: [SPARK-17370] Shuffle service files not invalidated when...

2016-09-02 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14931 **[Test build #64862 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64862/consoleFull)** for PR 14931 at commit

[GitHub] spark pull request #14797: [SPARK-17230] [SQL] Should not pass optimized que...

2016-09-02 Thread davies
Github user davies commented on a diff in the pull request: https://github.com/apache/spark/pull/14797#discussion_r77394975 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/DataSource.scala --- @@ -479,13 +480,23 @@ case class DataSource(

[GitHub] spark issue #14939: [SPARK-17376][SPARKR] followup - change since version

2016-09-02 Thread shivaram
Github user shivaram commented on the issue: https://github.com/apache/spark/pull/14939 Ah thanks - I didn't notice this while merging the earlier PR. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does

[GitHub] spark issue #14854: [SPARK-17283][Core] Cancel job in RDD.take() as soon as ...

2016-09-02 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14854 **[Test build #64866 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64866/consoleFull)** for PR 14854 at commit

[GitHub] spark issue #14881: [SPARK-17315][SparkR] Kolmogorov-Smirnov test SparkR wra...

2016-09-02 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14881 **[Test build #64865 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/64865/consoleFull)** for PR 14881 at commit

[GitHub] spark issue #14828: [SPARK-17258][SQL] Parse scientific decimal literals as ...

2016-09-02 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14828 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #14828: [SPARK-17258][SQL] Parse scientific decimal literals as ...

2016-09-02 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14828 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/64861/ Test PASSed. ---

<    1   2   3   4   5   >