[GitHub] spark issue #17923: [SPARK-20591][WEB UI] Succeeded tasks num not equal in a...

2017-05-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17923 **[Test build #3712 has started](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/3712/testReport)** for PR 17923 at commit [`d80f140`](https://github.com/apache/spark/commit/d

[GitHub] spark pull request #17961: [SPARK-20720][WEB-UI]'Executor Summary' should sh...

2017-05-14 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/17961 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is ena

[GitHub] spark pull request #17952: [SPARK-20705][WEB-UI]The sort function can not be...

2017-05-14 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/17952 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is ena

[GitHub] spark issue #17961: [SPARK-20720][WEB-UI]'Executor Summary' should show the ...

2017-05-14 Thread srowen
Github user srowen commented on the issue: https://github.com/apache/spark/pull/17961 Merged to master --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or i

[GitHub] spark issue #17952: [SPARK-20705][WEB-UI]The sort function can not be used i...

2017-05-14 Thread srowen
Github user srowen commented on the issue: https://github.com/apache/spark/pull/17952 Merged to master/2.2/2.1 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes

[GitHub] spark issue #17982: [SPARK-20395][BUILD] Update Scala to 2.11.11 and zinc to...

2017-05-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17982 **[Test build #76933 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76933/testReport)** for PR 17982 at commit [`2028dbd`](https://github.com/apache/spark/commit/20

[GitHub] spark pull request #17982: [SPARK-20395][BUILD] Update Scala to 2.11.11 and ...

2017-05-14 Thread srowen
GitHub user srowen opened a pull request: https://github.com/apache/spark/pull/17982 [SPARK-20395][BUILD] Update Scala to 2.11.11 and zinc to 0.3.15 ## What changes were proposed in this pull request? Update Scala to 2.11.11 and zinc to 0.3.15 ## How was this patch

[GitHub] spark issue #17308: [SPARK-19968][SS] Use a cached instance of `KafkaProduce...

2017-05-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17308 **[Test build #76932 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76932/testReport)** for PR 17308 at commit [`e07e77e`](https://github.com/apache/spark/commit/e

[GitHub] spark issue #17308: [SPARK-19968][SS] Use a cached instance of `KafkaProduce...

2017-05-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17308 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature e

[GitHub] spark issue #17308: [SPARK-19968][SS] Use a cached instance of `KafkaProduce...

2017-05-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17308 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/76932/ Test PASSed. ---

[GitHub] spark issue #17981: [SPARK-15767][ML][SparkR] Decision Tree wrapper in Spark...

2017-05-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17981 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature e

[GitHub] spark issue #17981: [SPARK-15767][ML][SparkR] Decision Tree wrapper in Spark...

2017-05-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17981 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/76930/ Test FAILed. ---

[GitHub] spark issue #17981: [SPARK-15767][ML][SparkR] Decision Tree wrapper in Spark...

2017-05-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17981 **[Test build #76930 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76930/testReport)** for PR 17981 at commit [`7e383a2`](https://github.com/apache/spark/commit/7

[GitHub] spark issue #17308: [SPARK-19968][SS] Use a cached instance of `KafkaProduce...

2017-05-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17308 **[Test build #76932 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76932/testReport)** for PR 17308 at commit [`e07e77e`](https://github.com/apache/spark/commit/e0

[GitHub] spark issue #17308: [SPARK-19968][SS] Use a cached instance of `KafkaProduce...

2017-05-14 Thread ScrapCodes
Github user ScrapCodes commented on the issue: https://github.com/apache/spark/pull/17308 SPARK-20737 is created to look into cleanup mechanism in a separate JIRA. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your p

[GitHub] spark issue #16989: [SPARK-19659] Fetch big blocks to disk when shuffle-read...

2017-05-14 Thread jinxing64
Github user jinxing64 commented on the issue: https://github.com/apache/spark/pull/16989 Very gentle ping to @cloud-fan and @mridulm How do you think about the current change :) ? --- If your project is set up for it, you can reply to this email and have your reply appear on GitH

[GitHub] spark pull request #17644: [SPARK-17729] [SQL] Enable creating hive bucketed...

2017-05-14 Thread tejasapatil
Github user tejasapatil commented on a diff in the pull request: https://github.com/apache/spark/pull/17644#discussion_r116414803 --- Diff: sql/hive/src/main/scala/org/apache/spark/sql/hive/execution/InsertIntoHiveTable.scala --- @@ -307,6 +307,27 @@ case class InsertIntoHiveTable

[GitHub] spark pull request #17980: [SPARK-20728][SQL] Make ORCFileFormat configurabl...

2017-05-14 Thread dongjoon-hyun
Github user dongjoon-hyun commented on a diff in the pull request: https://github.com/apache/spark/pull/17980#discussion_r116414546 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/sources/DDLSourceLoadSuite.scala --- @@ -55,10 +56,12 @@ class DDLSourceLoadSuite extends Data

[GitHub] spark issue #16989: [SPARK-19659] Fetch big blocks to disk when shuffle-read...

2017-05-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16989 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/76925/ Test PASSed. ---

[GitHub] spark issue #16989: [SPARK-19659] Fetch big blocks to disk when shuffle-read...

2017-05-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16989 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature e

[GitHub] spark issue #16989: [SPARK-19659] Fetch big blocks to disk when shuffle-read...

2017-05-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16989 **[Test build #76925 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76925/testReport)** for PR 16989 at commit [`80b3154`](https://github.com/apache/spark/commit/8

[GitHub] spark issue #17980: [SPARK-20728][SQL] Make ORCFileFormat configurable betwe...

2017-05-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17980 **[Test build #76931 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76931/testReport)** for PR 17980 at commit [`73d56f2`](https://github.com/apache/spark/commit/73

[GitHub] spark pull request #17644: [SPARK-17729] [SQL] Enable creating hive bucketed...

2017-05-14 Thread cloud-fan
Github user cloud-fan commented on a diff in the pull request: https://github.com/apache/spark/pull/17644#discussion_r116412797 --- Diff: sql/hive/src/main/scala/org/apache/spark/sql/hive/execution/InsertIntoHiveTable.scala --- @@ -307,6 +307,27 @@ case class InsertIntoHiveTable(

[GitHub] spark issue #17981: [SPARK-15767][ML][SparkR] Decision Tree wrapper in Spark...

2017-05-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17981 **[Test build #76930 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76930/testReport)** for PR 17981 at commit [`7e383a2`](https://github.com/apache/spark/commit/7e

[GitHub] spark issue #17981: [SPARK-15767][ML][SparkR] Decision Tree wrapper in Spark...

2017-05-14 Thread zhengruifeng
Github user zhengruifeng commented on the issue: https://github.com/apache/spark/pull/17981 Jenkins, please retest this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature ena

[GitHub] spark pull request #17978: [SPARK-20736][Python] PySpark StringIndexer suppo...

2017-05-14 Thread actuaryzhang
Github user actuaryzhang commented on a diff in the pull request: https://github.com/apache/spark/pull/17978#discussion_r116411579 --- Diff: python/pyspark/ml/feature.py --- @@ -2115,22 +2115,32 @@ class StringIndexer(JavaEstimator, HasInputCol, HasOutputCol, HasHandleInvalid,

[GitHub] spark issue #17978: [SPARK-20736][Python] PySpark StringIndexer supports Str...

2017-05-14 Thread actuaryzhang
Github user actuaryzhang commented on the issue: https://github.com/apache/spark/pull/17978 @viirya Thanks much for your review. I corrected the typo and added some tests. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well.

[GitHub] spark issue #17980: [SPARK-20728][SQL] Make ORCFileFormat configurable betwe...

2017-05-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17980 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature e

[GitHub] spark pull request #16598: [SPARK-19236][Core] Added createOrReplaceGlobalTe...

2017-05-14 Thread arman1371
Github user arman1371 commented on a diff in the pull request: https://github.com/apache/spark/pull/16598#discussion_r116410932 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/Dataset.scala --- @@ -2603,6 +2603,21 @@ class Dataset[T] private[sql]( def createGlobalTempV

[GitHub] spark issue #17980: [SPARK-20728][SQL] Make ORCFileFormat configurable betwe...

2017-05-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17980 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/76924/ Test FAILed. ---

[GitHub] spark issue #17980: [SPARK-20728][SQL] Make ORCFileFormat configurable betwe...

2017-05-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17980 **[Test build #76924 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76924/testReport)** for PR 17980 at commit [`7716234`](https://github.com/apache/spark/commit/7

[GitHub] spark issue #17978: [SPARK-20736][Python] PySpark StringIndexer supports Str...

2017-05-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17978 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/76929/ Test PASSed. ---

[GitHub] spark issue #17978: [SPARK-20736][Python] PySpark StringIndexer supports Str...

2017-05-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17978 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature e

[GitHub] spark issue #17978: [SPARK-20736][Python] PySpark StringIndexer supports Str...

2017-05-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17978 **[Test build #76929 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76929/testReport)** for PR 17978 at commit [`f66a445`](https://github.com/apache/spark/commit/f

[GitHub] spark pull request #17973: [SPARK-20731][SQL] Add ability to change or omit ...

2017-05-14 Thread gatorsmile
Github user gatorsmile commented on a diff in the pull request: https://github.com/apache/spark/pull/17973#discussion_r116408851 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/execution/datasources/csv/CSVSuite.scala --- @@ -622,6 +622,31 @@ class CSVSuite extends QueryTes

[GitHub] spark issue #17933: [SPARK-20588][SQL] Cache TimeZone instances.

2017-05-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17933 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature e

[GitHub] spark issue #17981: [SPARK-15767][ML][SparkR] Decision Tree wrapper in Spark...

2017-05-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17981 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature e

[GitHub] spark pull request #17910: [SPARK-20669][ML] LogisticRegression family shoul...

2017-05-14 Thread sethah
Github user sethah commented on a diff in the pull request: https://github.com/apache/spark/pull/17910#discussion_r116408136 --- Diff: mllib/src/test/scala/org/apache/spark/ml/classification/LogisticRegressionSuite.scala --- @@ -2318,8 +2319,8 @@ class LogisticRegressionSuite

[GitHub] spark issue #17933: [SPARK-20588][SQL] Cache TimeZone instances.

2017-05-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17933 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/76923/ Test PASSed. ---

[GitHub] spark issue #17981: [SPARK-15767][ML][SparkR] Decision Tree wrapper in Spark...

2017-05-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17981 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/76927/ Test FAILed. ---

[GitHub] spark issue #17933: [SPARK-20588][SQL] Cache TimeZone instances.

2017-05-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17933 **[Test build #76923 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76923/testReport)** for PR 17933 at commit [`3cdbb3a`](https://github.com/apache/spark/commit/3

[GitHub] spark issue #17981: [SPARK-15767][ML][SparkR] Decision Tree wrapper in Spark...

2017-05-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17981 **[Test build #76927 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76927/testReport)** for PR 17981 at commit [`7e383a2`](https://github.com/apache/spark/commit/7

[GitHub] spark issue #17978: [SPARK-20736][Python] PySpark StringIndexer supports Str...

2017-05-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17978 **[Test build #76929 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76929/testReport)** for PR 17978 at commit [`f66a445`](https://github.com/apache/spark/commit/f6

[GitHub] spark issue #17848: [SPARK-20586] [SQL] Add deterministic and distinctLike t...

2017-05-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17848 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/76922/ Test PASSed. ---

[GitHub] spark issue #17848: [SPARK-20586] [SQL] Add deterministic and distinctLike t...

2017-05-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17848 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature e

[GitHub] spark issue #17848: [SPARK-20586] [SQL] Add deterministic and distinctLike t...

2017-05-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17848 **[Test build #76922 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76922/testReport)** for PR 17848 at commit [`d276b44`](https://github.com/apache/spark/commit/d

[GitHub] spark issue #17978: [SPARK-20736][Python] PySpark StringIndexer supports Str...

2017-05-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17978 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/76928/ Test FAILed. ---

[GitHub] spark issue #17978: [SPARK-20736][Python] PySpark StringIndexer supports Str...

2017-05-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17978 **[Test build #76928 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76928/testReport)** for PR 17978 at commit [`44f0a36`](https://github.com/apache/spark/commit/4

[GitHub] spark issue #17978: [SPARK-20736][Python] PySpark StringIndexer supports Str...

2017-05-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17978 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature e

[GitHub] spark issue #17924: [SPARK-20682][SQL] Support a new faster ORC data source ...

2017-05-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17924 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/76920/ Test PASSed. ---

[GitHub] spark issue #17978: [SPARK-20736][Python] PySpark StringIndexer supports Str...

2017-05-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17978 **[Test build #76928 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76928/testReport)** for PR 17978 at commit [`44f0a36`](https://github.com/apache/spark/commit/44

[GitHub] spark issue #17924: [SPARK-20682][SQL] Support a new faster ORC data source ...

2017-05-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17924 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature e

[GitHub] spark issue #17924: [SPARK-20682][SQL] Support a new faster ORC data source ...

2017-05-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17924 **[Test build #76920 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76920/testReport)** for PR 17924 at commit [`85ef731`](https://github.com/apache/spark/commit/8

[GitHub] spark issue #17981: [SPARK-15767][ML][SparkR] Decision Tree wrapper in Spark...

2017-05-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17981 **[Test build #76927 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76927/testReport)** for PR 17981 at commit [`7e383a2`](https://github.com/apache/spark/commit/7e

[GitHub] spark issue #17933: [SPARK-20588][SQL] Cache TimeZone instances.

2017-05-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17933 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature e

[GitHub] spark issue #17933: [SPARK-20588][SQL] Cache TimeZone instances.

2017-05-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17933 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/76921/ Test PASSed. ---

[GitHub] spark issue #17933: [SPARK-20588][SQL] Cache TimeZone instances.

2017-05-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17933 **[Test build #76921 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76921/testReport)** for PR 17933 at commit [`7935a1a`](https://github.com/apache/spark/commit/7

[GitHub] spark issue #17981: [SPARK-15767][ML][SparkR] Decision Tree wrapper in Spark...

2017-05-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17981 **[Test build #76926 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76926/testReport)** for PR 17981 at commit [`68041a0`](https://github.com/apache/spark/commit/6

[GitHub] spark issue #17981: [SPARK-15767][ML][SparkR] Decision Tree wrapper in Spark...

2017-05-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17981 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/76926/ Test FAILed. ---

[GitHub] spark issue #17981: [SPARK-15767][ML][SparkR] Decision Tree wrapper in Spark...

2017-05-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17981 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature e

[GitHub] spark issue #17981: [SPARK-15767][ML][SparkR] Decision Tree wrapper in Spark...

2017-05-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17981 **[Test build #76926 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76926/testReport)** for PR 17981 at commit [`68041a0`](https://github.com/apache/spark/commit/68

[GitHub] spark pull request #17981: [SPARK-15767][ML][SparkR] Decision Tree wrapper i...

2017-05-14 Thread zhengruifeng
GitHub user zhengruifeng opened a pull request: https://github.com/apache/spark/pull/17981 [SPARK-15767][ML][SparkR] Decision Tree wrapper in SparkR ## What changes were proposed in this pull request? support decision tree in R ## How was this patch tested? added tes

[GitHub] spark issue #16989: [SPARK-19659] Fetch big blocks to disk when shuffle-read...

2017-05-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16989 **[Test build #76925 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76925/testReport)** for PR 16989 at commit [`80b3154`](https://github.com/apache/spark/commit/80

[GitHub] spark issue #17910: [SPARK-20669][ML] LogisticRegression family should be ca...

2017-05-14 Thread zhengruifeng
Github user zhengruifeng commented on the issue: https://github.com/apache/spark/pull/17910 Ping @yanboliang --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes s

[GitHub] spark issue #17980: [SPARK-20728][SQL] Make ORCFileFormat configurable betwe...

2017-05-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17980 **[Test build #76924 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76924/testReport)** for PR 17980 at commit [`7716234`](https://github.com/apache/spark/commit/77

[GitHub] spark pull request #17980: [SPARK-20728][SQL] Make ORCFileFormat configurabl...

2017-05-14 Thread dongjoon-hyun
GitHub user dongjoon-hyun opened a pull request: https://github.com/apache/spark/pull/17980 [SPARK-20728][SQL] Make ORCFileFormat configurable between sql/hive and sql/core ## What changes were proposed in this pull request? [SPARK-20682](https://issues.apache.org/jira/brow

[GitHub] spark pull request #17978: [SPARK-20736][Python] PySpark StringIndexer suppo...

2017-05-14 Thread viirya
Github user viirya commented on a diff in the pull request: https://github.com/apache/spark/pull/17978#discussion_r116400199 --- Diff: python/pyspark/ml/feature.py --- @@ -2115,22 +2115,32 @@ class StringIndexer(JavaEstimator, HasInputCol, HasOutputCol, HasHandleInvalid, .

[GitHub] spark issue #17978: [SPARK-20736][Python] PySpark StringIndexer supports Str...

2017-05-14 Thread viirya
Github user viirya commented on the issue: https://github.com/apache/spark/pull/17978 Code changes looks good. But we need to add test for this. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not hav

[GitHub] spark issue #17848: [SPARK-20586] [SQL] Add deterministic and distinctLike t...

2017-05-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17848 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/76919/ Test FAILed. ---

[GitHub] spark issue #17848: [SPARK-20586] [SQL] Add deterministic and distinctLike t...

2017-05-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17848 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature e

[GitHub] spark issue #17848: [SPARK-20586] [SQL] Add deterministic and distinctLike t...

2017-05-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17848 **[Test build #76919 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76919/testReport)** for PR 17848 at commit [`387af4b`](https://github.com/apache/spark/commit/3

[GitHub] spark issue #17848: [SPARK-20586] [SQL] Add deterministic and distinctLike t...

2017-05-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17848 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature e

[GitHub] spark issue #17848: [SPARK-20586] [SQL] Add deterministic and distinctLike t...

2017-05-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17848 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/76918/ Test FAILed. ---

[GitHub] spark issue #17848: [SPARK-20586] [SQL] Add deterministic and distinctLike t...

2017-05-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17848 **[Test build #76918 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76918/testReport)** for PR 17848 at commit [`c496b62`](https://github.com/apache/spark/commit/c

[GitHub] spark issue #17933: [SPARK-20588][SQL] Cache TimeZone instances.

2017-05-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17933 **[Test build #76923 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76923/testReport)** for PR 17933 at commit [`3cdbb3a`](https://github.com/apache/spark/commit/3c

[GitHub] spark issue #17933: [SPARK-20588][SQL] Cache TimeZone instances.

2017-05-14 Thread viirya
Github user viirya commented on the issue: https://github.com/apache/spark/pull/17933 LGTM --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the featur

[GitHub] spark pull request #17933: [SPARK-20588][SQL] Cache TimeZone instances.

2017-05-14 Thread ueshin
Github user ueshin commented on a diff in the pull request: https://github.com/apache/spark/pull/17933#discussion_r116398935 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/util/DateTimeUtils.scala --- @@ -20,9 +20,12 @@ package org.apache.spark.sql.catalyst.ut

[GitHub] spark pull request #17933: [SPARK-20588][SQL] Cache TimeZone instances.

2017-05-14 Thread ueshin
Github user ueshin commented on a diff in the pull request: https://github.com/apache/spark/pull/17933#discussion_r116398915 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/util/DateTimeUtils.scala --- @@ -98,6 +101,15 @@ object DateTimeUtils { sdf

[GitHub] spark issue #17941: [SPARK-20684][R] Expose createGlobalTempView and dropGlo...

2017-05-14 Thread falaki
Github user falaki commented on the issue: https://github.com/apache/spark/pull/17941 @felixcheung we all know that SparkR (and in general R) API is not perfect when it comes to ETLing unstructured data. For example we don't have a great story for nested data, etc. To overcome these l

[GitHub] spark issue #17848: [SPARK-20586] [SQL] Add deterministic and distinctLike t...

2017-05-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17848 **[Test build #76922 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76922/testReport)** for PR 17848 at commit [`d276b44`](https://github.com/apache/spark/commit/d2

[GitHub] spark pull request #17933: [SPARK-20588][SQL] Cache TimeZone instances.

2017-05-14 Thread viirya
Github user viirya commented on a diff in the pull request: https://github.com/apache/spark/pull/17933#discussion_r116398563 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/util/DateTimeUtils.scala --- @@ -20,9 +20,12 @@ package org.apache.spark.sql.catalyst.ut

[GitHub] spark pull request #17933: [SPARK-20588][SQL] Cache TimeZone instances.

2017-05-14 Thread viirya
Github user viirya commented on a diff in the pull request: https://github.com/apache/spark/pull/17933#discussion_r116398454 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/util/DateTimeUtils.scala --- @@ -98,6 +101,15 @@ object DateTimeUtils { sdf

[GitHub] spark issue #17936: [SPARK-20638][Core]Optimize the CartesianRDD to reduce r...

2017-05-14 Thread ConeyLiu
Github user ConeyLiu commented on the issue: https://github.com/apache/spark/pull/17936 Yeah, I can test it. You see, the `ALS` is an pratical use case. So, choose it as a test case more convincing. And I also want to see the improvement of this `pr` even after merged #17742. --- I

[GitHub] spark pull request #17933: [SPARK-20588][SQL] Cache TimeZone instances.

2017-05-14 Thread viirya
Github user viirya commented on a diff in the pull request: https://github.com/apache/spark/pull/17933#discussion_r116398184 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/util/DateTimeUtils.scala --- @@ -98,6 +99,14 @@ object DateTimeUtils { sdf

[GitHub] spark issue #17858: [SPARK-20594][SQL]The staging directory should be a chil...

2017-05-14 Thread zuotingbing
Github user zuotingbing commented on the issue: https://github.com/apache/spark/pull/17858 Thank you all. Delete the branch. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabl

[GitHub] spark issue #17979: [SPARK-19320][MESOS][WIP]allow specifying a hard limit o...

2017-05-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17979 Can one of the admins verify this patch? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feat

[GitHub] spark pull request #17979: [SPARK-19320][MESOS][WIP]allow specifying a hard ...

2017-05-14 Thread yanji84
GitHub user yanji84 opened a pull request: https://github.com/apache/spark/pull/17979 [SPARK-19320][MESOS][WIP]allow specifying a hard limit on number of gpus required in each spark executor when running on mesos ## What changes were proposed in this pull request? Currently

[GitHub] spark issue #17936: [SPARK-20638][Core]Optimize the CartesianRDD to reduce r...

2017-05-14 Thread jtengyp
Github user jtengyp commented on the issue: https://github.com/apache/spark/pull/17936 I think you@ConeyLiu should directly test the Cartesian phase with the following patch. val user = model.userFeatures val item = model.productFeatures val start = System.nanoTime()

[GitHub] spark pull request #17898: [SPARK-20638][Core]Optimize the CartesianRDD to r...

2017-05-14 Thread jtengyp
Github user jtengyp closed the pull request at: https://github.com/apache/spark/pull/17898 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is en

[GitHub] spark issue #17933: [SPARK-20588][SQL] Cache TimeZone instances.

2017-05-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17933 **[Test build #76921 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76921/testReport)** for PR 17933 at commit [`7935a1a`](https://github.com/apache/spark/commit/79

[GitHub] spark pull request #17933: [SPARK-20588][SQL] Cache TimeZone instances per t...

2017-05-14 Thread ueshin
Github user ueshin commented on a diff in the pull request: https://github.com/apache/spark/pull/17933#discussion_r116396203 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/util/DateTimeUtils.scala --- @@ -98,6 +99,14 @@ object DateTimeUtils { sdf

[GitHub] spark issue #17924: [SPARK-20682][SQL] Support a new faster ORC data source ...

2017-05-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17924 **[Test build #76920 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76920/testReport)** for PR 17924 at commit [`85ef731`](https://github.com/apache/spark/commit/85

[GitHub] spark issue #17924: [SPARK-20682][SQL] Support a new faster ORC data source ...

2017-05-14 Thread dongjoon-hyun
Github user dongjoon-hyun commented on the issue: https://github.com/apache/spark/pull/17924 Retest this please. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishe

[GitHub] spark issue #17978: [SPARK-20736][Python] PySpark StringIndexer supports Str...

2017-05-14 Thread HyukjinKwon
Github user HyukjinKwon commented on the issue: https://github.com/apache/spark/pull/17978 (I am not used to ML. I just left a trivial comment for Python.) --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project d

[GitHub] spark pull request #17978: [SPARK-20736][Python] PySpark StringIndexer suppo...

2017-05-14 Thread HyukjinKwon
Github user HyukjinKwon commented on a diff in the pull request: https://github.com/apache/spark/pull/17978#discussion_r116395911 --- Diff: python/pyspark/ml/feature.py --- @@ -2115,22 +2115,32 @@ class StringIndexer(JavaEstimator, HasInputCol, HasOutputCol, HasHandleInvalid,

[GitHub] spark pull request #17978: [SPARK-20736][Python] PySpark StringIndexer suppo...

2017-05-14 Thread HyukjinKwon
Github user HyukjinKwon commented on a diff in the pull request: https://github.com/apache/spark/pull/17978#discussion_r116395876 --- Diff: python/pyspark/ml/feature.py --- @@ -2115,22 +2115,32 @@ class StringIndexer(JavaEstimator, HasInputCol, HasOutputCol, HasHandleInvalid,

[GitHub] spark issue #17848: [SPARK-20586] [SQL] Add deterministic and distinctLike t...

2017-05-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17848 **[Test build #76919 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76919/testReport)** for PR 17848 at commit [`387af4b`](https://github.com/apache/spark/commit/38

[GitHub] spark issue #17848: [SPARK-20586] [SQL] Add deterministic and distinctLike t...

2017-05-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17848 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature e

[GitHub] spark issue #17848: [SPARK-20586] [SQL] Add deterministic and distinctLike t...

2017-05-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17848 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/76915/ Test FAILed. ---

[GitHub] spark issue #17848: [SPARK-20586] [SQL] Add deterministic and distinctLike t...

2017-05-14 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17848 **[Test build #76915 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/76915/testReport)** for PR 17848 at commit [`00b4dff`](https://github.com/apache/spark/commit/0

  1   2   >