[jira] [Updated] (SPARK-22814) JDBC support date/timestamp type as partitionColumn

2017-12-15 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22814?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-22814: Component/s: (was: Input/Output) SQL > JDBC support date/timestamp type as

[jira] [Updated] (SPARK-22814) JDBC support date/timestamp type as partitionColumn

2017-12-15 Thread Yuechen Chen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22814?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuechen Chen updated SPARK-22814: - Docs Text: (was: https://github.com/apache/spark/pull/1)

[jira] [Issue Comment Deleted] (SPARK-22814) JDBC support date/timestamp type as partitionColumn

2017-12-15 Thread Yuechen Chen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22814?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuechen Chen updated SPARK-22814: - Comment: was deleted (was: https://github.com/apache/spark/pull/1) > JDBC support

[jira] [Updated] (SPARK-22814) JDBC support date/timestamp type as partitionColumn

2017-12-15 Thread Yuechen Chen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22814?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuechen Chen updated SPARK-22814: - Docs Text: https://github.com/apache/spark/pull/1 External issue URL: (was:

[jira] [Updated] (SPARK-22814) JDBC support date/timestamp type as partitionColumn

2017-12-15 Thread Yuechen Chen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22814?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuechen Chen updated SPARK-22814: - External issue URL: https://github.com/apache/spark/pull/1 > JDBC support date/timestamp

[jira] [Commented] (SPARK-22814) JDBC support date/timestamp type as partitionColumn

2017-12-15 Thread Yuechen Chen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22814?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16293680#comment-16293680 ] Yuechen Chen commented on SPARK-22814: -- https://github.com/apache/spark/pull/1 > JDBC support

[jira] [Created] (SPARK-22814) JDBC support date/timestamp type as partitionColumn

2017-12-15 Thread Yuechen Chen (JIRA)
Yuechen Chen created SPARK-22814: Summary: JDBC support date/timestamp type as partitionColumn Key: SPARK-22814 URL: https://issues.apache.org/jira/browse/SPARK-22814 Project: Spark Issue

[jira] [Reopened] (SPARK-22496) beeline display operation log

2017-12-15 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22496?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reopened SPARK-22496: -- It was reverted in

[jira] [Commented] (SPARK-22812) Failing cran-check on master

2017-12-15 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22812?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16293648#comment-16293648 ] Felix Cheung commented on SPARK-22812: -- Not exactly... what’s the environment? Seems like something

[jira] [Commented] (SPARK-22796) Add multiple column support to PySpark QuantileDiscretizer

2017-12-15 Thread Huaxin Gao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22796?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16293619#comment-16293619 ] Huaxin Gao commented on SPARK-22796: I will work on this. > Add multiple column support to PySpark

[jira] [Assigned] (SPARK-22813) run-tests.py fails when /usr/sbin/lsof does not exist

2017-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22813?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22813: Assignee: Apache Spark > run-tests.py fails when /usr/sbin/lsof does not exist >

[jira] [Commented] (SPARK-22813) run-tests.py fails when /usr/sbin/lsof does not exist

2017-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22813?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16293617#comment-16293617 ] Apache Spark commented on SPARK-22813: -- User 'kiszk' has created a pull request for this issue:

[jira] [Assigned] (SPARK-22813) run-tests.py fails when /usr/sbin/lsof does not exist

2017-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22813?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22813: Assignee: (was: Apache Spark) > run-tests.py fails when /usr/sbin/lsof does not exist

[jira] [Commented] (SPARK-22377) Maven nightly snapshot jenkins jobs are broken on multiple workers due to lsof

2017-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22377?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16293589#comment-16293589 ] Apache Spark commented on SPARK-22377: -- User 'kiszk' has created a pull request for this issue:

[jira] [Assigned] (SPARK-22811) pyspark.ml.tests is missing a py4j import.

2017-12-15 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22811?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-22811: Assignee: Bago Amirbekian > pyspark.ml.tests is missing a py4j import. >

[jira] [Commented] (SPARK-22813) run-tests.py fails when /usr/sbin/lsof does not exist

2017-12-15 Thread Kazuaki Ishizaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22813?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16293570#comment-16293570 ] Kazuaki Ishizaki commented on SPARK-22813: -- Thank you for pointing the PR that we worked. I was

[jira] [Updated] (SPARK-22813) run-tests.py fails when /usr/sbin/lsof does not exist

2017-12-15 Thread Kazuaki Ishizaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22813?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kazuaki Ishizaki updated SPARK-22813: - Description: Running ./dev/run-tests.py for mvn on OS that does not have /usr/sbin/lsof

[jira] [Commented] (SPARK-22813) run-tests.py fails when /usr/sbin/lsof does not exist

2017-12-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22813?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16293564#comment-16293564 ] Sean Owen commented on SPARK-22813: --- https://issues.apache.org/jira/browse/SPARK-22377 > run-tests.py

[jira] [Created] (SPARK-22813) run-tests.py fails when /usr/sbin/lsof does not exist

2017-12-15 Thread Kazuaki Ishizaki (JIRA)
Kazuaki Ishizaki created SPARK-22813: Summary: run-tests.py fails when /usr/sbin/lsof does not exist Key: SPARK-22813 URL: https://issues.apache.org/jira/browse/SPARK-22813 Project: Spark

[jira] [Resolved] (SPARK-22811) pyspark.ml.tests is missing a py4j import.

2017-12-15 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22811?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-22811. -- Resolution: Fixed Fix Version/s: 2.3.0 Issue resolved by pull request 19997

[jira] [Created] (SPARK-22812) Failing cran-check on master

2017-12-15 Thread Hossein Falaki (JIRA)
Hossein Falaki created SPARK-22812: -- Summary: Failing cran-check on master Key: SPARK-22812 URL: https://issues.apache.org/jira/browse/SPARK-22812 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-22811) pyspark.ml.tests is missing a py4j import.

2017-12-15 Thread Bago Amirbekian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22811?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bago Amirbekian updated SPARK-22811: Priority: Minor (was: Major) > pyspark.ml.tests is missing a py4j import. >

[jira] [Assigned] (SPARK-22810) PySpark supports LinearRegression with huber loss

2017-12-15 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22810?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang reassigned SPARK-22810: --- Assignee: Yanbo Liang > PySpark supports LinearRegression with huber loss >

[jira] [Commented] (SPARK-22805) Use aliases for StorageLevel in event logs

2017-12-15 Thread Sergei Lebedev (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22805?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16293369#comment-16293369 ] Sergei Lebedev commented on SPARK-22805: After some investigation, it turns out that the majority

[jira] [Assigned] (SPARK-22811) pyspark.ml.tests is missing a py4j import.

2017-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22811?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22811: Assignee: Apache Spark > pyspark.ml.tests is missing a py4j import. >

[jira] [Commented] (SPARK-22811) pyspark.ml.tests is missing a py4j import.

2017-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22811?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16293364#comment-16293364 ] Apache Spark commented on SPARK-22811: -- User 'MrBago' has created a pull request for this issue:

[jira] [Assigned] (SPARK-22811) pyspark.ml.tests is missing a py4j import.

2017-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22811?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22811: Assignee: (was: Apache Spark) > pyspark.ml.tests is missing a py4j import. >

[jira] [Created] (SPARK-22811) pyspark.ml.tests is missing a py4j import.

2017-12-15 Thread Bago Amirbekian (JIRA)
Bago Amirbekian created SPARK-22811: --- Summary: pyspark.ml.tests is missing a py4j import. Key: SPARK-22811 URL: https://issues.apache.org/jira/browse/SPARK-22811 Project: Spark Issue Type:

[jira] [Commented] (SPARK-19804) HiveClientImpl does not work with Hive 2.2.0 metastore

2017-12-15 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19804?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16293307#comment-16293307 ] Marcelo Vanzin commented on SPARK-19804: Spark still doesn't have explicit support for Hive 2.2.

[jira] [Commented] (SPARK-19804) HiveClientImpl does not work with Hive 2.2.0 metastore

2017-12-15 Thread Zhongting Hu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19804?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16293302#comment-16293302 ] Zhongting Hu commented on SPARK-19804: -- [~smilegator] , I've made some comments on above PR, so for

[jira] [Comment Edited] (SPARK-22805) Use aliases for StorageLevel in event logs

2017-12-15 Thread Sergei Lebedev (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22805?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16292758#comment-16292758 ] Sergei Lebedev edited comment on SPARK-22805 at 12/15/17 9:44 PM: -- I

[jira] [Comment Edited] (SPARK-22805) Use aliases for StorageLevel in event logs

2017-12-15 Thread Sergei Lebedev (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22805?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16292758#comment-16292758 ] Sergei Lebedev edited comment on SPARK-22805 at 12/15/17 9:43 PM: -- I

[jira] [Commented] (SPARK-22809) pyspark is sensitive to imports with dots

2017-12-15 Thread Cricket Temple (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22809?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16293209#comment-16293209 ] Cricket Temple commented on SPARK-22809: Outputs: When I run it, it plots a picture and prints

[jira] [Assigned] (SPARK-22807) Change configuration options to use "container" instead of "docker"

2017-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22807?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22807: Assignee: Apache Spark > Change configuration options to use "container" instead of

[jira] [Commented] (SPARK-22807) Change configuration options to use "container" instead of "docker"

2017-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22807?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16293201#comment-16293201 ] Apache Spark commented on SPARK-22807: -- User 'foxish' has created a pull request for this issue:

[jira] [Assigned] (SPARK-22807) Change configuration options to use "container" instead of "docker"

2017-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22807?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22807: Assignee: (was: Apache Spark) > Change configuration options to use "container"

[jira] [Assigned] (SPARK-22810) PySpark supports LinearRegression with huber loss

2017-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22810?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22810: Assignee: (was: Apache Spark) > PySpark supports LinearRegression with huber loss >

[jira] [Assigned] (SPARK-22810) PySpark supports LinearRegression with huber loss

2017-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22810?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22810: Assignee: Apache Spark > PySpark supports LinearRegression with huber loss >

[jira] [Commented] (SPARK-22810) PySpark supports LinearRegression with huber loss

2017-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22810?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16293133#comment-16293133 ] Apache Spark commented on SPARK-22810: -- User 'yanboliang' has created a pull request for this issue:

[jira] [Created] (SPARK-22810) PySpark supports LinearRegression with huber loss

2017-12-15 Thread Yanbo Liang (JIRA)
Yanbo Liang created SPARK-22810: --- Summary: PySpark supports LinearRegression with huber loss Key: SPARK-22810 URL: https://issues.apache.org/jira/browse/SPARK-22810 Project: Spark Issue Type:

[jira] [Updated] (SPARK-22809) pyspark is sensitive to imports with dots

2017-12-15 Thread Cricket Temple (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22809?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cricket Temple updated SPARK-22809: --- Description: User code can fail with dotted imports. Here's a repro script. {noformat}

[jira] [Commented] (SPARK-22809) pyspark is sensitive to imports with dots

2017-12-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22809?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16293103#comment-16293103 ] Sean Owen commented on SPARK-22809: --- It's not clear what the problem is? what does this output? Spark

[jira] [Created] (SPARK-22809) pyspark is sensitive to imports with dots

2017-12-15 Thread Cricket Temple (JIRA)
Cricket Temple created SPARK-22809: -- Summary: pyspark is sensitive to imports with dots Key: SPARK-22809 URL: https://issues.apache.org/jira/browse/SPARK-22809 Project: Spark Issue Type:

[jira] [Created] (SPARK-22808) saveAsTable() should be marked as deprecated

2017-12-15 Thread Jason Vaccaro (JIRA)
Jason Vaccaro created SPARK-22808: - Summary: saveAsTable() should be marked as deprecated Key: SPARK-22808 URL: https://issues.apache.org/jira/browse/SPARK-22808 Project: Spark Issue Type:

[jira] [Commented] (SPARK-22683) DynamicAllocation wastes resources by allocating containers that will barely be used

2017-12-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22683?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16293012#comment-16293012 ] Sean Owen commented on SPARK-22683: --- I don't think the pros/cons have changed here. I don't doubt there

[jira] [Updated] (SPARK-22807) Change configuration options to use "container" instead of "docker"

2017-12-15 Thread Anirudh Ramanathan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22807?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Anirudh Ramanathan updated SPARK-22807: --- Summary: Change configuration options to use "container" instead of "docker" (was:

[jira] [Created] (SPARK-22807) Change commandline options to use "container" instead of "docker"

2017-12-15 Thread Anirudh Ramanathan (JIRA)
Anirudh Ramanathan created SPARK-22807: -- Summary: Change commandline options to use "container" instead of "docker" Key: SPARK-22807 URL: https://issues.apache.org/jira/browse/SPARK-22807

[jira] [Comment Edited] (SPARK-22806) Window Aggregate functions: unexpected result at ordered partition

2017-12-15 Thread Attila Zsolt Piros (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22806?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16292995#comment-16292995 ] Attila Zsolt Piros edited comment on SPARK-22806 at 12/15/17 6:38 PM:

[jira] [Updated] (SPARK-22806) Window Aggregate functions: unexpected result at ordered partition

2017-12-15 Thread Attila Zsolt Piros (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22806?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Attila Zsolt Piros updated SPARK-22806: --- Attachment: WindowFunctionsWithGroupByError.scala Test to reproduce the error >

[jira] [Updated] (SPARK-22806) Window Aggregate functions: unexpected result at ordered partition

2017-12-15 Thread Attila Zsolt Piros (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22806?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Attila Zsolt Piros updated SPARK-22806: --- Description: I got different results for aggregate functions (even for sum and

[jira] [Updated] (SPARK-22806) Window Aggregate functions: unexpected result at ordered partition

2017-12-15 Thread Attila Zsolt Piros (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22806?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Attila Zsolt Piros updated SPARK-22806: --- Description: I got different results for aggregate functions (even for sum and

[jira] [Updated] (SPARK-22806) Window Aggregate functions: unexpected result at ordered partition

2017-12-15 Thread Attila Zsolt Piros (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22806?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Attila Zsolt Piros updated SPARK-22806: --- Description: I got different results for aggregate functions (even for sum and

[jira] [Updated] (SPARK-22806) Window Aggregate functions: unexpected result at ordered partition

2017-12-15 Thread Attila Zsolt Piros (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22806?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Attila Zsolt Piros updated SPARK-22806: --- Description: I got different results for aggregate functions (even for sum and

[jira] [Created] (SPARK-22806) Window Aggregate functions: unexpected result at ordered partition

2017-12-15 Thread Attila Zsolt Piros (JIRA)
Attila Zsolt Piros created SPARK-22806: -- Summary: Window Aggregate functions: unexpected result at ordered partition Key: SPARK-22806 URL: https://issues.apache.org/jira/browse/SPARK-22806

[jira] [Updated] (SPARK-22806) Window Aggregate functions: unexpected result at ordered partition

2017-12-15 Thread Attila Zsolt Piros (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22806?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Attila Zsolt Piros updated SPARK-22806: --- Description: I got different results for aggregate functions (even for sum and

[jira] [Assigned] (SPARK-22799) Bucketizer should throw exception if single- and multi-column params are both set

2017-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22799?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22799: Assignee: Apache Spark > Bucketizer should throw exception if single- and multi-column

[jira] [Assigned] (SPARK-22799) Bucketizer should throw exception if single- and multi-column params are both set

2017-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22799?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22799: Assignee: (was: Apache Spark) > Bucketizer should throw exception if single- and

[jira] [Commented] (SPARK-22799) Bucketizer should throw exception if single- and multi-column params are both set

2017-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16292972#comment-16292972 ] Apache Spark commented on SPARK-22799: -- User 'mgaido91' has created a pull request for this issue:

[jira] [Resolved] (SPARK-22800) Add a SSB query suite

2017-12-15 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22800?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-22800. - Resolution: Fixed Assignee: Takeshi Yamamuro > Add a SSB query suite > - > >

[jira] [Updated] (SPARK-22800) Add a SSB query suite

2017-12-15 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22800?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-22800: Fix Version/s: 2.3.0 > Add a SSB query suite > - > > Key: SPARK-22800

[jira] [Commented] (SPARK-22362) Add unit test for Window Aggregate Functions

2017-12-15 Thread Attila Zsolt Piros (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16292902#comment-16292902 ] Attila Zsolt Piros commented on SPARK-22362: I am working on this subtask. > Add unit test

[jira] [Assigned] (SPARK-22805) Use aliases for StorageLevel in event logs

2017-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22805?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22805: Assignee: (was: Apache Spark) > Use aliases for StorageLevel in event logs >

[jira] [Commented] (SPARK-22805) Use aliases for StorageLevel in event logs

2017-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22805?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16292876#comment-16292876 ] Apache Spark commented on SPARK-22805: -- User 'superbobry' has created a pull request for this issue:

[jira] [Assigned] (SPARK-22805) Use aliases for StorageLevel in event logs

2017-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22805?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22805: Assignee: Apache Spark > Use aliases for StorageLevel in event logs >

[jira] [Commented] (SPARK-22683) DynamicAllocation wastes resources by allocating containers that will barely be used

2017-12-15 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22683?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16292855#comment-16292855 ] Thomas Graves commented on SPARK-22683: --- Thanks for the clarification, a few of those I misread and

[jira] [Comment Edited] (SPARK-22805) Use aliases for StorageLevel in event logs

2017-12-15 Thread Sergei Lebedev (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22805?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16292848#comment-16292848 ] Sergei Lebedev edited comment on SPARK-22805 at 12/15/17 5:25 PM: --

[jira] [Commented] (SPARK-22805) Use aliases for StorageLevel in event logs

2017-12-15 Thread Sergei Lebedev (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22805?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16292848#comment-16292848 ] Sergei Lebedev commented on SPARK-22805: Here're results for a single application with 6K

[jira] [Commented] (SPARK-22805) Use aliases for StorageLevel in event logs

2017-12-15 Thread Sergei Lebedev (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22805?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16292758#comment-16292758 ] Sergei Lebedev commented on SPARK-22805: I have a patch which preserves backward compatibility.

[jira] [Comment Edited] (SPARK-22805) Use aliases for StorageLevel in event logs

2017-12-15 Thread Sergei Lebedev (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22805?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16292758#comment-16292758 ] Sergei Lebedev edited comment on SPARK-22805 at 12/15/17 4:28 PM: -- I

[jira] [Commented] (SPARK-22805) Use aliases for StorageLevel in event logs

2017-12-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22805?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16292742#comment-16292742 ] Sean Owen commented on SPARK-22805: --- The current format is somewhat more flexible but yes it's verbose.

[jira] [Created] (SPARK-22805) Use aliases for StorageLevel in event logs

2017-12-15 Thread Sergei Lebedev (JIRA)
Sergei Lebedev created SPARK-22805: -- Summary: Use aliases for StorageLevel in event logs Key: SPARK-22805 URL: https://issues.apache.org/jira/browse/SPARK-22805 Project: Spark Issue Type:

[jira] [Updated] (SPARK-22804) Using a window function inside of an aggregation causes StackOverflowError

2017-12-15 Thread Sandor Murakozi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22804?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sandor Murakozi updated SPARK-22804: Priority: Minor (was: Major) > Using a window function inside of an aggregation causes

[jira] [Commented] (SPARK-22804) Using a window function inside of an aggregation causes StackOverflowError

2017-12-15 Thread Sandor Murakozi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22804?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16292732#comment-16292732 ] Sandor Murakozi commented on SPARK-22804: - Indeed, it looks pretty similar. I will check if it's

[jira] [Updated] (SPARK-22804) Using a window function inside of an aggregation causes StackOverflowError

2017-12-15 Thread Sandor Murakozi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22804?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sandor Murakozi updated SPARK-22804: Description: {code} import org.apache.spark.sql.expressions.Window val df = Seq(("a", 1),

[jira] [Commented] (SPARK-22804) Using a window function inside of an aggregation causes StackOverflowError

2017-12-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22804?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16292698#comment-16292698 ] Sean Owen commented on SPARK-22804: --- Same as SPARK-21896? > Using a window function inside of an

[jira] [Updated] (SPARK-22804) Using a window function inside of an aggregation causes StackOverflowError

2017-12-15 Thread Sandor Murakozi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22804?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sandor Murakozi updated SPARK-22804: Description: {code} import org.apache.spark.sql.expressions.Window val df = Seq(("a", 1),

[jira] [Updated] (SPARK-22804) Using a window function inside of an aggregation causes StackOverflowError

2017-12-15 Thread Sandor Murakozi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22804?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sandor Murakozi updated SPARK-22804: Description: {code} import org.apache.spark.sql.expressions.Window val df = Seq(("a", 1),

[jira] [Updated] (SPARK-22804) Using a window function inside of an aggregation causes StackOverflowError

2017-12-15 Thread Sandor Murakozi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22804?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sandor Murakozi updated SPARK-22804: Description: {code} import org.apache.spark.sql.expressions.Window val df = Seq(("a", 1),

[jira] [Created] (SPARK-22804) Using a window function inside of an aggregation causes StackOverflowError

2017-12-15 Thread Sandor Murakozi (JIRA)
Sandor Murakozi created SPARK-22804: --- Summary: Using a window function inside of an aggregation causes StackOverflowError Key: SPARK-22804 URL: https://issues.apache.org/jira/browse/SPARK-22804

[jira] [Commented] (SPARK-22794) Spark Job failed, but the state is succeeded in Yarn Web

2017-12-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22794?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16292675#comment-16292675 ] Sean Owen commented on SPARK-22794: --- This looks like a duplicate of lots of things like SPARK-22708,

[jira] [Commented] (SPARK-22465) Cogroup of two disproportionate RDDs could lead into 2G limit BUG

2017-12-15 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22465?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16292608#comment-16292608 ] Thomas Graves commented on SPARK-22465: --- Yes I think that makes sense. > Cogroup of two

[jira] [Commented] (SPARK-22765) Create a new executor allocation scheme based on that of MR

2017-12-15 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22765?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16292594#comment-16292594 ] Thomas Graves commented on SPARK-22765: --- I'm not sure how mr style and 4-core executor go together.

[jira] [Commented] (SPARK-18294) Implement commit protocol to support `mapred` package's committer

2017-12-15 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18294?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16292595#comment-16292595 ] Steve Loughran commented on SPARK-18294: Following up on this, one question: Why support the

[jira] [Commented] (SPARK-22465) Cogroup of two disproportionate RDDs could lead into 2G limit BUG

2017-12-15 Thread Sujith Jay Nair (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22465?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16292581#comment-16292581 ] Sujith Jay Nair commented on SPARK-22465: - Would something along the lines of 'add a

[jira] [Commented] (SPARK-22465) Cogroup of two disproportionate RDDs could lead into 2G limit BUG

2017-12-15 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22465?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16292569#comment-16292569 ] Thomas Graves commented on SPARK-22465: --- I don't have time at the moment to work on this so if you

[jira] [Commented] (SPARK-22799) Bucketizer should throw exception if single- and multi-column params are both set

2017-12-15 Thread Marco Gaido (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16292540#comment-16292540 ] Marco Gaido commented on SPARK-22799: - may I work on this? > Bucketizer should throw exception if

[jira] [Resolved] (SPARK-22803) Connection Refused Error is happening sometime.

2017-12-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22803?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-22803. --- Resolution: Invalid Please don't put questions on JIRA. Use the mailing list. > Connection Refused

[jira] [Resolved] (SPARK-22792) PySpark UDF registering issue

2017-12-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22792?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-22792. --- Resolution: Invalid For JIRAs, you'd need to narrow this down to a clearly-described and narrow

[jira] [Created] (SPARK-22803) Connection Refused Error is happening sometime.

2017-12-15 Thread Annamalai Venugopal (JIRA)
Annamalai Venugopal created SPARK-22803: --- Summary: Connection Refused Error is happening sometime. Key: SPARK-22803 URL: https://issues.apache.org/jira/browse/SPARK-22803 Project: Spark

[jira] [Closed] (SPARK-22802) Regarding max tax size

2017-12-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22802?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen closed SPARK-22802. - > Regarding max tax size > -- > > Key: SPARK-22802 >

[jira] [Resolved] (SPARK-22802) Regarding max tax size

2017-12-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22802?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-22802. --- Resolution: Invalid Don't reopen this. > Regarding max tax size > -- > >

[jira] [Commented] (SPARK-22802) Regarding max tax size

2017-12-15 Thread Annamalai Venugopal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22802?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16292527#comment-16292527 ] Annamalai Venugopal commented on SPARK-22802: - I am doing a project with spark with the

[jira] [Reopened] (SPARK-22802) Regarding max tax size

2017-12-15 Thread Annamalai Venugopal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22802?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Annamalai Venugopal reopened SPARK-22802: - I am doing a project with spark with the source code as: import sys import os from

[jira] [Resolved] (SPARK-22802) Regarding max tax size

2017-12-15 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22802?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-22802. --- Resolution: Invalid It's not clear what you're even asking, but it should go to the mailing list >

[jira] [Assigned] (SPARK-22801) Allow FeatureHasher to specify numeric columns to treat as categorical

2017-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22801?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22801: Assignee: Nick Pentreath (was: Apache Spark) > Allow FeatureHasher to specify numeric

[jira] [Commented] (SPARK-22801) Allow FeatureHasher to specify numeric columns to treat as categorical

2017-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22801?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16292467#comment-16292467 ] Apache Spark commented on SPARK-22801: -- User 'MLnick' has created a pull request for this issue:

[jira] [Assigned] (SPARK-22801) Allow FeatureHasher to specify numeric columns to treat as categorical

2017-12-15 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22801?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22801: Assignee: Apache Spark (was: Nick Pentreath) > Allow FeatureHasher to specify numeric

[jira] [Created] (SPARK-22802) Regarding max tax size

2017-12-15 Thread Annamalai Venugopal (JIRA)
Annamalai Venugopal created SPARK-22802: --- Summary: Regarding max tax size Key: SPARK-22802 URL: https://issues.apache.org/jira/browse/SPARK-22802 Project: Spark Issue Type: Question

[jira] [Commented] (SPARK-22792) PySpark UDF registering issue

2017-12-15 Thread Annamalai Venugopal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22792?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16292378#comment-16292378 ] Annamalai Venugopal commented on SPARK-22792: - def hypernym_generation(token_array_a):

[jira] [Updated] (SPARK-22799) Bucketizer should throw exception if single- and multi-column params are both set

2017-12-15 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22799?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Pentreath updated SPARK-22799: --- Description: See the related discussion:

  1   2   >