[jira] [Commented] (SPARK-21274) Implement EXCEPT ALL and INTERSECT ALL

2018-04-26 Thread Dilip Biswal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21274?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16455734#comment-16455734 ] Dilip Biswal commented on SPARK-21274: -- [~maropu] Thanks. Yeah.. i will have two separate PRs. >

[jira] [Assigned] (SPARK-24109) Remove class SnappyOutputStreamWrapper

2018-04-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24109?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24109: Assignee: (was: Apache Spark) > Remove class SnappyOutputStreamWrapper >

[jira] [Commented] (SPARK-24109) Remove class SnappyOutputStreamWrapper

2018-04-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24109?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16455731#comment-16455731 ] Apache Spark commented on SPARK-24109: -- User 'manbuyun' has created a pull request for this issue:

[jira] [Assigned] (SPARK-24109) Remove class SnappyOutputStreamWrapper

2018-04-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24109?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24109: Assignee: Apache Spark > Remove class SnappyOutputStreamWrapper >

[jira] [Created] (SPARK-24109) Remove class SnappyOutputStreamWrapper

2018-04-26 Thread wangjinhai (JIRA)
wangjinhai created SPARK-24109: -- Summary: Remove class SnappyOutputStreamWrapper Key: SPARK-24109 URL: https://issues.apache.org/jira/browse/SPARK-24109 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-21274) Implement EXCEPT ALL and INTERSECT ALL

2018-04-26 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21274?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16455716#comment-16455716 ] Takeshi Yamamuro commented on SPARK-21274: -- ok, thanks! IMO it'd be better to make separate two

[jira] [Resolved] (SPARK-24108) ChunkedByteBuffer.writeFully method has not reset the limit value

2018-04-26 Thread wangjinhai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24108?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] wangjinhai resolved SPARK-24108. Resolution: Won't Do > ChunkedByteBuffer.writeFully method has not reset the limit value >

[jira] [Assigned] (SPARK-24107) ChunkedByteBuffer.writeFully method has not reset the limit value

2018-04-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24107?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24107: Assignee: Apache Spark > ChunkedByteBuffer.writeFully method has not reset the limit

[jira] [Assigned] (SPARK-24107) ChunkedByteBuffer.writeFully method has not reset the limit value

2018-04-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24107?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24107: Assignee: (was: Apache Spark) > ChunkedByteBuffer.writeFully method has not reset the

[jira] [Commented] (SPARK-24107) ChunkedByteBuffer.writeFully method has not reset the limit value

2018-04-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24107?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16455700#comment-16455700 ] Apache Spark commented on SPARK-24107: -- User 'manbuyun' has created a pull request for this issue:

[jira] [Commented] (SPARK-21274) Implement EXCEPT ALL and INTERSECT ALL

2018-04-26 Thread Dilip Biswal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21274?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16455697#comment-16455697 ] Dilip Biswal commented on SPARK-21274: -- [~maropu] I am currently testing the code. I will open the

[jira] [Created] (SPARK-24108) ChunkedByteBuffer.writeFully method has not reset the limit value

2018-04-26 Thread wangjinhai (JIRA)
wangjinhai created SPARK-24108: -- Summary: ChunkedByteBuffer.writeFully method has not reset the limit value Key: SPARK-24108 URL: https://issues.apache.org/jira/browse/SPARK-24108 Project: Spark

[jira] [Created] (SPARK-24107) ChunkedByteBuffer.writeFully method has not reset the limit value

2018-04-26 Thread wangjinhai (JIRA)
wangjinhai created SPARK-24107: -- Summary: ChunkedByteBuffer.writeFully method has not reset the limit value Key: SPARK-24107 URL: https://issues.apache.org/jira/browse/SPARK-24107 Project: Spark

[jira] [Resolved] (SPARK-23355) convertMetastore should not ignore table properties

2018-04-26 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23355?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-23355. - Resolution: Fixed Fix Version/s: 2.4.0 Issue resolved by pull request 20522

[jira] [Assigned] (SPARK-23355) convertMetastore should not ignore table properties

2018-04-26 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23355?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-23355: --- Assignee: Dongjoon Hyun > convertMetastore should not ignore table properties >

[jira] [Updated] (SPARK-24100) Add the CompressionCodec to the saveAsTextFiles interface.

2018-04-26 Thread caijie (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] caijie updated SPARK-24100: --- Component/s: (was: DStreams) > Add the CompressionCodec to the saveAsTextFiles interface. >

[jira] [Commented] (SPARK-23830) Spark on YARN in cluster deploy mode fail with NullPointerException when a Spark application is a Scala class not object

2018-04-26 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23830?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16455621#comment-16455621 ] Saisai Shao commented on SPARK-23830: - I agree [~emaynard]. > Spark on YARN in cluster deploy mode

[jira] [Resolved] (SPARK-24099) java.io.CharConversionException: Invalid UTF-32 character prevents me from querying my data in JSON

2018-04-26 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24099?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24099. -- Resolution: Duplicate It will be fixed in SPARK-23723 soon. >

[jira] [Comment Edited] (SPARK-24068) CSV schema inferring doesn't work for compressed files

2018-04-26 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24068?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16455611#comment-16455611 ] Hyukjin Kwon edited comment on SPARK-24068 at 4/27/18 12:51 AM: I roughly

[jira] [Commented] (SPARK-24068) CSV schema inferring doesn't work for compressed files

2018-04-26 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24068?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16455611#comment-16455611 ] Hyukjin Kwon commented on SPARK-24068: -- I roughly assume the fix will be small, similar or the same?

[jira] [Commented] (SPARK-23925) High-order function: repeat(element, count) → array

2018-04-26 Thread Florent Pepin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23925?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16455506#comment-16455506 ] Florent Pepin commented on SPARK-23925: --- Hey, sorry I didn't realise the complexity of this one,

[jira] [Assigned] (SPARK-24083) Diagnostics message for uncaught exceptions should include the stacktrace

2018-04-26 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24083?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin reassigned SPARK-24083: -- Assignee: zhoukang > Diagnostics message for uncaught exceptions should include the

[jira] [Resolved] (SPARK-24083) Diagnostics message for uncaught exceptions should include the stacktrace

2018-04-26 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24083?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-24083. Resolution: Fixed Fix Version/s: 2.4.0 Issue resolved by pull request 21151

[jira] [Commented] (SPARK-21274) Implement EXCEPT ALL and INTERSECT ALL

2018-04-26 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21274?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16455470#comment-16455470 ] Takeshi Yamamuro commented on SPARK-21274: -- Looks great to me. I checked the queries above

[jira] [Assigned] (SPARK-24085) Scalar subquery error

2018-04-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24085?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24085: Assignee: (was: Apache Spark) > Scalar subquery error > - > >

[jira] [Assigned] (SPARK-24085) Scalar subquery error

2018-04-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24085?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24085: Assignee: Apache Spark > Scalar subquery error > - > >

[jira] [Commented] (SPARK-24085) Scalar subquery error

2018-04-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24085?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16455468#comment-16455468 ] Apache Spark commented on SPARK-24085: -- User 'dilipbiswal' has created a pull request for this

[jira] [Resolved] (SPARK-24044) Explicitly print out skipped tests from unittest module

2018-04-26 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24044?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler resolved SPARK-24044. -- Resolution: Fixed Fix Version/s: 2.4.0 Issue resolved by pull request 21107

[jira] [Assigned] (SPARK-24044) Explicitly print out skipped tests from unittest module

2018-04-26 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24044?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler reassigned SPARK-24044: Assignee: Hyukjin Kwon > Explicitly print out skipped tests from unittest module >

[jira] [Assigned] (SPARK-23856) Spark jdbc setQueryTimeout option

2018-04-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23856?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23856: Assignee: (was: Apache Spark) > Spark jdbc setQueryTimeout option >

[jira] [Commented] (SPARK-23856) Spark jdbc setQueryTimeout option

2018-04-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23856?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16455342#comment-16455342 ] Apache Spark commented on SPARK-23856: -- User 'maropu' has created a pull request for this issue:

[jira] [Assigned] (SPARK-23856) Spark jdbc setQueryTimeout option

2018-04-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23856?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23856: Assignee: Apache Spark > Spark jdbc setQueryTimeout option >

[jira] [Commented] (SPARK-23580) Interpreted mode fallback should be implemented for all expressions & projections

2018-04-26 Thread Bruce Robbins (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23580?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16454918#comment-16454918 ] Bruce Robbins commented on SPARK-23580: --- Should SortPrefix also get this treatment? > Interpreted

[jira] [Resolved] (SPARK-24057) put the real data type in the AssertionError message

2018-04-26 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24057?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler resolved SPARK-24057. -- Resolution: Fixed Fix Version/s: 2.4.0 Issue resolved by pull request 21159

[jira] [Assigned] (SPARK-24057) put the real data type in the AssertionError message

2018-04-26 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24057?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler reassigned SPARK-24057: Assignee: Huaxin Gao > put the real data type in the AssertionError message >

[jira] [Commented] (SPARK-24105) Spark 2.3.0 on kubernetes

2018-04-26 Thread Anirudh Ramanathan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24105?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16454847#comment-16454847 ] Anirudh Ramanathan commented on SPARK-24105: > To avoid this deadlock, its required to

[jira] [Updated] (SPARK-24106) Spark Structure Streaming with RF model taking long time in processing probability for each mini batch

2018-04-26 Thread Tamilselvan Veeramani (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24106?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tamilselvan Veeramani updated SPARK-24106: -- Target Version/s: 2.3.0, 2.2.1 (was: 2.2.1, 2.3.0) Issue Type:

[jira] [Commented] (SPARK-24036) Stateful operators in continuous processing

2018-04-26 Thread Jose Torres (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24036?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16454788#comment-16454788 ] Jose Torres commented on SPARK-24036: -

[jira] [Updated] (SPARK-24106) Spark Structure Streaming with RF model taking long time in processing probability for each mini batch

2018-04-26 Thread Tamilselvan Veeramani (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24106?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tamilselvan Veeramani updated SPARK-24106: -- Target Version/s: 2.3.0, 2.2.1 (was: 2.2.1, 2.3.0) Component/s:

[jira] [Commented] (SPARK-24106) Spark Structure Streaming with RF model taking long time in processing probability for each mini batch

2018-04-26 Thread Tamilselvan Veeramani (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24106?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16454774#comment-16454774 ] Tamilselvan Veeramani commented on SPARK-24106: --- I have the code change ready and tested in

[jira] [Created] (SPARK-24106) Spark Structure Streaming with RF model taking long time in processing probability for each mini batch

2018-04-26 Thread Tamilselvan Veeramani (JIRA)
Tamilselvan Veeramani created SPARK-24106: - Summary: Spark Structure Streaming with RF model taking long time in processing probability for each mini batch Key: SPARK-24106 URL:

[jira] [Commented] (SPARK-24068) CSV schema inferring doesn't work for compressed files

2018-04-26 Thread Maxim Gekk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24068?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16454761#comment-16454761 ] Maxim Gekk commented on SPARK-24068: The same issue exists in JSON datasource. [~hyukjin.kwon] Do we

[jira] [Commented] (SPARK-23120) Add PMML pipeline export support to PySpark

2018-04-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23120?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16454716#comment-16454716 ] Apache Spark commented on SPARK-23120: -- User 'holdenk' has created a pull request for this issue:

[jira] [Assigned] (SPARK-23120) Add PMML pipeline export support to PySpark

2018-04-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23120?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23120: Assignee: holdenk (was: Apache Spark) > Add PMML pipeline export support to PySpark >

[jira] [Assigned] (SPARK-23120) Add PMML pipeline export support to PySpark

2018-04-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23120?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23120: Assignee: Apache Spark (was: holdenk) > Add PMML pipeline export support to PySpark >

[jira] [Comment Edited] (SPARK-24085) Scalar subquery error

2018-04-26 Thread Dilip Biswal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24085?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16454707#comment-16454707 ] Dilip Biswal edited comment on SPARK-24085 at 4/26/18 7:09 PM: --- Working on

[jira] [Commented] (SPARK-24085) Scalar subquery error

2018-04-26 Thread Dilip Biswal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24085?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16454707#comment-16454707 ] Dilip Biswal commented on SPARK-24085: -- Working on a fix for this. > Scalar subquery error >

[jira] [Updated] (SPARK-24105) Spark 2.3.0 on kubernetes

2018-04-26 Thread Lenin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24105?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lenin updated SPARK-24105: -- Description: Right now its only possible to define node selector configurations 

[jira] [Assigned] (SPARK-24104) SQLAppStatusListener overwrites metrics onDriverAccumUpdates instead of updating them

2018-04-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24104?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24104: Assignee: (was: Apache Spark) > SQLAppStatusListener overwrites metrics

[jira] [Assigned] (SPARK-24104) SQLAppStatusListener overwrites metrics onDriverAccumUpdates instead of updating them

2018-04-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24104?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24104: Assignee: Apache Spark > SQLAppStatusListener overwrites metrics onDriverAccumUpdates

[jira] [Commented] (SPARK-24104) SQLAppStatusListener overwrites metrics onDriverAccumUpdates instead of updating them

2018-04-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24104?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16454630#comment-16454630 ] Apache Spark commented on SPARK-24104: -- User 'juliuszsompolski' has created a pull request for this

[jira] [Created] (SPARK-24105) Spark 2.3.0 on kubernetes

2018-04-26 Thread Lenin (JIRA)
Lenin created SPARK-24105: - Summary: Spark 2.3.0 on kubernetes Key: SPARK-24105 URL: https://issues.apache.org/jira/browse/SPARK-24105 Project: Spark Issue Type: Improvement Components:

[jira] [Commented] (SPARK-23962) Flaky tests from SQLMetricsTestUtils.currentExecutionIds

2018-04-26 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23962?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16454594#comment-16454594 ] Dongjoon Hyun commented on SPARK-23962: --- [~irashid], [~cloud_fan], Please check the build status on

[jira] [Resolved] (SPARK-23842) accessing java from PySpark lambda functions

2018-04-26 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23842?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk resolved SPARK-23842. - Resolution: Won't Fix Not supported by the current design, alternatives do exist though. > accessing

[jira] [Commented] (SPARK-23842) accessing java from PySpark lambda functions

2018-04-26 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23842?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16454585#comment-16454585 ] holdenk commented on SPARK-23842: - So the py4j gateway only exists on the driver program, on the worker

[jira] [Created] (SPARK-24104) SQLAppStatusListener overwrites metrics onDriverAccumUpdates instead of updating them

2018-04-26 Thread Juliusz Sompolski (JIRA)
Juliusz Sompolski created SPARK-24104: - Summary: SQLAppStatusListener overwrites metrics onDriverAccumUpdates instead of updating them Key: SPARK-24104 URL: https://issues.apache.org/jira/browse/SPARK-24104

[jira] [Comment Edited] (SPARK-23925) High-order function: repeat(element, count) → array

2018-04-26 Thread Marek Novotny (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23925?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16454526#comment-16454526 ] Marek Novotny edited comment on SPARK-23925 at 4/26/18 4:47 PM:

[jira] [Commented] (SPARK-23925) High-order function: repeat(element, count) → array

2018-04-26 Thread Marek Novotny (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23925?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16454526#comment-16454526 ] Marek Novotny commented on SPARK-23925: --- @pepinoflo Any joy? I can take this one or help if you

[jira] [Commented] (SPARK-22732) Add DataSourceV2 streaming APIs

2018-04-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22732?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16454501#comment-16454501 ] Apache Spark commented on SPARK-22732: -- User 'wangyum' has created a pull request for this issue:

[jira] [Assigned] (SPARK-23715) from_utc_timestamp returns incorrect results for some UTC date/time values

2018-04-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23715?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23715: Assignee: Apache Spark > from_utc_timestamp returns incorrect results for some UTC

[jira] [Commented] (SPARK-24096) create table as select not using hive.default.fileformat

2018-04-26 Thread Yuming Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24096?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16454463#comment-16454463 ] Yuming Wang commented on SPARK-24096: - Another related PR: https://github.com/apache/spark/pull/14430

[jira] [Commented] (SPARK-23715) from_utc_timestamp returns incorrect results for some UTC date/time values

2018-04-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23715?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16454462#comment-16454462 ] Apache Spark commented on SPARK-23715: -- User 'cloud-fan' has created a pull request for this issue:

[jira] [Assigned] (SPARK-23715) from_utc_timestamp returns incorrect results for some UTC date/time values

2018-04-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23715?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23715: Assignee: (was: Apache Spark) > from_utc_timestamp returns incorrect results for some

[jira] [Resolved] (SPARK-24096) create table as select not using hive.default.fileformat

2018-04-26 Thread Yuming Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24096?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang resolved SPARK-24096. - Resolution: Duplicate > create table as select not using hive.default.fileformat >

[jira] [Commented] (SPARK-20087) Include accumulators / taskMetrics when sending TaskKilled to onTaskEnd listeners

2018-04-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20087?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16454454#comment-16454454 ] Apache Spark commented on SPARK-20087: -- User 'advancedxy' has created a pull request for this issue:

[jira] [Commented] (SPARK-24101) MulticlassClassificationEvaluator should use sample weight data

2018-04-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24101?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16454445#comment-16454445 ] Apache Spark commented on SPARK-24101: -- User 'imatiach-msft' has created a pull request for this

[jira] [Assigned] (SPARK-24101) MulticlassClassificationEvaluator should use sample weight data

2018-04-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24101?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24101: Assignee: (was: Apache Spark) > MulticlassClassificationEvaluator should use sample

[jira] [Assigned] (SPARK-24101) MulticlassClassificationEvaluator should use sample weight data

2018-04-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24101?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24101: Assignee: Apache Spark > MulticlassClassificationEvaluator should use sample weight data

[jira] [Updated] (SPARK-24101) MulticlassClassificationEvaluator should use sample weight data

2018-04-26 Thread Ilya Matiach (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24101?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ilya Matiach updated SPARK-24101: - Description: The LogisticRegression and LinearRegression models support training with a weight

[jira] [Updated] (SPARK-24102) RegressionEvaluator should use sample weight data

2018-04-26 Thread Ilya Matiach (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24102?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ilya Matiach updated SPARK-24102: - Description: The LogisticRegression and LinearRegression models support training with a weight

[jira] [Updated] (SPARK-24103) BinaryClassificationEvaluator should use sample weight data

2018-04-26 Thread Ilya Matiach (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ilya Matiach updated SPARK-24103: - Description: The LogisticRegression and LinearRegression models support training with a weight

[jira] [Commented] (SPARK-18693) BinaryClassificationEvaluator, RegressionEvaluator, and MulticlassClassificationEvaluator should use sample weight data

2018-04-26 Thread Ilya Matiach (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18693?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16454435#comment-16454435 ] Ilya Matiach commented on SPARK-18693: -- [~josephkb] sure, I've added 3 JIRAs for tracking and have

[jira] [Created] (SPARK-24103) BinaryClassificationEvaluator should use sample weight data

2018-04-26 Thread Ilya Matiach (JIRA)
Ilya Matiach created SPARK-24103: Summary: BinaryClassificationEvaluator should use sample weight data Key: SPARK-24103 URL: https://issues.apache.org/jira/browse/SPARK-24103 Project: Spark

[jira] [Updated] (SPARK-24102) RegressionEvaluator should use sample weight data

2018-04-26 Thread Ilya Matiach (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24102?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ilya Matiach updated SPARK-24102: - Issue Type: Improvement (was: Bug) > RegressionEvaluator should use sample weight data >

[jira] [Updated] (SPARK-24101) MulticlassClassificationEvaluator should use sample weight data

2018-04-26 Thread Ilya Matiach (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24101?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ilya Matiach updated SPARK-24101: - Issue Type: Improvement (was: Bug) > MulticlassClassificationEvaluator should use sample weight

[jira] [Created] (SPARK-24102) RegressionEvaluator should use sample weight data

2018-04-26 Thread Ilya Matiach (JIRA)
Ilya Matiach created SPARK-24102: Summary: RegressionEvaluator should use sample weight data Key: SPARK-24102 URL: https://issues.apache.org/jira/browse/SPARK-24102 Project: Spark Issue

[jira] [Created] (SPARK-24101) MulticlassClassificationEvaluator should use sample weight data

2018-04-26 Thread Ilya Matiach (JIRA)
Ilya Matiach created SPARK-24101: Summary: MulticlassClassificationEvaluator should use sample weight data Key: SPARK-24101 URL: https://issues.apache.org/jira/browse/SPARK-24101 Project: Spark

[jira] [Commented] (SPARK-23933) High-order function: map(array, array) → map<K,V>

2018-04-26 Thread Alex Wajda (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23933?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16454393#comment-16454393 ] Alex Wajda commented on SPARK-23933: Oh, now I got it, thanks. I overlooked the case when the keys

[jira] [Commented] (SPARK-23830) Spark on YARN in cluster deploy mode fail with NullPointerException when a Spark application is a Scala class not object

2018-04-26 Thread Eric Maynard (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23830?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16454365#comment-16454365 ] Eric Maynard commented on SPARK-23830: -- [~jerryshao] I agree we should not support using a

[jira] [Assigned] (SPARK-23830) Spark on YARN in cluster deploy mode fail with NullPointerException when a Spark application is a Scala class not object

2018-04-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23830?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23830: Assignee: (was: Apache Spark) > Spark on YARN in cluster deploy mode fail with

[jira] [Assigned] (SPARK-23830) Spark on YARN in cluster deploy mode fail with NullPointerException when a Spark application is a Scala class not object

2018-04-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23830?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23830: Assignee: Apache Spark > Spark on YARN in cluster deploy mode fail with

[jira] [Commented] (SPARK-23830) Spark on YARN in cluster deploy mode fail with NullPointerException when a Spark application is a Scala class not object

2018-04-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23830?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16454358#comment-16454358 ] Apache Spark commented on SPARK-23830: -- User 'eric-maynard' has created a pull request for this

[jira] [Commented] (SPARK-13446) Spark need to support reading data from Hive 2.0.0 metastore

2018-04-26 Thread Tavis Barr (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13446?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16454278#comment-16454278 ] Tavis Barr commented on SPARK-13446: Enjoy!   This is with Spark 2.3.0 and Hive 2.2.0   scala>

[jira] [Comment Edited] (SPARK-23715) from_utc_timestamp returns incorrect results for some UTC date/time values

2018-04-26 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23715?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16454221#comment-16454221 ] Wenchen Fan edited comment on SPARK-23715 at 4/26/18 1:53 PM: -- It seems the

[jira] [Commented] (SPARK-23715) from_utc_timestamp returns incorrect results for some UTC date/time values

2018-04-26 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23715?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16454221#comment-16454221 ] Wenchen Fan commented on SPARK-23715: - It seems the `from_utc_timestamp` doesn't make a lot of sense

[jira] [Commented] (SPARK-21661) SparkSQL can't merge load table from Hadoop

2018-04-26 Thread Li Yuanjian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21661?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16454070#comment-16454070 ] Li Yuanjian commented on SPARK-21661: - Got it. > SparkSQL can't merge load table from Hadoop >

[jira] [Commented] (SPARK-23151) Provide a distribution of Spark with Hadoop 3.0

2018-04-26 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23151?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16453991#comment-16453991 ] Steve Loughran commented on SPARK-23151: well, there's "have everything work on Hadoop 3" and

[jira] [Commented] (SPARK-24100) Add the CompressionCodec to the saveAsTextFiles interface.

2018-04-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24100?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16453952#comment-16453952 ] Apache Spark commented on SPARK-24100: -- User 'WzRaCai' has created a pull request for this issue:

[jira] [Assigned] (SPARK-24100) Add the CompressionCodec to the saveAsTextFiles interface.

2018-04-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24100: Assignee: (was: Apache Spark) > Add the CompressionCodec to the saveAsTextFiles

[jira] [Assigned] (SPARK-24100) Add the CompressionCodec to the saveAsTextFiles interface.

2018-04-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24100: Assignee: Apache Spark > Add the CompressionCodec to the saveAsTextFiles interface. >

[jira] [Created] (SPARK-24100) Add the CompressionCodec to the saveAsTextFiles interface.

2018-04-26 Thread caijie (JIRA)
caijie created SPARK-24100: -- Summary: Add the CompressionCodec to the saveAsTextFiles interface. Key: SPARK-24100 URL: https://issues.apache.org/jira/browse/SPARK-24100 Project: Spark Issue Type:

[jira] [Commented] (SPARK-23856) Spark jdbc setQueryTimeout option

2018-04-26 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23856?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16453948#comment-16453948 ] Takeshi Yamamuro commented on SPARK-23856: -- [~dmitrymikhailov] You don't have time to make a pr?

[jira] [Commented] (SPARK-23929) pandas_udf schema mapped by position and not by name

2018-04-26 Thread Tr3wory (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23929?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16453880#comment-16453880 ] Tr3wory commented on SPARK-23929: - Yes, the documentation is a must, even for 2.3 if possible (it's a new

[jira] [Commented] (SPARK-4781) Column values become all NULL after doing ALTER TABLE CHANGE for renaming column names (Parquet external table in HiveContext)

2018-04-26 Thread Peter Simon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4781?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16453861#comment-16453861 ] Peter Simon commented on SPARK-4781: As commented under SPARK-11748,  Possible workaround can be:

[jira] [Commented] (SPARK-11748) Result is null after alter column name of table stored as Parquet

2018-04-26 Thread Peter Simon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11748?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16453858#comment-16453858 ] Peter Simon commented on SPARK-11748: - Possible workaround can be: {code:java} scala> spark.sql("set

[jira] [Commented] (SPARK-11334) numRunningTasks can't be less than 0, or it will affect executor allocation

2018-04-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11334?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16453838#comment-16453838 ] Apache Spark commented on SPARK-11334: -- User 'sadhen' has created a pull request for this issue:

[jira] [Commented] (SPARK-13446) Spark need to support reading data from Hive 2.0.0 metastore

2018-04-26 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13446?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16453825#comment-16453825 ] Steve Loughran commented on SPARK-13446: [~Tavis]: can you paste in the stack you see? > Spark

[jira] [Commented] (SPARK-23885) trying to spark submit 2.3.0 on minikube

2018-04-26 Thread anant pukale (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23885?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16453803#comment-16453803 ] anant pukale commented on SPARK-23885: -- HI Anirudh, Kindly suggest any work around .   Thanks

[jira] [Created] (SPARK-24099) java.io.CharConversionException: Invalid UTF-32 character prevents me from querying my data in JSON

2018-04-26 Thread ABHISHEK KUMAR GUPTA (JIRA)
ABHISHEK KUMAR GUPTA created SPARK-24099: Summary: java.io.CharConversionException: Invalid UTF-32 character prevents me from querying my data in JSON Key: SPARK-24099 URL:

[jira] [Commented] (SPARK-24098) ScriptTransformationExec should wait process exiting before output iterator finish

2018-04-26 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24098?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16453716#comment-16453716 ] Apache Spark commented on SPARK-24098: -- User 'liutang123' has created a pull request for this issue:

  1   2   >