[jira] [Created] (SPARK-24035) SQL syntax for Pivot

2018-04-20 Thread Xiao Li (JIRA)
Xiao Li created SPARK-24035: --- Summary: SQL syntax for Pivot Key: SPARK-24035 URL: https://issues.apache.org/jira/browse/SPARK-24035 Project: Spark Issue Type: Improvement Components: SQL

[jira] [Reopened] (SPARK-23775) Flaky test: DataFrameRangeSuite

2018-04-20 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23775?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin reopened SPARK-23775: Assignee: (was: Gabor Somogyi) > Flaky test: DataFrameRangeSuite >

[jira] [Commented] (SPARK-24033) LAG Window function broken in Spark 2.3

2018-04-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24033?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16446090#comment-16446090 ] Apache Spark commented on SPARK-24033: -- User 'gatorsmile' has created a pull request for this issue:

[jira] [Assigned] (SPARK-24033) LAG Window function broken in Spark 2.3

2018-04-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24033?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24033: Assignee: Apache Spark (was: Xiao Li) > LAG Window function broken in Spark 2.3 >

[jira] [Assigned] (SPARK-24033) LAG Window function broken in Spark 2.3

2018-04-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24033?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24033: Assignee: Xiao Li (was: Apache Spark) > LAG Window function broken in Spark 2.3 >

[jira] [Commented] (SPARK-22371) dag-scheduler-event-loop thread stopped with error Attempted to access garbage collected accumulator 5605982

2018-04-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22371?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16446084#comment-16446084 ] Apache Spark commented on SPARK-22371: -- User 'artemrd' has created a pull request for this issue:

[jira] [Assigned] (SPARK-22371) dag-scheduler-event-loop thread stopped with error Attempted to access garbage collected accumulator 5605982

2018-04-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22371?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22371: Assignee: Apache Spark > dag-scheduler-event-loop thread stopped with error Attempted to

[jira] [Assigned] (SPARK-22371) dag-scheduler-event-loop thread stopped with error Attempted to access garbage collected accumulator 5605982

2018-04-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22371?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22371: Assignee: (was: Apache Spark) > dag-scheduler-event-loop thread stopped with error

[jira] [Created] (SPARK-24038) refactor continuous write exec to its own class

2018-04-20 Thread Jose Torres (JIRA)
Jose Torres created SPARK-24038: --- Summary: refactor continuous write exec to its own class Key: SPARK-24038 URL: https://issues.apache.org/jira/browse/SPARK-24038 Project: Spark Issue Type:

[jira] [Commented] (SPARK-10399) Off Heap Memory Access for non-JVM libraries (C++)

2018-04-20 Thread Kazuaki Ishizaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16446166#comment-16446166 ] Kazuaki Ishizaki commented on SPARK-10399: -- https://issues.apache.org/jira/browse/SPARK-23879 is

[jira] [Assigned] (SPARK-24033) LAG Window function broken in Spark 2.3

2018-04-20 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24033?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li reassigned SPARK-24033: --- Assignee: Xiao Li > LAG Window function broken in Spark 2.3 >

[jira] [Created] (SPARK-24039) remove restarting iterators hack

2018-04-20 Thread Jose Torres (JIRA)
Jose Torres created SPARK-24039: --- Summary: remove restarting iterators hack Key: SPARK-24039 URL: https://issues.apache.org/jira/browse/SPARK-24039 Project: Spark Issue Type: Sub-task

[jira] [Assigned] (SPARK-24038) refactor continuous write exec to its own class

2018-04-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24038?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24038: Assignee: Apache Spark > refactor continuous write exec to its own class >

[jira] [Assigned] (SPARK-24038) refactor continuous write exec to its own class

2018-04-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24038?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24038: Assignee: (was: Apache Spark) > refactor continuous write exec to its own class >

[jira] [Commented] (SPARK-24038) refactor continuous write exec to its own class

2018-04-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24038?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16446161#comment-16446161 ] Apache Spark commented on SPARK-24038: -- User 'jose-torres' has created a pull request for this

[jira] [Commented] (SPARK-10399) Off Heap Memory Access for non-JVM libraries (C++)

2018-04-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10399?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16446176#comment-16446176 ] Apache Spark commented on SPARK-10399: -- User 'kiszk' has created a pull request for this issue:

[jira] [Commented] (SPARK-23879) Introduce MemoryBlock API instead of Platform API with Object

2018-04-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23879?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16446177#comment-16446177 ] Apache Spark commented on SPARK-23879: -- User 'kiszk' has created a pull request for this issue:

[jira] [Created] (SPARK-24041) add flag to remove whitelist of continuous processing operators

2018-04-20 Thread Jose Torres (JIRA)
Jose Torres created SPARK-24041: --- Summary: add flag to remove whitelist of continuous processing operators Key: SPARK-24041 URL: https://issues.apache.org/jira/browse/SPARK-24041 Project: Spark

[jira] [Created] (SPARK-24037) stateful operators

2018-04-20 Thread Jose Torres (JIRA)
Jose Torres created SPARK-24037: --- Summary: stateful operators Key: SPARK-24037 URL: https://issues.apache.org/jira/browse/SPARK-24037 Project: Spark Issue Type: Sub-task Components:

[jira] [Commented] (SPARK-22371) dag-scheduler-event-loop thread stopped with error Attempted to access garbage collected accumulator 5605982

2018-04-20 Thread Artem Rudoy (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22371?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16445999#comment-16445999 ] Artem Rudoy commented on SPARK-22371: - Do we really need to throw an exception from

[jira] [Comment Edited] (SPARK-22371) dag-scheduler-event-loop thread stopped with error Attempted to access garbage collected accumulator 5605982

2018-04-20 Thread Artem Rudoy (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22371?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16445999#comment-16445999 ] Artem Rudoy edited comment on SPARK-22371 at 4/20/18 4:37 PM: -- Do we really

[jira] [Assigned] (SPARK-23775) Flaky test: DataFrameRangeSuite

2018-04-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23775?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23775: Assignee: Apache Spark > Flaky test: DataFrameRangeSuite >

[jira] [Updated] (SPARK-23775) Flaky test: DataFrameRangeSuite

2018-04-20 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23775?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin updated SPARK-23775: --- Fix Version/s: (was: 2.3.1) (was: 2.4.0) > Flaky test:

[jira] [Assigned] (SPARK-23775) Flaky test: DataFrameRangeSuite

2018-04-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23775?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23775: Assignee: (was: Apache Spark) > Flaky test: DataFrameRangeSuite >

[jira] [Commented] (SPARK-24035) SQL syntax for Pivot

2018-04-20 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24035?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16446058#comment-16446058 ] Xiao Li commented on SPARK-24035: - Also cc [~simeons] > SQL syntax for Pivot > > >

[jira] [Created] (SPARK-24036) Stateful operators in continuous processing

2018-04-20 Thread Jose Torres (JIRA)
Jose Torres created SPARK-24036: --- Summary: Stateful operators in continuous processing Key: SPARK-24036 URL: https://issues.apache.org/jira/browse/SPARK-24036 Project: Spark Issue Type:

[jira] [Created] (SPARK-24040) support single partition aggregates

2018-04-20 Thread Jose Torres (JIRA)
Jose Torres created SPARK-24040: --- Summary: support single partition aggregates Key: SPARK-24040 URL: https://issues.apache.org/jira/browse/SPARK-24040 Project: Spark Issue Type: Sub-task

[jira] [Updated] (SPARK-17916) CSV data source treats empty string as null no matter what nullValue option is

2018-04-20 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17916?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-17916: Target Version/s: 2.4.0 > CSV data source treats empty string as null no matter what nullValue option is >

[jira] [Assigned] (SPARK-23325) DataSourceV2 readers should always produce InternalRow.

2018-04-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23325?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23325: Assignee: (was: Apache Spark) > DataSourceV2 readers should always produce

[jira] [Assigned] (SPARK-23325) DataSourceV2 readers should always produce InternalRow.

2018-04-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23325?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23325: Assignee: Apache Spark > DataSourceV2 readers should always produce InternalRow. >

[jira] [Commented] (SPARK-23325) DataSourceV2 readers should always produce InternalRow.

2018-04-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23325?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16446313#comment-16446313 ] Apache Spark commented on SPARK-23325: -- User 'rdblue' has created a pull request for this issue:

[jira] [Commented] (SPARK-23206) Additional Memory Tuning Metrics

2018-04-20 Thread Edwina Lu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16446247#comment-16446247 ] Edwina Lu commented on SPARK-23206: --- The design discussion is scheduled for Monday 4/23 PDT at 11am

[jira] [Updated] (SPARK-24019) AnalysisException for Window function expression to compute derivative

2018-04-20 Thread Li Jin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24019?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Li Jin updated SPARK-24019: --- Component/s: (was: Spark Core) SQL > AnalysisException for Window function expression

[jira] [Assigned] (SPARK-19826) spark.ml Python API for PIC

2018-04-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19826?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19826: Assignee: Apache Spark > spark.ml Python API for PIC > --- > >

[jira] [Commented] (SPARK-19826) spark.ml Python API for PIC

2018-04-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19826?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16446626#comment-16446626 ] Apache Spark commented on SPARK-19826: -- User 'huaxingao' has created a pull request for this issue:

[jira] [Assigned] (SPARK-19826) spark.ml Python API for PIC

2018-04-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19826?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19826: Assignee: (was: Apache Spark) > spark.ml Python API for PIC >

[jira] [Created] (SPARK-24031) the method of postTaskEnd should write once in handleTaskCompletion

2018-04-20 Thread Yu Wang (JIRA)
Yu Wang created SPARK-24031: --- Summary: the method of postTaskEnd should write once in handleTaskCompletion Key: SPARK-24031 URL: https://issues.apache.org/jira/browse/SPARK-24031 Project: Spark

[jira] [Updated] (SPARK-24031) the method of postTaskEnd should write once in handleTaskCompletion

2018-04-20 Thread Yu Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24031?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yu Wang updated SPARK-24031: Attachment: 24031_master_1.patch > the method of postTaskEnd should write once in handleTaskCompletion >

[jira] [Resolved] (SPARK-24031) the method of postTaskEnd should write once in handleTaskCompletion

2018-04-20 Thread Yu Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24031?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yu Wang resolved SPARK-24031. - Resolution: Fixed > the method of postTaskEnd should write once in handleTaskCompletion >

[jira] [Reopened] (SPARK-24031) the method of postTaskEnd should write once in handleTaskCompletion

2018-04-20 Thread Yu Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24031?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yu Wang reopened SPARK-24031: - > the method of postTaskEnd should write once in handleTaskCompletion >

[jira] [Updated] (SPARK-24031) the method of postTaskEnd should write once in handleTaskCompletion

2018-04-20 Thread Yu Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24031?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yu Wang updated SPARK-24031: Target Version/s: (was: 3.0.0) Fix Version/s: (was: 3.0.0) > the method of postTaskEnd should

[jira] [Commented] (SPARK-23588) Add interpreted execution for CatalystToExternalMap expression

2018-04-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23588?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16445382#comment-16445382 ] Apache Spark commented on SPARK-23588: -- User 'maropu' has created a pull request for this issue:

[jira] [Issue Comment Deleted] (SPARK-23705) dataframe.groupBy() may inadvertently receive sequence of non-distinct strings

2018-04-20 Thread Yu Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23705?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yu Wang updated SPARK-23705: Comment: was deleted (was: [~khoatrantan2000] Could you assign this patch to me?) > dataframe.groupBy()

[jira] [Updated] (SPARK-23705) dataframe.groupBy() may inadvertently receive sequence of non-distinct strings

2018-04-20 Thread Yu Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23705?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yu Wang updated SPARK-23705: Attachment: (was: SPARK-23705.patch) > dataframe.groupBy() may inadvertently receive sequence of

[jira] [Commented] (SPARK-24031) the method of postTaskEnd should write once in handleTaskCompletion

2018-04-20 Thread Yu Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24031?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16445381#comment-16445381 ] Yu Wang commented on SPARK-24031: - @[~srowen] [~joshrosen]  please have a look. > the method of

[jira] [Updated] (SPARK-24032) ElasticSearch update fails on nested field

2018-04-20 Thread Cristina Luengo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24032?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cristina Luengo updated SPARK-24032: Description: I'm trying to update a nested field on ElasticSearch using a scripted update.

[jira] [Created] (SPARK-24032) ElasticSearch updata fails on nested field

2018-04-20 Thread Cristina Luengo (JIRA)
Cristina Luengo created SPARK-24032: --- Summary: ElasticSearch updata fails on nested field Key: SPARK-24032 URL: https://issues.apache.org/jira/browse/SPARK-24032 Project: Spark Issue Type:

[jira] [Updated] (SPARK-24032) ElasticSearch update fails on nested field

2018-04-20 Thread Cristina Luengo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24032?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cristina Luengo updated SPARK-24032: Summary: ElasticSearch update fails on nested field (was: ElasticSearch updata fails on

[jira] [Commented] (SPARK-24025) Join of bucketed and non-bucketed tables can give two exchanges and sorts for non-bucketed side

2018-04-20 Thread Jacek Laskowski (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24025?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16445512#comment-16445512 ] Jacek Laskowski commented on SPARK-24025: - I was about to have closed this as a duplicate, but

[jira] [Commented] (SPARK-24032) ElasticSearch update fails on nested field

2018-04-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24032?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16445510#comment-16445510 ] Hyukjin Kwon commented on SPARK-24032: -- BTW, please avoid to set a Critical+ which is usually

[jira] [Commented] (SPARK-24032) ElasticSearch update fails on nested field

2018-04-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24032?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16445507#comment-16445507 ] Hyukjin Kwon commented on SPARK-24032: -- Can you post the full stack trace? I doubt if it's a Spark

[jira] [Commented] (SPARK-24025) Join of bucketed and non-bucketed tables can give two exchanges and sorts for non-bucketed side

2018-04-20 Thread Jacek Laskowski (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24025?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16445520#comment-16445520 ] Jacek Laskowski commented on SPARK-24025: - it seems related or duplicated > Join of bucketed and

[jira] [Updated] (SPARK-24032) ElasticSearch update fails on nested field

2018-04-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24032?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-24032: - Priority: Major (was: Critical) > ElasticSearch update fails on nested field >

[jira] [Commented] (SPARK-13136) Data exchange (shuffle, broadcast) should only be handled by the exchange operator

2018-04-20 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13136?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16445428#comment-16445428 ] Apache Spark commented on SPARK-13136: -- User 'seancxmao' has created a pull request for this issue:

[jira] [Resolved] (SPARK-24032) ElasticSearch update fails on nested field

2018-04-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24032?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24032. -- Resolution: Invalid I'd not open an issue unless it's clear. Let me leave this resolved for

[jira] [Commented] (SPARK-24032) ElasticSearch update fails on nested field

2018-04-20 Thread Cristina Luengo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24032?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16445514#comment-16445514 ] Cristina Luengo commented on SPARK-24032: - Ok thanks!! Sorry, I didn't really know what to assign

[jira] [Commented] (SPARK-24032) ElasticSearch update fails on nested field

2018-04-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24032?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16445519#comment-16445519 ] Hyukjin Kwon commented on SPARK-24032: -- Yea, doesn't that look a problem from elasticsearch

[jira] [Commented] (SPARK-24032) ElasticSearch update fails on nested field

2018-04-20 Thread Cristina Luengo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24032?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16445522#comment-16445522 ] Cristina Luengo commented on SPARK-24032: - Thanks for the very fast reply! I wasn't sure whether

[jira] [Commented] (SPARK-16854) mapWithState Support for Python

2018-04-20 Thread Fatma Ali (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16854?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16445503#comment-16445503 ] Fatma Ali commented on SPARK-16854: --- +1 > mapWithState Support for Python >

[jira] [Commented] (SPARK-24025) Join of bucketed and non-bucketed tables can give two exchanges and sorts for non-bucketed side

2018-04-20 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24025?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16445516#comment-16445516 ] Hyukjin Kwon commented on SPARK-24025: -- I haven't taken a close look yet but it should be good to

[jira] [Updated] (SPARK-22823) Race Condition when reading Broadcast shuffle input. Failed to get broadcast piece

2018-04-20 Thread Dmitrii Bundin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22823?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dmitrii Bundin updated SPARK-22823: --- Affects Version/s: 2.3.0 > Race Condition when reading Broadcast shuffle input. Failed to

[jira] [Created] (SPARK-24033) LAG Window function broken in Spark 2.3

2018-04-20 Thread Emlyn Corrin (JIRA)
Emlyn Corrin created SPARK-24033: Summary: LAG Window function broken in Spark 2.3 Key: SPARK-24033 URL: https://issues.apache.org/jira/browse/SPARK-24033 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-24031) the method of postTaskEnd should write once in handleTaskCompletion

2018-04-20 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24031?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-24031: -- Affects Version/s: (was: 3.0.0) 2.3.0 Priority: Minor (was:

[jira] [Resolved] (SPARK-23595) Add interpreted execution for ValidateExternalType expression

2018-04-20 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23595?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell resolved SPARK-23595. --- Resolution: Fixed Assignee: Takeshi Yamamuro Fix Version/s: 2.4.0 >

[jira] [Updated] (SPARK-24030) SparkSQL percentile_approx function is too slow for over 1,060,000 records.

2018-04-20 Thread Seok-Joon,Yun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Seok-Joon,Yun updated SPARK-24030: -- Attachment: screenshot_2018-04-20 23.15.02.png > SparkSQL percentile_approx function is too

[jira] [Updated] (SPARK-24034) StopIteration in pyspark mapper results in partial results

2018-04-20 Thread Emilio Dorigatti (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24034?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Emilio Dorigatti updated SPARK-24034: - Description: Consider the following code {noformat} def mapper(xx): if xx % 2 == 0:

[jira] [Updated] (SPARK-24034) StopIteration in pyspark mapper results in partial results

2018-04-20 Thread Emilio Dorigatti (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24034?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Emilio Dorigatti updated SPARK-24034: - Description: Consider the following code {noformat} def mapper(xx): if xx % 2 == 0:

[jira] [Updated] (SPARK-24034) StopIteration in pyspark mapper gives partial results

2018-04-20 Thread Emilio Dorigatti (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24034?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Emilio Dorigatti updated SPARK-24034: - Summary: StopIteration in pyspark mapper gives partial results (was: StopIteration in

[jira] [Updated] (SPARK-24034) StopIteration in pyspark mapper gives partial results

2018-04-20 Thread Emilio Dorigatti (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24034?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Emilio Dorigatti updated SPARK-24034: - Description: Consider the following code {noformat} def mapper(xx): if xx % 2 == 0:

[jira] [Created] (SPARK-24034) StopIteration in pyspark mapper results in partial results

2018-04-20 Thread Emilio Dorigatti (JIRA)
Emilio Dorigatti created SPARK-24034: Summary: StopIteration in pyspark mapper results in partial results Key: SPARK-24034 URL: https://issues.apache.org/jira/browse/SPARK-24034 Project: Spark

[jira] [Updated] (SPARK-24034) StopIteration in pyspark mapper results in partial results

2018-04-20 Thread Emilio Dorigatti (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24034?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Emilio Dorigatti updated SPARK-24034: - Description: Consider the following code {noformat} def mapper(xx): if xx % 2 == 0: