[jira] [Assigned] (SPARK-23116) SparkR 2.3 QA: Update user guide for new features & APIs

2018-01-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23116?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-23116: - Assignee: (was: Felix Cheung) > SparkR 2.3 QA: Update user guide for new

[jira] [Updated] (SPARK-23116) CLONE - SparkR 2.2 QA: Update user guide for new features & APIs

2018-01-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23116?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-23116: -- Fix Version/s: (was: 2.2.0) > CLONE - SparkR 2.2 QA: Update user guide for new

[jira] [Updated] (SPARK-23115) SparkR 2.3 QA: New R APIs and API docs

2018-01-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23115?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-23115: -- Target Version/s: (was: 2.2.0) > SparkR 2.3 QA: New R APIs and API docs >

[jira] [Updated] (SPARK-23115) SparkR 2.3 QA: New R APIs and API docs

2018-01-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23115?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-23115: -- Fix Version/s: (was: 2.2.0) > SparkR 2.3 QA: New R APIs and API docs >

[jira] [Updated] (SPARK-23115) SparkR 2.3 QA: New R APIs and API docs

2018-01-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23115?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-23115: -- Summary: SparkR 2.3 QA: New R APIs and API docs (was: CLONE - SparkR 2.2 QA: New R

[jira] [Updated] (SPARK-23105) Spark MLlib, GraphX 2.3 QA umbrella

2018-01-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23105?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-23105: -- Fix Version/s: (was: 2.2.0) > Spark MLlib, GraphX 2.3 QA umbrella >

[jira] [Updated] (SPARK-23105) Spark MLlib, GraphX 2.3 QA umbrella

2018-01-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23105?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-23105: -- Target Version/s: 2.3.0 (was: 2.2.0) > Spark MLlib, GraphX 2.3 QA umbrella >

[jira] [Updated] (SPARK-23114) Spark R 2.3 QA umbrella

2018-01-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23114?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-23114: -- Target Version/s: 2.3.0 (was: 2.2.0) > Spark R 2.3 QA umbrella >

[jira] [Updated] (SPARK-23105) Spark MLlib, GraphX 2.3 QA umbrella

2018-01-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23105?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-23105: -- Description: This JIRA lists tasks for the next Spark release's QA period for MLlib

[jira] [Updated] (SPARK-23114) Spark R 2.3 QA umbrella

2018-01-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23114?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-23114: -- Fix Version/s: (was: 2.2.0) > Spark R 2.3 QA umbrella > --- >

[jira] [Created] (SPARK-23117) CLONE - SparkR 2.2 QA: Check for new R APIs requiring example code

2018-01-16 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-23117: - Summary: CLONE - SparkR 2.2 QA: Check for new R APIs requiring example code Key: SPARK-23117 URL: https://issues.apache.org/jira/browse/SPARK-23117

[jira] [Created] (SPARK-23118) CLONE - SparkR 2.2 QA: Programming guide, migration guide, vignettes updates

2018-01-16 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-23118: - Summary: CLONE - SparkR 2.2 QA: Programming guide, migration guide, vignettes updates Key: SPARK-23118 URL: https://issues.apache.org/jira/browse/SPARK-23118

[jira] [Created] (SPARK-23107) CLONE - ML, Graph 2.2 QA: API: New Scala APIs, docs

2018-01-16 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-23107: - Summary: CLONE - ML, Graph 2.2 QA: API: New Scala APIs, docs Key: SPARK-23107 URL: https://issues.apache.org/jira/browse/SPARK-23107 Project: Spark

[jira] [Created] (SPARK-23116) CLONE - SparkR 2.2 QA: Update user guide for new features & APIs

2018-01-16 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-23116: - Summary: CLONE - SparkR 2.2 QA: Update user guide for new features & APIs Key: SPARK-23116 URL: https://issues.apache.org/jira/browse/SPARK-23116 Project:

[jira] [Created] (SPARK-23108) CLONE - ML, Graph 2.2 QA: API: Experimental, DeveloperApi, final, sealed audit

2018-01-16 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-23108: - Summary: CLONE - ML, Graph 2.2 QA: API: Experimental, DeveloperApi, final, sealed audit Key: SPARK-23108 URL: https://issues.apache.org/jira/browse/SPARK-23108

[jira] [Created] (SPARK-23105) Spark MLlib, GraphX 2.3 QA umbrella

2018-01-16 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-23105: - Summary: Spark MLlib, GraphX 2.3 QA umbrella Key: SPARK-23105 URL: https://issues.apache.org/jira/browse/SPARK-23105 Project: Spark Issue Type:

[jira] [Created] (SPARK-23115) CLONE - SparkR 2.2 QA: New R APIs and API docs

2018-01-16 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-23115: - Summary: CLONE - SparkR 2.2 QA: New R APIs and API docs Key: SPARK-23115 URL: https://issues.apache.org/jira/browse/SPARK-23115 Project: Spark

[jira] [Created] (SPARK-23111) CLONE - ML, Graph 2.2 QA: Update user guide for new features & APIs

2018-01-16 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-23111: - Summary: CLONE - ML, Graph 2.2 QA: Update user guide for new features & APIs Key: SPARK-23111 URL: https://issues.apache.org/jira/browse/SPARK-23111

[jira] [Created] (SPARK-23110) CLONE - ML 2.2 QA: API: Java compatibility, docs

2018-01-16 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-23110: - Summary: CLONE - ML 2.2 QA: API: Java compatibility, docs Key: SPARK-23110 URL: https://issues.apache.org/jira/browse/SPARK-23110 Project: Spark

[jira] [Created] (SPARK-23106) CLONE - ML, Graph 2.2 QA: API: Binary incompatible changes

2018-01-16 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-23106: - Summary: CLONE - ML, Graph 2.2 QA: API: Binary incompatible changes Key: SPARK-23106 URL: https://issues.apache.org/jira/browse/SPARK-23106 Project: Spark

[jira] [Created] (SPARK-23114) Spark R 2.3 QA umbrella

2018-01-16 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-23114: - Summary: Spark R 2.3 QA umbrella Key: SPARK-23114 URL: https://issues.apache.org/jira/browse/SPARK-23114 Project: Spark Issue Type: Umbrella

[jira] [Created] (SPARK-23113) CLONE - Update MLlib, GraphX websites for 2.2

2018-01-16 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-23113: - Summary: CLONE - Update MLlib, GraphX websites for 2.2 Key: SPARK-23113 URL: https://issues.apache.org/jira/browse/SPARK-23113 Project: Spark

[jira] [Created] (SPARK-23109) CLONE - ML 2.2 QA: API: Python API coverage

2018-01-16 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-23109: - Summary: CLONE - ML 2.2 QA: API: Python API coverage Key: SPARK-23109 URL: https://issues.apache.org/jira/browse/SPARK-23109 Project: Spark Issue

[jira] [Created] (SPARK-23112) CLONE - ML, Graph 2.2 QA: Programming guide update and migration guide

2018-01-16 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-23112: - Summary: CLONE - ML, Graph 2.2 QA: Programming guide update and migration guide Key: SPARK-23112 URL: https://issues.apache.org/jira/browse/SPARK-23112

[jira] [Resolved] (SPARK-23031) Merge script should allow arbitrary assignees

2018-01-16 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23031?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-23031. Resolution: Fixed Assignee: Imran Rashid Fix Version/s: 2.4.0 Fixed with

[jira] [Resolved] (SPARK-23044) merge script has bug when assigning jiras to non-contributors

2018-01-16 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23044?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-23044. Resolution: Fixed Fix Version/s: 2.4.0 Issue resolved by pull request 20236

[jira] [Assigned] (SPARK-23044) merge script has bug when assigning jiras to non-contributors

2018-01-16 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23044?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin reassigned SPARK-23044: -- Assignee: Imran Rashid > merge script has bug when assigning jiras to

[jira] [Comment Edited] (SPARK-21274) Implement EXCEPT ALL and INTERSECT ALL

2018-01-16 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21274?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16328028#comment-16328028 ] Takeshi Yamamuro edited comment on SPARK-21274 at 1/17/18 12:05 AM:

[jira] [Commented] (SPARK-21274) Implement EXCEPT ALL and INTERSECT ALL

2018-01-16 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21274?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16328028#comment-16328028 ] Takeshi Yamamuro commented on SPARK-21274: -- yea, I tried though, I couldn't find a rewriting

[jira] [Created] (SPARK-23104) Document that kubernetes is still "experimental"

2018-01-16 Thread Marcelo Vanzin (JIRA)
Marcelo Vanzin created SPARK-23104: -- Summary: Document that kubernetes is still "experimental" Key: SPARK-23104 URL: https://issues.apache.org/jira/browse/SPARK-23104 Project: Spark Issue

[jira] [Commented] (SPARK-23103) LevelDB store not iterating correctly when indexed value has negative value

2018-01-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16327960#comment-16327960 ] Apache Spark commented on SPARK-23103: -- User 'vanzin' has created a pull request for this issue:

[jira] [Assigned] (SPARK-23103) LevelDB store not iterating correctly when indexed value has negative value

2018-01-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23103: Assignee: (was: Apache Spark) > LevelDB store not iterating correctly when indexed

[jira] [Assigned] (SPARK-23103) LevelDB store not iterating correctly when indexed value has negative value

2018-01-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23103: Assignee: Apache Spark > LevelDB store not iterating correctly when indexed value has

[jira] [Created] (SPARK-23103) LevelDB store not iterating correctly when indexed value has negative value

2018-01-16 Thread Marcelo Vanzin (JIRA)
Marcelo Vanzin created SPARK-23103: -- Summary: LevelDB store not iterating correctly when indexed value has negative value Key: SPARK-23103 URL: https://issues.apache.org/jira/browse/SPARK-23103

[jira] [Commented] (SPARK-22923) Non-equi join(theta join) should use sort merge join

2018-01-16 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22923?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16327881#comment-16327881 ] Herman van Hovell commented on SPARK-22923: --- You cannot use a shuffling join for such problems.

[jira] [Created] (SPARK-23102) Migrate kafka sink

2018-01-16 Thread Jose Torres (JIRA)
Jose Torres created SPARK-23102: --- Summary: Migrate kafka sink Key: SPARK-23102 URL: https://issues.apache.org/jira/browse/SPARK-23102 Project: Spark Issue Type: Sub-task Components:

[jira] [Created] (SPARK-23101) Migrate unit test sinks

2018-01-16 Thread Jose Torres (JIRA)
Jose Torres created SPARK-23101: --- Summary: Migrate unit test sinks Key: SPARK-23101 URL: https://issues.apache.org/jira/browse/SPARK-23101 Project: Spark Issue Type: Sub-task

[jira] [Assigned] (SPARK-23095) Decorrelation of scalar subquery fails with java.util.NoSuchElementException.

2018-01-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23095?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23095: Assignee: (was: Apache Spark) > Decorrelation of scalar subquery fails with

[jira] [Commented] (SPARK-23095) Decorrelation of scalar subquery fails with java.util.NoSuchElementException.

2018-01-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23095?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16327845#comment-16327845 ] Apache Spark commented on SPARK-23095: -- User 'dilipbiswal' has created a pull request for this

[jira] [Assigned] (SPARK-23095) Decorrelation of scalar subquery fails with java.util.NoSuchElementException.

2018-01-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23095?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23095: Assignee: Apache Spark > Decorrelation of scalar subquery fails with

[jira] [Created] (SPARK-23100) Migrate unit test sources

2018-01-16 Thread Jose Torres (JIRA)
Jose Torres created SPARK-23100: --- Summary: Migrate unit test sources Key: SPARK-23100 URL: https://issues.apache.org/jira/browse/SPARK-23100 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-23099) Migrate foreach sink

2018-01-16 Thread Jose Torres (JIRA)
Jose Torres created SPARK-23099: --- Summary: Migrate foreach sink Key: SPARK-23099 URL: https://issues.apache.org/jira/browse/SPARK-23099 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-23098) Migrate kafka source

2018-01-16 Thread Jose Torres (JIRA)
Jose Torres created SPARK-23098: --- Summary: Migrate kafka source Key: SPARK-23098 URL: https://issues.apache.org/jira/browse/SPARK-23098 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-23096) Migrate rate source to v2

2018-01-16 Thread Jose Torres (JIRA)
Jose Torres created SPARK-23096: --- Summary: Migrate rate source to v2 Key: SPARK-23096 URL: https://issues.apache.org/jira/browse/SPARK-23096 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-23097) Migrate text socket source

2018-01-16 Thread Jose Torres (JIRA)
Jose Torres created SPARK-23097: --- Summary: Migrate text socket source Key: SPARK-23097 URL: https://issues.apache.org/jira/browse/SPARK-23097 Project: Spark Issue Type: Sub-task

[jira] [Updated] (SPARK-23095) Decorrelation of scalar subquery fails with java.util.NoSuchElementException.

2018-01-16 Thread Dilip Biswal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23095?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dilip Biswal updated SPARK-23095: - Description: The following SQL involving scalar correlated query returns a map exception.

[jira] [Updated] (SPARK-23095) Decorrelation of scalar subquery fails with java.util.NoSuchElementException.

2018-01-16 Thread Dilip Biswal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23095?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dilip Biswal updated SPARK-23095: - Description: The following SQL involving scalar correlated query returns a map exception.

[jira] [Created] (SPARK-23095) Decorrelation of scalar subquery fails with java.util.NoSuchElementException.

2018-01-16 Thread Dilip Biswal (JIRA)
Dilip Biswal created SPARK-23095: Summary: Decorrelation of scalar subquery fails with java.util.NoSuchElementException. Key: SPARK-23095 URL: https://issues.apache.org/jira/browse/SPARK-23095

[jira] [Updated] (SPARK-21996) Streaming ignores files with spaces in the file names

2018-01-16 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21996?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-21996: - Component/s: (was: SQL) Structured Streaming > Streaming ignores files with

[jira] [Assigned] (SPARK-23037) RFormula should not use deprecated OneHotEncoder and should include VectorSizeHint in pipeline

2018-01-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23037?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-23037: - Assignee: Bago Amirbekian > RFormula should not use deprecated OneHotEncoder

[jira] [Resolved] (SPARK-23037) RFormula should not use deprecated OneHotEncoder and should include VectorSizeHint in pipeline

2018-01-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23037?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-23037. --- Resolution: Fixed Fix Version/s: 2.3.0 > RFormula should not use deprecated

[jira] [Resolved] (SPARK-23045) Have RFormula use OneHoEncoderEstimator

2018-01-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23045?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-23045. --- Resolution: Fixed Fix Version/s: 2.3.0 Resolved by

[jira] [Created] (SPARK-23094) Json Readers choose wrong encoding when bad records are present and fail

2018-01-16 Thread Burak Yavuz (JIRA)
Burak Yavuz created SPARK-23094: --- Summary: Json Readers choose wrong encoding when bad records are present and fail Key: SPARK-23094 URL: https://issues.apache.org/jira/browse/SPARK-23094 Project:

[jira] [Assigned] (SPARK-23093) don't modify run id

2018-01-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23093?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23093: Assignee: (was: Apache Spark) > don't modify run id > --- > >

[jira] [Assigned] (SPARK-23093) don't modify run id

2018-01-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23093?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23093: Assignee: Apache Spark > don't modify run id > --- > >

[jira] [Commented] (SPARK-23093) don't modify run id

2018-01-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23093?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16327776#comment-16327776 ] Apache Spark commented on SPARK-23093: -- User 'jose-torres' has created a pull request for this

[jira] [Assigned] (SPARK-23045) Have RFormula use OneHoEncoderEstimator

2018-01-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23045?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-23045: - Assignee: Bago Amirbekian > Have RFormula use OneHoEncoderEstimator >

[jira] [Updated] (SPARK-23033) disable task-level retry for continuous execution

2018-01-16 Thread Jose Torres (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23033?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jose Torres updated SPARK-23033: Target Version/s: 2.3.0 > disable task-level retry for continuous execution >

[jira] [Created] (SPARK-23093) don't modify run id

2018-01-16 Thread Jose Torres (JIRA)
Jose Torres created SPARK-23093: --- Summary: don't modify run id Key: SPARK-23093 URL: https://issues.apache.org/jira/browse/SPARK-23093 Project: Spark Issue Type: Sub-task Components:

[jira] [Assigned] (SPARK-23089) "Unable to create operation log session directory" when parent directory not present

2018-01-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23089?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23089: Assignee: Apache Spark > "Unable to create operation log session directory" when parent

[jira] [Commented] (SPARK-23089) "Unable to create operation log session directory" when parent directory not present

2018-01-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23089?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16327674#comment-16327674 ] Apache Spark commented on SPARK-23089: -- User 'mgaido91' has created a pull request for this issue:

[jira] [Assigned] (SPARK-23089) "Unable to create operation log session directory" when parent directory not present

2018-01-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23089?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23089: Assignee: (was: Apache Spark) > "Unable to create operation log session directory"

[jira] [Resolved] (SPARK-16139) Audit tests for leaked threads

2018-01-16 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16139?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-16139. Resolution: Fixed Fix Version/s: 2.4.0 Issue resolved by pull request 19893

[jira] [Assigned] (SPARK-16139) Audit tests for leaked threads

2018-01-16 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16139?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin reassigned SPARK-16139: -- Assignee: Gabor Somogyi > Audit tests for leaked threads >

[jira] [Commented] (SPARK-22232) Row objects in pyspark created using the `Row(**kwars)` syntax do not get serialized/deserialized properly

2018-01-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22232?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16327661#comment-16327661 ] Apache Spark commented on SPARK-22232: -- User 'BryanCutler' has created a pull request for this

[jira] [Assigned] (SPARK-22232) Row objects in pyspark created using the `Row(**kwars)` syntax do not get serialized/deserialized properly

2018-01-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22232?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22232: Assignee: (was: Apache Spark) > Row objects in pyspark created using the

[jira] [Assigned] (SPARK-22232) Row objects in pyspark created using the `Row(**kwars)` syntax do not get serialized/deserialized properly

2018-01-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22232?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22232: Assignee: Apache Spark > Row objects in pyspark created using the `Row(**kwars)` syntax

[jira] [Updated] (SPARK-23091) Incorrect unit test for approxQuantile

2018-01-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23091?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-23091: -- Environment: Agree, go ahead with a PR. Priority: Minor (was: Major) Component/s: Tests >

[jira] [Updated] (SPARK-23091) Incorrect unit test for approxQuantile

2018-01-16 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23091?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-23091: -- Environment: (was: Agree, go ahead with a PR.) Agree, go ahead with a PR. > Incorrect unit test

[jira] [Commented] (SPARK-23050) Structured Streaming with S3 file source duplicates data because of eventual consistency.

2018-01-16 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23050?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16327649#comment-16327649 ] Steve Loughran commented on SPARK-23050: {quote} Is there an API to detect S3 like file systems?

[jira] [Commented] (SPARK-23092) Migrate MemoryStream to DataSource V2

2018-01-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23092?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16327637#comment-16327637 ] Apache Spark commented on SPARK-23092: -- User 'brkyvz' has created a pull request for this issue:

[jira] [Assigned] (SPARK-23092) Migrate MemoryStream to DataSource V2

2018-01-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23092?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23092: Assignee: Apache Spark > Migrate MemoryStream to DataSource V2 >

[jira] [Assigned] (SPARK-23092) Migrate MemoryStream to DataSource V2

2018-01-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23092?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23092: Assignee: (was: Apache Spark) > Migrate MemoryStream to DataSource V2 >

[jira] [Created] (SPARK-23092) Migrate MemoryStream to DataSource V2

2018-01-16 Thread Burak Yavuz (JIRA)
Burak Yavuz created SPARK-23092: --- Summary: Migrate MemoryStream to DataSource V2 Key: SPARK-23092 URL: https://issues.apache.org/jira/browse/SPARK-23092 Project: Spark Issue Type: Task

[jira] [Commented] (SPARK-23050) Structured Streaming with S3 file source duplicates data because of eventual consistency.

2018-01-16 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23050?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16327632#comment-16327632 ] Shixiong Zhu commented on SPARK-23050: -- [~ste...@apache.org] Yeah, that's a good improvement for S3.

[jira] [Created] (SPARK-23091) Incorrect unit test for approxQuantile

2018-01-16 Thread Kuang Chen (JIRA)
Kuang Chen created SPARK-23091: -- Summary: Incorrect unit test for approxQuantile Key: SPARK-23091 URL: https://issues.apache.org/jira/browse/SPARK-23091 Project: Spark Issue Type: Improvement

[jira] [Assigned] (SPARK-12297) Add work-around for Parquet/Hive int96 timestamp bug.

2018-01-16 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12297?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Imran Rashid reassigned SPARK-12297: Assignee: Imran Rashid (was: Marcelo Vanzin) > Add work-around for Parquet/Hive int96

[jira] [Assigned] (SPARK-12297) Add work-around for Parquet/Hive int96 timestamp bug.

2018-01-16 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12297?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Imran Rashid reassigned SPARK-12297: Assignee: Marcelo Vanzin (was: Imran Rashid) > Add work-around for Parquet/Hive int96

[jira] [Assigned] (SPARK-12297) Add work-around for Parquet/Hive int96 timestamp bug.

2018-01-16 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12297?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Imran Rashid reassigned SPARK-12297: Assignee: Imran Rashid (was: Marcelo Vanzin) > Add work-around for Parquet/Hive int96

[jira] [Assigned] (SPARK-12297) Add work-around for Parquet/Hive int96 timestamp bug.

2018-01-16 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12297?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Imran Rashid reassigned SPARK-12297: Assignee: Marcelo Vanzin (was: Imran Rashid) > Add work-around for Parquet/Hive int96

[jira] [Assigned] (SPARK-12297) Add work-around for Parquet/Hive int96 timestamp bug.

2018-01-16 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12297?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Imran Rashid reassigned SPARK-12297: Assignee: Imran Rashid (was: Marcelo Vanzin) > Add work-around for Parquet/Hive int96

[jira] [Assigned] (SPARK-12297) Add work-around for Parquet/Hive int96 timestamp bug.

2018-01-16 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-12297?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Imran Rashid reassigned SPARK-12297: Assignee: Marcelo Vanzin (was: Imran Rashid) > Add work-around for Parquet/Hive int96

[jira] [Commented] (SPARK-16534) Kafka 0.10 Python support

2018-01-16 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-16534?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16327553#comment-16327553 ] Maciej Bryński commented on SPARK-16534: [~rxin] I tested this patch with 2.2.1 and everything

[jira] [Commented] (SPARK-23081) Add colRegex API to PySpark

2018-01-16 Thread Huaxin Gao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23081?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16327472#comment-16327472 ] Huaxin Gao commented on SPARK-23081: Hi Sean, are you going to work on this? If not, may I work on

[jira] [Commented] (SPARK-23084) Add unboundedPreceding(), unboundedFollowing() and currentRow() to PySpark

2018-01-16 Thread Huaxin Gao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23084?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16327473#comment-16327473 ] Huaxin Gao commented on SPARK-23084: Hi Sean, are you going to work on this? If not, may I work on

[jira] [Commented] (SPARK-21274) Implement EXCEPT ALL and INTERSECT ALL

2018-01-16 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21274?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16327453#comment-16327453 ] Reynold Xin commented on SPARK-21274: - Can't we rewrite this as two aggregates and a join?   >

[jira] [Commented] (SPARK-23079) Fix query constraints propagation with aliases

2018-01-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23079?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16327398#comment-16327398 ] Apache Spark commented on SPARK-23079: -- User 'gengliangwang' has created a pull request for this

[jira] [Updated] (SPARK-23016) Spark UI access and documentation

2018-01-16 Thread Anirudh Ramanathan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23016?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Anirudh Ramanathan updated SPARK-23016: --- Priority: Minor (was: Major) > Spark UI access and documentation >

[jira] [Assigned] (SPARK-23090) polish ColumnVector

2018-01-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23090?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23090: Assignee: Apache Spark (was: Wenchen Fan) > polish ColumnVector > --- >

[jira] [Assigned] (SPARK-23090) polish ColumnVector

2018-01-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23090?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23090: Assignee: Wenchen Fan (was: Apache Spark) > polish ColumnVector > --- >

[jira] [Commented] (SPARK-23090) polish ColumnVector

2018-01-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23090?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16327356#comment-16327356 ] Apache Spark commented on SPARK-23090: -- User 'cloud-fan' has created a pull request for this issue:

[jira] [Created] (SPARK-23090) polish ColumnVector

2018-01-16 Thread Wenchen Fan (JIRA)
Wenchen Fan created SPARK-23090: --- Summary: polish ColumnVector Key: SPARK-23090 URL: https://issues.apache.org/jira/browse/SPARK-23090 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-23089) "Unable to create operation log session directory" when parent directory not present

2018-01-16 Thread Sean Roberts (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23089?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Roberts updated SPARK-23089: - Environment: /usr/hdp/2.6.3.0-235/spark2/jars/spark-hive-thriftserver_2.11-2.2.0.2.6.3.0-235.jar

[jira] [Updated] (SPARK-23089) "Unable to create operation log session directory" when parent directory not present

2018-01-16 Thread Sean Roberts (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23089?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Roberts updated SPARK-23089: - Description: When creating a session directory, Thrift should create the parent directory

[jira] [Updated] (SPARK-23089) "Unable to create operation log session directory" when parent directory not present

2018-01-16 Thread Sean Roberts (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23089?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Roberts updated SPARK-23089: - Environment: (was: When creating a session directory, Thrift should create the parent

[jira] [Updated] (SPARK-23089) "Unable to create operation log session directory" when parent directory not present

2018-01-16 Thread Sean Roberts (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23089?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Roberts updated SPARK-23089: - Description: When creating a session directory, Thrift should create the parent directory

[jira] [Created] (SPARK-23089) "Unable to create operation log session directory" when parent directory not present

2018-01-16 Thread Sean Roberts (JIRA)
Sean Roberts created SPARK-23089: Summary: "Unable to create operation log session directory" when parent directory not present Key: SPARK-23089 URL: https://issues.apache.org/jira/browse/SPARK-23089

[jira] [Updated] (SPARK-23011) Support alternative function form with group aggregate pandas UDF

2018-01-16 Thread Li Jin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23011?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Li Jin updated SPARK-23011: --- Description: The current semantics of groupby apply is that the output schema of groupby apply is the same

[jira] [Updated] (SPARK-23011) Support alternative function form with group aggregate pandas UDF

2018-01-16 Thread Li Jin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23011?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Li Jin updated SPARK-23011: --- Description: The current semantics of groupby apply is that the output schema of groupby apply is the same

[jira] [Updated] (SPARK-23011) Support alternative function form with group aggregate pandas UDF

2018-01-16 Thread Li Jin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23011?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Li Jin updated SPARK-23011: --- Description: The current semantics of groupby apply is that the output schema of groupby apply is the same

<    1   2   3   >