[jira] [Resolved] (SPARK-23045) Have RFormula use OneHoEncoderEstimator

2018-01-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23045?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-23045. --- Resolution: Fixed Fix Version/s: 2.3.0 Resolved by

[jira] [Assigned] (SPARK-23037) RFormula should not use deprecated OneHotEncoder and should include VectorSizeHint in pipeline

2018-01-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23037?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-23037: - Assignee: Bago Amirbekian > RFormula should not use deprecated OneHotEncoder

[jira] [Resolved] (SPARK-23037) RFormula should not use deprecated OneHotEncoder and should include VectorSizeHint in pipeline

2018-01-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23037?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-23037. --- Resolution: Fixed Fix Version/s: 2.3.0 > RFormula should not use deprecated

[jira] [Created] (SPARK-23098) Migrate kafka source

2018-01-16 Thread Jose Torres (JIRA)
Jose Torres created SPARK-23098: --- Summary: Migrate kafka source Key: SPARK-23098 URL: https://issues.apache.org/jira/browse/SPARK-23098 Project: Spark Issue Type: Sub-task

[jira] [Updated] (SPARK-23033) disable task-level retry for continuous execution

2018-01-16 Thread Jose Torres (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23033?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jose Torres updated SPARK-23033: Target Version/s: 2.3.0 > disable task-level retry for continuous execution >

[jira] [Created] (SPARK-23093) don't modify run id

2018-01-16 Thread Jose Torres (JIRA)
Jose Torres created SPARK-23093: --- Summary: don't modify run id Key: SPARK-23093 URL: https://issues.apache.org/jira/browse/SPARK-23093 Project: Spark Issue Type: Sub-task Components:

[jira] [Updated] (SPARK-23095) Decorrelation of scalar subquery fails with java.util.NoSuchElementException.

2018-01-16 Thread Dilip Biswal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23095?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dilip Biswal updated SPARK-23095: - Description: The following SQL involving scalar correlated query returns a map exception.

[jira] [Assigned] (SPARK-23095) Decorrelation of scalar subquery fails with java.util.NoSuchElementException.

2018-01-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23095?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23095: Assignee: (was: Apache Spark) > Decorrelation of scalar subquery fails with

[jira] [Created] (SPARK-23101) Migrate unit test sinks

2018-01-16 Thread Jose Torres (JIRA)
Jose Torres created SPARK-23101: --- Summary: Migrate unit test sinks Key: SPARK-23101 URL: https://issues.apache.org/jira/browse/SPARK-23101 Project: Spark Issue Type: Sub-task

[jira] [Commented] (SPARK-23103) LevelDB store not iterating correctly when indexed value has negative value

2018-01-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16327960#comment-16327960 ] Apache Spark commented on SPARK-23103: -- User 'vanzin' has created a pull request for this issue:

[jira] [Assigned] (SPARK-23103) LevelDB store not iterating correctly when indexed value has negative value

2018-01-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23103: Assignee: Apache Spark > LevelDB store not iterating correctly when indexed value has

[jira] [Assigned] (SPARK-23103) LevelDB store not iterating correctly when indexed value has negative value

2018-01-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23103: Assignee: (was: Apache Spark) > LevelDB store not iterating correctly when indexed

[jira] [Created] (SPARK-23104) Document that kubernetes is still "experimental"

2018-01-16 Thread Marcelo Vanzin (JIRA)
Marcelo Vanzin created SPARK-23104: -- Summary: Document that kubernetes is still "experimental" Key: SPARK-23104 URL: https://issues.apache.org/jira/browse/SPARK-23104 Project: Spark Issue

[jira] [Assigned] (SPARK-23044) merge script has bug when assigning jiras to non-contributors

2018-01-16 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23044?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin reassigned SPARK-23044: -- Assignee: Imran Rashid > merge script has bug when assigning jiras to

[jira] [Resolved] (SPARK-23044) merge script has bug when assigning jiras to non-contributors

2018-01-16 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23044?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-23044. Resolution: Fixed Fix Version/s: 2.4.0 Issue resolved by pull request 20236

[jira] [Assigned] (SPARK-23045) Have RFormula use OneHoEncoderEstimator

2018-01-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23045?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-23045: - Assignee: Bago Amirbekian > Have RFormula use OneHoEncoderEstimator >

[jira] [Created] (SPARK-23097) Migrate text socket source

2018-01-16 Thread Jose Torres (JIRA)
Jose Torres created SPARK-23097: --- Summary: Migrate text socket source Key: SPARK-23097 URL: https://issues.apache.org/jira/browse/SPARK-23097 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-23096) Migrate rate source to v2

2018-01-16 Thread Jose Torres (JIRA)
Jose Torres created SPARK-23096: --- Summary: Migrate rate source to v2 Key: SPARK-23096 URL: https://issues.apache.org/jira/browse/SPARK-23096 Project: Spark Issue Type: Sub-task

[jira] [Commented] (SPARK-22923) Non-equi join(theta join) should use sort merge join

2018-01-16 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22923?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16327881#comment-16327881 ] Herman van Hovell commented on SPARK-22923: --- You cannot use a shuffling join for such problems.

[jira] [Created] (SPARK-23103) LevelDB store not iterating correctly when indexed value has negative value

2018-01-16 Thread Marcelo Vanzin (JIRA)
Marcelo Vanzin created SPARK-23103: -- Summary: LevelDB store not iterating correctly when indexed value has negative value Key: SPARK-23103 URL: https://issues.apache.org/jira/browse/SPARK-23103

[jira] [Commented] (SPARK-21274) Implement EXCEPT ALL and INTERSECT ALL

2018-01-16 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21274?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16328028#comment-16328028 ] Takeshi Yamamuro commented on SPARK-21274: -- yea, I tried though, I couldn't find a rewriting

[jira] [Comment Edited] (SPARK-21274) Implement EXCEPT ALL and INTERSECT ALL

2018-01-16 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21274?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16328028#comment-16328028 ] Takeshi Yamamuro edited comment on SPARK-21274 at 1/17/18 12:05 AM:

[jira] [Commented] (SPARK-23095) Decorrelation of scalar subquery fails with java.util.NoSuchElementException.

2018-01-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23095?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16327845#comment-16327845 ] Apache Spark commented on SPARK-23095: -- User 'dilipbiswal' has created a pull request for this

[jira] [Assigned] (SPARK-23095) Decorrelation of scalar subquery fails with java.util.NoSuchElementException.

2018-01-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23095?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23095: Assignee: Apache Spark > Decorrelation of scalar subquery fails with

[jira] [Updated] (SPARK-21996) Streaming ignores files with spaces in the file names

2018-01-16 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21996?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-21996: - Component/s: (was: SQL) Structured Streaming > Streaming ignores files with

[jira] [Created] (SPARK-23099) Migrate foreach sink

2018-01-16 Thread Jose Torres (JIRA)
Jose Torres created SPARK-23099: --- Summary: Migrate foreach sink Key: SPARK-23099 URL: https://issues.apache.org/jira/browse/SPARK-23099 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-23102) Migrate kafka sink

2018-01-16 Thread Jose Torres (JIRA)
Jose Torres created SPARK-23102: --- Summary: Migrate kafka sink Key: SPARK-23102 URL: https://issues.apache.org/jira/browse/SPARK-23102 Project: Spark Issue Type: Sub-task Components:

[jira] [Commented] (SPARK-23093) don't modify run id

2018-01-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23093?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16327776#comment-16327776 ] Apache Spark commented on SPARK-23093: -- User 'jose-torres' has created a pull request for this

[jira] [Created] (SPARK-23095) Decorrelation of scalar subquery fails with java.util.NoSuchElementException.

2018-01-16 Thread Dilip Biswal (JIRA)
Dilip Biswal created SPARK-23095: Summary: Decorrelation of scalar subquery fails with java.util.NoSuchElementException. Key: SPARK-23095 URL: https://issues.apache.org/jira/browse/SPARK-23095

[jira] [Updated] (SPARK-23095) Decorrelation of scalar subquery fails with java.util.NoSuchElementException.

2018-01-16 Thread Dilip Biswal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23095?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dilip Biswal updated SPARK-23095: - Description: The following SQL involving scalar correlated query returns a map exception.

[jira] [Commented] (SPARK-23060) RDD's apply function

2018-01-16 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23060?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16326866#comment-16326866 ] Nick Pentreath commented on SPARK-23060: I agree I don't see enough of a compelling case for

[jira] [Updated] (SPARK-23086) Spark SQL cannot support high concurrency for lock in HiveMetastoreCatalog

2018-01-16 Thread pin_zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23086?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] pin_zhang updated SPARK-23086: -- Description: * Hive metastore is mysql * Set hive.server2.thrift.max.worker.threads=500 create

[jira] [Commented] (SPARK-22991) High read latency with spark streaming 2.2.1 and kafka 0.10.0.1

2018-01-16 Thread zhaoshijie (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22991?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16327028#comment-16327028 ] zhaoshijie commented on SPARK-22991: [~kiranjapannavar] I encounter something like this ,I dont not

[jira] [Commented] (SPARK-22457) Tables are supposed to be MANAGED only taking into account whether a path is provided

2018-01-16 Thread Jacek Laskowski (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22457?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16326959#comment-16326959 ] Jacek Laskowski commented on SPARK-22457: -  That should be fairly easy to fix _iff_ we want to

[jira] [Created] (SPARK-23086) Spark SQL cannot support high concurrency for lock in HiveMetastoreCatalog

2018-01-16 Thread pin_zhang (JIRA)
pin_zhang created SPARK-23086: - Summary: Spark SQL cannot support high concurrency for lock in HiveMetastoreCatalog Key: SPARK-23086 URL: https://issues.apache.org/jira/browse/SPARK-23086 Project: Spark

[jira] [Resolved] (SPARK-22978) Register Vectorized UDFs for SQL Statement

2018-01-16 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22978?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-22978. -- Resolution: Fixed Fix Version/s: 2.3.0 Issue resolved by pull request 20171

[jira] [Comment Edited] (SPARK-22923) Non-equi join(theta join) should use sort merge join

2018-01-16 Thread Marco Gaido (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22923?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16325074#comment-16325074 ] Marco Gaido edited comment on SPARK-22923 at 1/16/18 10:47 AM: --- I dob't

[jira] [Commented] (SPARK-23090) polish ColumnVector

2018-01-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23090?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16327356#comment-16327356 ] Apache Spark commented on SPARK-23090: -- User 'cloud-fan' has created a pull request for this issue:

[jira] [Assigned] (SPARK-23090) polish ColumnVector

2018-01-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23090?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23090: Assignee: Wenchen Fan (was: Apache Spark) > polish ColumnVector > --- >

[jira] [Assigned] (SPARK-23090) polish ColumnVector

2018-01-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23090?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23090: Assignee: Apache Spark (was: Wenchen Fan) > polish ColumnVector > --- >

[jira] [Updated] (SPARK-23016) Spark UI access and documentation

2018-01-16 Thread Anirudh Ramanathan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23016?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Anirudh Ramanathan updated SPARK-23016: --- Priority: Minor (was: Major) > Spark UI access and documentation >

[jira] [Created] (SPARK-23090) polish ColumnVector

2018-01-16 Thread Wenchen Fan (JIRA)
Wenchen Fan created SPARK-23090: --- Summary: polish ColumnVector Key: SPARK-23090 URL: https://issues.apache.org/jira/browse/SPARK-23090 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-23079) Fix query constraints propagation with aliases

2018-01-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23079?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16327398#comment-16327398 ] Apache Spark commented on SPARK-23079: -- User 'gengliangwang' has created a pull request for this

[jira] [Assigned] (SPARK-14948) Exception when joining DataFrames derived form the same DataFrame

2018-01-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14948?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14948: Assignee: Apache Spark > Exception when joining DataFrames derived form the same

[jira] [Assigned] (SPARK-14948) Exception when joining DataFrames derived form the same DataFrame

2018-01-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14948?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-14948: Assignee: (was: Apache Spark) > Exception when joining DataFrames derived form the

[jira] [Commented] (SPARK-14948) Exception when joining DataFrames derived form the same DataFrame

2018-01-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14948?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16327139#comment-16327139 ] Apache Spark commented on SPARK-14948: -- User 'cloud-fan' has created a pull request for this issue:

[jira] [Updated] (SPARK-23011) Support alternative function form with group aggregate pandas UDF

2018-01-16 Thread Li Jin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23011?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Li Jin updated SPARK-23011: --- Description: The current semantics of groupby apply is that the output schema of groupby apply is the same

[jira] [Assigned] (SPARK-22392) columnar reader interface

2018-01-16 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22392?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-22392: --- Assignee: Wenchen Fan > columnar reader interface > -- > >

[jira] [Resolved] (SPARK-22392) columnar reader interface

2018-01-16 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22392?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-22392. - Resolution: Fixed Fix Version/s: 2.3.0 Issue resolved by pull request 20153

[jira] [Updated] (SPARK-23011) Support alternative function form with group aggregate pandas UDF

2018-01-16 Thread Li Jin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23011?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Li Jin updated SPARK-23011: --- Summary: Support alternative function form with group aggregate pandas UDF (was: Prepend missing grouping

[jira] [Updated] (SPARK-23011) Support alternative function form with group aggregate pandas UDF

2018-01-16 Thread Li Jin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23011?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Li Jin updated SPARK-23011: --- Description: The current semantics of groupby apply is that the output schema of groupby apply is the same

[jira] [Updated] (SPARK-23011) Support alternative function form with group aggregate pandas UDF

2018-01-16 Thread Li Jin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23011?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Li Jin updated SPARK-23011: --- Description: The current semantics of groupby apply is that the output schema of groupby apply is the same

[jira] [Commented] (SPARK-21274) Implement EXCEPT ALL and INTERSECT ALL

2018-01-16 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21274?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16327453#comment-16327453 ] Reynold Xin commented on SPARK-21274: - Can't we rewrite this as two aggregates and a join?   >

[jira] [Commented] (SPARK-23081) Add colRegex API to PySpark

2018-01-16 Thread Huaxin Gao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23081?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16327472#comment-16327472 ] Huaxin Gao commented on SPARK-23081: Hi Sean, are you going to work on this? If not, may I work on

[jira] [Commented] (SPARK-23084) Add unboundedPreceding(), unboundedFollowing() and currentRow() to PySpark

2018-01-16 Thread Huaxin Gao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23084?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16327473#comment-16327473 ] Huaxin Gao commented on SPARK-23084: Hi Sean, are you going to work on this? If not, may I work on

[jira] [Updated] (SPARK-23089) "Unable to create operation log session directory" when parent directory not present

2018-01-16 Thread Sean Roberts (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23089?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Roberts updated SPARK-23089: - Environment: /usr/hdp/2.6.3.0-235/spark2/jars/spark-hive-thriftserver_2.11-2.2.0.2.6.3.0-235.jar

[jira] [Updated] (SPARK-23089) "Unable to create operation log session directory" when parent directory not present

2018-01-16 Thread Sean Roberts (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23089?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Roberts updated SPARK-23089: - Environment: (was: When creating a session directory, Thrift should create the parent

[jira] [Updated] (SPARK-23089) "Unable to create operation log session directory" when parent directory not present

2018-01-16 Thread Sean Roberts (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23089?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Roberts updated SPARK-23089: - Description: When creating a session directory, Thrift should create the parent directory

[jira] [Updated] (SPARK-23089) "Unable to create operation log session directory" when parent directory not present

2018-01-16 Thread Sean Roberts (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23089?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Roberts updated SPARK-23089: - Description: When creating a session directory, Thrift should create the parent directory

[jira] [Created] (SPARK-23089) "Unable to create operation log session directory" when parent directory not present

2018-01-16 Thread Sean Roberts (JIRA)
Sean Roberts created SPARK-23089: Summary: "Unable to create operation log session directory" when parent directory not present Key: SPARK-23089 URL: https://issues.apache.org/jira/browse/SPARK-23089

[jira] [Updated] (SPARK-23116) SparkR 2.3 QA: Update user guide for new features & APIs

2018-01-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23116?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-23116: -- Summary: SparkR 2.3 QA: Update user guide for new features & APIs (was: CLONE -

[jira] [Assigned] (SPARK-23115) SparkR 2.3 QA: New R APIs and API docs

2018-01-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23115?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-23115: - Assignee: (was: Joseph K. Bradley) > SparkR 2.3 QA: New R APIs and API docs

[jira] [Updated] (SPARK-23118) SparkR 2.3 QA: Programming guide, migration guide, vignettes updates

2018-01-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23118?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-23118: -- Target Version/s: (was: 2.2.0) > SparkR 2.3 QA: Programming guide, migration guide,

[jira] [Updated] (SPARK-23118) SparkR 2.3 QA: Programming guide, migration guide, vignettes updates

2018-01-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23118?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-23118: -- Fix Version/s: (was: 2.2.0) > SparkR 2.3 QA: Programming guide, migration guide,

[jira] [Updated] (SPARK-23117) SparkR 2.3 QA: Check for new R APIs requiring example code

2018-01-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23117?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-23117: -- Fix Version/s: (was: 2.2.0) > SparkR 2.3 QA: Check for new R APIs requiring

[jira] [Updated] (SPARK-23117) SparkR 2.3 QA: Check for new R APIs requiring example code

2018-01-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23117?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-23117: -- Summary: SparkR 2.3 QA: Check for new R APIs requiring example code (was: CLONE -

[jira] [Updated] (SPARK-23118) SparkR 2.3 QA: Programming guide, migration guide, vignettes updates

2018-01-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23118?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-23118: -- Summary: SparkR 2.3 QA: Programming guide, migration guide, vignettes updates (was:

[jira] [Assigned] (SPARK-23118) SparkR 2.3 QA: Programming guide, migration guide, vignettes updates

2018-01-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23118?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-23118: - Assignee: (was: Felix Cheung) > SparkR 2.3 QA: Programming guide, migration

[jira] [Updated] (SPARK-23116) CLONE - SparkR 2.2 QA: Update user guide for new features & APIs

2018-01-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23116?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-23116: -- Target Version/s: (was: 2.2.0) > CLONE - SparkR 2.2 QA: Update user guide for new

[jira] [Updated] (SPARK-23117) SparkR 2.3 QA: Check for new R APIs requiring example code

2018-01-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23117?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-23117: -- Target Version/s: (was: 2.2.0) > SparkR 2.3 QA: Check for new R APIs requiring

[jira] [Assigned] (SPARK-23117) SparkR 2.3 QA: Check for new R APIs requiring example code

2018-01-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23117?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-23117: - Assignee: (was: Felix Cheung) > SparkR 2.3 QA: Check for new R APIs

[jira] [Assigned] (SPARK-23116) SparkR 2.3 QA: Update user guide for new features & APIs

2018-01-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23116?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-23116: - Assignee: (was: Felix Cheung) > SparkR 2.3 QA: Update user guide for new

[jira] [Assigned] (SPARK-23093) don't modify run id

2018-01-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23093?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23093: Assignee: Apache Spark > don't modify run id > --- > >

[jira] [Assigned] (SPARK-23093) don't modify run id

2018-01-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23093?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23093: Assignee: (was: Apache Spark) > don't modify run id > --- > >

[jira] [Created] (SPARK-23094) Json Readers choose wrong encoding when bad records are present and fail

2018-01-16 Thread Burak Yavuz (JIRA)
Burak Yavuz created SPARK-23094: --- Summary: Json Readers choose wrong encoding when bad records are present and fail Key: SPARK-23094 URL: https://issues.apache.org/jira/browse/SPARK-23094 Project:

[jira] [Created] (SPARK-23100) Migrate unit test sources

2018-01-16 Thread Jose Torres (JIRA)
Jose Torres created SPARK-23100: --- Summary: Migrate unit test sources Key: SPARK-23100 URL: https://issues.apache.org/jira/browse/SPARK-23100 Project: Spark Issue Type: Sub-task

[jira] [Updated] (SPARK-23106) ML, Graph 2.3 QA: API: Binary incompatible changes

2018-01-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23106?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-23106: -- Fix Version/s: (was: 2.2.0) > ML, Graph 2.3 QA: API: Binary incompatible changes >

[jira] [Updated] (SPARK-23107) ML, Graph 2.3 QA: API: New Scala APIs, docs

2018-01-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23107?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-23107: -- Fix Version/s: (was: 2.2.0) > ML, Graph 2.3 QA: API: New Scala APIs, docs >

[jira] [Assigned] (SPARK-23108) ML, Graph 2.3 QA: API: Experimental, DeveloperApi, final, sealed audit

2018-01-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23108?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-23108: - Assignee: (was: yuhao yang) > ML, Graph 2.3 QA: API: Experimental,

[jira] [Assigned] (SPARK-23107) CLONE - ML, Graph 2.2 QA: API: New Scala APIs, docs

2018-01-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23107?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-23107: - Assignee: (was: Yanbo Liang) > CLONE - ML, Graph 2.2 QA: API: New Scala

[jira] [Updated] (SPARK-23108) ML, Graph 2.3 QA: API: Experimental, DeveloperApi, final, sealed audit

2018-01-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23108?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-23108: -- Summary: ML, Graph 2.3 QA: API: Experimental, DeveloperApi, final, sealed audit (was:

[jira] [Updated] (SPARK-23118) SparkR 2.3 QA: Programming guide, migration guide, vignettes updates

2018-01-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23118?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-23118: -- Description: Before the release, we need to update the SparkR Programming Guide, its

[jira] [Assigned] (SPARK-23114) Spark R 2.3 QA umbrella

2018-01-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23114?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-23114: - Assignee: (was: Felix Cheung) > Spark R 2.3 QA umbrella >

[jira] [Commented] (SPARK-23114) Spark R 2.3 QA umbrella

2018-01-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23114?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16328078#comment-16328078 ] Joseph K. Bradley commented on SPARK-23114: --- [~felixcheung] Are you interested in shepherding

[jira] [Assigned] (SPARK-22735) Add VectorSizeHint to ML features documentation

2018-01-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22735?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22735: Assignee: (was: Apache Spark) > Add VectorSizeHint to ML features documentation >

[jira] [Commented] (SPARK-22735) Add VectorSizeHint to ML features documentation

2018-01-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22735?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16328136#comment-16328136 ] Apache Spark commented on SPARK-22735: -- User 'MrBago' has created a pull request for this issue:

[jira] [Assigned] (SPARK-22735) Add VectorSizeHint to ML features documentation

2018-01-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22735?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22735: Assignee: Apache Spark > Add VectorSizeHint to ML features documentation >

[jira] [Created] (SPARK-23119) Fix API annotation in DataSource V2 for streaming

2018-01-16 Thread Tathagata Das (JIRA)
Tathagata Das created SPARK-23119: - Summary: Fix API annotation in DataSource V2 for streaming Key: SPARK-23119 URL: https://issues.apache.org/jira/browse/SPARK-23119 Project: Spark Issue

[jira] [Resolved] (SPARK-22908) add basic continuous kafka source

2018-01-16 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22908?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-22908. --- Resolution: Fixed Fix Version/s: 2.3.0 3.0.0 Issue resolved by

[jira] [Created] (SPARK-23120) Add PMML pipeline export support to PySpark

2018-01-16 Thread holdenk (JIRA)
holdenk created SPARK-23120: --- Summary: Add PMML pipeline export support to PySpark Key: SPARK-23120 URL: https://issues.apache.org/jira/browse/SPARK-23120 Project: Spark Issue Type: Sub-task

[jira] [Updated] (SPARK-23122) Deprecate register* for UDFs in SQLContext and Catalog in PySpark

2018-01-16 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-23122: - Environment: (was: Seems we allow many other ways to register UDFs in SQL statements. Some

[jira] [Updated] (SPARK-23122) Deprecate register* for UDFs in SQLContext and Catalog in PySpark

2018-01-16 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-23122: - Description: Seems we allow many other ways to register UDFs in SQL statements. Some are in

[jira] [Created] (SPARK-23122) Deprecate register* for UDFs in SQLContext and Catalog in PySpark

2018-01-16 Thread Hyukjin Kwon (JIRA)
Hyukjin Kwon created SPARK-23122: Summary: Deprecate register* for UDFs in SQLContext and Catalog in PySpark Key: SPARK-23122 URL: https://issues.apache.org/jira/browse/SPARK-23122 Project: Spark

[jira] [Commented] (SPARK-23105) Spark MLlib, GraphX 2.3 QA umbrella

2018-01-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23105?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16328081#comment-16328081 ] Joseph K. Bradley commented on SPARK-23105: --- Sorry this is a bit late (after the branch cut),

[jira] [Updated] (SPARK-23095) Decorrelation of scalar subquery fails with java.util.NoSuchElementException.

2018-01-16 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23095?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-23095: Affects Version/s: 2.2.2 > Decorrelation of scalar subquery fails with java.util.NoSuchElementException. >

[jira] [Resolved] (SPARK-23095) Decorrelation of scalar subquery fails with java.util.NoSuchElementException.

2018-01-16 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23095?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-23095. - Resolution: Fixed Assignee: Dilip Biswal Fix Version/s: 2.3.0 > Decorrelation of scalar

[jira] [Updated] (SPARK-23121) When the Spark Streaming app is running for a period of time, the page is incorrectly reported when accessing '/ jobs /' or '/ jobs / job /? Id = 13' and ui can not be a

2018-01-16 Thread guoxiaolongzte (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23121?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] guoxiaolongzte updated SPARK-23121: --- Attachment: 2.png 1.png > When the Spark Streaming app is running for a

[jira] [Commented] (SPARK-23122) Deprecate register* for UDFs in SQLContext and Catalog in PySpark

2018-01-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23122?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16328250#comment-16328250 ] Apache Spark commented on SPARK-23122: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Assigned] (SPARK-23122) Deprecate register* for UDFs in SQLContext and Catalog in PySpark

2018-01-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23122: Assignee: (was: Apache Spark) > Deprecate register* for UDFs in SQLContext and

[jira] [Assigned] (SPARK-23122) Deprecate register* for UDFs in SQLContext and Catalog in PySpark

2018-01-16 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23122: Assignee: Apache Spark > Deprecate register* for UDFs in SQLContext and Catalog in

  1   2   3   >