[jira] [Created] (SPARK-29379) SHOW FUNCTIONS don't show '!=', '<>' , 'between', 'case'

2019-10-07 Thread angerszhu (Jira)
angerszhu created SPARK-29379: - Summary: SHOW FUNCTIONS don't show '!=', '<>' , 'between', 'case' Key: SPARK-29379 URL: https://issues.apache.org/jira/browse/SPARK-29379 Project: Spark Issue

[jira] [Commented] (SPARK-24640) size(null) returns null

2019-10-07 Thread Maxim Gekk (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24640?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16946503#comment-16946503 ] Maxim Gekk commented on SPARK-24640: As far as I remember we planed to remove

[jira] [Resolved] (SPARK-25008) Add memory mode info to showMemoryUsage in TaskMemoryManager

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-25008?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-25008. -- Resolution: Incomplete > Add memory mode info to showMemoryUsage in TaskMemoryManager >

[jira] [Resolved] (SPARK-25230) Upper behavior incorrect for string contains "ß"

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-25230?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-25230. -- Resolution: Incomplete > Upper behavior incorrect for string contains "ß" >

[jira] [Resolved] (SPARK-24074) Maven package resolver downloads javadoc instead of jar

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24074?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24074. -- Resolution: Incomplete > Maven package resolver downloads javadoc instead of jar >

[jira] [Resolved] (SPARK-25165) Cannot parse Hive Struct

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-25165?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-25165. -- Resolution: Incomplete > Cannot parse Hive Struct > > >

[jira] [Resolved] (SPARK-22132) Document the Dispatcher REST API

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-22132?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-22132. -- Resolution: Incomplete > Document the Dispatcher REST API >

[jira] [Resolved] (SPARK-24838) Support uncorrelated IN/EXISTS subqueries for more operators

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24838?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24838. -- Resolution: Incomplete > Support uncorrelated IN/EXISTS subqueries for more operators >

[jira] [Resolved] (SPARK-24524) Improve aggregateMetrics: less memory usage and loops

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24524?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24524. -- Resolution: Incomplete > Improve aggregateMetrics: less memory usage and loops >

[jira] [Resolved] (SPARK-10413) ML models should support prediction on single instances

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-10413?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-10413. -- Resolution: Incomplete > ML models should support prediction on single instances >

[jira] [Resolved] (SPARK-16203) regexp_extract to return an ArrayType(StringType())

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-16203?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-16203. -- Resolution: Incomplete > regexp_extract to return an ArrayType(StringType()) >

[jira] [Resolved] (SPARK-23790) proxy-user failed connecting to a kerberos configured metastore

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23790?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23790. -- Resolution: Incomplete > proxy-user failed connecting to a kerberos configured metastore >

[jira] [Resolved] (SPARK-19609) Broadcast joins should pushdown join constraints as Filter to the larger relation

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-19609?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-19609. -- Resolution: Incomplete > Broadcast joins should pushdown join constraints as Filter to the

[jira] [Resolved] (SPARK-5556) Latent Dirichlet Allocation (LDA) using Gibbs sampler

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-5556?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-5556. - Resolution: Incomplete > Latent Dirichlet Allocation (LDA) using Gibbs sampler >

[jira] [Resolved] (SPARK-23498) Accuracy problem in comparison with string and integer

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23498?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23498. -- Resolution: Incomplete > Accuracy problem in comparison with string and integer >

[jira] [Resolved] (SPARK-24118) Support lineSep format independent from encoding

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24118?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24118. -- Resolution: Incomplete > Support lineSep format independent from encoding >

[jira] [Resolved] (SPARK-21016) Improve code fault tolerance for converting string to number

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-21016?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-21016. -- Resolution: Incomplete > Improve code fault tolerance for converting string to number >

[jira] [Resolved] (SPARK-24221) Retry spark app submission to k8 in KubernetesClientApplication

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24221?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24221. -- Resolution: Incomplete > Retry spark app submission to k8 in KubernetesClientApplication >

[jira] [Resolved] (SPARK-12014) Spark SQL query containing semicolon is broken in Beeline (related to HIVE-11100)

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-12014?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-12014. -- Resolution: Incomplete > Spark SQL query containing semicolon is broken in Beeline (related

[jira] [Resolved] (SPARK-24051) Incorrect results for certain queries using Java and Python APIs on Spark 2.3.0

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24051?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24051. -- Resolution: Incomplete > Incorrect results for certain queries using Java and Python APIs on

[jira] [Resolved] (SPARK-21122) Address starvation issues when dynamic allocation is enabled

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-21122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-21122. -- Resolution: Incomplete > Address starvation issues when dynamic allocation is enabled >

[jira] [Resolved] (SPARK-20624) Add better handling for node shutdown

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-20624?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-20624. -- Resolution: Incomplete > Add better handling for node shutdown >

[jira] [Resolved] (SPARK-22963) Make failure recovery global and automatic for continuous processing.

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-22963?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-22963. -- Resolution: Incomplete > Make failure recovery global and automatic for continuous

[jira] [Resolved] (SPARK-24733) Dataframe saved to parquet can have different metadata then the resulting parquet file

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24733?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24733. -- Resolution: Incomplete > Dataframe saved to parquet can have different metadata then the

[jira] [Resolved] (SPARK-21812) PySpark ML Models should not depend transfering params from Java

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-21812?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-21812. -- Resolution: Incomplete > PySpark ML Models should not depend transfering params from Java >

[jira] [Resolved] (SPARK-24845) spark distribution generate exception while locally worked correctly

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24845?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24845. -- Resolution: Incomplete > spark distribution generate exception while locally worked correctly

[jira] [Resolved] (SPARK-22780) make insert commands have real children to fix UI issues

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-22780?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-22780. -- Resolution: Incomplete > make insert commands have real children to fix UI issues >

[jira] [Resolved] (SPARK-24650) GroupingSet

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24650?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24650. -- Resolution: Incomplete > GroupingSet > --- > > Key: SPARK-24650 >

[jira] [Resolved] (SPARK-24623) Hadoop - Spark Cluster - Python XGBoost - Not working in distributed mode

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24623. -- Resolution: Incomplete > Hadoop - Spark Cluster - Python XGBoost - Not working in distributed

[jira] [Resolved] (SPARK-24265) lintr checks not failing PR build

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24265?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24265. -- Resolution: Incomplete > lintr checks not failing PR build >

[jira] [Resolved] (SPARK-19498) Discussion: Making MLlib APIs extensible for 3rd party libraries

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-19498?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-19498. -- Resolution: Incomplete > Discussion: Making MLlib APIs extensible for 3rd party libraries >

[jira] [Resolved] (SPARK-8696) Streaming API for Online LDA

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-8696?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-8696. - Resolution: Incomplete > Streaming API for Online LDA > > >

[jira] [Resolved] (SPARK-8767) Abstractions for InputColParam, OutputColParam

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-8767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-8767. - Resolution: Incomplete > Abstractions for InputColParam, OutputColParam >

[jira] [Resolved] (SPARK-24016) Yarn does not update node blacklist in static allocation

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24016?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24016. -- Resolution: Incomplete > Yarn does not update node blacklist in static allocation >

[jira] [Resolved] (SPARK-25232) Support Full-Text Search in Spark SQL

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-25232?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-25232. -- Resolution: Incomplete > Support Full-Text Search in Spark SQL >

[jira] [Resolved] (SPARK-24106) Spark Structure Streaming with RF model taking long time in processing probability for each mini batch

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24106?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24106. -- Resolution: Incomplete > Spark Structure Streaming with RF model taking long time in

[jira] [Resolved] (SPARK-24568) Code refactoring for DataType equalsXXX methods

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24568?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24568. -- Resolution: Incomplete > Code refactoring for DataType equalsXXX methods >

[jira] [Resolved] (SPARK-24862) Spark Encoder is not consistent to scala case class semantic for multiple argument lists

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24862?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24862. -- Resolution: Incomplete > Spark Encoder is not consistent to scala case class semantic for

[jira] [Resolved] (SPARK-24461) Snapshot Cache

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24461?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24461. -- Resolution: Incomplete > Snapshot Cache > -- > > Key: SPARK-24461

[jira] [Resolved] (SPARK-20782) Dataset's isCached operator

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-20782?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-20782. -- Resolution: Incomplete > Dataset's isCached operator > --- > >

[jira] [Resolved] (SPARK-23068) Jekyll doc build error does not fail build

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23068?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23068. -- Resolution: Incomplete > Jekyll doc build error does not fail build >

[jira] [Resolved] (SPARK-20885) JDBC predicate pushdown uses hardcoded date format

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-20885?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-20885. -- Resolution: Incomplete > JDBC predicate pushdown uses hardcoded date format >

[jira] [Resolved] (SPARK-24597) Spark ML Pipeline Should support non-linear models => DAGPipeline

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24597?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24597. -- Resolution: Incomplete > Spark ML Pipeline Should support non-linear models => DAGPipeline >

[jira] [Resolved] (SPARK-17602) PySpark - Performance Optimization Large Size of Broadcast Variable

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-17602?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-17602. -- Resolution: Incomplete > PySpark - Performance Optimization Large Size of Broadcast Variable

[jira] [Resolved] (SPARK-20007) Make SparkR apply() functions robust to workers that return empty data.frame

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-20007?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-20007. -- Resolution: Incomplete > Make SparkR apply() functions robust to workers that return empty

[jira] [Resolved] (SPARK-17129) Support statistics collection and cardinality estimation for partitioned tables

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-17129?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-17129. -- Resolution: Incomplete > Support statistics collection and cardinality estimation for

[jira] [Resolved] (SPARK-22054) Allow release managers to inject their keys

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-22054?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-22054. -- Resolution: Incomplete > Allow release managers to inject their keys >

[jira] [Resolved] (SPARK-23632) sparkR.session() error with spark packages - JVM is not ready after 10 seconds

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23632?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23632. -- Resolution: Incomplete > sparkR.session() error with spark packages - JVM is not ready after

[jira] [Resolved] (SPARK-23452) Extend test coverage to all ORC readers

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23452?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23452. -- Resolution: Incomplete > Extend test coverage to all ORC readers >

[jira] [Resolved] (SPARK-22658) SPIP: TeansorFlowOnSpark as a Scalable Deep Learning Lib of Apache Spark

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-22658?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-22658. -- Resolution: Incomplete > SPIP: TeansorFlowOnSpark as a Scalable Deep Learning Lib of Apache

[jira] [Resolved] (SPARK-15694) Implement ScriptTransformation in sql/core

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-15694?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-15694. -- Resolution: Incomplete > Implement ScriptTransformation in sql/core >

[jira] [Resolved] (SPARK-21166) Automated ML persistence

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-21166?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-21166. -- Resolution: Incomplete > Automated ML persistence > > >

[jira] [Resolved] (SPARK-23777) Missing DAG arrows between stages

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23777?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23777. -- Resolution: Incomplete > Missing DAG arrows between stages >

[jira] [Resolved] (SPARK-23227) Add user guide entry for collecting sub models for cross-validation classes

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23227?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23227. -- Resolution: Incomplete > Add user guide entry for collecting sub models for cross-validation

[jira] [Resolved] (SPARK-24406) Exposing custom spark scala ml transformers in pyspark

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24406?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24406. -- Resolution: Incomplete > Exposing custom spark scala ml transformers in pyspark >

[jira] [Resolved] (SPARK-25340) Pushes down Sample beneath deterministic Project

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-25340?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-25340. -- Resolution: Incomplete > Pushes down Sample beneath deterministic Project >

[jira] [Resolved] (SPARK-23744) Memory leak in ReadableChannelFileRegion

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23744?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23744. -- Resolution: Incomplete > Memory leak in ReadableChannelFileRegion >

[jira] [Resolved] (SPARK-23073) Fix incorrect R doc page header for generated sql functions

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23073. -- Resolution: Incomplete > Fix incorrect R doc page header for generated sql functions >

[jira] [Resolved] (SPARK-16707) TransportClientFactory.createClient may throw NPE

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-16707?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-16707. -- Resolution: Incomplete > TransportClientFactory.createClient may throw NPE >

[jira] [Resolved] (SPARK-25244) [Python] Setting `spark.sql.session.timeZone` only partially respected

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-25244?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-25244. -- Resolution: Incomplete > [Python] Setting `spark.sql.session.timeZone` only partially

[jira] [Resolved] (SPARK-23983) Disable X-Frame-Options from Spark UI response headers if explicitly configured

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23983?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23983. -- Resolution: Incomplete > Disable X-Frame-Options from Spark UI response headers if explicitly

[jira] [Resolved] (SPARK-12449) Pushing down arbitrary logical plans to data sources

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-12449?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-12449. -- Resolution: Incomplete > Pushing down arbitrary logical plans to data sources >

[jira] [Resolved] (SPARK-21972) Allow users to control input data persistence in ML Estimators via a handlePersistence ml.Param

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-21972?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-21972. -- Resolution: Incomplete > Allow users to control input data persistence in ML Estimators via a

[jira] [Resolved] (SPARK-20598) Iterative checkpoints do not get removed from HDFS

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-20598?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-20598. -- Resolution: Incomplete > Iterative checkpoints do not get removed from HDFS >

[jira] [Resolved] (SPARK-24260) Support for multi-statement SQL in SparkSession.sql API

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24260?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24260. -- Resolution: Incomplete > Support for multi-statement SQL in SparkSession.sql API >

[jira] [Resolved] (SPARK-21707) Improvement a special case for non-deterministic filters in optimizer

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-21707?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-21707. -- Resolution: Incomplete > Improvement a special case for non-deterministic filters in

[jira] [Resolved] (SPARK-24731) java.io.IOException: s3n://bucketname: 400 : Bad Request

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24731?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24731. -- Resolution: Incomplete > java.io.IOException: s3n://bucketname: 400 : Bad Request >

[jira] [Resolved] (SPARK-21405) Add LBFGS solver for GeneralizedLinearRegression

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-21405?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-21405. -- Resolution: Incomplete > Add LBFGS solver for GeneralizedLinearRegression >

[jira] [Resolved] (SPARK-22868) 64KB JVM bytecode limit problem with aggregation

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-22868?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-22868. -- Resolution: Incomplete > 64KB JVM bytecode limit problem with aggregation >

[jira] [Resolved] (SPARK-24447) Pyspark RowMatrix.columnSimilarities() loses spark context

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24447?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24447. -- Resolution: Incomplete > Pyspark RowMatrix.columnSimilarities() loses spark context >

[jira] [Resolved] (SPARK-23612) Specify formats for individual DateType and TimestampType columns in schemas

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23612?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23612. -- Resolution: Incomplete > Specify formats for individual DateType and TimestampType columns in

[jira] [Resolved] (SPARK-23442) Reading from partitioned and bucketed table uses only bucketSpec.numBuckets partitions in all cases

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23442?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23442. -- Resolution: Incomplete > Reading from partitioned and bucketed table uses only

[jira] [Resolved] (SPARK-21730) Consider officially dropping PyPy pre-2.5 support

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-21730?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-21730. -- Resolution: Incomplete > Consider officially dropping PyPy pre-2.5 support >

[jira] [Resolved] (SPARK-23485) Kubernetes should support node blacklist

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23485?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23485. -- Resolution: Incomplete > Kubernetes should support node blacklist >

[jira] [Resolved] (SPARK-21443) Very long planning duration for queries with lots of operations

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-21443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-21443. -- Resolution: Incomplete > Very long planning duration for queries with lots of operations >

[jira] [Resolved] (SPARK-24946) PySpark - Allow np.Arrays and pd.Series in df.approxQuantile

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24946?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24946. -- Resolution: Incomplete > PySpark - Allow np.Arrays and pd.Series in df.approxQuantile >

[jira] [Resolved] (SPARK-23833) Incorrect primitive type check for input arguments of udf

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23833?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23833. -- Resolution: Incomplete > Incorrect primitive type check for input arguments of udf >

[jira] [Resolved] (SPARK-25215) Make PipelineModel public

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-25215?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-25215. -- Resolution: Incomplete > Make PipelineModel public > - > >

[jira] [Resolved] (SPARK-23322) Launcher handles can miss application updates if application finishes too quickly

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23322. -- Resolution: Incomplete > Launcher handles can miss application updates if application

[jira] [Resolved] (SPARK-24560) Fix some getTimeAsMs as getTimeAsSeconds

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24560?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24560. -- Resolution: Incomplete > Fix some getTimeAsMs as getTimeAsSeconds >

[jira] [Resolved] (SPARK-22202) Release tgz content differences for python and R

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-22202?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-22202. -- Resolution: Incomplete > Release tgz content differences for python and R >

[jira] [Resolved] (SPARK-20153) Support Multiple aws credentials in order to access multiple Hive on S3 table in spark application

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-20153?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-20153. -- Resolution: Incomplete > Support Multiple aws credentials in order to access multiple Hive on

[jira] [Resolved] (SPARK-24456) Spark submit - server environment variables are overwritten by client environment variables

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24456?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24456. -- Resolution: Incomplete > Spark submit - server environment variables are overwritten by

[jira] [Resolved] (SPARK-17877) Can not checkpoint connectedComponents resulting graph

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-17877?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-17877. -- Resolution: Incomplete > Can not checkpoint connectedComponents resulting graph >

[jira] [Resolved] (SPARK-23298) distinct.count on Dataset/DataFrame yields non-deterministic results

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23298?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23298. -- Resolution: Incomplete > distinct.count on Dataset/DataFrame yields non-deterministic results

[jira] [Resolved] (SPARK-25059) Exception while executing an action on DataFrame that read Json

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-25059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-25059. -- Resolution: Incomplete > Exception while executing an action on DataFrame that read Json >

[jira] [Resolved] (SPARK-22105) Dataframe has poor performance when computing on many columns with codegen

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-22105?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-22105. -- Resolution: Incomplete > Dataframe has poor performance when computing on many columns with

[jira] [Resolved] (SPARK-22823) Race Condition when reading Broadcast shuffle input. Failed to get broadcast piece

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-22823?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-22823. -- Resolution: Incomplete > Race Condition when reading Broadcast shuffle input. Failed to get

[jira] [Resolved] (SPARK-22911) Migrate structured streaming sources to new DataSourceV2 APIs

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-22911?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-22911. -- Resolution: Incomplete > Migrate structured streaming sources to new DataSourceV2 APIs >

[jira] [Resolved] (SPARK-24608) report number of iteration/progress for ML training

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24608?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24608. -- Resolution: Incomplete > report number of iteration/progress for ML training >

[jira] [Resolved] (SPARK-24640) size(null) returns null

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24640?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24640. -- Resolution: Incomplete > size(null) returns null > > >

[jira] [Resolved] (SPARK-24095) Spark Streaming performance drastically drops when when saving dataframes with withColumn

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24095?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24095. -- Resolution: Incomplete > Spark Streaming performance drastically drops when when saving

[jira] [Resolved] (SPARK-23563) make the size fo cache in CodeGenerator configable

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23563?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23563. -- Resolution: Incomplete > make the size fo cache in CodeGenerator configable >

[jira] [Resolved] (SPARK-24342) Large Task prior scheduling to Reduce overall execution time

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24342?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24342. -- Resolution: Incomplete > Large Task prior scheduling to Reduce overall execution time >

[jira] [Resolved] (SPARK-24457) Performance improvement while converting stringToTimestamp in DateTimeUtils

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24457?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24457. -- Resolution: Incomplete > Performance improvement while converting stringToTimestamp in

[jira] [Resolved] (SPARK-20074) Make buffer size in unsafe external sorter configurable

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-20074?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-20074. -- Resolution: Incomplete > Make buffer size in unsafe external sorter configurable >

[jira] [Resolved] (SPARK-25480) Dynamic partitioning + saveAsTable with multiple partition columns create empty directory

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-25480?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-25480. -- Resolution: Incomplete > Dynamic partitioning + saveAsTable with multiple partition columns

[jira] [Resolved] (SPARK-23994) Add Host To Blacklist If Shuffle Cannot Complete

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23994?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23994. -- Resolution: Incomplete > Add Host To Blacklist If Shuffle Cannot Complete >

[jira] [Resolved] (SPARK-24474) Cores are left idle when there are a lot of tasks to run

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24474?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24474. -- Resolution: Incomplete > Cores are left idle when there are a lot of tasks to run >

[jira] [Resolved] (SPARK-25125) Spark SQL percentile_approx takes longer than Hive version for large datasets

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-25125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-25125. -- Resolution: Incomplete > Spark SQL percentile_approx takes longer than Hive version for large

  1   2   3   4   5   6   >