[jira] [Resolved] (SPARK-23537) Logistic Regression without standardization

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23537?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23537. -- Resolution: Incomplete > Logistic Regression without standardization >

[jira] [Resolved] (SPARK-25020) Unable to Perform Graceful Shutdown in Spark Streaming with Hadoop 2.8

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-25020?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-25020. -- Resolution: Incomplete > Unable to Perform Graceful Shutdown in Spark Streaming with Hadoop

[jira] [Resolved] (SPARK-24357) createDataFrame in Python infers large integers as long type and then fails silently when converting them

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24357?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24357. -- Resolution: Incomplete > createDataFrame in Python infers large integers as long type and

[jira] [Resolved] (SPARK-15867) Use bucket files for TABLESAMPLE BUCKET

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-15867?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-15867. -- Resolution: Incomplete > Use bucket files for TABLESAMPLE BUCKET >

[jira] [Resolved] (SPARK-23996) Implement the optimal KLL algorithms for quantiles in streams

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23996?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23996. -- Resolution: Incomplete > Implement the optimal KLL algorithms for quantiles in streams >

[jira] [Resolved] (SPARK-23368) Avoid unnecessary Exchange or Sort after projection

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23368?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23368. -- Resolution: Incomplete > Avoid unnecessary Exchange or Sort after projection >

[jira] [Resolved] (SPARK-23730) Save and expose "in bag" tracking for random forest model

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23730?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23730. -- Resolution: Incomplete > Save and expose "in bag" tracking for random forest model >

[jira] [Resolved] (SPARK-24473) It is no need to clip the predictive value by maxValue and minValue when computing gradient on SVDplusplus model

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24473?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24473. -- Resolution: Incomplete > It is no need to clip the predictive value by maxValue and minValue

[jira] [Resolved] (SPARK-5572) LDA improvement listing

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-5572?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-5572. - Resolution: Incomplete > LDA improvement listing > --- > >

[jira] [Resolved] (SPARK-22415) lint-r fails if lint-r.R installs any new packages

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-22415?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-22415. -- Resolution: Incomplete > lint-r fails if lint-r.R installs any new packages >

[jira] [Resolved] (SPARK-23987) Unused mailing lists

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23987?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23987. -- Resolution: Incomplete > Unused mailing lists > > > Key:

[jira] [Resolved] (SPARK-24974) Spark put all file's paths into SharedInMemoryCache even for unused partitions.

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24974?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24974. -- Resolution: Incomplete > Spark put all file's paths into SharedInMemoryCache even for unused

[jira] [Resolved] (SPARK-24955) spark continuing to execute on a task despite not reading all data from a downed machine

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24955?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24955. -- Resolution: Incomplete > spark continuing to execute on a task despite not reading all data

[jira] [Resolved] (SPARK-24550) Add support for Kubernetes specific metrics

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24550?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24550. -- Resolution: Incomplete > Add support for Kubernetes specific metrics >

[jira] [Resolved] (SPARK-25585) Allow users to specify scale of result in Decimal arithmetic

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-25585?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-25585. -- Resolution: Incomplete > Allow users to specify scale of result in Decimal arithmetic >

[jira] [Resolved] (SPARK-21406) Add logLikelihood to GLR families

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-21406?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-21406. -- Resolution: Incomplete > Add logLikelihood to GLR families >

[jira] [Resolved] (SPARK-12878) Dataframe fails with nested User Defined Types

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-12878?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-12878. -- Resolution: Incomplete > Dataframe fails with nested User Defined Types >

[jira] [Resolved] (SPARK-9636) Treat $SPARK_HOME as write-only

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-9636?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-9636. - Resolution: Incomplete > Treat $SPARK_HOME as write-only > --- > >

[jira] [Resolved] (SPARK-8614) Row order preservation for operations on MLlib IndexedRowMatrix

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-8614?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-8614. - Resolution: Incomplete > Row order preservation for operations on MLlib IndexedRowMatrix >

[jira] [Resolved] (SPARK-24585) Adding ability to audit file system before and after test to ensure all files are cleaned up.

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24585?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24585. -- Resolution: Incomplete > Adding ability to audit file system before and after test to ensure

[jira] [Resolved] (SPARK-18245) Improving support for bucketed table

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-18245?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-18245. -- Resolution: Incomplete > Improving support for bucketed table >

[jira] [Resolved] (SPARK-24081) Spark SQL drops the table while writing into table in "overwrite" mode.

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24081?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24081. -- Resolution: Incomplete > Spark SQL drops the table while writing into table in "overwrite"

[jira] [Resolved] (SPARK-15690) Fast single-node (single-process) in-memory shuffle

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-15690?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-15690. -- Resolution: Incomplete > Fast single-node (single-process) in-memory shuffle >

[jira] [Resolved] (SPARK-24745) Map function does not keep rdd name

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24745?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24745. -- Resolution: Incomplete > Map function does not keep rdd name >

[jira] [Resolved] (SPARK-22731) Add a test for ROWID type to OracleIntegrationSuite

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-22731?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-22731. -- Resolution: Incomplete > Add a test for ROWID type to OracleIntegrationSuite >

[jira] [Resolved] (SPARK-24964) Please add OWASP Dependency Check to all comonent builds(pom.xml)

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24964?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24964. -- Resolution: Incomplete > Please add OWASP Dependency Check to all comonent builds(pom.xml) >

[jira] [Resolved] (SPARK-24122) Allow automatic driver restarts on K8s

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24122. -- Resolution: Incomplete > Allow automatic driver restarts on K8s >

[jira] [Resolved] (SPARK-25311) `SPARK_LOCAL_HOSTNAME` unsupport IPV6 when do host checking

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-25311?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-25311. -- Resolution: Incomplete > `SPARK_LOCAL_HOSTNAME` unsupport IPV6 when do host checking >

[jira] [Resolved] (SPARK-24910) Spark Bloom Filter Closure Serialization improvement for very high volume of Data

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24910?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24910. -- Resolution: Incomplete > Spark Bloom Filter Closure Serialization improvement for very high

[jira] [Resolved] (SPARK-22114) The condition of OnlineLDAOptimizer convergence should be configurable

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-22114?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-22114. -- Resolution: Incomplete > The condition of OnlineLDAOptimizer convergence should be

[jira] [Resolved] (SPARK-25103) CompletionIterator may delay GC of completed resources

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-25103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-25103. -- Resolution: Incomplete > CompletionIterator may delay GC of completed resources >

[jira] [Resolved] (SPARK-23686) Make better usage of org.apache.spark.ml.util.Instrumentation

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23686?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23686. -- Resolution: Incomplete > Make better usage of org.apache.spark.ml.util.Instrumentation >

[jira] [Resolved] (SPARK-15573) Backwards-compatible persistence for spark.ml

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-15573?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-15573. -- Resolution: Incomplete > Backwards-compatible persistence for spark.ml >

[jira] [Resolved] (SPARK-20732) Copy cache data when node is being shut down

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-20732?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-20732. -- Resolution: Incomplete > Copy cache data when node is being shut down >

[jira] [Resolved] (SPARK-24163) Support "ANY" or sub-query for Pivot "IN" clause

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24163?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24163. -- Resolution: Incomplete > Support "ANY" or sub-query for Pivot "IN" clause >

[jira] [Resolved] (SPARK-20744) Predicates with multiple columns do not work

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-20744?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-20744. -- Resolution: Incomplete > Predicates with multiple columns do not work >

[jira] [Resolved] (SPARK-24100) Add the CompressionCodec to the saveAsTextFiles interface.

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24100. -- Resolution: Incomplete > Add the CompressionCodec to the saveAsTextFiles interface. >

[jira] [Resolved] (SPARK-24750) HiveCaseSensitiveInferenceMode with INFER_AND_SAVE will show WRITE permission denied even if select table operation

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24750?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24750. -- Resolution: Incomplete > HiveCaseSensitiveInferenceMode with INFER_AND_SAVE will show WRITE

[jira] [Resolved] (SPARK-25217) Error thrown when creating BlockMatrix

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-25217?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-25217. -- Resolution: Incomplete > Error thrown when creating BlockMatrix >

[jira] [Resolved] (SPARK-22359) Improve the test coverage of window functions

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-22359?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-22359. -- Resolution: Incomplete > Improve the test coverage of window functions >

[jira] [Resolved] (SPARK-18600) BZ2 CRC read error needs better reporting

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-18600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-18600. -- Resolution: Incomplete > BZ2 CRC read error needs better reporting >

[jira] [Resolved] (SPARK-21962) Distributed Tracing in Spark

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-21962?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-21962. -- Resolution: Incomplete > Distributed Tracing in Spark > > >

[jira] [Resolved] (SPARK-20443) The blockSize of MLLIB ALS should be setting by the User

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-20443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-20443. -- Resolution: Incomplete > The blockSize of MLLIB ALS should be setting by the User >

[jira] [Resolved] (SPARK-24607) Distribute by rand() can lead to data inconsistency

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24607?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24607. -- Resolution: Incomplete > Distribute by rand() can lead to data inconsistency >

[jira] [Resolved] (SPARK-25107) Spark 2.2.0 Upgrade Issue : Throwing TreeNodeException: makeCopy, tree: CatalogRelation Errors

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-25107?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-25107. -- Resolution: Incomplete > Spark 2.2.0 Upgrade Issue : Throwing TreeNodeException: makeCopy,

[jira] [Resolved] (SPARK-23797) SparkSQL performance on small TPCDS tables is very low when compared to Drill or Presto

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23797?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23797. -- Resolution: Incomplete > SparkSQL performance on small TPCDS tables is very low when compared

[jira] [Resolved] (SPARK-8582) Optimize checkpointing to avoid computing an RDD twice

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-8582?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-8582. - Resolution: Incomplete > Optimize checkpointing to avoid computing an RDD twice >

[jira] [Resolved] (SPARK-21076) R dapply doesn't return array or raw columns when array have different length

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-21076?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-21076. -- Resolution: Incomplete > R dapply doesn't return array or raw columns when array have

[jira] [Resolved] (SPARK-16418) DataFrame.filter fails if it references a window function

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-16418?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-16418. -- Resolution: Incomplete > DataFrame.filter fails if it references a window function >

[jira] [Resolved] (SPARK-22565) Session-based windowing

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-22565?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-22565. -- Resolution: Incomplete > Session-based windowing > --- > >

[jira] [Resolved] (SPARK-24969) SQL: to_date function can't parse date strings in different locales.

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24969?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24969. -- Resolution: Incomplete > SQL: to_date function can't parse date strings in different locales.

[jira] [Resolved] (SPARK-15516) Schema merging in driver fails for parquet when merging LongType and IntegerType

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-15516?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-15516. -- Resolution: Incomplete > Schema merging in driver fails for parquet when merging LongType and

[jira] [Resolved] (SPARK-19241) remove hive generated table properties if they are not useful in Spark

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-19241?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-19241. -- Resolution: Incomplete > remove hive generated table properties if they are not useful in

[jira] [Resolved] (SPARK-21885) HiveMetastoreCatalog.InferIfNeeded too slow when caseSensitiveInference enabled

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-21885?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-21885. -- Resolution: Incomplete > HiveMetastoreCatalog.InferIfNeeded too slow when

[jira] [Resolved] (SPARK-23664) Add interface to collect query result through file iterator

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23664?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23664. -- Resolution: Incomplete > Add interface to collect query result through file iterator >

[jira] [Resolved] (SPARK-25428) Support plain Kerberos Authentication with Spark

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-25428?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-25428. -- Resolution: Incomplete > Support plain Kerberos Authentication with Spark >

[jira] [Resolved] (SPARK-24764) Add ServiceLoader implementation for SparkHadoopUtil

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24764?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24764. -- Resolution: Incomplete > Add ServiceLoader implementation for SparkHadoopUtil >

[jira] [Resolved] (SPARK-18822) Support ML Pipeline in SparkR

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-18822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-18822. -- Resolution: Incomplete > Support ML Pipeline in SparkR > - > >

[jira] [Resolved] (SPARK-24617) Spark driver not requesting another executor once original executor exits due to 'lost worker'

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24617?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24617. -- Resolution: Incomplete > Spark driver not requesting another executor once original executor

[jira] [Resolved] (SPARK-24240) Add a config to control whether InMemoryFileIndex should update cache when refresh.

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24240?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24240. -- Resolution: Incomplete > Add a config to control whether InMemoryFileIndex should update

[jira] [Resolved] (SPARK-15691) Refactor and improve Hive support

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-15691?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-15691. -- Resolution: Incomplete > Refactor and improve Hive support >

[jira] [Resolved] (SPARK-24450) Error: Exception in thread "main" java.lang.NoSuchMethodError: org.apache.curator.utils.PathUtils.validatePath(Ljava/lang/String;)Ljava/lang/String;

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24450?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24450. -- Resolution: Incomplete > Error: Exception in thread "main" java.lang.NoSuchMethodError: >

[jira] [Resolved] (SPARK-25030) SparkSubmit.doSubmit will not return result if the mainClass submitted creates a Timer()

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-25030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-25030. -- Resolution: Incomplete > SparkSubmit.doSubmit will not return result if the mainClass

[jira] [Resolved] (SPARK-23292) python tests related to pandas are skipped with python 2

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23292?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23292. -- Resolution: Incomplete > python tests related to pandas are skipped with python 2 >

[jira] [Resolved] (SPARK-14604) Modify design of ML model summaries

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-14604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-14604. -- Resolution: Incomplete > Modify design of ML model summaries >

[jira] [Resolved] (SPARK-23837) Create table as select gives exception if the spark generated alias name contains comma

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23837?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23837. -- Resolution: Incomplete > Create table as select gives exception if the spark generated alias

[jira] [Resolved] (SPARK-24837) Add kafka as spark metrics sink

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24837?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24837. -- Resolution: Incomplete > Add kafka as spark metrics sink > --- >

[jira] [Resolved] (SPARK-24273) Failure while using .checkpoint method to private S3 store via S3A connector

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24273?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24273. -- Resolution: Incomplete > Failure while using .checkpoint method to private S3 store via S3A

[jira] [Resolved] (SPARK-25351) Handle Pandas category type when converting from Python with Arrow

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-25351?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-25351. -- Resolution: Incomplete > Handle Pandas category type when converting from Python with Arrow >

[jira] [Resolved] (SPARK-23669) Executors fetch jars and name the jars with md5 prefix

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23669?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23669. -- Resolution: Incomplete > Executors fetch jars and name the jars with md5 prefix >

[jira] [Resolved] (SPARK-25219) KMeans Clustering - Text Data - Results are incorrect

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-25219?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-25219. -- Resolution: Incomplete > KMeans Clustering - Text Data - Results are incorrect >

[jira] [Resolved] (SPARK-25180) Spark standalone failure in Utils.doFetchFile() if nslookup of local hostname fails

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-25180?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-25180. -- Resolution: Incomplete > Spark standalone failure in Utils.doFetchFile() if nslookup of local

[jira] [Resolved] (SPARK-12126) JDBC datasource processes filters only commonly pushed down.

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-12126?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-12126. -- Resolution: Incomplete > JDBC datasource processes filters only commonly pushed down. >

[jira] [Resolved] (SPARK-20629) Copy shuffle data when nodes are being shut down

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-20629?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-20629. -- Resolution: Incomplete > Copy shuffle data when nodes are being shut down >

[jira] [Resolved] (SPARK-24293) Serialized shuffle supports mapSideCombine

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24293?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24293. -- Resolution: Incomplete > Serialized shuffle supports mapSideCombine >

[jira] [Resolved] (SPARK-23740) Add FPGrowth Param for filtering out very common items

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23740?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23740. -- Resolution: Incomplete > Add FPGrowth Param for filtering out very common items >

[jira] [Resolved] (SPARK-15041) adding mode strategy for ml.feature.Imputer for categorical features

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-15041?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-15041. -- Resolution: Incomplete > adding mode strategy for ml.feature.Imputer for categorical features

[jira] [Resolved] (SPARK-23858) Need to apply pyarrow adjustments to complex types with DateType/TimestampType

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23858?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23858. -- Resolution: Incomplete > Need to apply pyarrow adjustments to complex types with >

[jira] [Resolved] (SPARK-23237) Add UI / endpoint for threaddumps for executors with active tasks

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23237?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23237. -- Resolution: Incomplete > Add UI / endpoint for threaddumps for executors with active tasks >

[jira] [Resolved] (SPARK-15777) Catalog federation

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-15777?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-15777. -- Resolution: Incomplete > Catalog federation > -- > > Key:

[jira] [Resolved] (SPARK-21040) On executor/worker decommission consider speculatively re-launching current tasks

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-21040?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-21040. -- Resolution: Incomplete > On executor/worker decommission consider speculatively re-launching

[jira] [Resolved] (SPARK-24905) Spark 2.3 Internal URL env variable

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24905?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24905. -- Resolution: Incomplete > Spark 2.3 Internal URL env variable >

[jira] [Resolved] (SPARK-24494) Give users possibility to skip own classes in SparkContext.getCallSite()

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24494?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24494. -- Resolution: Incomplete > Give users possibility to skip own classes in

[jira] [Resolved] (SPARK-23796) There's no API to change state RDD's name

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23796?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23796. -- Resolution: Incomplete > There's no API to change state RDD's name >

[jira] [Resolved] (SPARK-22728) Unify artifact access for (mesos, standalone and yarn) when HDFS is available

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-22728?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-22728. -- Resolution: Incomplete > Unify artifact access for (mesos, standalone and yarn) when HDFS is

[jira] [Resolved] (SPARK-24748) Support for reporting custom metrics via Streaming Query Progress

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24748?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24748. -- Resolution: Incomplete > Support for reporting custom metrics via Streaming Query Progress >

[jira] [Resolved] (SPARK-22918) sbt test (spark - local) fail after upgrading to 2.2.1 with: java.security.AccessControlException: access denied org.apache.derby.security.SystemPermission( "engine", "

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-22918?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-22918. -- Resolution: Incomplete > sbt test (spark - local) fail after upgrading to 2.2.1 with: >

[jira] [Resolved] (SPARK-23181) Add compatibility tests for SHS serialized data / disk format

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23181?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23181. -- Resolution: Incomplete > Add compatibility tests for SHS serialized data / disk format >

[jira] [Resolved] (SPARK-21084) Improvements to dynamic allocation for notebook use cases

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-21084?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-21084. -- Resolution: Incomplete > Improvements to dynamic allocation for notebook use cases >

[jira] [Resolved] (SPARK-23631) Add summary to RandomForestClassificationModel

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23631?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23631. -- Resolution: Incomplete > Add summary to RandomForestClassificationModel >

[jira] [Resolved] (SPARK-24939) Support YARN Shared Cache in Spark

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24939?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24939. -- Resolution: Incomplete > Support YARN Shared Cache in Spark >

[jira] [Resolved] (SPARK-24049) Add a feature to not start speculative tasks when average task duration is less than a configurable absolute number

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24049?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24049. -- Resolution: Incomplete > Add a feature to not start speculative tasks when average task

[jira] [Resolved] (SPARK-24689) java.io.NotSerializableException: org.apache.spark.mllib.clustering.DistributedLDAModel

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24689?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24689. -- Resolution: Incomplete > java.io.NotSerializableException: >

[jira] [Resolved] (SPARK-24527) select column alias should support quotation marks

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24527?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24527. -- Resolution: Incomplete > select column alias should support quotation marks >

[jira] [Resolved] (SPARK-20869) Master should clear failed apps when worker down

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-20869?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-20869. -- Resolution: Incomplete > Master should clear failed apps when worker down >

[jira] [Resolved] (SPARK-25022) Add spark.executor.pyspark.memory support to Mesos

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-25022?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-25022. -- Resolution: Incomplete > Add spark.executor.pyspark.memory support to Mesos >

[jira] [Resolved] (SPARK-24835) col function ignores drop

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24835?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24835. -- Resolution: Incomplete > col function ignores drop > - > >

[jira] [Resolved] (SPARK-23607) Use HDFS extended attributes to store application summary to improve the Spark History Server performance

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23607?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23607. -- Resolution: Incomplete > Use HDFS extended attributes to store application summary to improve

[jira] [Resolved] (SPARK-25057) Unable to start spark on master URL

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-25057?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-25057. -- Resolution: Incomplete > Unable to start spark on master URL >

[jira] [Resolved] (SPARK-24832) Improve inputMetrics's bytesRead update for ColumnarBatch

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24832?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24832. -- Resolution: Incomplete > Improve inputMetrics's bytesRead update for ColumnarBatch >

<    1   2   3   4   5   6   >