[jira] [Resolved] (SPARK-24830) Problem with logging on Glassfish

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24830?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24830. -- Resolution: Incomplete > Problem with logging on Glassfish >

[jira] [Resolved] (SPARK-7206) Gaussian Mixture Model (GMM) improvements

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-7206?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-7206. - Resolution: Incomplete > Gaussian Mixture Model (GMM) improvements >

[jira] [Resolved] (SPARK-9775) Query Mesos for number of CPUs to set default parallelism

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-9775?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-9775. - Resolution: Incomplete > Query Mesos for number of CPUs to set default parallelism >

[jira] [Resolved] (SPARK-24210) incorrect handling of boolean expressions when using column in expressions in pyspark.sql.DataFrame filter function

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24210?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24210. -- Resolution: Incomplete > incorrect handling of boolean expressions when using column in

[jira] [Resolved] (SPARK-24442) Add configuration parameter to adjust the numbers of records and the characters per row before truncation when a user runs.show()

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24442?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24442. -- Resolution: Incomplete > Add configuration parameter to adjust the numbers of records and the

[jira] [Resolved] (SPARK-24440) When use constant as column we may get wrong answer versus impala

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24440?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24440. -- Resolution: Incomplete > When use constant as column we may get wrong answer versus impala >

[jira] [Resolved] (SPARK-22887) ML test for StructuredStreaming: spark.ml.fpm

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-22887?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-22887. -- Resolution: Incomplete > ML test for StructuredStreaming: spark.ml.fpm >

[jira] [Resolved] (SPARK-24616) Need to retreive free memory on command prompt on DSE cluster

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24616?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24616. -- Resolution: Incomplete > Need to retreive free memory on command prompt on DSE cluster >

[jira] [Resolved] (SPARK-22723) Add support for other data types and add mode info to ImageSchema

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-22723?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-22723. -- Resolution: Incomplete > Add support for other data types and add mode info to ImageSchema >

[jira] [Resolved] (SPARK-24512) SparkSQL ThriftServer port (ie 10015) supports TLSv1.0

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24512?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24512. -- Resolution: Incomplete > SparkSQL ThriftServer port (ie 10015) supports TLSv1.0 >

[jira] [Resolved] (SPARK-17570) Avoid Hash and Exchange in Sort Merge join if bucketing factor is multiple for tables

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-17570?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-17570. -- Resolution: Incomplete > Avoid Hash and Exchange in Sort Merge join if bucketing factor is

[jira] [Resolved] (SPARK-24756) Incorrect Statistics

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24756?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24756. -- Resolution: Incomplete > Incorrect Statistics > > > Key:

[jira] [Resolved] (SPARK-23074) Dataframe-ified zipwithindex

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23074?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23074. -- Resolution: Incomplete > Dataframe-ified zipwithindex > > >

[jira] [Resolved] (SPARK-24922) Iterative rdd union + reduceByKey operations on small dataset leads to "No space left on device" error on account of lot of shuffle spill.

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24922?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24922. -- Resolution: Incomplete > Iterative rdd union + reduceByKey operations on small dataset leads

[jira] [Resolved] (SPARK-25537) spark.pyspark.driver.python when set in code doesnt work

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-25537?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-25537. -- Resolution: Incomplete > spark.pyspark.driver.python when set in code doesnt work >

[jira] [Resolved] (SPARK-23058) Show create table can't show non printable field delim

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23058?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23058. -- Resolution: Incomplete > Show create table can't show non printable field delim >

[jira] [Resolved] (SPARK-22005) CrossValidator, TrainValidationSplit dump sub models to disk when fitting: Python API

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-22005?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-22005. -- Resolution: Incomplete > CrossValidator, TrainValidationSplit dump sub models to disk when

[jira] [Resolved] (SPARK-16483) Unifying struct fields and columns

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-16483?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-16483. -- Resolution: Incomplete > Unifying struct fields and columns >

[jira] [Resolved] (SPARK-24358) createDataFrame in Python 3 should be able to infer bytes type as Binary type

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24358?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24358. -- Resolution: Incomplete > createDataFrame in Python 3 should be able to infer bytes type as

[jira] [Resolved] (SPARK-22461) Move Spark ML model summaries into a dedicated package

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-22461?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-22461. -- Resolution: Incomplete > Move Spark ML model summaries into a dedicated package >

[jira] [Resolved] (SPARK-23255) Add user guide and examples for DataFrame image reading functions

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23255?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23255. -- Resolution: Incomplete > Add user guide and examples for DataFrame image reading functions >

[jira] [Resolved] (SPARK-24354) Adding support for quoteMode in Spark's build in CSV DataFrameWriter

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24354?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24354. -- Resolution: Incomplete > Adding support for quoteMode in Spark's build in CSV DataFrameWriter

[jira] [Resolved] (SPARK-23981) ShuffleBlockFetcherIterator - Spamming Logs

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23981?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23981. -- Resolution: Incomplete > ShuffleBlockFetcherIterator - Spamming Logs >

[jira] [Resolved] (SPARK-20819) Enhance ColumnVector to keep UnsafeArrayData for other types

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-20819?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-20819. -- Resolution: Incomplete > Enhance ColumnVector to keep UnsafeArrayData for other types >

[jira] [Resolved] (SPARK-16534) Kafka 0.10 Python support

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-16534?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-16534. -- Resolution: Incomplete > Kafka 0.10 Python support > - > >

[jira] [Resolved] (SPARK-24463) Add catalyst rule to reorder TypedFilters separated by Filters to reduce serde operations

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24463?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24463. -- Resolution: Incomplete > Add catalyst rule to reorder TypedFilters separated by Filters to

[jira] [Resolved] (SPARK-18492) GeneratedIterator grows beyond 64 KB

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-18492?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-18492. -- Resolution: Incomplete > GeneratedIterator grows beyond 64 KB >

[jira] [Resolved] (SPARK-24180) Using another dynamodb endpoint for kinesis

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24180?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24180. -- Resolution: Incomplete > Using another dynamodb endpoint for kinesis >

[jira] [Resolved] (SPARK-23571) Delete auxiliary Kubernetes resources upon application completion

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23571?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23571. -- Resolution: Incomplete > Delete auxiliary Kubernetes resources upon application completion >

[jira] [Resolved] (SPARK-25024) Update mesos documentation to be clear about security supported

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-25024?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-25024. -- Resolution: Incomplete > Update mesos documentation to be clear about security supported >

[jira] [Resolved] (SPARK-23704) PySpark access of individual trees in random forest is slow

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23704?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23704. -- Resolution: Incomplete > PySpark access of individual trees in random forest is slow >

[jira] [Resolved] (SPARK-3723) DecisionTree, RandomForest: Add more instrumentation

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-3723?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-3723. - Resolution: Incomplete > DecisionTree, RandomForest: Add more instrumentation >

[jira] [Resolved] (SPARK-24410) Missing optimization for Union on bucketed tables

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24410?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24410. -- Resolution: Incomplete > Missing optimization for Union on bucketed tables >

[jira] [Resolved] (SPARK-24587) RDD.takeOrdered uses reduce, pulling all partition data to the driver

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24587?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24587. -- Resolution: Incomplete > RDD.takeOrdered uses reduce, pulling all partition data to the

[jira] [Resolved] (SPARK-25032) Create table is failing, after dropping the database . It is not falling back to default database

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-25032?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-25032. -- Resolution: Incomplete > Create table is failing, after dropping the database . It is not

[jira] [Resolved] (SPARK-24986) OOM in BufferHolder during writes to a stream

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24986?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24986. -- Resolution: Incomplete > OOM in BufferHolder during writes to a stream >

[jira] [Resolved] (SPARK-22204) Explain output for SQL with commands shows no optimization

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-22204?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-22204. -- Resolution: Incomplete > Explain output for SQL with commands shows no optimization >

[jira] [Resolved] (SPARK-24202) Separate SQLContext dependency from SparkSession.implicits

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24202?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24202. -- Resolution: Incomplete > Separate SQLContext dependency from SparkSession.implicits >

[jira] [Resolved] (SPARK-24604) upgrade to spark 2.3.0 makes MPC model training slower

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24604. -- Resolution: Incomplete > upgrade to spark 2.3.0 makes MPC model training slower >

[jira] [Resolved] (SPARK-24200) Read subdirectories with out asterisks

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24200?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24200. -- Resolution: Incomplete > Read subdirectories with out asterisks >

[jira] [Resolved] (SPARK-15882) Discuss distributed linear algebra in spark.ml package

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-15882?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-15882. -- Resolution: Incomplete > Discuss distributed linear algebra in spark.ml package >

[jira] [Resolved] (SPARK-23002) SparkUI inconsistent driver hostname compare with other executors

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23002?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23002. -- Resolution: Incomplete > SparkUI inconsistent driver hostname compare with other executors >

[jira] [Resolved] (SPARK-24784) Retraining (each document as separate file) creates OOME

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24784?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24784. -- Resolution: Incomplete > Retraining (each document as separate file) creates OOME >

[jira] [Resolved] (SPARK-23536) Update each Data frame row with a random value

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23536. -- Resolution: Incomplete > Update each Data frame row with a random value >

[jira] [Resolved] (SPARK-23221) Fix KafkaContinuousSourceStressForDontFailOnDataLossSuite to run with enough cores

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23221?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23221. -- Resolution: Incomplete > Fix KafkaContinuousSourceStressForDontFailOnDataLossSuite to run

[jira] [Resolved] (SPARK-23258) Should not split Arrow record batches based on row count

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23258?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23258. -- Resolution: Incomplete > Should not split Arrow record batches based on row count >

[jira] [Resolved] (SPARK-22055) Port release scripts

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-22055?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-22055. -- Resolution: Incomplete > Port release scripts > > > Key:

[jira] [Resolved] (SPARK-25397) SparkSession.conf fails when given default value with Python 3

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-25397?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-25397. -- Resolution: Incomplete > SparkSession.conf fails when given default value with Python 3 >

[jira] [Resolved] (SPARK-20592) Alter table concatenate is not working as expected.

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-20592?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-20592. -- Resolution: Incomplete > Alter table concatenate is not working as expected. >

[jira] [Resolved] (SPARK-22943) OneHotEncoder supports manual specification of categorySizes

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-22943?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-22943. -- Resolution: Incomplete > OneHotEncoder supports manual specification of categorySizes >

[jira] [Resolved] (SPARK-25109) spark python should retry reading another datanode if the first one fails to connect

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-25109?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-25109. -- Resolution: Incomplete > spark python should retry reading another datanode if the first one

[jira] [Resolved] (SPARK-22954) ANALYZE TABLE fails with NoSuchTableException for temporary tables (but should have reported "not supported on views")

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-22954?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-22954. -- Resolution: Incomplete > ANALYZE TABLE fails with NoSuchTableException for temporary tables

[jira] [Resolved] (SPARK-24390) confusion of columns in projection after WITH ROLLUP

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24390?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24390. -- Resolution: Incomplete > confusion of columns in projection after WITH ROLLUP >

[jira] [Resolved] (SPARK-21302) history server WebUI show HTTP ERROR 500

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-21302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-21302. -- Resolution: Incomplete > history server WebUI show HTTP ERROR 500 >

[jira] [Resolved] (SPARK-24144) monotonically_increasing_id on streaming dataFrames

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24144. -- Resolution: Incomplete > monotonically_increasing_id on streaming dataFrames >

[jira] [Resolved] (SPARK-23655) Add support for type aclitem (PostgresDialect)

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23655?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23655. -- Resolution: Incomplete > Add support for type aclitem (PostgresDialect) >

[jira] [Resolved] (SPARK-21624) Optimize communication cost of RF/GBT/DT

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-21624?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-21624. -- Resolution: Incomplete > Optimize communication cost of RF/GBT/DT >

[jira] [Resolved] (SPARK-24059) When blacklist disable always hash to a bad local directory may cause job failure

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24059. -- Resolution: Incomplete > When blacklist disable always hash to a bad local directory may

[jira] [Resolved] (SPARK-24728) org.apache.spark.repl.ExecutorClassLoader with cache

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24728?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24728. -- Resolution: Incomplete > org.apache.spark.repl.ExecutorClassLoader with cache >

[jira] [Resolved] (SPARK-11136) Warm-start support for ML estimator

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-11136?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-11136. -- Resolution: Incomplete > Warm-start support for ML estimator >

[jira] [Resolved] (SPARK-24394) Nodes in decision tree sometimes have negative impurity values

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24394?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24394. -- Resolution: Incomplete > Nodes in decision tree sometimes have negative impurity values >

[jira] [Resolved] (SPARK-23745) Remove the directories of the “hive.downloaded.resources.dir” when HiveThriftServer2 stopped

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23745?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23745. -- Resolution: Incomplete > Remove the directories of the “hive.downloaded.resources.dir” when

[jira] [Resolved] (SPARK-25329) Support passing Kerberos configuration information

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-25329?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-25329. -- Resolution: Incomplete > Support passing Kerberos configuration information >

[jira] [Resolved] (SPARK-24469) Support collations in Spark SQL

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24469?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24469. -- Resolution: Incomplete > Support collations in Spark SQL > --- >

[jira] [Resolved] (SPARK-23650) Slow SparkR udf (dapply)

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23650?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23650. -- Resolution: Incomplete > Slow SparkR udf (dapply) > > >

[jira] [Resolved] (SPARK-24269) Infer nullability rather than declaring all columns as nullable

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24269?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24269. -- Resolution: Incomplete > Infer nullability rather than declaring all columns as nullable >

[jira] [Resolved] (SPARK-19903) Watermark metadata is lost when using resolved attributes

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-19903?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-19903. -- Resolution: Incomplete > Watermark metadata is lost when using resolved attributes >

[jira] [Resolved] (SPARK-24405) parameter for python worker timeout

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24405?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24405. -- Resolution: Incomplete > parameter for python worker timeout >

[jira] [Resolved] (SPARK-17694) convert DataFrame to DataSet should check columns match

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-17694?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-17694. -- Resolution: Incomplete > convert DataFrame to DataSet should check columns match >

[jira] [Resolved] (SPARK-14585) Provide accessor methods for Pipeline stages

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-14585?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-14585. -- Resolution: Incomplete > Provide accessor methods for Pipeline stages >

[jira] [Resolved] (SPARK-25377) spark sql dataframe cache is invalid

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-25377?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-25377. -- Resolution: Incomplete > spark sql dataframe cache is invalid >

[jira] [Resolved] (SPARK-25361) Support for Kinesis Client Library 2.0

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-25361?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-25361. -- Resolution: Incomplete > Support for Kinesis Client Library 2.0 >

[jira] [Resolved] (SPARK-24904) Join with broadcasted dataframe causes shuffle of redundant data

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24904?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24904. -- Resolution: Incomplete > Join with broadcasted dataframe causes shuffle of redundant data >

[jira] [Resolved] (SPARK-24382) Spark Structured Streaming aggregation on old timestamp data

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24382?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24382. -- Resolution: Incomplete > Spark Structured Streaming aggregation on old timestamp data >

[jira] [Resolved] (SPARK-24651) Add ability to write null values while writing JSON

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24651?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24651. -- Resolution: Incomplete > Add ability to write null values while writing JSON >

[jira] [Resolved] (SPARK-23531) When explain, plan's output should include attribute type info

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23531?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23531. -- Resolution: Incomplete > When explain, plan's output should include attribute type info >

[jira] [Resolved] (SPARK-5158) Allow for keytab-based HDFS security in Standalone mode

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-5158?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-5158. - Resolution: Incomplete > Allow for keytab-based HDFS security in Standalone mode >

[jira] [Resolved] (SPARK-24189) Spark Strcutured Streaming not working with the Kafka Transactions

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24189. -- Resolution: Incomplete > Spark Strcutured Streaming not working with the Kafka Transactions >

[jira] [Resolved] (SPARK-25171) After restart, StreamingContext is replaying the last successful micro-batch right before the stop

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-25171?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-25171. -- Resolution: Incomplete > After restart, StreamingContext is replaying the last successful

[jira] [Resolved] (SPARK-24280) Speed up indexing of files in object stores by using listFiles(path, recursive=true)

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24280?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24280. -- Resolution: Incomplete > Speed up indexing of files in object stores by using listFiles(path,

[jira] [Resolved] (SPARK-22869) 64KB JVM bytecode limit problem with filter

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-22869?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-22869. -- Resolution: Incomplete > 64KB JVM bytecode limit problem with filter >

[jira] [Resolved] (SPARK-23954) Converting spark dataframe containing int64 fields to R dataframes leads to impredictable errors.

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23954?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23954. -- Resolution: Incomplete > Converting spark dataframe containing int64 fields to R dataframes

[jira] [Resolved] (SPARK-23337) withWatermark raises an exception on struct objects

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23337?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23337. -- Resolution: Incomplete > withWatermark raises an exception on struct objects >

[jira] [Resolved] (SPARK-22004) CrossValidator, TrainValidationSplit dump sub models to disk when fitting: Scala API

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-22004?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-22004. -- Resolution: Incomplete > CrossValidator, TrainValidationSplit dump sub models to disk when

[jira] [Resolved] (SPARK-24503) Implement SparkSQL authorization plugin in Apache Ranger

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24503?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24503. -- Resolution: Incomplete > Implement SparkSQL authorization plugin in Apache Ranger >

[jira] [Resolved] (SPARK-9120) Add multivariate regression (or prediction) interface

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-9120?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-9120. - Resolution: Incomplete > Add multivariate regression (or prediction) interface >

[jira] [Resolved] (SPARK-20174) Analyzer gives mysterious AnalysisException when posexplode used in withColumn

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-20174?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-20174. -- Resolution: Incomplete > Analyzer gives mysterious AnalysisException when posexplode used in

[jira] [Resolved] (SPARK-23575) ERROR RetryingHMSHandler:159 - AlreadyExistsException(message:

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23575?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23575. -- Resolution: Incomplete > ERROR RetryingHMSHandler:159 - AlreadyExistsException(message: >

[jira] [Resolved] (SPARK-25065) Driver and executors pick the wrong logging configuration file.

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-25065?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-25065. -- Resolution: Incomplete > Driver and executors pick the wrong logging configuration file. >

[jira] [Resolved] (SPARK-24827) Some memory waste in History Server by strings in AccumulableInfo objects

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24827?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24827. -- Resolution: Incomplete > Some memory waste in History Server by strings in AccumulableInfo

[jira] [Resolved] (SPARK-23832) Adding possibility to set timestamp into KafkaRowWriter

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23832?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23832. -- Resolution: Incomplete > Adding possibility to set timestamp into KafkaRowWriter >

[jira] [Resolved] (SPARK-10817) ML abstraction umbrella

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-10817?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-10817. -- Resolution: Incomplete > ML abstraction umbrella > --- > >

[jira] [Resolved] (SPARK-18649) sc.textFile(my_file).collect() raises socket.timeout on large files

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-18649?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-18649. -- Resolution: Incomplete > sc.textFile(my_file).collect() raises socket.timeout on large files

[jira] [Resolved] (SPARK-18082) Locality Sensitive Hashing (LSH) - SignRandomProjection

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-18082?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-18082. -- Resolution: Incomplete > Locality Sensitive Hashing (LSH) - SignRandomProjection >

[jira] [Resolved] (SPARK-24362) SUM function precision issue

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24362?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24362. -- Resolution: Incomplete > SUM function precision issue > > >

[jira] [Resolved] (SPARK-22035) the value of statistical logicalPlan.stats.sizeInBytes which is not expected

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-22035?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-22035. -- Resolution: Incomplete > the value of statistical logicalPlan.stats.sizeInBytes which is not

[jira] [Resolved] (SPARK-9140) Replace TimeTracker by Stopwatch

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-9140?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-9140. - Resolution: Incomplete > Replace TimeTracker by Stopwatch > > >

[jira] [Resolved] (SPARK-22600) Fix 64kb limit for deeply nested expressions under wholestage codegen

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-22600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-22600. -- Resolution: Incomplete > Fix 64kb limit for deeply nested expressions under wholestage

[jira] [Resolved] (SPARK-24258) SPIP: Improve PySpark support for ML Matrix and Vector types

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24258?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24258. -- Resolution: Incomplete > SPIP: Improve PySpark support for ML Matrix and Vector types >

[jira] [Resolved] (SPARK-25198) org.apache.spark.sql.catalyst.parser.ParseException: DataType json is not supported.

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-25198?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-25198. -- Resolution: Incomplete > org.apache.spark.sql.catalyst.parser.ParseException: DataType json

<    1   2   3   4   5   6   >