[jira] [Resolved] (SPARK-20915) lpad/rpad with empty pad string different from MySQL

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-20915?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-20915. -- Resolution: Incomplete > lpad/rpad with empty pad string different from MySQL >

[jira] [Resolved] (SPARK-19680) Offsets out of range with no configured reset policy for partitions

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-19680?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-19680. -- Resolution: Incomplete > Offsets out of range with no configured reset policy for partitions

[jira] [Resolved] (SPARK-23056) parse_url regression when switched to using java.net.URI instead of java.net.URL

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23056?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23056. -- Resolution: Incomplete > parse_url regression when switched to using java.net.URI instead of

[jira] [Resolved] (SPARK-22748) Error in query: grouping_id() can only be used with GroupingSets/Cube/Rollup;

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-22748?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-22748. -- Resolution: Incomplete > Error in query: grouping_id() can only be used with

[jira] [Resolved] (SPARK-25263) Add scheduler integration test for SPARK-24909

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-25263?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-25263. -- Resolution: Incomplete > Add scheduler integration test for SPARK-24909 >

[jira] [Resolved] (SPARK-24842) self-join query fails on different letter case for same field

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24842?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24842. -- Resolution: Incomplete > self-join query fails on different letter case for same field >

[jira] [Resolved] (SPARK-22440) Add Calinski-Harabasz index to ClusteringEvaluator

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-22440?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-22440. -- Resolution: Incomplete > Add Calinski-Harabasz index to ClusteringEvaluator >

[jira] [Resolved] (SPARK-23369) HiveClientSuites fails with unresolved dependency

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23369?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23369. -- Resolution: Incomplete > HiveClientSuites fails with unresolved dependency >

[jira] [Resolved] (SPARK-16217) Support SELECT INTO statement

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-16217?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-16217. -- Resolution: Incomplete > Support SELECT INTO statement > - > >

[jira] [Resolved] (SPARK-20891) Reduce duplicate code in typedaggregators.scala

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-20891?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-20891. -- Resolution: Incomplete > Reduce duplicate code in typedaggregators.scala >

[jira] [Resolved] (SPARK-24998) spark-sql will scan the same table repeatedly when doing multi-insert

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24998?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24998. -- Resolution: Incomplete > spark-sql will scan the same table repeatedly when doing

[jira] [Resolved] (SPARK-13998) HashingTF should extend UnaryTransformer

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-13998?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-13998. -- Resolution: Incomplete > HashingTF should extend UnaryTransformer >

[jira] [Resolved] (SPARK-22402) Allow fetcher URIs to be downloaded to specific locations relative to Mesos Sandbox

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-22402?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-22402. -- Resolution: Incomplete > Allow fetcher URIs to be downloaded to specific locations relative

[jira] [Resolved] (SPARK-24264) [Structured Streaming] Remove 'mergeSchema' option from Parquet source configuration

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24264?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24264. -- Resolution: Incomplete > [Structured Streaming] Remove 'mergeSchema' option from Parquet

[jira] [Resolved] (SPARK-24554) Add MapType Support for Arrow in PySpark

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24554?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24554. -- Resolution: Incomplete > Add MapType Support for Arrow in PySpark >

[jira] [Resolved] (SPARK-24011) Cache rdd's immediate parent ShuffleDependencies to accelerate getShuffleDependencies()

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24011?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24011. -- Resolution: Incomplete > Cache rdd's immediate parent ShuffleDependencies to accelerate >

[jira] [Resolved] (SPARK-7424) spark.ml classification, regression abstractions should add metadata to output column

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-7424?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-7424. - Resolution: Incomplete > spark.ml classification, regression abstractions should add metadata to

[jira] [Resolved] (SPARK-24084) Add job group id for query through spark-sql

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24084?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24084. -- Resolution: Incomplete > Add job group id for query through spark-sql >

[jira] [Resolved] (SPARK-23742) Filter out redundant AssociationRules

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23742?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23742. -- Resolution: Incomplete > Filter out redundant AssociationRules >

[jira] [Resolved] (SPARK-24298) PCAModel Memory in Pipeline

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24298?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24298. -- Resolution: Incomplete > PCAModel Memory in Pipeline > --- > >

[jira] [Resolved] (SPARK-25242) Suggestion to make sql config setting fluent

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-25242?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-25242. -- Resolution: Incomplete > Suggestion to make sql config setting fluent >

[jira] [Resolved] (SPARK-24088) only HadoopRDD leverage HDFS Cache as preferred location

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24088?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24088. -- Resolution: Incomplete > only HadoopRDD leverage HDFS Cache as preferred location >

[jira] [Resolved] (SPARK-19536) Improve capability to merge SQL data types

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-19536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-19536. -- Resolution: Incomplete > Improve capability to merge SQL data types >

[jira] [Resolved] (SPARK-23560) Group by on struct field can add extra shuffle

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23560?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23560. -- Resolution: Incomplete > Group by on struct field can add extra shuffle >

[jira] [Resolved] (SPARK-24483) enableHiveSupport doesn't work with Spark 2.3 on EMR

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24483?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24483. -- Resolution: Incomplete > enableHiveSupport doesn't work with Spark 2.3 on EMR >

[jira] [Resolved] (SPARK-23705) dataframe.groupBy() may inadvertently receive sequence of non-distinct strings

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23705?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23705. -- Resolution: Incomplete > dataframe.groupBy() may inadvertently receive sequence of

[jira] [Resolved] (SPARK-21068) SparkR error message when passed an R object rather than Java object could be more informative

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-21068?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-21068. -- Resolution: Incomplete > SparkR error message when passed an R object rather than Java object

[jira] [Resolved] (SPARK-24449) ApplicationMaster reporter thread failure counter is not effective

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24449?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24449. -- Resolution: Incomplete > ApplicationMaster reporter thread failure counter is not effective >

[jira] [Resolved] (SPARK-22657) Hadoop fs implementation classes are not loaded if they are part of the app jar or other jar when --packages flag is used

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-22657?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-22657. -- Resolution: Incomplete > Hadoop fs implementation classes are not loaded if they are part of

[jira] [Resolved] (SPARK-23681) Switch OrcFileFormat to newer hadoop.mapreduce output classes

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23681. -- Resolution: Incomplete > Switch OrcFileFormat to newer hadoop.mapreduce output classes >

[jira] [Resolved] (SPARK-5362) Gradient and Optimizer to support generic output (instead of label) and data batches

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-5362?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-5362. - Resolution: Incomplete > Gradient and Optimizer to support generic output (instead of label) and

[jira] [Resolved] (SPARK-24162) Support aliased literal values for Pivot "IN" clause

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24162?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24162. -- Resolution: Incomplete > Support aliased literal values for Pivot "IN" clause >

[jira] [Resolved] (SPARK-22632) Fix the behavior of timestamp values for R's DataFrame to respect session timezone

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-22632?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-22632. -- Resolution: Incomplete > Fix the behavior of timestamp values for R's DataFrame to respect

[jira] [Resolved] (SPARK-23171) Reduce the time costs of the rule runs that do not change the plans

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23171?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23171. -- Resolution: Incomplete > Reduce the time costs of the rule runs that do not change the plans

[jira] [Resolved] (SPARK-24631) Cannot up cast column from bigint to smallint as it may truncate

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24631?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24631. -- Resolution: Incomplete > Cannot up cast column from bigint to smallint as it may truncate >

[jira] [Resolved] (SPARK-24218) Allow Configuration of DynamoDbEndpointUrl in KinesisReceiver

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24218?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24218. -- Resolution: Incomplete > Allow Configuration of DynamoDbEndpointUrl in KinesisReceiver >

[jira] [Resolved] (SPARK-25593) JDBC write Impala, `truncate` true option in Overwrite mode for JDBC DataFrameWriter is dropping and creating the table instead of truncating.

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-25593?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-25593. -- Resolution: Incomplete > JDBC write Impala, `truncate` true option in Overwrite mode for JDBC

[jira] [Resolved] (SPARK-25316) Spark error - ERROR ContextCleaner: Error cleaning broadcast 22, Exception thrown in awaitResult:

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-25316?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-25316. -- Resolution: Incomplete > Spark error - ERROR ContextCleaner: Error cleaning broadcast 22,

[jira] [Resolved] (SPARK-23673) PySpark dayofweek does not conform with ISO 8601

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23673?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23673. -- Resolution: Incomplete > PySpark dayofweek does not conform with ISO 8601 >

[jira] [Resolved] (SPARK-21722) Enable timezone-aware timestamp type when creating Pandas DataFrame.

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-21722?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-21722. -- Resolution: Incomplete > Enable timezone-aware timestamp type when creating Pandas DataFrame.

[jira] [Resolved] (SPARK-23968) allow reading JSON that is composed of pure maps

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23968?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23968. -- Resolution: Incomplete > allow reading JSON that is composed of pure maps >

[jira] [Resolved] (SPARK-22926) Respect table-level conf compression codec `Compression` in multiple scenarios

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-22926?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-22926. -- Resolution: Incomplete > Respect table-level conf compression codec `Compression` in multiple

[jira] [Resolved] (SPARK-21389) ALS recommendForAll optimization uses Native BLAS

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-21389?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-21389. -- Resolution: Incomplete > ALS recommendForAll optimization uses Native BLAS >

[jira] [Resolved] (SPARK-24735) Improve exception when mixing up pandas_udf types

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24735?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24735. -- Resolution: Incomplete > Improve exception when mixing up pandas_udf types >

[jira] [Resolved] (SPARK-8799) OneVsRestModel should extend ClassificationModel

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-8799?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-8799. - Resolution: Incomplete > OneVsRestModel should extend ClassificationModel >

[jira] [Resolved] (SPARK-23952) remove type parameter in DataReaderFactory

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23952?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23952. -- Resolution: Incomplete > remove type parameter in DataReaderFactory >

[jira] [Resolved] (SPARK-23879) Introduce MemoryBlock API instead of Platform API with Object

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23879?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23879. -- Resolution: Incomplete > Introduce MemoryBlock API instead of Platform API with Object >

[jira] [Resolved] (SPARK-23995) initial job has not accept any resources and executor keep exit

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23995?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23995. -- Resolution: Incomplete > initial job has not accept any resources and executor keep exit >

[jira] [Resolved] (SPARK-2620) case class cannot be used as key for reduce

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-2620?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-2620. - Resolution: Incomplete > case class cannot be used as key for reduce >

[jira] [Resolved] (SPARK-23543) Automatic Module creation fails in Java 9

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23543?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23543. -- Resolution: Incomplete > Automatic Module creation fails in Java 9 >

[jira] [Resolved] (SPARK-22743) Consolidate logic for handling spark.driver.memoryOverhead and spark.executor.memoryOverhead

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-22743?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-22743. -- Resolution: Incomplete > Consolidate logic for handling spark.driver.memoryOverhead and >

[jira] [Resolved] (SPARK-22741) Add global aggregate for typed aggregation

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-22741?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-22741. -- Resolution: Incomplete > Add global aggregate for typed aggregation >

[jira] [Resolved] (SPARK-23982) NoSuchMethodException: There is no startCredentialUpdater method in the object YarnSparkHadoopUtil

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23982?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23982. -- Resolution: Incomplete > NoSuchMethodException: There is no startCredentialUpdater method in

[jira] [Resolved] (SPARK-23350) [SS]Exception when stopping continuous processing application

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23350?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23350. -- Resolution: Incomplete > [SS]Exception when stopping continuous processing application >

[jira] [Resolved] (SPARK-25070) BlockFetchingListener#onBlockFetchSuccess throw "java.util.NoSuchElementException: key not found: shuffle_8_68_113" on ShuffleBlockFetcherIterator caused stage hang lo

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-25070?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-25070. -- Resolution: Incomplete > BlockFetchingListener#onBlockFetchSuccess throw >

[jira] [Resolved] (SPARK-24618) Allow ability to consume driver memory on worker hosts not master (option for clustermode to wait for returncode?)

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24618. -- Resolution: Incomplete > Allow ability to consume driver memory on worker hosts not master

[jira] [Resolved] (SPARK-22907) MetadataFetchFailedException broadcast is already present

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-22907?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-22907. -- Resolution: Incomplete > MetadataFetchFailedException broadcast is already present >

[jira] [Resolved] (SPARK-17248) Add native Scala enum support to Dataset Encoders

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-17248?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-17248. -- Resolution: Incomplete > Add native Scala enum support to Dataset Encoders >

[jira] [Resolved] (SPARK-25367) The column attributes obtained by Spark sql are inconsistent with hive

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-25367?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-25367. -- Resolution: Incomplete > The column attributes obtained by Spark sql are inconsistent with

[jira] [Resolved] (SPARK-24841) Memory leak in converting spark dataframe to pandas dataframe

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24841?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24841. -- Resolution: Incomplete > Memory leak in converting spark dataframe to pandas dataframe >

[jira] [Resolved] (SPARK-24164) Support column list as the pivot column in Pivot

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24164?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24164. -- Resolution: Incomplete > Support column list as the pivot column in Pivot >

[jira] [Resolved] (SPARK-17181) [Spark2.0 web ui]The status of the certain jobs is still displayed as running even if all the stages of this job have already finished

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-17181?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-17181. -- Resolution: Incomplete > [Spark2.0 web ui]The status of the certain jobs is still displayed

[jira] [Resolved] (SPARK-24844) spark REST API need to add ipFilter

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24844?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24844. -- Resolution: Incomplete > spark REST API need to add ipFilter >

[jira] [Resolved] (SPARK-24826) Self-Join not working in Apache Spark 2.2.2

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24826?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24826. -- Resolution: Incomplete > Self-Join not working in Apache Spark 2.2.2 >

[jira] [Resolved] (SPARK-10795) FileNotFoundException while deploying pyspark job on cluster

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-10795?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-10795. -- Resolution: Incomplete > FileNotFoundException while deploying pyspark job on cluster >

[jira] [Resolved] (SPARK-14834) Force adding doc for new api in pyspark with @since annotation

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-14834?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-14834. -- Resolution: Incomplete > Force adding doc for new api in pyspark with @since annotation >

[jira] [Resolved] (SPARK-13346) Using DataFrames iteratively leads to slow query planning

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-13346?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-13346. -- Resolution: Incomplete > Using DataFrames iteratively leads to slow query planning >

[jira] [Resolved] (SPARK-22925) ml model persistence creates a lot of small files

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-22925?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-22925. -- Resolution: Incomplete > ml model persistence creates a lot of small files >

[jira] [Resolved] (SPARK-24430) CREATE VIEW with UNION statement: Failed to recognize predicate 'UNION'.

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24430?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24430. -- Resolution: Incomplete > CREATE VIEW with UNION statement: Failed to recognize predicate

[jira] [Resolved] (SPARK-17265) EdgeRDD Difference throws an exception

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-17265?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-17265. -- Resolution: Incomplete > EdgeRDD Difference throws an exception >

[jira] [Resolved] (SPARK-24693) Row order preservation for operations on MLlib IndexedRowMatrix

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24693?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24693. -- Resolution: Incomplete > Row order preservation for operations on MLlib IndexedRowMatrix >

[jira] [Resolved] (SPARK-25193) insert overwrite doesn't throw exception when drop old data fails

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-25193?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-25193. -- Resolution: Incomplete > insert overwrite doesn't throw exception when drop old data fails >

[jira] [Resolved] (SPARK-24866) Artifactual ROC scores when scaling up Random Forest classifier

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24866?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24866. -- Resolution: Incomplete > Artifactual ROC scores when scaling up Random Forest classifier >

[jira] [Resolved] (SPARK-22391) add `MetadataCreationSupport` trait to separate data and metadata handling at write path

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-22391?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-22391. -- Resolution: Incomplete > add `MetadataCreationSupport` trait to separate data and metadata

[jira] [Resolved] (SPARK-3727) Trees and ensembles: More prediction functionality

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-3727?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-3727. - Resolution: Incomplete > Trees and ensembles: More prediction functionality >

[jira] [Resolved] (SPARK-23210) Introduce the concept of default value to schema

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23210?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23210. -- Resolution: Incomplete > Introduce the concept of default value to schema >

[jira] [Resolved] (SPARK-20691) Difference between Storage Memory as seen internally and in web UI

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-20691?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-20691. -- Resolution: Incomplete > Difference between Storage Memory as seen internally and in web UI >

[jira] [Resolved] (SPARK-23689) Spark 2.3.0/2.2.1 Some changes cause org.apache.spark.sql.catalyst.errors.package$TreeNodeException:

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23689?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23689. -- Resolution: Incomplete > Spark 2.3.0/2.2.1 Some changes cause >

[jira] [Resolved] (SPARK-24448) File not found on the address SparkFiles.get returns on standalone cluster

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24448?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24448. -- Resolution: Incomplete > File not found on the address SparkFiles.get returns on standalone

[jira] [Resolved] (SPARK-24426) Unexpected combination of cache and join on DataFrame

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24426?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24426. -- Resolution: Incomplete > Unexpected combination of cache and join on DataFrame >

[jira] [Resolved] (SPARK-24306) Sort a Dataset with a lambda (like RDD.sortBy)

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24306?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24306. -- Resolution: Incomplete > Sort a Dataset with a lambda (like RDD.sortBy) >

[jira] [Resolved] (SPARK-25459) Add viewOriginalText back to CatalogTable

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-25459?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-25459. -- Resolution: Incomplete > Add viewOriginalText back to CatalogTable >

[jira] [Resolved] (SPARK-13127) Upgrade Parquet to 1.9 (Fixes parquet sorting)

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-13127?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-13127. -- Resolution: Incomplete > Upgrade Parquet to 1.9 (Fixes parquet sorting) >

[jira] [Resolved] (SPARK-21927) scalastyle 1.0.0 generates SBT warnings

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-21927?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-21927. -- Resolution: Incomplete > scalastyle 1.0.0 generates SBT warnings >

[jira] [Resolved] (SPARK-24431) wrong areaUnderPR calculation in BinaryClassificationEvaluator

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24431?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24431. -- Resolution: Incomplete > wrong areaUnderPR calculation in BinaryClassificationEvaluator >

[jira] [Resolved] (SPARK-23236) Make it easier to find the rest API, especially in local mode

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23236?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23236. -- Resolution: Incomplete > Make it easier to find the rest API, especially in local mode >

[jira] [Resolved] (SPARK-24208) Cannot resolve column in self join after applying Pandas UDF

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24208?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24208. -- Resolution: Incomplete > Cannot resolve column in self join after applying Pandas UDF >

[jira] [Resolved] (SPARK-22245) dataframe should always put partition columns at the end

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-22245?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-22245. -- Resolution: Incomplete > dataframe should always put partition columns at the end >

[jira] [Resolved] (SPARK-21940) Support timezone for timestamps in SparkR

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-21940?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-21940. -- Resolution: Incomplete > Support timezone for timestamps in SparkR >

[jira] [Resolved] (SPARK-21353) add checkValue in spark.internal.config about how to correctly set configurations

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-21353?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-21353. -- Resolution: Incomplete > add checkValue in spark.internal.config about how to correctly set

[jira] [Resolved] (SPARK-4285) Transpose RDD[Vector] to column store for ML

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-4285?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-4285. - Resolution: Incomplete > Transpose RDD[Vector] to column store for ML >

[jira] [Resolved] (SPARK-20295) when spark.sql.adaptive.enabled is enabled, have conflict with Exchange Resue

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-20295?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-20295. -- Resolution: Incomplete > when spark.sql.adaptive.enabled is enabled, have conflict with

[jira] [Resolved] (SPARK-20618) Support Custom Partitioners in PySpark

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-20618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-20618. -- Resolution: Incomplete > Support Custom Partitioners in PySpark >

[jira] [Resolved] (SPARK-22964) don't allow task restarts for continuous processing

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-22964?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-22964. -- Resolution: Incomplete > don't allow task restarts for continuous processing >

[jira] [Resolved] (SPARK-24266) Spark client terminates while driver is still running

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24266?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24266. -- Resolution: Incomplete > Spark client terminates while driver is still running >

[jira] [Resolved] (SPARK-24729) Spark - stackoverflow error - org.apache.spark.sql.catalyst.plans.QueryPlan

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24729?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24729. -- Resolution: Incomplete > Spark - stackoverflow error -

[jira] [Resolved] (SPARK-21536) Remove the workaroud to allow dots in field names in R's createDataFame

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-21536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-21536. -- Resolution: Incomplete > Remove the workaroud to allow dots in field names in R's

[jira] [Resolved] (SPARK-23839) consider bucket join in cost-based JoinReorder rule

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23839?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-23839. -- Resolution: Incomplete > consider bucket join in cost-based JoinReorder rule >

[jira] [Resolved] (SPARK-24656) SparkML Transformers and Estimators with multiple columns

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24656?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24656. -- Resolution: Incomplete > SparkML Transformers and Estimators with multiple columns >

[jira] [Resolved] (SPARK-24301) Add Instrumentation test coverage

2019-10-07 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-24301?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-24301. -- Resolution: Incomplete > Add Instrumentation test coverage >

<    1   2   3   4   5   6   >