[jira] [Assigned] (SPARK-22692) Reduce the number of generated mutable states

2018-01-30 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22692?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li reassigned SPARK-22692: --- Assignee: Marco Gaido > Reduce the number of generated mutable states >

[jira] [Updated] (SPARK-22939) Support Spark UDF in registerFunction

2018-01-30 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22939?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-22939: Issue Type: Improvement (was: Bug) > Support Spark UDF in registerFunction >

[jira] [Updated] (SPARK-22961) Constant columns no longer picked as constraints in 2.3

2018-01-30 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22961?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-22961: Issue Type: Bug (was: Improvement) > Constant columns no longer picked as constraints in 2.3 >

[jira] [Updated] (SPARK-16060) Vectorized Orc reader

2018-01-30 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16060?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-16060: Labels: release-notes releasenotes (was: release-notes) > Vectorized Orc reader > - >

[jira] [Updated] (SPARK-22510) Exceptions caused by 64KB JVM bytecode or 64K constant pool entry limit

2018-01-30 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22510?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-22510: Labels: releasenotes (was: ) > Exceptions caused by 64KB JVM bytecode or 64K constant pool entry limit >

[jira] [Assigned] (SPARK-22510) Exceptions caused by 64KB JVM bytecode or 64K constant pool entry limit

2018-01-30 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22510?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li reassigned SPARK-22510: --- Assignee: Kazuaki Ishizaki > Exceptions caused by 64KB JVM bytecode or 64K constant pool entry

[jira] [Updated] (SPARK-22510) Exceptions caused by 64KB JVM bytecode or 64K constant pool entry limit

2018-01-30 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22510?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-22510: Fix Version/s: (was: 2.3.0) > Exceptions caused by 64KB JVM bytecode or 64K constant pool entry limit

[jira] [Updated] (SPARK-22510) Exceptions caused by 64KB JVM bytecode or 64K constant pool entry limit

2018-01-30 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22510?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-22510: Fix Version/s: 2.3.0 > Exceptions caused by 64KB JVM bytecode or 64K constant pool entry limit >

[jira] [Updated] (SPARK-20392) Slow performance when calling fit on ML pipeline for dataset with many columns but few rows

2018-01-30 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20392?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-20392: Component/s: SQL > Slow performance when calling fit on ML pipeline for dataset with many > columns but

[jira] [Updated] (SPARK-20682) Add new ORCFileFormat based on Apache ORC

2018-01-30 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20682?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-20682: Labels: releasenotes (was: ) > Add new ORCFileFormat based on Apache ORC >

[jira] [Updated] (SPARK-23219) Rename ReadTask to DataReaderFactory

2018-01-30 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23219?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-23219: Parent Issue: SPARK-15689 (was: SPARK-22386) > Rename ReadTask to DataReaderFactory >

[jira] [Assigned] (SPARK-23280) add map type support to ColumnVector

2018-01-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23280?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23280: Assignee: Wenchen Fan (was: Apache Spark) > add map type support to ColumnVector >

[jira] [Commented] (SPARK-23280) add map type support to ColumnVector

2018-01-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23280?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16346372#comment-16346372 ] Apache Spark commented on SPARK-23280: -- User 'cloud-fan' has created a pull request for this issue:

[jira] [Assigned] (SPARK-23280) add map type support to ColumnVector

2018-01-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23280?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23280: Assignee: Apache Spark (was: Wenchen Fan) > add map type support to ColumnVector >

[jira] [Updated] (SPARK-22400) rename some APIs and classes to make their meaning clearer

2018-01-30 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22400?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-22400: Parent Issue: SPARK-15689 (was: SPARK-22386) > rename some APIs and classes to make their meaning clearer

[jira] [Updated] (SPARK-22452) DataSourceV2Options should have getInt, getBoolean, etc.

2018-01-30 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22452?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-22452: Parent Issue: SPARK-15689 (was: SPARK-22386) > DataSourceV2Options should have getInt, getBoolean, etc. >

[jira] [Updated] (SPARK-22392) columnar reader interface

2018-01-30 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22392?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-22392: Parent Issue: SPARK-15689 (was: SPARK-22386) > columnar reader interface > -- >

[jira] [Updated] (SPARK-22389) partitioning reporting

2018-01-30 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22389?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-22389: Fix Version/s: (was: 2.3.1) 2.3.0 > partitioning reporting > --

[jira] [Updated] (SPARK-22389) partitioning reporting

2018-01-30 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22389?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-22389: Parent Issue: SPARK-15689 (was: SPARK-22386) > partitioning reporting > -- > >

[jira] [Updated] (SPARK-22387) propagate session configs to data source read/write options

2018-01-30 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22387?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-22387: Parent Issue: SPARK-15689 (was: SPARK-22386) > propagate session configs to data source read/write

[jira] [Updated] (SPARK-22386) Data Source V2 improvements

2018-01-30 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22386?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-22386: Labels: releasenotes (was: ) > Data Source V2 improvements > --- > >

[jira] [Created] (SPARK-23280) add map type support to ColumnVector

2018-01-30 Thread Wenchen Fan (JIRA)
Wenchen Fan created SPARK-23280: --- Summary: add map type support to ColumnVector Key: SPARK-23280 URL: https://issues.apache.org/jira/browse/SPARK-23280 Project: Spark Issue Type: Sub-task

[jira] [Updated] (SPARK-23260) remove V2 from the class name of data source reader/writer

2018-01-30 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23260?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-23260: Issue Type: Sub-task (was: Improvement) Parent: SPARK-15689 > remove V2 from the class name of

[jira] [Updated] (SPARK-23262) mix-in interface should extend the interface it aimed to mix in

2018-01-30 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23262?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-23262: Issue Type: Sub-task (was: Bug) Parent: SPARK-15689 > mix-in interface should extend the

[jira] [Updated] (SPARK-20960) make ColumnVector public

2018-01-30 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20960?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-20960: Labels: releasenotes (was: ) > make ColumnVector public > > >

[jira] [Resolved] (SPARK-22969) aggregateByKey with aggregator compression

2018-01-30 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22969?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng resolved SPARK-22969. -- Resolution: Not A Problem > aggregateByKey with aggregator compression >

[jira] [Resolved] (SPARK-23272) add calendar interval type support to ColumnVector

2018-01-30 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23272?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-23272. - Resolution: Fixed Fix Version/s: 2.3.0 Issue resolved by pull request 20438

[jira] [Commented] (SPARK-22969) aggregateByKey with aggregator compression

2018-01-30 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22969?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16346359#comment-16346359 ] zhengruifeng commented on SPARK-22969: -- [~srowen]  Mailing list is a better place to discuss. 

[jira] [Updated] (SPARK-22971) OneVsRestModel should use temporary RawPredictionCol

2018-01-30 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22971?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng updated SPARK-22971: - Affects Version/s: (was: 2.3.0) 2.4.0 > OneVsRestModel should use

[jira] [Assigned] (SPARK-23040) BlockStoreShuffleReader's return Iterator isn't interruptible if aggregator or ordering is specified

2018-01-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23040?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23040: Assignee: (was: Apache Spark) > BlockStoreShuffleReader's return Iterator isn't

[jira] [Assigned] (SPARK-23040) BlockStoreShuffleReader's return Iterator isn't interruptible if aggregator or ordering is specified

2018-01-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23040?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23040: Assignee: Apache Spark > BlockStoreShuffleReader's return Iterator isn't interruptible if

[jira] [Commented] (SPARK-23040) BlockStoreShuffleReader's return Iterator isn't interruptible if aggregator or ordering is specified

2018-01-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23040?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16346325#comment-16346325 ] Apache Spark commented on SPARK-23040: -- User 'advancedxy' has created a pull request for this issue:

[jira] [Resolved] (SPARK-23279) Avoid triggering distributed job for Console sink

2018-01-30 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23279?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Saisai Shao resolved SPARK-23279. - Resolution: Fixed Assignee: Saisai Shao Fix Version/s: 2.3.0 > Avoid triggering

[jira] [Commented] (SPARK-23279) Avoid triggering distributed job for Console sink

2018-01-30 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23279?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16346301#comment-16346301 ] Saisai Shao commented on SPARK-23279: - Issue resolved by pull request 20447

[jira] [Updated] (SPARK-23202) Break down DataSourceV2Writer.commit into two phase

2018-01-30 Thread Sameer Agarwal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23202?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sameer Agarwal updated SPARK-23202: --- Target Version/s: 2.3.0 > Break down DataSourceV2Writer.commit into two phase >

[jira] [Updated] (SPARK-23202) Break down DataSourceV2Writer.commit into two phase

2018-01-30 Thread Gengliang Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23202?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gengliang Wang updated SPARK-23202: --- Affects Version/s: (was: 2.2.1) 2.3.0 > Break down

[jira] [Updated] (SPARK-23203) DataSourceV2 should use immutable trees.

2018-01-30 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23203?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-23203: Priority: Blocker (was: Major) > DataSourceV2 should use immutable trees. >

[jira] [Commented] (SPARK-23203) DataSourceV2 should use immutable trees.

2018-01-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23203?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16346253#comment-16346253 ] Apache Spark commented on SPARK-23203: -- User 'cloud-fan' has created a pull request for this issue:

[jira] [Comment Edited] (SPARK-23251) ClassNotFoundException: scala.Any when there's a missing implicit Map encoder

2018-01-30 Thread Bruce Robbins (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23251?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16346246#comment-16346246 ] Bruce Robbins edited comment on SPARK-23251 at 1/31/18 4:35 AM: [~srowen] 

[jira] [Commented] (SPARK-23251) ClassNotFoundException: scala.Any when there's a missing implicit Map encoder

2018-01-30 Thread Bruce Robbins (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23251?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16346246#comment-16346246 ] Bruce Robbins commented on SPARK-23251: --- [~srowen] This also occurs with compiled apps submitted

[jira] [Resolved] (SPARK-23274) ReplaceExceptWithFilter fails on dataframes filtered on same column

2018-01-30 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-23274. - Resolution: Fixed Fix Version/s: 2.3.0 > ReplaceExceptWithFilter fails on dataframes filtered on

[jira] [Commented] (SPARK-18016) Code Generation: Constant Pool Past Limit for Wide/Nested Dataset

2018-01-30 Thread Gaurav Garg (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18016?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16346200#comment-16346200 ] Gaurav Garg commented on SPARK-18016: - Thanks [~kiszk] for helping me out. I have attached the logs

[jira] [Updated] (SPARK-18016) Code Generation: Constant Pool Past Limit for Wide/Nested Dataset

2018-01-30 Thread Gaurav Garg (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18016?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gaurav Garg updated SPARK-18016: Attachment: 910825_9.zip > Code Generation: Constant Pool Past Limit for Wide/Nested Dataset >

[jira] [Commented] (SPARK-23279) Avoid triggering distributed job for Console sink

2018-01-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23279?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16346158#comment-16346158 ] Apache Spark commented on SPARK-23279: -- User 'jerryshao' has created a pull request for this issue:

[jira] [Assigned] (SPARK-23279) Avoid triggering distributed job for Console sink

2018-01-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23279?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23279: Assignee: (was: Apache Spark) > Avoid triggering distributed job for Console sink >

[jira] [Assigned] (SPARK-23279) Avoid triggering distributed job for Console sink

2018-01-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23279?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23279: Assignee: Apache Spark > Avoid triggering distributed job for Console sink >

[jira] [Resolved] (SPARK-23277) Spark ALS : param coldStartStrategy does not exist.

2018-01-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23277?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-23277. --- Resolution: Invalid > Spark ALS : param coldStartStrategy does not exist. >

[jira] [Created] (SPARK-23279) Avoid triggering distributed job for Console sink

2018-01-30 Thread Saisai Shao (JIRA)
Saisai Shao created SPARK-23279: --- Summary: Avoid triggering distributed job for Console sink Key: SPARK-23279 URL: https://issues.apache.org/jira/browse/SPARK-23279 Project: Spark Issue Type:

[jira] [Commented] (SPARK-23273) Spark Dataset withColumn - schema column order isn't the same as case class paramether order

2018-01-30 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23273?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16346148#comment-16346148 ] Liang-Chi Hsieh commented on SPARK-23273: - The {{name}} column will be added after {{age}} in

[jira] [Commented] (SPARK-23254) Add user guide entry for DataFrame multivariate summary

2018-01-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23254?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16346118#comment-16346118 ] Apache Spark commented on SPARK-23254: -- User 'WeichenXu123' has created a pull request for this

[jira] [Assigned] (SPARK-23254) Add user guide entry for DataFrame multivariate summary

2018-01-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23254?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23254: Assignee: Apache Spark > Add user guide entry for DataFrame multivariate summary >

[jira] [Assigned] (SPARK-23254) Add user guide entry for DataFrame multivariate summary

2018-01-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23254?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23254: Assignee: (was: Apache Spark) > Add user guide entry for DataFrame multivariate

[jira] [Commented] (SPARK-23092) Migrate MemoryStream to DataSource V2

2018-01-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23092?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16346110#comment-16346110 ] Apache Spark commented on SPARK-23092: -- User 'tdas' has created a pull request for this issue:

[jira] [Resolved] (SPARK-23276) Enable UDT tests in (Hive)OrcHadoopFsRelationSuite

2018-01-30 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23276?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-23276. - Resolution: Fixed Assignee: Dongjoon Hyun Fix Version/s: 2.3.0 > Enable UDT tests in

[jira] [Updated] (SPARK-23261) Rename Pandas UDFs

2018-01-30 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23261?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-23261: Description: Rename the public APIs of pandas udfs from  - PANDAS SCALAR UDF -> SCALAR PANDAS UDF -

[jira] [Updated] (SPARK-23261) Rename Pandas UDFs

2018-01-30 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23261?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-23261: Fix Version/s: (was: 2.4.0) 2.3.0 > Rename Pandas UDFs > -- > >

[jira] [Updated] (SPARK-23202) Break down DataSourceV2Writer.commit into two phase

2018-01-30 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23202?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan updated SPARK-23202: Priority: Blocker (was: Major) > Break down DataSourceV2Writer.commit into two phase >

[jira] [Commented] (SPARK-23251) ClassNotFoundException: scala.Any when there's a missing implicit Map encoder

2018-01-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23251?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16346075#comment-16346075 ] Sean Owen commented on SPARK-23251: --- Hm. I don't know is this is related to Encoders and the mechanism

[jira] [Assigned] (SPARK-23274) ReplaceExceptWithFilter fails on dataframes filtered on same column

2018-01-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23274: Assignee: Xiao Li (was: Apache Spark) > ReplaceExceptWithFilter fails on dataframes

[jira] [Commented] (SPARK-23274) ReplaceExceptWithFilter fails on dataframes filtered on same column

2018-01-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23274?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16346071#comment-16346071 ] Apache Spark commented on SPARK-23274: -- User 'gatorsmile' has created a pull request for this issue:

[jira] [Assigned] (SPARK-23274) ReplaceExceptWithFilter fails on dataframes filtered on same column

2018-01-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23274: Assignee: Apache Spark (was: Xiao Li) > ReplaceExceptWithFilter fails on dataframes

[jira] [Commented] (SPARK-23251) ClassNotFoundException: scala.Any when there's a missing implicit Map encoder

2018-01-30 Thread Bruce Robbins (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23251?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16346050#comment-16346050 ] Bruce Robbins commented on SPARK-23251: --- I commented out the following line in 

[jira] [Commented] (SPARK-18057) Update structured streaming kafka from 10.0.1 to 10.2.0

2018-01-30 Thread John Cheng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18057?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16345963#comment-16345963 ] John Cheng commented on SPARK-18057: Apache Kafka is now at version 1.0. For people who want to use

[jira] [Commented] (SPARK-23157) withColumn fails for a column that is a result of mapped DataSet

2018-01-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16345949#comment-16345949 ] Apache Spark commented on SPARK-23157: -- User 'henryr' has created a pull request for this issue:

[jira] [Resolved] (SPARK-23275) hive/tests have been failing when run locally on the laptop (Mac) with OOM

2018-01-30 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23275?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-23275. - Resolution: Fixed Assignee: Dilip Biswal Fix Version/s: 2.3.0 > hive/tests have been

[jira] [Commented] (SPARK-23236) Make it easier to find the rest API, especially in local mode

2018-01-30 Thread Alex Bozarth (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23236?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16345827#comment-16345827 ] Alex Bozarth commented on SPARK-23236: -- For #1, a REST API endpoint shouldn't return html, but we

[jira] [Commented] (SPARK-23237) Add UI / endpoint for threaddumps for executors with active tasks

2018-01-30 Thread Alex Bozarth (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23237?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16345818#comment-16345818 ] Alex Bozarth commented on SPARK-23237: -- I would rather keep it to an api endpoint, but what I'm

[jira] [Updated] (SPARK-23274) ReplaceExceptWithFilter fails on dataframes filtered on same column

2018-01-30 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-23274: --- Labels: (was: correctness) > ReplaceExceptWithFilter fails on dataframes filtered on same column >

[jira] [Resolved] (SPARK-23278) Spark ALS : param coldStartStrategy does not exist.

2018-01-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23278?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-23278. --- Resolution: Duplicate You opened this twice, so I closed it. Please don't reopen JIRAs. Your

[jira] [Closed] (SPARK-23278) Spark ALS : param coldStartStrategy does not exist.

2018-01-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23278?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen closed SPARK-23278. - > Spark ALS : param coldStartStrategy does not exist. > ---

[jira] [Reopened] (SPARK-23278) Spark ALS : param coldStartStrategy does not exist.

2018-01-30 Thread Surya Prakash Reddy (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23278?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Surya Prakash Reddy reopened SPARK-23278: - > Spark ALS : param coldStartStrategy does not exist. >

[jira] [Resolved] (SPARK-23278) Spark ALS : param coldStartStrategy does not exist.

2018-01-30 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23278?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-23278. --- Resolution: Duplicate > Spark ALS : param coldStartStrategy does not exist. >

[jira] [Assigned] (SPARK-23265) Update multi-column error handling logic in QuantileDiscretizer

2018-01-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23265?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23265: Assignee: (was: Apache Spark) > Update multi-column error handling logic in

[jira] [Assigned] (SPARK-23265) Update multi-column error handling logic in QuantileDiscretizer

2018-01-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23265?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23265: Assignee: Apache Spark > Update multi-column error handling logic in QuantileDiscretizer

[jira] [Commented] (SPARK-23265) Update multi-column error handling logic in QuantileDiscretizer

2018-01-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23265?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16345723#comment-16345723 ] Apache Spark commented on SPARK-23265: -- User 'huaxingao' has created a pull request for this issue:

[jira] [Updated] (SPARK-23278) Spark ALS : param coldStartStrategy does not exist.

2018-01-30 Thread Surya Prakash Reddy (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23278?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Surya Prakash Reddy updated SPARK-23278: Description: An error occurred while calling o105.getParam. :

[jira] [Created] (SPARK-23278) Spark ALS : param coldStartStrategy does not exist.

2018-01-30 Thread Surya Prakash Reddy (JIRA)
Surya Prakash Reddy created SPARK-23278: --- Summary: Spark ALS : param coldStartStrategy does not exist. Key: SPARK-23278 URL: https://issues.apache.org/jira/browse/SPARK-23278 Project: Spark

[jira] [Created] (SPARK-23277) Spark ALS : param coldStartStrategy does not exist.

2018-01-30 Thread Surya Prakash Reddy (JIRA)
Surya Prakash Reddy created SPARK-23277: --- Summary: Spark ALS : param coldStartStrategy does not exist. Key: SPARK-23277 URL: https://issues.apache.org/jira/browse/SPARK-23277 Project: Spark

[jira] [Commented] (SPARK-23275) hive/tests have been failing when run locally on the laptop (Mac) with OOM

2018-01-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23275?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16345686#comment-16345686 ] Apache Spark commented on SPARK-23275: -- User 'dilipbiswal' has created a pull request for this

[jira] [Assigned] (SPARK-23275) hive/tests have been failing when run locally on the laptop (Mac) with OOM

2018-01-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23275?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23275: Assignee: (was: Apache Spark) > hive/tests have been failing when run locally on the

[jira] [Assigned] (SPARK-23275) hive/tests have been failing when run locally on the laptop (Mac) with OOM

2018-01-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23275?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23275: Assignee: Apache Spark > hive/tests have been failing when run locally on the laptop

[jira] [Updated] (SPARK-23274) ReplaceExceptWithFilter fails on dataframes filtered on same column

2018-01-30 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-23274: --- Labels: correctness (was: ) > ReplaceExceptWithFilter fails on dataframes filtered on same column >

[jira] [Assigned] (SPARK-23276) Enable UDT tests in (Hive)OrcHadoopFsRelationSuite

2018-01-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23276?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23276: Assignee: (was: Apache Spark) > Enable UDT tests in (Hive)OrcHadoopFsRelationSuite >

[jira] [Assigned] (SPARK-23276) Enable UDT tests in (Hive)OrcHadoopFsRelationSuite

2018-01-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23276?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23276: Assignee: Apache Spark > Enable UDT tests in (Hive)OrcHadoopFsRelationSuite >

[jira] [Commented] (SPARK-23276) Enable UDT tests in (Hive)OrcHadoopFsRelationSuite

2018-01-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23276?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16345679#comment-16345679 ] Apache Spark commented on SPARK-23276: -- User 'dongjoon-hyun' has created a pull request for this

[jira] [Updated] (SPARK-23276) Enable UDT tests in (Hive)OrcHadoopFsRelationSuite

2018-01-30 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23276?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-23276: -- Component/s: Tests > Enable UDT tests in (Hive)OrcHadoopFsRelationSuite >

[jira] [Updated] (SPARK-23276) Enable UDT tests in (Hive)OrcHadoopFsRelationSuite

2018-01-30 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23276?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-23276: -- Description: Like Parquet, ORC test suite should enable UDT tests. > Enable UDT tests in

[jira] [Created] (SPARK-23276) Enable UDT tests in (Hive)OrcHadoopFsRelationSuite

2018-01-30 Thread Dongjoon Hyun (JIRA)
Dongjoon Hyun created SPARK-23276: - Summary: Enable UDT tests in (Hive)OrcHadoopFsRelationSuite Key: SPARK-23276 URL: https://issues.apache.org/jira/browse/SPARK-23276 Project: Spark Issue

[jira] [Resolved] (SPARK-23267) Increase spark.sql.codegen.hugeMethodLimit to 65535

2018-01-30 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23267?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-23267. - Resolution: Fixed Fix Version/s: 2.3.0 > Increase spark.sql.codegen.hugeMethodLimit to 65535 >

[jira] [Created] (SPARK-23275) hive/tests have been failing when run locally on the laptop (Mac) with OOM

2018-01-30 Thread Dilip Biswal (JIRA)
Dilip Biswal created SPARK-23275: Summary: hive/tests have been failing when run locally on the laptop (Mac) with OOM Key: SPARK-23275 URL: https://issues.apache.org/jira/browse/SPARK-23275 Project:

[jira] [Commented] (SPARK-23254) Add user guide entry for DataFrame multivariate summary

2018-01-30 Thread Weichen Xu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23254?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16345651#comment-16345651 ] Weichen Xu commented on SPARK-23254: I will work on this. Thanks! > Add user guide entry for

[jira] [Commented] (SPARK-23261) Rename Pandas UDFs

2018-01-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16345641#comment-16345641 ] Apache Spark commented on SPARK-23261: -- User 'gatorsmile' has created a pull request for this issue:

[jira] [Commented] (SPARK-18085) SPIP: Better History Server scalability for many / large applications

2018-01-30 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18085?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16345615#comment-16345615 ] Marcelo Vanzin commented on SPARK-18085: If you want the short and dirty description: these

[jira] [Assigned] (SPARK-23020) Re-enable Flaky Test: org.apache.spark.launcher.SparkLauncherSuite.testInProcessLauncher

2018-01-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23020?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23020: Assignee: Marcelo Vanzin (was: Apache Spark) > Re-enable Flaky Test: >

[jira] [Assigned] (SPARK-23020) Re-enable Flaky Test: org.apache.spark.launcher.SparkLauncherSuite.testInProcessLauncher

2018-01-30 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23020?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23020: Assignee: Apache Spark (was: Marcelo Vanzin) > Re-enable Flaky Test: >

[jira] [Commented] (SPARK-12394) Support writing out pre-hash-partitioned data and exploit that in join optimizations to avoid shuffle (i.e. bucketing in Hive)

2018-01-30 Thread JIRA
[ https://issues.apache.org/jira/browse/SPARK-12394?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16345598#comment-16345598 ] Thomas Bünger commented on SPARK-12394: --- Any news on this issue? Is it really fixed? I also can't

[jira] [Commented] (SPARK-23265) Update multi-column error handling logic in QuantileDiscretizer

2018-01-30 Thread Huaxin Gao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23265?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16345585#comment-16345585 ] Huaxin Gao commented on SPARK-23265: I am working on it. Will submit a PR today.  > Update

[jira] [Comment Edited] (SPARK-18016) Code Generation: Constant Pool Past Limit for Wide/Nested Dataset

2018-01-30 Thread Kazuaki Ishizaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18016?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16345551#comment-16345551 ] Kazuaki Ishizaki edited comment on SPARK-18016 at 1/30/18 6:27 PM: ---

[jira] [Updated] (SPARK-23274) ReplaceExceptWithFilter fails on dataframes filtered on same column

2018-01-30 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-23274: Target Version/s: 2.3.0 > ReplaceExceptWithFilter fails on dataframes filtered on same column >

[jira] [Commented] (SPARK-23274) ReplaceExceptWithFilter fails on dataframes filtered on same column

2018-01-30 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23274?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16345554#comment-16345554 ] Xiao Li commented on SPARK-23274: - Since this is a regression, I will try to fix it ASAP >

  1   2   >