[jira] [Commented] (SPARK-23551) Exclude `hadoop-mapreduce-client-core` dependency from `orc-mapreduce`

2018-04-13 Thread Sergey Serebryakov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23551?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16438208#comment-16438208 ] Sergey Serebryakov commented on SPARK-23551: Thank you [~dongjoon]! This was also affecting

[jira] [Commented] (SPARK-23904) Big execution plan cause OOM

2018-04-13 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23904?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16438189#comment-16438189 ] Liang-Chi Hsieh commented on SPARK-23904: - If you don't need UI, can you try to set

[jira] [Commented] (SPARK-23970) pyspark - simple filter/select doesn't use all tasks when coalesce is set

2018-04-13 Thread Matthew Anthony (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23970?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16438188#comment-16438188 ] Matthew Anthony commented on SPARK-23970: - Understood but this isnhorribly inefficient. In the

[jira] [Assigned] (SPARK-23979) MultiAlias should not be a CodegenFallback

2018-04-13 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23979?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-23979: --- Assignee: Liang-Chi Hsieh > MultiAlias should not be a CodegenFallback >

[jira] [Resolved] (SPARK-23979) MultiAlias should not be a CodegenFallback

2018-04-13 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23979?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-23979. - Resolution: Fixed Fix Version/s: 2.4.0 Issue resolved by pull request 21065

[jira] [Commented] (SPARK-23970) pyspark - simple filter/select doesn't use all tasks when coalesce is set

2018-04-13 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23970?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16438132#comment-16438132 ] Liang-Chi Hsieh commented on SPARK-23970: - I think the document of {{coalesce}} might answer

[jira] [Assigned] (SPARK-21962) Distributed Tracing in Spark

2018-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21962?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21962: Assignee: (was: Apache Spark) > Distributed Tracing in Spark >

[jira] [Commented] (SPARK-21962) Distributed Tracing in Spark

2018-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21962?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16438124#comment-16438124 ] Apache Spark commented on SPARK-21962: -- User 'devaraj-kavali' has created a pull request for this

[jira] [Assigned] (SPARK-21962) Distributed Tracing in Spark

2018-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21962?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21962: Assignee: Apache Spark > Distributed Tracing in Spark > > >

[jira] [Resolved] (SPARK-23966) Refactoring all checkpoint file writing logic in a common interface

2018-04-13 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23966?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-23966. --- Resolution: Fixed Fix Version/s: 3.0.0 Issue resolved by pull request 21048

[jira] [Commented] (SPARK-23030) Decrease memory consumption with toPandas() collection using Arrow

2018-04-13 Thread Li Jin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16438001#comment-16438001 ] Li Jin commented on SPARK-23030: Hey [~bryanc], did you by an chance have some process on this? I guess

[jira] [Commented] (SPARK-23936) High-order function: map_concat(map1<K, V>, map2<K, V>, ..., mapN<K, V>) → map<K,V>

2018-04-13 Thread Bruce Robbins (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23936?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16437985#comment-16437985 ] Bruce Robbins commented on SPARK-23936: --- I will have a WIP pull request tonight or tomorrow

[jira] [Created] (SPARK-23981) ShuffleBlockFetcherIterator - Spamming Logs

2018-04-13 Thread BELUGA BEHR (JIRA)
BELUGA BEHR created SPARK-23981: --- Summary: ShuffleBlockFetcherIterator - Spamming Logs Key: SPARK-23981 URL: https://issues.apache.org/jira/browse/SPARK-23981 Project: Spark Issue Type:

[jira] [Updated] (SPARK-23963) Queries on text-based Hive tables grow disproportionately slower as the number of columns increase

2018-04-13 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23963?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-23963: Issue Type: Improvement (was: Bug) > Queries on text-based Hive tables grow disproportionately slower as

[jira] [Assigned] (SPARK-23972) Upgrade to Parquet 1.10

2018-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23972?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23972: Assignee: (was: Apache Spark) > Upgrade to Parquet 1.10 > --- > >

[jira] [Commented] (SPARK-23972) Upgrade to Parquet 1.10

2018-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23972?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16437913#comment-16437913 ] Apache Spark commented on SPARK-23972: -- User 'rdblue' has created a pull request for this issue:

[jira] [Assigned] (SPARK-23972) Upgrade to Parquet 1.10

2018-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23972?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23972: Assignee: Apache Spark > Upgrade to Parquet 1.10 > --- > >

[jira] [Resolved] (SPARK-23963) Queries on text-based Hive tables grow disproportionately slower as the number of columns increase

2018-04-13 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23963?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-23963. - Resolution: Fixed Assignee: Bruce Robbins Fix Version/s: 2.4.0 > Queries on text-based

[jira] [Assigned] (SPARK-23920) High-order function: array_remove(x, element) → array

2018-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23920?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23920: Assignee: (was: Apache Spark) > High-order function: array_remove(x, element) → array

[jira] [Commented] (SPARK-23920) High-order function: array_remove(x, element) → array

2018-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23920?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16437714#comment-16437714 ] Apache Spark commented on SPARK-23920: -- User 'huaxingao' has created a pull request for this issue:

[jira] [Assigned] (SPARK-23920) High-order function: array_remove(x, element) → array

2018-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23920?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23920: Assignee: Apache Spark > High-order function: array_remove(x, element) → array >

[jira] [Commented] (SPARK-23161) Add missing APIs to Python GBTClassifier

2018-04-13 Thread Huaxin Gao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23161?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16437698#comment-16437698 ] Huaxin Gao commented on SPARK-23161: I will work on this. Thanks! > Add missing APIs to Python

[jira] [Assigned] (SPARK-23375) Optimizer should remove unneeded Sort

2018-04-13 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23375?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-23375: --- Assignee: Marco Gaido > Optimizer should remove unneeded Sort >

[jira] [Resolved] (SPARK-23375) Optimizer should remove unneeded Sort

2018-04-13 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23375?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-23375. - Resolution: Fixed Fix Version/s: 2.4.0 Issue resolved by pull request 20560

[jira] [Resolved] (SPARK-23896) Improve PartitioningAwareFileIndex

2018-04-13 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23896?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-23896. - Resolution: Fixed Fix Version/s: 2.4.0 Issue resolved by pull request 21004

[jira] [Assigned] (SPARK-23896) Improve PartitioningAwareFileIndex

2018-04-13 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23896?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-23896: --- Assignee: Gengliang Wang > Improve PartitioningAwareFileIndex >

[jira] [Commented] (SPARK-23959) UnresolvedException with DataSet created from Seq.empty since Spark 2.3.0

2018-04-13 Thread Marco Gaido (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23959?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16437512#comment-16437512 ] Marco Gaido commented on SPARK-23959: - I am not able to reproduce on current master. This must have

[jira] [Commented] (SPARK-23928) High-order function: shuffle(x) → array

2018-04-13 Thread H Lu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23928?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16437472#comment-16437472 ] H Lu commented on SPARK-23928: -- Yes, I am working on it. I will submit a pull request later. Thanks,

[jira] [Resolved] (SPARK-22839) Refactor Kubernetes code for configuring driver/executor pods to use consistent and cleaner abstraction

2018-04-13 Thread Anirudh Ramanathan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22839?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Anirudh Ramanathan resolved SPARK-22839. Resolution: Fixed Fix Version/s: 3.0.0 Issue resolved by pull request 20910

[jira] [Assigned] (SPARK-22839) Refactor Kubernetes code for configuring driver/executor pods to use consistent and cleaner abstraction

2018-04-13 Thread Anirudh Ramanathan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22839?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Anirudh Ramanathan reassigned SPARK-22839: -- Assignee: Matt Cheah > Refactor Kubernetes code for configuring

[jira] [Commented] (SPARK-23901) Data Masking Functions

2018-04-13 Thread Marco Gaido (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23901?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16437401#comment-16437401 ] Marco Gaido commented on SPARK-23901: - Actually I am facing some issues in the implementation. I have

[jira] [Assigned] (SPARK-16630) Blacklist a node if executors won't launch on it.

2018-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16630?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16630: Assignee: Apache Spark > Blacklist a node if executors won't launch on it. >

[jira] [Commented] (SPARK-16630) Blacklist a node if executors won't launch on it.

2018-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16437381#comment-16437381 ] Apache Spark commented on SPARK-16630: -- User 'attilapiros' has created a pull request for this

[jira] [Assigned] (SPARK-16630) Blacklist a node if executors won't launch on it.

2018-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16630?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16630: Assignee: (was: Apache Spark) > Blacklist a node if executors won't launch on it. >

[jira] [Commented] (SPARK-23927) High-order function: sequence

2018-04-13 Thread Alex Wajda (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16437376#comment-16437376 ] Alex Wajda commented on SPARK-23927: For the dates and timestamps I assume that {{step}} type can be

[jira] [Assigned] (SPARK-23980) Resilient Spark driver on Kubernetes

2018-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23980?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23980: Assignee: Apache Spark > Resilient Spark driver on Kubernetes >

[jira] [Commented] (SPARK-23980) Resilient Spark driver on Kubernetes

2018-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23980?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16437355#comment-16437355 ] Apache Spark commented on SPARK-23980: -- User 'baluchicken' has created a pull request for this

[jira] [Assigned] (SPARK-23980) Resilient Spark driver on Kubernetes

2018-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23980?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23980: Assignee: (was: Apache Spark) > Resilient Spark driver on Kubernetes >

[jira] [Created] (SPARK-23980) Resilient Spark driver on Kubernetes

2018-04-13 Thread Sebastian Toader (JIRA)
Sebastian Toader created SPARK-23980: Summary: Resilient Spark driver on Kubernetes Key: SPARK-23980 URL: https://issues.apache.org/jira/browse/SPARK-23980 Project: Spark Issue Type:

[jira] [Commented] (SPARK-23855) Performing a Join after a CrossJoin can lead to data corruption

2018-04-13 Thread Erik Selin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23855?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16437327#comment-16437327 ] Erik Selin commented on SPARK-23855: +1, from our investigations it looks like we've also hit this

[jira] [Assigned] (SPARK-23977) Add commit protocol binding to Hadoop 3.1 PathOutputCommitter mechanism

2018-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23977?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23977: Assignee: Apache Spark > Add commit protocol binding to Hadoop 3.1 PathOutputCommitter

[jira] [Commented] (SPARK-23977) Add commit protocol binding to Hadoop 3.1 PathOutputCommitter mechanism

2018-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23977?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16437288#comment-16437288 ] Apache Spark commented on SPARK-23977: -- User 'steveloughran' has created a pull request for this

[jira] [Assigned] (SPARK-23977) Add commit protocol binding to Hadoop 3.1 PathOutputCommitter mechanism

2018-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23977?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23977: Assignee: (was: Apache Spark) > Add commit protocol binding to Hadoop 3.1

[jira] [Commented] (SPARK-23979) MultiAlias should not be a CodegenFallback

2018-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16437280#comment-16437280 ] Apache Spark commented on SPARK-23979: -- User 'viirya' has created a pull request for this issue:

[jira] [Assigned] (SPARK-23979) MultiAlias should not be a CodegenFallback

2018-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23979?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23979: Assignee: Apache Spark > MultiAlias should not be a CodegenFallback >

[jira] [Assigned] (SPARK-23979) MultiAlias should not be a CodegenFallback

2018-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23979?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23979: Assignee: (was: Apache Spark) > MultiAlias should not be a CodegenFallback >

[jira] [Created] (SPARK-23979) MultiAlias should not be a CodegenFallback

2018-04-13 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-23979: --- Summary: MultiAlias should not be a CodegenFallback Key: SPARK-23979 URL: https://issues.apache.org/jira/browse/SPARK-23979 Project: Spark Issue Type:

[jira] [Updated] (SPARK-23977) Add commit protocol binding to Hadoop 3.1 PathOutputCommitter mechanism

2018-04-13 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23977?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steve Loughran updated SPARK-23977: --- Summary: Add commit protocol binding to Hadoop 3.1 PathOutputCommitter mechanism (was: Add

[jira] [Updated] (SPARK-23978) Kryo much slower when mllib jar not on classpath

2018-04-13 Thread Richard Wilkinson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23978?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Wilkinson updated SPARK-23978: -- Priority: Minor (was: Major) Description: Spark 2.3 added a bunch of

[jira] [Created] (SPARK-23978) Kryo much slower when mllib jar not on classpath

2018-04-13 Thread Richard Wilkinson (JIRA)
Richard Wilkinson created SPARK-23978: - Summary: Kryo much slower when mllib jar not on classpath Key: SPARK-23978 URL: https://issues.apache.org/jira/browse/SPARK-23978 Project: Spark

[jira] [Updated] (SPARK-23978) Kryo much slower when mllib jar not on classpath

2018-04-13 Thread Richard Wilkinson (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23978?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Richard Wilkinson updated SPARK-23978: -- Attachment: kryo_stats.png > Kryo much slower when mllib jar not on classpath >

[jira] [Created] (SPARK-23977) Add committer binding to Hadoop 3.1 PathOutputCommitter Mechanism

2018-04-13 Thread Steve Loughran (JIRA)
Steve Loughran created SPARK-23977: -- Summary: Add committer binding to Hadoop 3.1 PathOutputCommitter Mechanism Key: SPARK-23977 URL: https://issues.apache.org/jira/browse/SPARK-23977 Project: Spark

[jira] [Updated] (SPARK-22198) Java incompatibility when extending UnaryTransformer or Transformer

2018-04-13 Thread Akos Tomasits (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22198?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Akos Tomasits updated SPARK-22198: -- Description: It is not possible to create proper Java custom Transformer by extending

[jira] [Updated] (SPARK-22198) Java incompatibility when extending UnaryTransformer or Transformer

2018-04-13 Thread Akos Tomasits (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22198?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Akos Tomasits updated SPARK-22198: -- Summary: Java incompatibility when extending UnaryTransformer or Transformer (was: Java

[jira] [Commented] (SPARK-23976) UTF8String.concat() or ByteArray.concat() may allocate shorter structure.

2018-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23976?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16437234#comment-16437234 ] Apache Spark commented on SPARK-23976: -- User 'kiszk' has created a pull request for this issue:

[jira] [Assigned] (SPARK-23976) UTF8String.concat() or ByteArray.concat() may allocate shorter structure.

2018-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23976?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23976: Assignee: Apache Spark > UTF8String.concat() or ByteArray.concat() may allocate shorter

[jira] [Assigned] (SPARK-23976) UTF8String.concat() or ByteArray.concat() may allocate shorter structure.

2018-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23976?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23976: Assignee: (was: Apache Spark) > UTF8String.concat() or ByteArray.concat() may

[jira] [Created] (SPARK-23976) UTF8String.concat() or ByteArray.concat() may allocate shorter structure.

2018-04-13 Thread Kazuaki Ishizaki (JIRA)
Kazuaki Ishizaki created SPARK-23976: Summary: UTF8String.concat() or ByteArray.concat() may allocate shorter structure. Key: SPARK-23976 URL: https://issues.apache.org/jira/browse/SPARK-23976

[jira] [Commented] (SPARK-23710) Upgrade Hive to 2.3.2

2018-04-13 Thread Darek (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23710?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16437224#comment-16437224 ] Darek commented on SPARK-23710: --- Spark is on Hive 1.2 and there's no appetite in the community to merge [PR

[jira] [Commented] (SPARK-23966) Refactoring all checkpoint file writing logic in a common interface

2018-04-13 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23966?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16437201#comment-16437201 ] Steve Loughran commented on SPARK-23966: w.r.t FileContext.rename vs FileSystem.rename(), they

[jira] [Commented] (SPARK-23901) Data Masking Functions

2018-04-13 Thread Marco Gaido (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23901?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16437155#comment-16437155 ] Marco Gaido commented on SPARK-23901: - I am working on this. I will create a single PR, then we can

[jira] [Commented] (SPARK-23886) update query.status

2018-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23886?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16437142#comment-16437142 ] Apache Spark commented on SPARK-23886: -- User 'efimpoberezkin' has created a pull request for this

[jira] [Assigned] (SPARK-23886) update query.status

2018-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23886?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23886: Assignee: Apache Spark > update query.status > --- > >

[jira] [Assigned] (SPARK-23886) update query.status

2018-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23886?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23886: Assignee: (was: Apache Spark) > update query.status > --- > >

[jira] [Commented] (SPARK-23928) High-order function: shuffle(x) → array

2018-04-13 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23928?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16437102#comment-16437102 ] Liang-Chi Hsieh commented on SPARK-23928: - Hi [~hzlu], So will you take this one? > High-order

[jira] [Commented] (SPARK-23933) High-order function: map(array, array) → map<K,V>

2018-04-13 Thread Kazuaki Ishizaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23933?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16436976#comment-16436976 ] Kazuaki Ishizaki commented on SPARK-23933: -- [~smilegator] [~ueshin] Could you favor us? SparkSQL

[jira] [Comment Edited] (SPARK-23891) Debian based Dockerfile

2018-04-13 Thread Sercan Karaoglu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23891?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16436942#comment-16436942 ] Sercan Karaoglu edited comment on SPARK-23891 at 4/13/18 7:24 AM: --

[jira] [Comment Edited] (SPARK-23891) Debian based Dockerfile

2018-04-13 Thread Sercan Karaoglu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23891?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16436942#comment-16436942 ] Sercan Karaoglu edited comment on SPARK-23891 at 4/13/18 7:23 AM: --

[jira] [Commented] (SPARK-23891) Debian based Dockerfile

2018-04-13 Thread Sercan Karaoglu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23891?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16436942#comment-16436942 ] Sercan Karaoglu commented on SPARK-23891: - Debian and centos based linux distros are the most

[jira] [Resolved] (SPARK-23905) Add UDF weekday

2018-04-13 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23905?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-23905. - Resolution: Fixed Assignee: yucai Fix Version/s: 2.4.0 > Add UDF weekday >

[jira] [Commented] (SPARK-23914) High-order function: array_union(x, y) → array

2018-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23914?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16436910#comment-16436910 ] Apache Spark commented on SPARK-23914: -- User 'kiszk' has created a pull request for this issue:

[jira] [Assigned] (SPARK-23914) High-order function: array_union(x, y) → array

2018-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23914?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23914: Assignee: Apache Spark > High-order function: array_union(x, y) → array >

[jira] [Assigned] (SPARK-23914) High-order function: array_union(x, y) → array

2018-04-13 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23914?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23914: Assignee: (was: Apache Spark) > High-order function: array_union(x, y) → array >

[jira] [Assigned] (SPARK-23815) Spark writer dynamic partition overwrite mode fails to write output on multi level partition

2018-04-13 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23815?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-23815: --- Assignee: Fangshi Li > Spark writer dynamic partition overwrite mode fails to write output