[jira] [Comment Edited] (SPARK-24763) Remove redundant key data from value in streaming aggregation

2018-07-08 Thread Jungtaek Lim (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24763?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16536539#comment-16536539 ] Jungtaek Lim edited comment on SPARK-24763 at 7/9/18 5:19 AM: -- > Spark

[jira] [Comment Edited] (SPARK-24763) Remove redundant key data from value in streaming aggregation

2018-07-08 Thread Jungtaek Lim (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24763?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16536539#comment-16536539 ] Jungtaek Lim edited comment on SPARK-24763 at 7/9/18 5:18 AM: -- > Spark

[jira] [Commented] (SPARK-24760) Pandas UDF does not handle NaN correctly

2018-07-08 Thread Mortada Mehyar (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24760?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16536540#comment-16536540 ] Mortada Mehyar commented on SPARK-24760: cc [~icexelloss] [~hyukjin.kwon] > Pandas UDF does not

[jira] [Commented] (SPARK-24666) Word2Vec generate infinity vectors when numIterations are large

2018-07-08 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24666?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16536538#comment-16536538 ] Liang-Chi Hsieh commented on SPARK-24666: - Is it possible you can provide an example dataset and

[jira] [Commented] (SPARK-24763) Remove redundant key data from value in streaming aggregation

2018-07-08 Thread Jungtaek Lim (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24763?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16536539#comment-16536539 ] Jungtaek Lim commented on SPARK-24763: -- > Spark version * 2.4.0-SNAPSHOT * commit: 

[jira] [Created] (SPARK-24763) Remove redundant key data from value in streaming aggregation

2018-07-08 Thread Jungtaek Lim (JIRA)
Jungtaek Lim created SPARK-24763: Summary: Remove redundant key data from value in streaming aggregation Key: SPARK-24763 URL: https://issues.apache.org/jira/browse/SPARK-24763 Project: Spark

[jira] [Updated] (SPARK-24576) Upgrade Apache ORC to 1.5.2

2018-07-08 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24576?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-24576: -- Description: This issue aims to upgrade Apache ORC library from 1.4.4 to 1.5.1 in order to

[jira] [Updated] (SPARK-24576) Upgrade Apache ORC to 1.5.2

2018-07-08 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24576?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-24576: -- Description: This issue aims to upgrade Apache ORC library from 1.4.4 to 1.5.1 in order to

[jira] [Updated] (SPARK-24576) Upgrade Apache ORC to 1.5.2

2018-07-08 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24576?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-24576: -- Summary: Upgrade Apache ORC to 1.5.2 (was: Upgrade Apache ORC to 1.5.1) > Upgrade Apache

[jira] [Assigned] (SPARK-24762) Aggregator should be able to use Option of Product encoder

2018-07-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24762?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24762: Assignee: (was: Apache Spark) > Aggregator should be able to use Option of Product

[jira] [Assigned] (SPARK-24762) Aggregator should be able to use Option of Product encoder

2018-07-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24762?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24762: Assignee: Apache Spark > Aggregator should be able to use Option of Product encoder >

[jira] [Commented] (SPARK-24752) date_format provides incorrect year after a timezone conversation changes the year on a timestamp

2018-07-08 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24752?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16536519#comment-16536519 ] Takeshi Yamamuro commented on SPARK-24752: -- I closed this cuz this is expected. > date_format

[jira] [Commented] (SPARK-24762) Aggregator should be able to use Option of Product encoder

2018-07-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24762?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16536518#comment-16536518 ] Apache Spark commented on SPARK-24762: -- User 'viirya' has created a pull request for this issue:

[jira] [Resolved] (SPARK-24752) date_format provides incorrect year after a timezone conversation changes the year on a timestamp

2018-07-08 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24752?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takeshi Yamamuro resolved SPARK-24752. -- Resolution: Not A Problem > date_format provides incorrect year after a timezone

[jira] [Commented] (SPARK-24752) date_format provides incorrect year after a timezone conversation changes the year on a timestamp

2018-07-08 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24752?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16536516#comment-16536516 ] Takeshi Yamamuro commented on SPARK-24752: -- oh, I see...that's it. > date_format provides

[jira] [Created] (SPARK-24762) Aggregator should be able to use Option of Product encoder

2018-07-08 Thread Liang-Chi Hsieh (JIRA)
Liang-Chi Hsieh created SPARK-24762: --- Summary: Aggregator should be able to use Option of Product encoder Key: SPARK-24762 URL: https://issues.apache.org/jira/browse/SPARK-24762 Project: Spark

[jira] [Commented] (SPARK-24752) date_format provides incorrect year after a timezone conversation changes the year on a timestamp

2018-07-08 Thread Takuya Ueshin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24752?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16536502#comment-16536502 ] Takuya Ueshin commented on SPARK-24752: --- Maybe we should use {{"-MM-dd"}} for the format

[jira] [Commented] (SPARK-24753) bad backslah parsing in SQL statements

2018-07-08 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24753?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16536495#comment-16536495 ] Hyukjin Kwon commented on SPARK-24753: -- gentle ping [~mathieulongtin] > bad backslah parsing in

[jira] [Updated] (SPARK-24752) date_format provides incorrect year after a timezone conversation changes the year on a timestamp

2018-07-08 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24752?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-24752: - Affects Version/s: 2.4.0 > date_format provides incorrect year after a timezone conversation

[jira] [Resolved] (SPARK-24646) Support wildcard '*' for to spark.yarn.dist.forceDownloadSchemes

2018-07-08 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24646?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Saisai Shao resolved SPARK-24646. - Resolution: Fixed Fix Version/s: 2.4.0 Issue resolved by pull request 21633

[jira] [Assigned] (SPARK-24646) Support wildcard '*' for to spark.yarn.dist.forceDownloadSchemes

2018-07-08 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24646?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Saisai Shao reassigned SPARK-24646: --- Assignee: Saisai Shao > Support wildcard '*' for to spark.yarn.dist.forceDownloadSchemes >

[jira] [Comment Edited] (SPARK-13343) speculative tasks that didn't commit shouldn't be marked as success

2018-07-08 Thread Hieu Tri Huynh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13343?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16536409#comment-16536409 ] Hieu Tri Huynh edited comment on SPARK-13343 at 7/8/18 9:04 PM: While

[jira] [Comment Edited] (SPARK-13343) speculative tasks that didn't commit shouldn't be marked as success

2018-07-08 Thread Hieu Tri Huynh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13343?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16536409#comment-16536409 ] Hieu Tri Huynh edited comment on SPARK-13343 at 7/8/18 9:02 PM: While

[jira] [Updated] (SPARK-13343) speculative tasks that didn't commit shouldn't be marked as success

2018-07-08 Thread Hieu Tri Huynh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hieu Tri Huynh updated SPARK-13343: --- Attachment: image.png > speculative tasks that didn't commit shouldn't be marked as success

[jira] [Commented] (SPARK-13343) speculative tasks that didn't commit shouldn't be marked as success

2018-07-08 Thread Hieu Tri Huynh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13343?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16536409#comment-16536409 ] Hieu Tri Huynh commented on SPARK-13343: While working on this issue, I noticed another problem

[jira] [Updated] (SPARK-13343) speculative tasks that didn't commit shouldn't be marked as success

2018-07-08 Thread Hieu Tri Huynh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hieu Tri Huynh updated SPARK-13343: --- Attachment: image.png > speculative tasks that didn't commit shouldn't be marked as success

[jira] [Updated] (SPARK-13343) speculative tasks that didn't commit shouldn't be marked as success

2018-07-08 Thread Hieu Tri Huynh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hieu Tri Huynh updated SPARK-13343: --- Attachment: Screen Shot 2018-07-08 at 3.49.52 PM.png > speculative tasks that didn't commit

[jira] [Commented] (SPARK-24761) Check modifiability of config parameters

2018-07-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24761?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16536391#comment-16536391 ] Apache Spark commented on SPARK-24761: -- User 'MaxGekk' has created a pull request for this issue:

[jira] [Assigned] (SPARK-24761) Check modifiability of config parameters

2018-07-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24761?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24761: Assignee: Apache Spark > Check modifiability of config parameters >

[jira] [Assigned] (SPARK-24761) Check modifiability of config parameters

2018-07-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24761?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24761: Assignee: (was: Apache Spark) > Check modifiability of config parameters >

[jira] [Created] (SPARK-24761) Check modifiability of config parameters

2018-07-08 Thread Maxim Gekk (JIRA)
Maxim Gekk created SPARK-24761: -- Summary: Check modifiability of config parameters Key: SPARK-24761 URL: https://issues.apache.org/jira/browse/SPARK-24761 Project: Spark Issue Type: Improvement

[jira] [Assigned] (SPARK-24755) Executor loss can cause task to not be resubmitted

2018-07-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24755?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24755: Assignee: Apache Spark > Executor loss can cause task to not be resubmitted >

[jira] [Assigned] (SPARK-24755) Executor loss can cause task to not be resubmitted

2018-07-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24755?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-24755: Assignee: (was: Apache Spark) > Executor loss can cause task to not be resubmitted >

[jira] [Commented] (SPARK-24755) Executor loss can cause task to not be resubmitted

2018-07-08 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24755?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16536371#comment-16536371 ] Apache Spark commented on SPARK-24755: -- User 'hthuynh2' has created a pull request for this issue:

[jira] [Commented] (SPARK-24582) Design: Barrier execution mode

2018-07-08 Thread Jiang Xingbo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24582?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16536331#comment-16536331 ] Jiang Xingbo commented on SPARK-24582: -- Design doc: 

[jira] [Commented] (SPARK-24755) Executor loss can cause task to not be resubmitted

2018-07-08 Thread Li Yuanjian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24755?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16536107#comment-16536107 ] Li Yuanjian commented on SPARK-24755: - No problem, thanks [~hthuynh2]. Thanks [~mridulm80] for

[jira] [Commented] (SPARK-24755) Executor loss can cause task to not be resubmitted

2018-07-08 Thread Mridul Muralidharan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24755?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16536016#comment-16536016 ] Mridul Muralidharan commented on SPARK-24755: - Go for it - thanks [~hthuynh2] ! > Executor

[jira] [Updated] (SPARK-24528) Missing optimization for Aggregations/Windowing on a bucketed table

2018-07-08 Thread Ohad Raviv (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24528?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ohad Raviv updated SPARK-24528: --- Description: https://issues.apache.org/jira/browse/SPARK-24528#Closely related to  SPARK-24410,