[jira] [Commented] (SPARK-21461) Spark Streaming crashes if CSV file has no read permissions

2017-07-18 Thread Dan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21461?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16092665#comment-16092665 ] Dan commented on SPARK-21461: - How could this lead to an unexpected consequence? If Spark fai

[jira] [Resolved] (SPARK-20065) Empty output files created for aggregation query in append mode

2017-07-18 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20065?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-20065. - Resolution: Fixed Assignee: Li Yuanjian Fix Version/s: 2.3.0 > Empty output files

[jira] [Assigned] (SPARK-21435) Empty files should be skipped while write to file

2017-07-18 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21435?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-21435: --- Assignee: Li Yuanjian > Empty files should be skipped while write to file >

[jira] [Resolved] (SPARK-21435) Empty files should be skipped while write to file

2017-07-18 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21435?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-21435. - Resolution: Fixed Fix Version/s: 2.3.0 > Empty files should be skipped while write to file

[jira] [Commented] (SPARK-21177) df.saveAsTable slows down linearly, with number of appends

2017-07-18 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21177?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16092643#comment-16092643 ] Liang-Chi Hsieh commented on SPARK-21177: - I can't reproduce the reported issue w

[jira] [Comment Edited] (SPARK-20065) Empty output files created for aggregation query in append mode

2017-07-18 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20065?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16092557#comment-16092557 ] Hyukjin Kwon edited comment on SPARK-20065 at 7/19/17 6:06 AM:

[jira] [Reopened] (SPARK-21435) Empty files should be skipped while write to file

2017-07-18 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21435?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reopened SPARK-21435: - > Empty files should be skipped while write to file > ---

[jira] [Commented] (SPARK-21316) Dataset Union output is not consistent with the column sequence

2017-07-18 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21316?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16092607#comment-16092607 ] Dongjoon Hyun commented on SPARK-21316: --- +1 > Dataset Union output is not consiste

[jira] [Commented] (SPARK-21461) Spark Streaming crashes if CSV file has no read permissions

2017-07-18 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21461?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16092600#comment-16092600 ] Hyukjin Kwon commented on SPARK-21461: -- No, they are different. Malformed CSV/JSON a

[jira] [Created] (SPARK-21466) com.cloudant.spark throws an error in python notebook

2017-07-18 Thread Smruthi Rajmohan (JIRA)
Smruthi Rajmohan created SPARK-21466: Summary: com.cloudant.spark throws an error in python notebook Key: SPARK-21466 URL: https://issues.apache.org/jira/browse/SPARK-21466 Project: Spark

[jira] [Commented] (SPARK-21316) Dataset Union output is not consistent with the column sequence

2017-07-18 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21316?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16092596#comment-16092596 ] Liang-Chi Hsieh commented on SPARK-21316: - As unionByName was merged, I think it

[jira] [Commented] (SPARK-21437) Java Keyword cannot be used in table schema

2017-07-18 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21437?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16092589#comment-16092589 ] Liang-Chi Hsieh commented on SPARK-21437: - For reference, the PR adds this check:

[jira] [Commented] (SPARK-21439) Cannot use Spark with Python ABCmeta (exception from cloudpickle)

2017-07-18 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16092586#comment-16092586 ] Liang-Chi Hsieh commented on SPARK-21439: - If there's no pr submitted and no one

[jira] [Updated] (SPARK-21441) Incorrect Codegen in SortMergeJoinExec results failures in some cases

2017-07-18 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21441?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liang-Chi Hsieh updated SPARK-21441: Priority: Major (was: Critical) > Incorrect Codegen in SortMergeJoinExec results failures

[jira] [Commented] (SPARK-21441) Incorrect Codegen in SortMergeJoinExec results failures in some cases

2017-07-18 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21441?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16092583#comment-16092583 ] Liang-Chi Hsieh commented on SPARK-21441: - Btw, I think the priority of this issu

[jira] [Commented] (SPARK-21461) Spark Streaming crashes if CSV file has no read permissions

2017-07-18 Thread Dan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21461?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16092567#comment-16092567 ] Dan commented on SPARK-21461: - I disagree this is a user error - a streaming application shou

[jira] [Commented] (SPARK-21435) Empty files should be skipped while write to file

2017-07-18 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21435?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16092559#comment-16092559 ] Hyukjin Kwon commented on SPARK-21435: -- (cc [~cloud_fan], I don't mind reopening thi

[jira] [Commented] (SPARK-20065) Empty output files created for aggregation query in append mode

2017-07-18 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20065?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16092557#comment-16092557 ] Hyukjin Kwon commented on SPARK-20065: -- I can confirm that PR (almost) fixes this is

[jira] [Commented] (SPARK-20065) Empty output files created for aggregation query in append mode

2017-07-18 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20065?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16092503#comment-16092503 ] Hyukjin Kwon commented on SPARK-20065: -- Yes, I am quite sure that fixes it. Will be

[jira] [Updated] (SPARK-21465) array('L') support might lead to overflow error

2017-07-18 Thread Xiang Gao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21465?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiang Gao updated SPARK-21465: -- Description: For now, the behavior of different types of {{array.array}} support in pyspark is not cle

[jira] [Created] (SPARK-21465) array('L') support might lead to overflow error

2017-07-18 Thread Xiang Gao (JIRA)
Xiang Gao created SPARK-21465: - Summary: array('L') support might lead to overflow error Key: SPARK-21465 URL: https://issues.apache.org/jira/browse/SPARK-21465 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-20065) Empty output files created for aggregation query in append mode

2017-07-18 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20065?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16092488#comment-16092488 ] Wenchen Fan commented on SPARK-20065: - Does https://github.com/apache/spark/pull/1865

[jira] [Assigned] (SPARK-21464) Minimize deprecation warnings caused by ProcessingTime class

2017-07-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21464?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21464: Assignee: Tathagata Das (was: Apache Spark) > Minimize deprecation warnings caused by Pro

[jira] [Commented] (SPARK-21464) Minimize deprecation warnings caused by ProcessingTime class

2017-07-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16092456#comment-16092456 ] Apache Spark commented on SPARK-21464: -- User 'tdas' has created a pull request for t

[jira] [Assigned] (SPARK-21464) Minimize deprecation warnings caused by ProcessingTime class

2017-07-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21464?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21464: Assignee: Apache Spark (was: Tathagata Das) > Minimize deprecation warnings caused by Pro

[jira] [Created] (SPARK-21464) Minimize deprecation warnings caused by ProcessingTime class

2017-07-18 Thread Tathagata Das (JIRA)
Tathagata Das created SPARK-21464: - Summary: Minimize deprecation warnings caused by ProcessingTime class Key: SPARK-21464 URL: https://issues.apache.org/jira/browse/SPARK-21464 Project: Spark

[jira] [Updated] (SPARK-21333) joinWith documents and analysis allow invalid join types

2017-07-18 Thread Corey Woodfield (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Corey Woodfield updated SPARK-21333: Affects Version/s: 2.2.0 > joinWith documents and analysis allow invalid join types > -

[jira] [Resolved] (SPARK-21457) ExternalCatalog.listPartitions should correctly handle partition values with dot

2017-07-18 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21457?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-21457. - Resolution: Fixed Fix Version/s: 2.3.0 2.2.1 > ExternalCatalog.listPartitions s

[jira] [Commented] (SPARK-21273) Decouple stats propagation from logical plan

2017-07-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21273?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16092373#comment-16092373 ] Apache Spark commented on SPARK-21273: -- User 'gatorsmile' has created a pull request

[jira] [Resolved] (SPARK-21462) Add batchId to the json of StreamingQueryProgress

2017-07-18 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21462?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das resolved SPARK-21462. --- Resolution: Fixed Fix Version/s: 2.3.0 Issue resolved by pull request 18675 [https://g

[jira] [Commented] (SPARK-21439) Cannot use Spark with Python ABCmeta (exception from cloudpickle)

2017-07-18 Thread Dimuthu Wickramanayake (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16092195#comment-16092195 ] Dimuthu Wickramanayake commented on SPARK-21439: Can i start work on this

[jira] [Assigned] (SPARK-21463) Output of StructuredStreaming tables don't respect user specified schema when reading back the table

2017-07-18 Thread Burak Yavuz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21463?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Burak Yavuz reassigned SPARK-21463: --- Assignee: Burak Yavuz > Output of StructuredStreaming tables don't respect user specified sc

[jira] [Assigned] (SPARK-21462) Add batchId to the json of StreamingQueryProgress

2017-07-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21462?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21462: Assignee: Tathagata Das (was: Apache Spark) > Add batchId to the json of StreamingQueryPr

[jira] [Assigned] (SPARK-21463) Output of StructuredStreaming tables don't respect user specified schema when reading back the table

2017-07-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21463?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21463: Assignee: Apache Spark > Output of StructuredStreaming tables don't respect user specified

[jira] [Commented] (SPARK-21462) Add batchId to the json of StreamingQueryProgress

2017-07-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21462?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16092157#comment-16092157 ] Apache Spark commented on SPARK-21462: -- User 'tdas' has created a pull request for t

[jira] [Assigned] (SPARK-21462) Add batchId to the json of StreamingQueryProgress

2017-07-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21462?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21462: Assignee: Apache Spark (was: Tathagata Das) > Add batchId to the json of StreamingQueryPr

[jira] [Commented] (SPARK-21463) Output of StructuredStreaming tables don't respect user specified schema when reading back the table

2017-07-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21463?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16092156#comment-16092156 ] Apache Spark commented on SPARK-21463: -- User 'brkyvz' has created a pull request for

[jira] [Assigned] (SPARK-21463) Output of StructuredStreaming tables don't respect user specified schema when reading back the table

2017-07-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21463?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21463: Assignee: (was: Apache Spark) > Output of StructuredStreaming tables don't respect use

[jira] [Resolved] (SPARK-21408) Default RPC dispatcher thread pool size too large for small executors

2017-07-18 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21408?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-21408. Resolution: Fixed Assignee: Marcelo Vanzin Fix Version/s: 2.3.0 > Default R

[jira] [Updated] (SPARK-21458) Tear down the framework when failover_timeout > 0 (Mesos cluster mode)

2017-07-18 Thread Susan X. Huynh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21458?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Susan X. Huynh updated SPARK-21458: --- Description: When the driver failover_timeout was always set to zero, we relied on the Mesos

[jira] [Created] (SPARK-21463) Output of StructuredStreaming tables don't respect user specified schema when reading back the table

2017-07-18 Thread Burak Yavuz (JIRA)
Burak Yavuz created SPARK-21463: --- Summary: Output of StructuredStreaming tables don't respect user specified schema when reading back the table Key: SPARK-21463 URL: https://issues.apache.org/jira/browse/SPARK-21463

[jira] [Created] (SPARK-21462) Add batchId to the json of StreamingQueryProgress

2017-07-18 Thread Tathagata Das (JIRA)
Tathagata Das created SPARK-21462: - Summary: Add batchId to the json of StreamingQueryProgress Key: SPARK-21462 URL: https://issues.apache.org/jira/browse/SPARK-21462 Project: Spark Issue Typ

[jira] [Comment Edited] (SPARK-21378) Spark Poll timeout when specific offsets are passed

2017-07-18 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21378?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16092088#comment-16092088 ] Shixiong Zhu edited comment on SPARK-21378 at 7/18/17 8:01 PM:

[jira] [Commented] (SPARK-21378) Spark Poll timeout when specific offsets are passed

2017-07-18 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21378?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16092088#comment-16092088 ] Shixiong Zhu commented on SPARK-21378: -- The data must already be in Kafka when execu

[jira] [Comment Edited] (SPARK-21378) Spark Poll timeout when specific offsets are passed

2017-07-18 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21378?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16092088#comment-16092088 ] Shixiong Zhu edited comment on SPARK-21378 at 7/18/17 8:00 PM:

[jira] [Comment Edited] (SPARK-21425) LongAccumulator, DoubleAccumulator not threadsafe

2017-07-18 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21425?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16092064#comment-16092064 ] Shixiong Zhu edited comment on SPARK-21425 at 7/18/17 7:41 PM:

[jira] [Commented] (SPARK-21425) LongAccumulator, DoubleAccumulator not threadsafe

2017-07-18 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21425?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16092064#comment-16092064 ] Shixiong Zhu commented on SPARK-21425: -- [~srowen] 1. Long/DoubleAccumulator assumes

[jira] [Commented] (SPARK-21460) Spark dynamic allocation breaks when ListenerBus event queue runs full

2017-07-18 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21460?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16092034#comment-16092034 ] Shixiong Zhu commented on SPARK-21460: -- Make sense. Reopened it. > Spark dynamic al

[jira] [Reopened] (SPARK-21460) Spark dynamic allocation breaks when ListenerBus event queue runs full

2017-07-18 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21460?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu reopened SPARK-21460: -- > Spark dynamic allocation breaks when ListenerBus event queue runs full >

[jira] [Updated] (SPARK-21411) Failed to get new HDFS delegation tokens in AMCredentialRenewer

2017-07-18 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21411?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin updated SPARK-21411: --- Affects Version/s: (was: 2.2.0) 2.3.0 > Failed to get new HDFS del

[jira] [Resolved] (SPARK-21411) Failed to get new HDFS delegation tokens in AMCredentialRenewer

2017-07-18 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21411?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-21411. Resolution: Fixed Assignee: Saisai Shao Fix Version/s: 2.3.0 > Failed to ge

[jira] [Commented] (SPARK-21460) Spark dynamic allocation breaks when ListenerBus event queue runs full

2017-07-18 Thread Ruslan Dautkhanov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21460?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16091972#comment-16091972 ] Ruslan Dautkhanov commented on SPARK-21460: --- [~zsxwing] can we keep this jira o

[jira] [Assigned] (SPARK-21456) Make the driver failover_timeout configurable (Mesos cluster mode)

2017-07-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21456?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21456: Assignee: (was: Apache Spark) > Make the driver failover_timeout configurable (Mesos c

[jira] [Assigned] (SPARK-21456) Make the driver failover_timeout configurable (Mesos cluster mode)

2017-07-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21456?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21456: Assignee: Apache Spark > Make the driver failover_timeout configurable (Mesos cluster mode

[jira] [Commented] (SPARK-21456) Make the driver failover_timeout configurable (Mesos cluster mode)

2017-07-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21456?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16091967#comment-16091967 ] Apache Spark commented on SPARK-21456: -- User 'susanxhuynh' has created a pull reques

[jira] [Commented] (SPARK-21460) Spark dynamic allocation breaks when ListenerBus event queue runs full

2017-07-18 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21460?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16091964#comment-16091964 ] Shixiong Zhu commented on SPARK-21460: -- [~Tagar] Right. SPARK-18838 probably will cr

[jira] [Commented] (SPARK-21460) Spark dynamic allocation breaks when ListenerBus event queue runs full

2017-07-18 Thread Ruslan Dautkhanov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21460?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16091943#comment-16091943 ] Ruslan Dautkhanov commented on SPARK-21460: --- [~zsxwing], according to [~tgraves

[jira] [Resolved] (SPARK-21461) Spark Streaming crashes if CSV file has no read permissions

2017-07-18 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21461?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu resolved SPARK-21461. -- Resolution: Won't Fix This is a user error rather than a Spark bug. Ignoring such errors will h

[jira] [Updated] (SPARK-21458) Tear down the framework when failover_timeout > 0 (Mesos cluster mode)

2017-07-18 Thread Susan X. Huynh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21458?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Susan X. Huynh updated SPARK-21458: --- Description: When the driver failover_timeout was always set to zero, we relied on the Mesos

[jira] [Updated] (SPARK-21458) Tear down the framework when failover_timeout > 0 (Mesos cluster mode)

2017-07-18 Thread Susan X. Huynh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21458?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Susan X. Huynh updated SPARK-21458: --- Description: When the driver failover_timeout was always set to zero, we relied on the Mesos

[jira] [Resolved] (SPARK-21460) Spark dynamic allocation breaks when ListenerBus event queue runs full

2017-07-18 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21460?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu resolved SPARK-21460. -- Resolution: Duplicate This will be addressed in SPARK-18838 > Spark dynamic allocation breaks

[jira] [Created] (SPARK-21461) Spark Streaming crashes if CSV file has no read permissions

2017-07-18 Thread Dan (JIRA)
Dan created SPARK-21461: --- Summary: Spark Streaming crashes if CSV file has no read permissions Key: SPARK-21461 URL: https://issues.apache.org/jira/browse/SPARK-21461 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-21459) Some aggregation functions change the case of nested field names

2017-07-18 Thread David Allsopp (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21459?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Allsopp updated SPARK-21459: -- Description: When working with DataFrames with nested schemas, the behavior of the aggregation

[jira] [Comment Edited] (SPARK-21392) Unable to infer schema when loading large Parquet file

2017-07-18 Thread Stuart Reynolds (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16090159#comment-16090159 ] Stuart Reynolds edited comment on SPARK-21392 at 7/18/17 5:30 PM: -

[jira] [Commented] (SPARK-21392) Unable to infer schema when loading large Parquet file

2017-07-18 Thread Stuart Reynolds (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16091854#comment-16091854 ] Stuart Reynolds commented on SPARK-21392: - Okie dokey: http://apache-spark-user-

[jira] [Assigned] (SPARK-21447) Spark history server fails to render compressed inprogress history file in some cases.

2017-07-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21447?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21447: Assignee: (was: Apache Spark) > Spark history server fails to render compressed inprog

[jira] [Commented] (SPARK-21447) Spark history server fails to render compressed inprogress history file in some cases.

2017-07-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21447?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16091834#comment-16091834 ] Apache Spark commented on SPARK-21447: -- User 'ericvandenbergfb' has created a pull r

[jira] [Assigned] (SPARK-21447) Spark history server fails to render compressed inprogress history file in some cases.

2017-07-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21447?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21447: Assignee: Apache Spark > Spark history server fails to render compressed inprogress histor

[jira] [Commented] (SPARK-15703) Make ListenerBus event queue size configurable

2017-07-18 Thread Ruslan Dautkhanov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16091821#comment-16091821 ] Ruslan Dautkhanov commented on SPARK-15703: --- [~tgraves], filed SPARK-21460. Tha

[jira] [Created] (SPARK-21460) Spark dynamic allocation breaks when ListenerBus event queue runs full

2017-07-18 Thread Ruslan Dautkhanov (JIRA)
Ruslan Dautkhanov created SPARK-21460: - Summary: Spark dynamic allocation breaks when ListenerBus event queue runs full Key: SPARK-21460 URL: https://issues.apache.org/jira/browse/SPARK-21460 Proj

[jira] [Resolved] (SPARK-15526) Shade JPMML

2017-07-18 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15526?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-15526. Resolution: Fixed Assignee: Sean Owen Fix Version/s: 2.3.0 > Shade JPMML >

[jira] [Created] (SPARK-21459) Some aggregation functions change the case of nested field names

2017-07-18 Thread David Allsopp (JIRA)
David Allsopp created SPARK-21459: - Summary: Some aggregation functions change the case of nested field names Key: SPARK-21459 URL: https://issues.apache.org/jira/browse/SPARK-21459 Project: Spark

[jira] [Commented] (SPARK-21457) ExternalCatalog.listPartitions should correctly handle partition values with dot

2017-07-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21457?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16091684#comment-16091684 ] Apache Spark commented on SPARK-21457: -- User 'cloud-fan' has created a pull request

[jira] [Assigned] (SPARK-21457) ExternalCatalog.listPartitions should correctly handle partition values with dot

2017-07-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21457?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21457: Assignee: Wenchen Fan (was: Apache Spark) > ExternalCatalog.listPartitions should correct

[jira] [Assigned] (SPARK-21457) ExternalCatalog.listPartitions should correctly handle partition values with dot

2017-07-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21457?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21457: Assignee: Apache Spark (was: Wenchen Fan) > ExternalCatalog.listPartitions should correct

[jira] [Commented] (SPARK-21419) Support Mesos failover_timeout in driver (Mesos cluster mode)

2017-07-18 Thread Susan X. Huynh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21419?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16091657#comment-16091657 ] Susan X. Huynh commented on SPARK-21419: I split this into two sub-tasks: (1) mak

[jira] [Created] (SPARK-21458) Tear down the framework when failover_timeout > 0 (Mesos cluster mode)

2017-07-18 Thread Susan X. Huynh (JIRA)
Susan X. Huynh created SPARK-21458: -- Summary: Tear down the framework when failover_timeout > 0 (Mesos cluster mode) Key: SPARK-21458 URL: https://issues.apache.org/jira/browse/SPARK-21458 Project: S

[jira] [Created] (SPARK-21457) ExternalCatalog.listPartitions should correctly handle partition values with dot

2017-07-18 Thread Wenchen Fan (JIRA)
Wenchen Fan created SPARK-21457: --- Summary: ExternalCatalog.listPartitions should correctly handle partition values with dot Key: SPARK-21457 URL: https://issues.apache.org/jira/browse/SPARK-21457 Projec

[jira] [Created] (SPARK-21456) Make the driver failover_timeout configurable (Mesos cluster mode)

2017-07-18 Thread Susan X. Huynh (JIRA)
Susan X. Huynh created SPARK-21456: -- Summary: Make the driver failover_timeout configurable (Mesos cluster mode) Key: SPARK-21456 URL: https://issues.apache.org/jira/browse/SPARK-21456 Project: Spark

[jira] [Updated] (SPARK-21455) RpcFailure should be call on RpcResponseCallback.onFailure

2017-07-18 Thread Xianyang Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21455?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xianyang Liu updated SPARK-21455: - Description: Currently, when there is a `RpcFailure` need be sent back to client, we call `RpcCa

[jira] [Updated] (SPARK-21455) RpcFailure should be call on RpcResponseCallback.onFailure

2017-07-18 Thread Xianyang Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21455?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xianyang Liu updated SPARK-21455: - Description: Currently, when there is a `RpcFailure` need be sent back to client, we call `RpcCa

[jira] [Updated] (SPARK-21455) RpcFailure should be call on RpcResponseCallback.onFailure

2017-07-18 Thread Xianyang Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21455?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xianyang Liu updated SPARK-21455: - Description: Currently, when there is a `RpcFailure` need be sent back to client, we call `RpcCa

[jira] [Updated] (SPARK-21455) RpcFailure should be call on RpcResponseCallback.onFailure

2017-07-18 Thread Xianyang Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21455?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xianyang Liu updated SPARK-21455: - Description: Currently, when there is a `RpcFailure` need be sent back to client, we call `RpcCa

[jira] [Updated] (SPARK-21455) RpcFailure should be call on RpcResponseCallback.onFailure

2017-07-18 Thread Xianyang Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21455?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xianyang Liu updated SPARK-21455: - Description: Currently, when there is a `RpcFailure` need be sent back to client, we call `RpcCa

[jira] [Updated] (SPARK-21455) RpcFailure should be call on RpcResponseCallback.onFailure

2017-07-18 Thread Xianyang Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21455?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xianyang Liu updated SPARK-21455: - Description: Currently, when there is a `RpcFailure` need be sent back to client, we call `RpcCa

[jira] [Updated] (SPARK-21455) RpcFailure should be call on RpcResponseCallback.onFailure

2017-07-18 Thread Xianyang Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21455?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xianyang Liu updated SPARK-21455: - Description: Currently, when there is a `RpcFailure` need be sent back to client, we call `RpcCa

[jira] [Assigned] (SPARK-21455) RpcFailure should be call on RpcResponseCallback.onFailure

2017-07-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21455?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21455: Assignee: Apache Spark > RpcFailure should be call on RpcResponseCallback.onFailure >

[jira] [Commented] (SPARK-15703) Make ListenerBus event queue size configurable

2017-07-18 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15703?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16091531#comment-16091531 ] Thomas Graves commented on SPARK-15703: --- It should not be breaking dynamic allocati

[jira] [Commented] (SPARK-21455) RpcFailure should be call on RpcResponseCallback.onFailure

2017-07-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21455?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16091529#comment-16091529 ] Apache Spark commented on SPARK-21455: -- User 'ConeyLiu' has created a pull request f

[jira] [Assigned] (SPARK-21455) RpcFailure should be call on RpcResponseCallback.onFailure

2017-07-18 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21455?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21455: Assignee: (was: Apache Spark) > RpcFailure should be call on RpcResponseCallback.onFai

[jira] [Updated] (SPARK-21420) Support array type code 'q' and 'Q'

2017-07-18 Thread Xiang Gao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21420?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiang Gao updated SPARK-21420: -- Description: Python's array type {{q}} and {{Q}} are currently not supported by {{net.razorvine.pickle

[jira] [Updated] (SPARK-21420) Support array type code 'q' and 'Q'

2017-07-18 Thread Xiang Gao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21420?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiang Gao updated SPARK-21420: -- Description: Python's array type {{q}} and {{Q}} are currently not supported by {{net.razorvine.pickle

[jira] [Created] (SPARK-21455) RpcFailure should be call on RpcResponseCallback.onFailure

2017-07-18 Thread Xianyang Liu (JIRA)
Xianyang Liu created SPARK-21455: Summary: RpcFailure should be call on RpcResponseCallback.onFailure Key: SPARK-21455 URL: https://issues.apache.org/jira/browse/SPARK-21455 Project: Spark Is

[jira] [Commented] (SPARK-20383) SparkSQL unsupports to create function with the keyword 'OR REPLACE' and 'IF NOT EXISTS'

2017-07-18 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20383?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16091524#comment-16091524 ] Wenchen Fan commented on SPARK-20383: - This seems like an improvement for the existin

[jira] [Commented] (SPARK-21454) Decimal up cast to higher scale fails while reading parquet to Dataset

2017-07-18 Thread Feng Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21454?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16091462#comment-16091462 ] Feng Zhu commented on SPARK-21454: -- I don't think it is an issue. The condition for Dec

[jira] [Commented] (SPARK-20383) SparkSQL unsupports to create function with the keyword 'OR REPLACE' and 'IF NOT EXISTS'

2017-07-18 Thread Xiaochen Ouyang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20383?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16091454#comment-16091454 ] Xiaochen Ouyang commented on SPARK-20383: - [~cloud_fan] Hello, whether the type o

[jira] [Commented] (SPARK-18226) SparkR displaying vector columns in incorrect way

2017-07-18 Thread Kirti (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16091382#comment-16091382 ] Kirti commented on SPARK-18226: --- Is there any way around to handle this issue in SparkR? >

[jira] [Commented] (SPARK-21453) Streaming kafka source (structured spark)

2017-07-18 Thread Pablo Panero (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21453?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16091353#comment-16091353 ] Pablo Panero commented on SPARK-21453: -- [~srowen] could you point me to the mailing

[jira] [Updated] (SPARK-21454) Decimal up cast to higher scale fails while reading parquet to Dataset

2017-07-18 Thread Karim Wadie (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21454?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Karim Wadie updated SPARK-21454: Description: Given a parquet file with a decimal (38,4) field. One can read it into a dataframe bu

[jira] [Created] (SPARK-21454) Decimal up cast to higher scale fails while reading parquet to Dataset

2017-07-18 Thread Karim Wadie (JIRA)
Karim Wadie created SPARK-21454: --- Summary: Decimal up cast to higher scale fails while reading parquet to Dataset Key: SPARK-21454 URL: https://issues.apache.org/jira/browse/SPARK-21454 Project: Spark

  1   2   >