[jira] [Comment Edited] (SPARK-19293) Spark 2.1.x unstable with spark.speculation=true

2017-06-21 Thread coneyliu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19293?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16057127#comment-16057127 ] coneyliu edited comment on SPARK-19293 at 6/21/17 8:01 AM: --- Have you tried the

[jira] [Commented] (SPARK-19293) Spark 2.1.x unstable with spark.speculation=true

2017-06-21 Thread coneyliu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19293?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16057127#comment-16057127 ] coneyliu commented on SPARK-19293: -- Have you tried the latest code? The exceptions you give are all

[jira] [Commented] (SPARK-19293) Spark 2.1.x unstable with spark.speculation=true

2017-06-21 Thread Damian Momot (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19293?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16057091#comment-16057091 ] Damian Momot commented on SPARK-19293: -- Yep, Some tasks are marked as "killed" but some become

[jira] [Commented] (SPARK-21159) Cluster mode, driver throws connection refused exception submitted by SparkLauncher

2017-06-21 Thread niefei (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21159?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16057117#comment-16057117 ] niefei commented on SPARK-21159: thank you for your reply. it should use launcher's IP address to connect

[jira] [Commented] (SPARK-17129) Support statistics collection and cardinality estimation for partitioned tables

2017-06-21 Thread Zhenhua Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17129?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16057033#comment-16057033 ] Zhenhua Wang commented on SPARK-17129: -- [~mbasmanova] Sure, that'll be great~ > Support statistics

[jira] [Commented] (SPARK-19293) Spark 2.1.x unstable with spark.speculation=true

2017-06-21 Thread Damian Momot (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19293?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16057157#comment-16057157 ] Damian Momot commented on SPARK-19293: -- I'll try to build from 2.2 branch and test today > Spark

[jira] [Commented] (SPARK-21144) Unexpected results when the data schema and partition schema have the duplicate columns

2017-06-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16057312#comment-16057312 ] Apache Spark commented on SPARK-21144: -- User 'maropu' has created a pull request for this issue:

[jira] [Updated] (SPARK-20466) HadoopRDD#addLocalConfiguration throws NPE

2017-06-21 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20466?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-20466: -- Priority: Minor (was: Major) Hm, I think the question is how the JobConf is ever null here. I think

[jira] [Commented] (SPARK-18484) case class datasets - ability to specify decimal precision and scale

2017-06-21 Thread Arkadiusz Bicz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18484?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16057410#comment-16057410 ] Arkadiusz Bicz commented on SPARK-18484: Usage of DecimalType should be avoided with this

[jira] [Created] (SPARK-21160) Filtering rows with "not equal" operator yields unexpected result with null rows

2017-06-21 Thread Edoardo Vivo (JIRA)
Edoardo Vivo created SPARK-21160: Summary: Filtering rows with "not equal" operator yields unexpected result with null rows Key: SPARK-21160 URL: https://issues.apache.org/jira/browse/SPARK-21160

[jira] [Commented] (SPARK-21137) Spark cannot read many small files (wholeTextFiles)

2017-06-21 Thread Mikael Valot (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16057510#comment-16057510 ] Mikael Valot commented on SPARK-21137: -- this is a very common issue, I do not understand why this is

[jira] [Created] (SPARK-21161) SparkContext stopped when execute a query on Solr

2017-06-21 Thread Jian Wu (JIRA)
Jian Wu created SPARK-21161: --- Summary: SparkContext stopped when execute a query on Solr Key: SPARK-21161 URL: https://issues.apache.org/jira/browse/SPARK-21161 Project: Spark Issue Type: Bug

[jira] [Assigned] (SPARK-21161) SparkContext stopped when execute a query on Solr

2017-06-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21161?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21161: Assignee: Apache Spark > SparkContext stopped when execute a query on Solr >

[jira] [Commented] (SPARK-21161) SparkContext stopped when execute a query on Solr

2017-06-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21161?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16057530#comment-16057530 ] Apache Spark commented on SPARK-21161: -- User 'janplus' has created a pull request for this issue:

[jira] [Assigned] (SPARK-21161) SparkContext stopped when execute a query on Solr

2017-06-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21161?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21161: Assignee: (was: Apache Spark) > SparkContext stopped when execute a query on Solr >

[jira] [Commented] (SPARK-21093) Multiple gapply execution occasionally failed in SparkR

2017-06-21 Thread Nick Pentreath (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21093?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16057550#comment-16057550 ] Nick Pentreath commented on SPARK-21093: Just adding the info from test failure report from the

[jira] [Commented] (SPARK-21164) Remove isTableSample from Sample

2017-06-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21164?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16057923#comment-16057923 ] Apache Spark commented on SPARK-21164: -- User 'gatorsmile' has created a pull request for this issue:

[jira] [Created] (SPARK-21165) Fail to write into partitioned hive table due to attribute reference not working with cast on partition column

2017-06-21 Thread Imran Rashid (JIRA)
Imran Rashid created SPARK-21165: Summary: Fail to write into partitioned hive table due to attribute reference not working with cast on partition column Key: SPARK-21165 URL:

[jira] [Commented] (SPARK-21165) Fail to write into partitioned hive table due to attribute reference not working with cast on partition column

2017-06-21 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16058440#comment-16058440 ] Xiao Li commented on SPARK-21165: - Unable to reproduce it in the current master branch. Will try to use

[jira] [Commented] (SPARK-21165) Fail to write into partitioned hive table due to attribute reference not working with cast on partition column

2017-06-21 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16058490#comment-16058490 ] Xiao Li commented on SPARK-21165: - 2.2 branch failed with the same error. > Fail to write into

[jira] [Updated] (SPARK-20114) spark.ml parity for sequential pattern mining - PrefixSpan

2017-06-21 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20114?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-20114: -- Issue Type: Sub-task (was: New Feature) Parent: SPARK-14501 > spark.ml parity

[jira] [Created] (SPARK-21166) Automated ML persistence

2017-06-21 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-21166: - Summary: Automated ML persistence Key: SPARK-21166 URL: https://issues.apache.org/jira/browse/SPARK-21166 Project: Spark Issue Type: New Feature

[jira] [Created] (SPARK-21167) Path is not decoded correctly when reading output of FileSink

2017-06-21 Thread Shixiong Zhu (JIRA)
Shixiong Zhu created SPARK-21167: Summary: Path is not decoded correctly when reading output of FileSink Key: SPARK-21167 URL: https://issues.apache.org/jira/browse/SPARK-21167 Project: Spark

[jira] [Resolved] (SPARK-20830) PySpark wrappers for explode_outer and posexplode_outer

2017-06-21 Thread Takuya Ueshin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20830?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takuya Ueshin resolved SPARK-20830. --- Resolution: Fixed Fix Version/s: 2.3.0 Issue resolved by pull request 18049

[jira] [Assigned] (SPARK-20830) PySpark wrappers for explode_outer and posexplode_outer

2017-06-21 Thread Takuya Ueshin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20830?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takuya Ueshin reassigned SPARK-20830: - Assignee: Maciej Szymkiewicz > PySpark wrappers for explode_outer and posexplode_outer

[jira] [Assigned] (SPARK-21167) Path is not decoded correctly when reading output of FileSink

2017-06-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21167?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21167: Assignee: Shixiong Zhu (was: Apache Spark) > Path is not decoded correctly when reading

[jira] [Assigned] (SPARK-21167) Path is not decoded correctly when reading output of FileSink

2017-06-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21167?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21167: Assignee: Apache Spark (was: Shixiong Zhu) > Path is not decoded correctly when reading

[jira] [Commented] (SPARK-21167) Path is not decoded correctly when reading output of FileSink

2017-06-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21167?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16058486#comment-16058486 ] Apache Spark commented on SPARK-21167: -- User 'zsxwing' has created a pull request for this issue:

[jira] [Updated] (SPARK-21147) the schema of socket/rate source can not be set.

2017-06-21 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21147?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-21147: - Summary: the schema of socket/rate source can not be set. (was: the schema of socket source can

[jira] [Resolved] (SPARK-21125) PySpark context missing function to set Job Description.

2017-06-21 Thread Takuya Ueshin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takuya Ueshin resolved SPARK-21125. --- Resolution: Fixed Fix Version/s: 2.3.0 Issue resolved by pull request 18332

[jira] [Updated] (SPARK-21147) the schema of socket/rate source can not be set.

2017-06-21 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21147?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-21147: - Affects Version/s: 2.2.0 > the schema of socket/rate source can not be set. >

[jira] [Assigned] (SPARK-21125) PySpark context missing function to set Job Description.

2017-06-21 Thread Takuya Ueshin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takuya Ueshin reassigned SPARK-21125: - Assignee: Shane Jarvie > PySpark context missing function to set Job Description. >

[jira] [Resolved] (SPARK-21147) the schema of socket/rate source can not be set.

2017-06-21 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21147?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu resolved SPARK-21147. -- Resolution: Fixed Assignee: Hyukjin Kwon Fix Version/s: 2.3.0 > the schema of

[jira] [Updated] (SPARK-21165) Fail to write into partitioned hive table due to attribute reference not working with cast on partition column

2017-06-21 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21165?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Imran Rashid updated SPARK-21165: - Description: A simple "insert into ... select" involving partitioned hive tables fails. Here's

[jira] [Resolved] (SPARK-20917) SparkR supports string encoding consistent with R

2017-06-21 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20917?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung resolved SPARK-20917. -- Resolution: Fixed Assignee: Wayne Zhang Fix Version/s: 2.3.0

[jira] [Created] (SPARK-21164) Remove isTableSample from Sample

2017-06-21 Thread Xiao Li (JIRA)
Xiao Li created SPARK-21164: --- Summary: Remove isTableSample from Sample Key: SPARK-21164 URL: https://issues.apache.org/jira/browse/SPARK-21164 Project: Spark Issue Type: Improvement

[jira] [Assigned] (SPARK-16019) Eliminate unexpected delay during spark on yarn job launch

2017-06-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16019?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16019: Assignee: Apache Spark > Eliminate unexpected delay during spark on yarn job launch >

[jira] [Assigned] (SPARK-16019) Eliminate unexpected delay during spark on yarn job launch

2017-06-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16019?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-16019: Assignee: (was: Apache Spark) > Eliminate unexpected delay during spark on yarn job

[jira] [Commented] (SPARK-16019) Eliminate unexpected delay during spark on yarn job launch

2017-06-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16058116#comment-16058116 ] Apache Spark commented on SPARK-16019: -- User 'vanzin' has created a pull request for this issue:

[jira] [Assigned] (SPARK-21164) Remove isTableSample from Sample

2017-06-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21164?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21164: Assignee: Xiao Li (was: Apache Spark) > Remove isTableSample from Sample >

[jira] [Assigned] (SPARK-21164) Remove isTableSample from Sample

2017-06-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21164?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21164: Assignee: Apache Spark (was: Xiao Li) > Remove isTableSample from Sample >

[jira] [Created] (SPARK-21168) KafkaRDD should always set kafka clientId.

2017-06-21 Thread Xingxing Di (JIRA)
Xingxing Di created SPARK-21168: --- Summary: KafkaRDD should always set kafka clientId. Key: SPARK-21168 URL: https://issues.apache.org/jira/browse/SPARK-21168 Project: Spark Issue Type: Bug

[jira] [Assigned] (SPARK-21149) Add job description API for R

2017-06-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21149?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21149: Assignee: Apache Spark > Add job description API for R > - >

[jira] [Commented] (SPARK-21149) Add job description API for R

2017-06-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21149?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16058599#comment-16058599 ] Apache Spark commented on SPARK-21149: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Assigned] (SPARK-21149) Add job description API for R

2017-06-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21149?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21149: Assignee: (was: Apache Spark) > Add job description API for R >

[jira] [Updated] (SPARK-21168) KafkaRDD should always set kafka clientId.

2017-06-21 Thread Xingxing Di (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21168?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xingxing Di updated SPARK-21168: External issue URL: https://github.com/apache/spark/pull/18383 > KafkaRDD should always set kafka

[jira] [Created] (SPARK-21169) Spark HA: Jobs state is in WAITING status after reconnecting to standby master

2017-06-21 Thread Srinivasarao Daruna (JIRA)
Srinivasarao Daruna created SPARK-21169: --- Summary: Spark HA: Jobs state is in WAITING status after reconnecting to standby master Key: SPARK-21169 URL: https://issues.apache.org/jira/browse/SPARK-21169

[jira] [Updated] (SPARK-18016) Code Generation: Constant Pool Past Limit for Wide/Nested Dataset

2017-06-21 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18016?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan updated SPARK-18016: Fix Version/s: (was: 2.3.0) 2.2.0 2.1.2 > Code

[jira] [Commented] (SPARK-21158) SparkSQL function SparkSession.Catalog.ListTables() does not handle spark setting for case-sensitivity

2017-06-21 Thread Kathryn McClintic (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21158?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16058577#comment-16058577 ] Kathryn McClintic commented on SPARK-21158: --- I'm fine with that from my perspective. >

[jira] [Commented] (SPARK-19141) VectorAssembler metadata causing memory issues

2017-06-21 Thread Mayur Bhole (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19141?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16058713#comment-16058713 ] Mayur Bhole commented on SPARK-19141: - Is there any possible work around for this issue? >

[jira] [Updated] (SPARK-21171) Speculate task scheduling block dirve handle normal task when a job task number more than one hundred thousand

2017-06-21 Thread wangminfeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21171?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] wangminfeng updated SPARK-21171: Description: If a job have more then one hundred thousand tasks and spark.speculation is true,

[jira] [Commented] (SPARK-21167) Path is not decoded correctly when reading output of FileSink

2017-06-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21167?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16058678#comment-16058678 ] Apache Spark commented on SPARK-21167: -- User 'dijingran' has created a pull request for this issue:

[jira] [Resolved] (SPARK-20906) Constrained Logistic Regression for SparkR

2017-06-21 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20906?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung resolved SPARK-20906. -- Resolution: Fixed Assignee: Miao Wang Fix Version/s: 2.3.0 Target

[jira] [Resolved] (SPARK-21160) Filtering rows with "not equal" operator yields unexpected result with null rows

2017-06-21 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21160?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-21160. -- Resolution: Not A Bug There is null-safe equality comparison ``` scala> Seq(Some(1), Some(2),

[jira] [Comment Edited] (SPARK-21160) Filtering rows with "not equal" operator yields unexpected result with null rows

2017-06-21 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21160?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16058537#comment-16058537 ] Hyukjin Kwon edited comment on SPARK-21160 at 6/22/17 12:40 AM: There is

[jira] [Comment Edited] (SPARK-21160) Filtering rows with "not equal" operator yields unexpected result with null rows

2017-06-21 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21160?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16058537#comment-16058537 ] Hyukjin Kwon edited comment on SPARK-21160 at 6/22/17 12:40 AM: There is

[jira] [Assigned] (SPARK-21170) Utils.tryWithSafeFinallyAndFailureCallbacks throws IllegalArgumentException: Self-suppression not permitted

2017-06-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21170?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21170: Assignee: (was: Apache Spark) > Utils.tryWithSafeFinallyAndFailureCallbacks throws

[jira] [Commented] (SPARK-21170) Utils.tryWithSafeFinallyAndFailureCallbacks throws IllegalArgumentException: Self-suppression not permitted

2017-06-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21170?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16058775#comment-16058775 ] Apache Spark commented on SPARK-21170: -- User 'devaraj-kavali' has created a pull request for this

[jira] [Assigned] (SPARK-21170) Utils.tryWithSafeFinallyAndFailureCallbacks throws IllegalArgumentException: Self-suppression not permitted

2017-06-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21170?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21170: Assignee: Apache Spark > Utils.tryWithSafeFinallyAndFailureCallbacks throws

[jira] [Created] (SPARK-21171) Speculate task scheduling block dirve handle normal task when a job task number more than one hundred thousand

2017-06-21 Thread wangminfeng (JIRA)
wangminfeng created SPARK-21171: --- Summary: Speculate task scheduling block dirve handle normal task when a job task number more than one hundred thousand Key: SPARK-21171 URL:

[jira] [Issue Comment Deleted] (SPARK-21155) Add (? running tasks) into Spark UI progress

2017-06-21 Thread Eric Vandenberg (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21155?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eric Vandenberg updated SPARK-21155: Comment: was deleted (was: Before ) > Add (? running tasks) into Spark UI progress >

[jira] [Commented] (SPARK-20338) Spaces in spark.eventLog.dir are not correctly handled

2017-06-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20338?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16058666#comment-16058666 ] Apache Spark commented on SPARK-20338: -- User 'zuotingbing' has created a pull request for this

[jira] [Updated] (SPARK-21168) KafkaRDD should always set kafka clientId.

2017-06-21 Thread Xingxing Di (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21168?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xingxing Di updated SPARK-21168: External issue URL: (was: https://github.com/apache/spark/pull/18383) > KafkaRDD should always

[jira] [Assigned] (SPARK-21165) Fail to write into partitioned hive table due to attribute reference not working with cast on partition column

2017-06-21 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21165?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li reassigned SPARK-21165: --- Assignee: Xiao Li > Fail to write into partitioned hive table due to attribute reference not >

[jira] [Commented] (SPARK-19341) Bucketing support for Structured Streaming

2017-06-21 Thread Fei Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19341?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16058590#comment-16058590 ] Fei Shao commented on SPARK-19341: -- @gagan taneia Would you like add more info about this issue please?

[jira] [Created] (SPARK-21170) Utils.tryWithSafeFinallyAndFailureCallbacks throws IllegalArgumentException: Self-suppression not permitted

2017-06-21 Thread Devaraj K (JIRA)
Devaraj K created SPARK-21170: - Summary: Utils.tryWithSafeFinallyAndFailureCallbacks throws IllegalArgumentException: Self-suppression not permitted Key: SPARK-21170 URL:

[jira] [Commented] (SPARK-21137) Spark cannot read many small files (wholeTextFiles)

2017-06-21 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16057565#comment-16057565 ] Sean Owen commented on SPARK-21137: --- [~leakimav] -- there still isn't detail here about why this is a

[jira] [Commented] (SPARK-21160) Filtering rows with "not equal" operator yields unexpected result with null rows

2017-06-21 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21160?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16057629#comment-16057629 ] Takeshi Yamamuro commented on SPARK-21160: -- you better google it though, this is because NULL is

[jira] [Created] (SPARK-21163) DataFrame.toPandas should respect the data type

2017-06-21 Thread Wenchen Fan (JIRA)
Wenchen Fan created SPARK-21163: --- Summary: DataFrame.toPandas should respect the data type Key: SPARK-21163 URL: https://issues.apache.org/jira/browse/SPARK-21163 Project: Spark Issue Type:

[jira] [Commented] (SPARK-21160) Filtering rows with "not equal" operator yields unexpected result with null rows

2017-06-21 Thread Edoardo Vivo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21160?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16057722#comment-16057722 ] Edoardo Vivo commented on SPARK-21160: -- Thank you for your answer. I noticed the same happens in

[jira] [Assigned] (SPARK-21163) DataFrame.toPandas should respect the data type

2017-06-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21163?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21163: Assignee: Wenchen Fan (was: Apache Spark) > DataFrame.toPandas should respect the data

[jira] [Commented] (SPARK-21163) DataFrame.toPandas should respect the data type

2017-06-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16057729#comment-16057729 ] Apache Spark commented on SPARK-21163: -- User 'cloud-fan' has created a pull request for this issue:

[jira] [Assigned] (SPARK-21163) DataFrame.toPandas should respect the data type

2017-06-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21163?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21163: Assignee: Apache Spark (was: Wenchen Fan) > DataFrame.toPandas should respect the data

[jira] [Commented] (SPARK-21161) SparkContext stopped when execute a query on Solr

2017-06-21 Thread Jian Wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21161?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16057753#comment-16057753 ] Jian Wu commented on SPARK-21161: - I'll fix this bug in the `spark-solr` project. Thx for comment. >

[jira] [Commented] (SPARK-21159) Cluster mode, driver throws connection refused exception submitted by SparkLauncher

2017-06-21 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21159?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16057767#comment-16057767 ] Marcelo Vanzin commented on SPARK-21159: No, that should not be it. That's not how the launcher

[jira] [Commented] (SPARK-21160) Filtering rows with "not equal" operator yields unexpected result with null rows

2017-06-21 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21160?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16057587#comment-16057587 ] Takeshi Yamamuro commented on SPARK-21160: -- This is an expected behaviour. Probably, you want to

[jira] [Commented] (SPARK-21137) Spark cannot read many small files (wholeTextFiles)

2017-06-21 Thread sam (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16057592#comment-16057592 ] sam commented on SPARK-21137: - [~srowen] I thought I already made a point about that? Please can you tell me

[jira] [Commented] (SPARK-21160) Filtering rows with "not equal" operator yields unexpected result with null rows

2017-06-21 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21160?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16057601#comment-16057601 ] Takeshi Yamamuro commented on SPARK-21160: -- BTW, anybody knows why `a` is nullable in this case?

[jira] [Resolved] (SPARK-21082) Consider Executor's memory usage when scheduling task

2017-06-21 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21082?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-21082. --- Resolution: Won't Fix > Consider Executor's memory usage when scheduling task >

[jira] [Commented] (SPARK-21161) SparkContext stopped when execute a query on Solr

2017-06-21 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21161?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16057564#comment-16057564 ] Sean Owen commented on SPARK-21161: --- Yes, that's not a valid host/port. No, you can't just ignore that

[jira] [Commented] (SPARK-21160) Filtering rows with "not equal" operator yields unexpected result with null rows

2017-06-21 Thread Edoardo Vivo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21160?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16057614#comment-16057614 ] Edoardo Vivo commented on SPARK-21160: -- Sorry for the stupid question, but may I ask WHY this is the

[jira] [Created] (SPARK-21162) Cannot count rows in an empty Hive table stored as parquet when spark.sql.parquet.cacheMetadata is set to false

2017-06-21 Thread Tom Ogle (JIRA)
Tom Ogle created SPARK-21162: Summary: Cannot count rows in an empty Hive table stored as parquet when spark.sql.parquet.cacheMetadata is set to false Key: SPARK-21162 URL:

[jira] [Commented] (SPARK-18016) Code Generation: Constant Pool Past Limit for Wide/Nested Dataset

2017-06-21 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18016?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16057657#comment-16057657 ] Apache Spark commented on SPARK-18016: -- User 'bdrillard' has created a pull request for this issue:

[jira] [Resolved] (SPARK-20640) Make rpc timeout and retry for shuffle registration configurable

2017-06-21 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20640?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-20640. - Resolution: Fixed Fix Version/s: 2.3.0 Issue resolved by pull request 18092

[jira] [Assigned] (SPARK-20640) Make rpc timeout and retry for shuffle registration configurable

2017-06-21 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20640?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-20640: --- Assignee: Li Yichao > Make rpc timeout and retry for shuffle registration configurable >

[jira] [Commented] (SPARK-21137) Spark cannot read many small files (wholeTextFiles)

2017-06-21 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16057605#comment-16057605 ] Sean Owen commented on SPARK-21137: --- (This is not a common use case) What change are you proposing in

[jira] [Commented] (SPARK-21161) SparkContext stopped when execute a query on Solr

2017-06-21 Thread Jian Wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21161?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16057855#comment-16057855 ] Jian Wu commented on SPARK-21161: - For others who come up with the same issue, please check

[jira] [Commented] (SPARK-10878) Race condition when resolving Maven coordinates via Ivy

2017-06-21 Thread Todd Morrison (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10878?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16057797#comment-16057797 ] Todd Morrison commented on SPARK-10878: --- Any chance we can move the priority of this issue up?

[jira] [Resolved] (SPARK-17851) Make sure all test sqls in catalyst pass checkAnalysis

2017-06-21 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17851?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-17851. - Resolution: Fixed Assignee: Jiang Xingbo Fix Version/s: 2.3.0 > Make sure all test sqls