[jira] [Updated] (SPARK-27232) Ignore file locality in InMemoryFileIndex if spark.locality.wait is set to

2019-03-21 Thread EdisonWang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27232?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] EdisonWang updated SPARK-27232: --- Summary: Ignore file locality in InMemoryFileIndex if spark.locality.wait is set to (was: Skip to

[jira] [Updated] (SPARK-27232) Ignore file locality in InMemoryFileIndex if spark.locality.wait is set to

2019-03-21 Thread EdisonWang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27232?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] EdisonWang updated SPARK-27232: --- Description: `InMemoryFileIndex` needs to request file block location information in order to do

[jira] [Assigned] (SPARK-23710) Upgrade the built-in Hive to 2.3.4 for hadoop-3.1

2019-03-21 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23710?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-23710: Assignee: Yuming Wang > Upgrade the built-in Hive to 2.3.4 for hadoop-3.1 >

[jira] [Resolved] (SPARK-27228) Spark long delay on close, possible problem with killing executors

2019-03-21 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27228?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-27228. -- Resolution: Incomplete > Spark long delay on close, possible problem with killing executors >

[jira] [Commented] (SPARK-27228) Spark long delay on close, possible problem with killing executors

2019-03-21 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27228?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16798683#comment-16798683 ] Hyukjin Kwon commented on SPARK-27228: -- Please just don't copy and paste the log without any

[jira] [Commented] (SPARK-27229) GroupBy Placement in Intersect Distinct

2019-03-21 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27229?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16798681#comment-16798681 ] Hyukjin Kwon commented on SPARK-27229: -- Please avoid to set Critical+ which is usually reserved for

[jira] [Updated] (SPARK-27229) GroupBy Placement in Intersect Distinct

2019-03-21 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27229?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-27229: - Priority: Major (was: Critical) > GroupBy Placement in Intersect Distinct >

[jira] [Commented] (SPARK-27232) Skip to get file block location if locality is ignored

2019-03-21 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27232?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16798679#comment-16798679 ] Hyukjin Kwon commented on SPARK-27232: -- Please fill the JIRA description. > Skip to get file block

[jira] [Commented] (SPARK-27234) Continuous Streaming does not support python UDFs

2019-03-21 Thread Jungtaek Lim (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27234?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16798677#comment-16798677 ] Jungtaek Lim commented on SPARK-27234: -- Could you please provide reproducer for Apache vanilla

[jira] [Updated] (SPARK-27238) In the same APP, maybe some hive parquet tables can't use the built-in Parquet reader and writer

2019-03-21 Thread liuxian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27238?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] liuxian updated SPARK-27238: Description: In the same APP, TableA and TableB are both hive parquet tables, but TableA can't use the

[jira] [Created] (SPARK-27238) In the same APP, maybe some hive parquet tables can't use the built-in Parquet reader and writer

2019-03-21 Thread liuxian (JIRA)
liuxian created SPARK-27238: --- Summary: In the same APP, maybe some hive parquet tables can't use the built-in Parquet reader and writer Key: SPARK-27238 URL: https://issues.apache.org/jira/browse/SPARK-27238

[jira] [Comment Edited] (SPARK-27169) number of active tasks is negative on executors page

2019-03-21 Thread acupple (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27169?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16797070#comment-16797070 ] acupple edited comment on SPARK-27169 at 3/22/19 3:12 AM: -- Thanks for your

[jira] [Commented] (SPARK-27234) Continuous Streaming does not support python UDFs

2019-03-21 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27234?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16798654#comment-16798654 ] Hyukjin Kwon commented on SPARK-27234: -- What's {{makeReply}} and output from the codes? >

[jira] [Resolved] (SPARK-27236) Refactor log-appender pattern in tests

2019-03-21 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27236?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-27236. - Resolution: Fixed Assignee: Maryann Xue Fix Version/s: 3.0.0 > Refactor log-appender

[jira] [Commented] (SPARK-27218) spark-sql-kafka-0-10 startingOffset=earliest not working as expected with streaming

2019-03-21 Thread Genmao Yu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27218?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16798635#comment-16798635 ] Genmao Yu commented on SPARK-27218: --- Could you please test it on master branch? >

[jira] [Assigned] (SPARK-26946) Identifiers for multi-catalog Spark

2019-03-21 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26946?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-26946: --- Assignee: John Zhuge > Identifiers for multi-catalog Spark >

[jira] [Resolved] (SPARK-26946) Identifiers for multi-catalog Spark

2019-03-21 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26946?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-26946. - Resolution: Fixed Fix Version/s: 3.0.0 Issue resolved by pull request 23848

[jira] [Commented] (SPARK-27170) Better error message for syntax error with extraneous comma in the SQL parser

2019-03-21 Thread Dilip Biswal (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27170?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16798583#comment-16798583 ] Dilip Biswal commented on SPARK-27170: -- Thank you [~maropu] [~hyukjin.kwon] > Better error message

[jira] [Updated] (SPARK-27237) Introduce State schema validation among query restart

2019-03-21 Thread Jungtaek Lim (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27237?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jungtaek Lim updated SPARK-27237: - Description: Even though Spark structured streaming guide page clearly documents that "Any

[jira] [Commented] (SPARK-27237) Introduce State schema validation among query restart

2019-03-21 Thread Jungtaek Lim (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27237?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16798465#comment-16798465 ] Jungtaek Lim commented on SPARK-27237: -- Working on this. > Introduce State schema validation among

[jira] [Created] (SPARK-27237) Introduce State schema validation among query restart

2019-03-21 Thread Jungtaek Lim (JIRA)
Jungtaek Lim created SPARK-27237: Summary: Introduce State schema validation among query restart Key: SPARK-27237 URL: https://issues.apache.org/jira/browse/SPARK-27237 Project: Spark Issue

[jira] [Created] (SPARK-27236) Refactor log-appender pattern in tests

2019-03-21 Thread Maryann Xue (JIRA)
Maryann Xue created SPARK-27236: --- Summary: Refactor log-appender pattern in tests Key: SPARK-27236 URL: https://issues.apache.org/jira/browse/SPARK-27236 Project: Spark Issue Type: Improvement

[jira] [Resolved] (SPARK-27222) Support Instant and LocalDate in Literal.apply

2019-03-21 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27222?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-27222. - Resolution: Fixed Fix Version/s: 3.0.0 Issue resolved by pull request 24161

[jira] [Assigned] (SPARK-27222) Support Instant and LocalDate in Literal.apply

2019-03-21 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27222?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-27222: --- Assignee: Maxim Gekk > Support Instant and LocalDate in Literal.apply >

[jira] [Created] (SPARK-27235) Remove the Dead code in HashedRelation.scala file from sql core module

2019-03-21 Thread Shivu Sondur (JIRA)
Shivu Sondur created SPARK-27235: Summary: Remove the Dead code in HashedRelation.scala file from sql core module Key: SPARK-27235 URL: https://issues.apache.org/jira/browse/SPARK-27235 Project:

[jira] [Commented] (SPARK-27235) Remove the Dead code in HashedRelation.scala file from sql core module

2019-03-21 Thread Shivu Sondur (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27235?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16798361#comment-16798361 ] Shivu Sondur commented on SPARK-27235: -- I will work on this > Remove the Dead code in

[jira] [Commented] (SPARK-27233) Schema of ArrayType change after saveAsTable and read

2019-03-21 Thread Sandeep Katta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27233?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16798306#comment-16798306 ] Sandeep Katta commented on SPARK-27233: --- I will analyze this issue, if required will raise PR for

[jira] [Commented] (SPARK-27177) Update jenkins locale to en_US.UTF-8

2019-03-21 Thread shane knapp (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27177?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16798307#comment-16798307 ] shane knapp commented on SPARK-27177: - done for the PRB. will get to the SBT builds later today or

[jira] [Assigned] (SPARK-27177) Update jenkins locale to en_US.UTF-8

2019-03-21 Thread shane knapp (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27177?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] shane knapp reassigned SPARK-27177: --- Assignee: shane knapp > Update jenkins locale to en_US.UTF-8 >

[jira] [Created] (SPARK-27234) Continuous Streaming does not support python UDFs

2019-03-21 Thread Mark Hamilton (JIRA)
Mark Hamilton created SPARK-27234: - Summary: Continuous Streaming does not support python UDFs Key: SPARK-27234 URL: https://issues.apache.org/jira/browse/SPARK-27234 Project: Spark Issue

[jira] [Updated] (SPARK-27233) Schema of ArrayType change after saveAsTable and read

2019-03-21 Thread Kritsada Limpawatkul (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27233?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kritsada Limpawatkul updated SPARK-27233: - Priority: Major (was: Minor) > Schema of ArrayType change after saveAsTable

[jira] [Updated] (SPARK-27233) Schema of ArrayType change after saveAsTable and read

2019-03-21 Thread Kritsada Limpawatkul (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27233?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kritsada Limpawatkul updated SPARK-27233: - Description: This is code for reproducing. {code:java} val testTable =

[jira] [Updated] (SPARK-27233) Schema of ArrayType change after saveAsTable and read

2019-03-21 Thread Kritsada Limpawatkul (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27233?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kritsada Limpawatkul updated SPARK-27233: - Description: This is code for reproducing. {code:java} val testTable =

[jira] [Created] (SPARK-27233) Schema of ArrayType change after saveAsTable and read

2019-03-21 Thread Kritsada Limpawatkul (JIRA)
Kritsada Limpawatkul created SPARK-27233: Summary: Schema of ArrayType change after saveAsTable and read Key: SPARK-27233 URL: https://issues.apache.org/jira/browse/SPARK-27233 Project: Spark

[jira] [Resolved] (SPARK-25196) Extends the analyze column command for cached tables

2019-03-21 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25196?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun resolved SPARK-25196. --- Resolution: Fixed Assignee: Takeshi Yamamuro Fix Version/s: 3.0.0 This is

[jira] [Updated] (SPARK-25196) Extends the analyze column command for cached tables

2019-03-21 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25196?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-25196: -- Priority: Major (was: Minor) > Extends the analyze column command for cached tables >

[jira] [Updated] (SPARK-25196) Extends the analyze column command for cached tables

2019-03-21 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-25196?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-25196: -- Summary: Extends the analyze column command for cached tables (was: Analyze column

[jira] [Created] (SPARK-27232) Skip to get file block location if locality is ignored

2019-03-21 Thread EdisonWang (JIRA)
EdisonWang created SPARK-27232: -- Summary: Skip to get file block location if locality is ignored Key: SPARK-27232 URL: https://issues.apache.org/jira/browse/SPARK-27232 Project: Spark Issue

[jira] [Updated] (SPARK-27218) spark-sql-kafka-0-10 startingOffset=earliest not working as expected with streaming

2019-03-21 Thread Emanuele Sabellico (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27218?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Emanuele Sabellico updated SPARK-27218: --- Description: Hi, I'm trying to stream a kafka topic with 

[jira] [Commented] (SPARK-27228) Spark long delay on close, possible problem with killing executors

2019-03-21 Thread Lukas Waldmann (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27228?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16798209#comment-16798209 ] Lukas Waldmann commented on SPARK-27228: Startup parameters: spark-submit --conf

[jira] [Updated] (SPARK-27228) Spark long delay on close, possible problem with killing executors

2019-03-21 Thread Lukas Waldmann (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27228?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lukas Waldmann updated SPARK-27228: --- Attachment: log.html > Spark long delay on close, possible problem with killing executors >

[jira] [Commented] (SPARK-27228) Spark long delay on close, possible problem with killing executors

2019-03-21 Thread Lukas Waldmann (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27228?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16798207#comment-16798207 ] Lukas Waldmann commented on SPARK-27228: log file added > Spark long delay on close, possible

[jira] [Closed] (SPARK-27230) 当我在hive2.3.4中创建一个表后,然后使用pyspark调用hivecontext无法使用表中数据,同时报错

2019-03-21 Thread Vitamin_C (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27230?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vitamin_C closed SPARK-27230. - > 当我在hive2.3.4中创建一个表后,然后使用pyspark调用hivecontext无法使用表中数据,同时报错 >

[jira] [Resolved] (SPARK-27230) 当我在hive2.3.4中创建一个表后,然后使用pyspark调用hivecontext无法使用表中数据,同时报错

2019-03-21 Thread Vitamin_C (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27230?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vitamin_C resolved SPARK-27230. --- Resolution: Fixed > 当我在hive2.3.4中创建一个表后,然后使用pyspark调用hivecontext无法使用表中数据,同时报错 >

[jira] [Commented] (SPARK-27230) 当我在hive2.3.4中创建一个表后,然后使用pyspark调用hivecontext无法使用表中数据,同时报错

2019-03-21 Thread Vitamin_C (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27230?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16798204#comment-16798204 ] Vitamin_C commented on SPARK-27230: ---

[jira] [Commented] (SPARK-27230) 当我在hive2.3.4中创建一个表后,然后使用pyspark调用hivecontext无法使用表中数据,同时报错

2019-03-21 Thread Sandeep Katta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27230?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16798201#comment-16798201 ] Sandeep Katta commented on SPARK-27230: --- just click on resolve issue and close it as invalid >

[jira] [Commented] (SPARK-27230) 当我在hive2.3.4中创建一个表后,然后使用pyspark调用hivecontext无法使用表中数据,同时报错

2019-03-21 Thread Vitamin_C (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27230?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16798198#comment-16798198 ] Vitamin_C commented on SPARK-27230: --- This is my first question, I want to know how to cancel this

[jira] [Updated] (SPARK-27218) spark-sql-kafka-0-10 startingOffset=earliest not working as expected with streaming

2019-03-21 Thread Emanuele Sabellico (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27218?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Emanuele Sabellico updated SPARK-27218: --- Description: Hi, I'm trying to stream a kafka topic with 

[jira] [Commented] (SPARK-27230) 当我在hive2.3.4中创建一个表后,然后使用pyspark调用hivecontext无法使用表中数据,同时报错

2019-03-21 Thread Vitamin_C (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27230?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16798195#comment-16798195 ] Vitamin_C commented on SPARK-27230: ---

[jira] [Commented] (SPARK-27230) 当我在hive2.3.4中创建一个表后,然后使用pyspark调用hivecontext无法使用表中数据,同时报错

2019-03-21 Thread Sandeep Katta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27230?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16798187#comment-16798187 ] Sandeep Katta commented on SPARK-27230: --- can you please explain the problem in english ? >

[jira] [Commented] (SPARK-27228) Spark long delay on close, possible problem with killing executors

2019-03-21 Thread Sandeep Katta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27228?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16798185#comment-16798185 ] Sandeep Katta commented on SPARK-27228: --- can you upload the log file? > Spark long delay on

[jira] [Commented] (SPARK-27204) First time Loading application page from History Server is taking time when event log size is huge

2019-03-21 Thread shahid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27204?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16798178#comment-16798178 ] shahid commented on SPARK-27204: Yes. actually, for loading the UI page, SHS need to replay the entire

[jira] [Updated] (SPARK-27231) Stack overflow error, when we increase the number of iteration in PIC

2019-03-21 Thread shahid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27231?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] shahid updated SPARK-27231: --- Description: val dataset = spark.createDataFrame(Seq( (0L, 1L, 1.0), (0L, 2L, 1.0),

[jira] [Commented] (SPARK-27231) Stack overflow error, when we increase the number of iteration in PIC

2019-03-21 Thread shahid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27231?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16798170#comment-16798170 ] shahid commented on SPARK-27231: I am analyzing the issue, > Stack overflow error, when we increase the

[jira] [Created] (SPARK-27231) Stack overflow error, when we increase the number of iteration in PIC

2019-03-21 Thread shahid (JIRA)
shahid created SPARK-27231: -- Summary: Stack overflow error, when we increase the number of iteration in PIC Key: SPARK-27231 URL: https://issues.apache.org/jira/browse/SPARK-27231 Project: Spark

[jira] [Commented] (SPARK-26894) Fix Alias handling in AggregateEstimation

2019-03-21 Thread Venkata krishnan Sowrirajan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16798154#comment-16798154 ] Venkata krishnan Sowrirajan commented on SPARK-26894: - Thanks for merging the code,

[jira] [Updated] (SPARK-27230) 当我在hive2.3.4中创建一个表后,然后使用pyspark调用hivecontext无法使用表中数据,同时报错

2019-03-21 Thread Vitamin_C (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27230?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vitamin_C updated SPARK-27230: -- Docs Text: hive执行建表语句 CREATE TABLE IF NOT EXISTS mdw.t_sd_mobile_user_log( imei string,

[jira] [Updated] (SPARK-27230) 当我在hive2.3.4中创建一个表后,然后使用pyspark调用hivecontext无法使用表中数据,同时报错

2019-03-21 Thread Vitamin_C (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27230?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vitamin_C updated SPARK-27230: -- Description: 我在hive中建表mdw.t_sd_mobile_user_log 然后使用pyspark执行查询 from pyspark.sql import HiveContext

[jira] [Updated] (SPARK-27230) 当我在hive2.3.4中创建一个表后,然后使用pyspark调用hivecontext无法使用表中数据,同时报错

2019-03-21 Thread Vitamin_C (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27230?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vitamin_C updated SPARK-27230: -- Docs Text: hive执行建表语句 CREATE TABLE IF NOT EXISTS mdw.t_sd_mobile_user_log_2( imei string,

[jira] [Created] (SPARK-27230) 当我在hive2.3.4中创建一个表后,然后使用pyspark调用hivecontext无法使用表中数据,同时报错

2019-03-21 Thread Vitamin_C (JIRA)
Vitamin_C created SPARK-27230: - Summary: 当我在hive2.3.4中创建一个表后,然后使用pyspark调用hivecontext无法使用表中数据,同时报错 Key: SPARK-27230 URL: https://issues.apache.org/jira/browse/SPARK-27230 Project: Spark Issue

[jira] [Updated] (SPARK-27229) GroupBy Placement in Intersect Distinct

2019-03-21 Thread Song Jun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27229?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Song Jun updated SPARK-27229: - Description: Intersect operator will be replace by Left Semi Join in Optimizer. for example: SELECT

[jira] [Created] (SPARK-27229) GroupBy Placement in Intersect Distinct

2019-03-21 Thread Song Jun (JIRA)
Song Jun created SPARK-27229: Summary: GroupBy Placement in Intersect Distinct Key: SPARK-27229 URL: https://issues.apache.org/jira/browse/SPARK-27229 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-27228) Spark long delay on close, possible problem with killing executors

2019-03-21 Thread Lukas Waldmann (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27228?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lukas Waldmann updated SPARK-27228: --- Description: When using dynamic allocations after all jobs finishes spark delays for

[jira] [Updated] (SPARK-27228) Spark long delay on close, possible problem with killing executors

2019-03-21 Thread Lukas Waldmann (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27228?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lukas Waldmann updated SPARK-27228: --- Description: When using dynamic allocations after all jobs finishes spark delays for

[jira] [Created] (SPARK-27228) Spark long delay on close, possible problem with killing executors

2019-03-21 Thread Lukas Waldmann (JIRA)
Lukas Waldmann created SPARK-27228: -- Summary: Spark long delay on close, possible problem with killing executors Key: SPARK-27228 URL: https://issues.apache.org/jira/browse/SPARK-27228 Project:

[jira] [Created] (SPARK-27227) Dynamic Partition Prune in Spark

2019-03-21 Thread Song Jun (JIRA)
Song Jun created SPARK-27227: Summary: Dynamic Partition Prune in Spark Key: SPARK-27227 URL: https://issues.apache.org/jira/browse/SPARK-27227 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-27226) Reduce the code duplicate when upgrading built-in Hive

2019-03-21 Thread Yuming Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27226?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-27226: Summary: Reduce the code duplicate when upgrading built-in Hive (was: Reduce the code duplicate)

[jira] [Created] (SPARK-27226) Reduce the code duplicate

2019-03-21 Thread Yuming Wang (JIRA)
Yuming Wang created SPARK-27226: --- Summary: Reduce the code duplicate Key: SPARK-27226 URL: https://issues.apache.org/jira/browse/SPARK-27226 Project: Spark Issue Type: Sub-task

[jira] [Commented] (SPARK-20712) [SPARK 2.1 REGRESSION][SQL] Spark can't read Hive table when column type has length greater than 4000 bytes

2019-03-21 Thread Kris Geusebroek (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20712?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16797988#comment-16797988 ] Kris Geusebroek commented on SPARK-20712: - [~yumwang] Also in scala it will fail after issuing a

[jira] [Comment Edited] (SPARK-20712) [SPARK 2.1 REGRESSION][SQL] Spark can't read Hive table when column type has length greater than 4000 bytes

2019-03-21 Thread Kris Geusebroek (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20712?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16797988#comment-16797988 ] Kris Geusebroek edited comment on SPARK-20712 at 3/21/19 10:53 AM: ---

[jira] [Comment Edited] (SPARK-20712) [SPARK 2.1 REGRESSION][SQL] Spark can't read Hive table when column type has length greater than 4000 bytes

2019-03-21 Thread Kris Geusebroek (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20712?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16797988#comment-16797988 ] Kris Geusebroek edited comment on SPARK-20712 at 3/21/19 10:51 AM: ---

[jira] [Resolved] (SPARK-27086) DataSourceV2 MicroBatchExecution commits last batch only if new batch is constructed

2019-03-21 Thread Sebastian Herold (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27086?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Herold resolved SPARK-27086. -- Resolution: Not A Problem I'll ask this as a question in the dev-mailing list again.

[jira] [Assigned] (SPARK-27163) Cleanup and consolidate Pandas UDF functionality

2019-03-21 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27163?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-27163: Assignee: Bryan Cutler > Cleanup and consolidate Pandas UDF functionality >

[jira] [Resolved] (SPARK-27163) Cleanup and consolidate Pandas UDF functionality

2019-03-21 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27163?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-27163. -- Resolution: Fixed Fix Version/s: 3.0.0 Issue resolved by pull request 24095

[jira] [Resolved] (SPARK-27170) Better error message for syntax error with extraneous comma in the SQL parser

2019-03-21 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27170?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takeshi Yamamuro resolved SPARK-27170. -- Resolution: Not A Problem > Better error message for syntax error with extraneous

[jira] [Assigned] (SPARK-26894) Fix Alias handling in AggregateEstimation

2019-03-21 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-26894?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takeshi Yamamuro reassigned SPARK-26894: Assignee: Venkata krishnan Sowrirajan > Fix Alias handling in

[jira] [Commented] (SPARK-27204) First time Loading application page from History Server is taking time when event log size is huge

2019-03-21 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27204?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16797912#comment-16797912 ] Hyukjin Kwon commented on SPARK-27204: -- You should manually turn on the configuration to use the

[jira] [Issue Comment Deleted] (SPARK-27186) Optimize SortShuffleWriter writing process

2019-03-21 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27186?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-27186: - Comment: was deleted (was: User 'AngersZh' has created a pull request for this issue:

[jira] [Commented] (SPARK-27170) Better error message for syntax error with extraneous comma in the SQL parser

2019-03-21 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27170?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16797906#comment-16797906 ] Hyukjin Kwon commented on SPARK-27170: -- Shall we resolve the JIRA then? > Better error message for

[jira] [Commented] (SPARK-27187) What spark jar files serves the following files ..

2019-03-21 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27187?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16797901#comment-16797901 ] Hyukjin Kwon commented on SPARK-27187: -- Sounds like a question. You could have a better answer from

[jira] [Resolved] (SPARK-27185) mapPartition to replace map to speedUp Dataset's toLocalIterator process

2019-03-21 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27185?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-27185. -- Resolution: Invalid Discussed in https://github.com/apache/spark/pull/24124 > mapPartition

[jira] [Resolved] (SPARK-27186) Optimize SortShuffleWriter writing process

2019-03-21 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27186?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-27186. -- Resolution: Won't Fix Sounds like too trivial to fix. > Optimize SortShuffleWriter writing

[jira] [Resolved] (SPARK-27187) What spark jar files serves the following files ..

2019-03-21 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27187?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-27187. -- Resolution: Invalid > What spark jar files serves the following files .. >

[jira] [Commented] (SPARK-27204) First time Loading application page from History Server is taking time when event log size is huge

2019-03-21 Thread shahid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27204?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16797888#comment-16797888 ] shahid commented on SPARK-27204: Hi @HyukjinKwon, this is still an issue after SPARK-18085. I will raise

[jira] [Commented] (SPARK-27204) First time Loading application page from History Server is taking time when event log size is huge

2019-03-21 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27204?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16797886#comment-16797886 ] Hyukjin Kwon commented on SPARK-27204: -- See SPARK-18085 > First time Loading application page from

[jira] [Resolved] (SPARK-27204) First time Loading application page from History Server is taking time when event log size is huge

2019-03-21 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27204?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-27204. -- Resolution: Not A Problem > First time Loading application page from History Server is taking