[jira] [Updated] (SPARK-23627) Provide isEmpty() function in DataSet

2018-03-07 Thread Goun Na (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23627?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Goun Na updated SPARK-23627: Description: Like rdd.isEmpty, adding isEmpty to DataSet would useful.  Some code without isEmpty:

[jira] [Created] (SPARK-23627) Provide isEmpty() function in DataSet

2018-03-07 Thread Goun Na (JIRA)
Goun Na created SPARK-23627: --- Summary: Provide isEmpty() function in DataSet Key: SPARK-23627 URL: https://issues.apache.org/jira/browse/SPARK-23627 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-23626) Spark DAGScheduler scheduling performance hindered on JobSubmitted Event

2018-03-07 Thread Ajith S (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23626?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16390832#comment-16390832 ] Ajith S commented on SPARK-23626: - Cc [~r...@databricks.com] , [~shivaram] [~blue_impala_48d6] > Spark

[jira] [Assigned] (SPARK-23626) Spark DAGScheduler scheduling performance hindered on JobSubmitted Event

2018-03-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23626?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23626: Assignee: (was: Apache Spark) > Spark DAGScheduler scheduling performance hindered on

[jira] [Assigned] (SPARK-23626) Spark DAGScheduler scheduling performance hindered on JobSubmitted Event

2018-03-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23626?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23626: Assignee: Apache Spark > Spark DAGScheduler scheduling performance hindered on

[jira] [Commented] (SPARK-23626) Spark DAGScheduler scheduling performance hindered on JobSubmitted Event

2018-03-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23626?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16390825#comment-16390825 ] Apache Spark commented on SPARK-23626: -- User 'AjithShetty2489' has created a pull request for this

[jira] [Commented] (SPARK-23625) spark sql long-running mission will be dead

2018-03-07 Thread Yu Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23625?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16390817#comment-16390817 ] Yu Wang commented on SPARK-23625: - [~joshrosen] please have a look!!! > spark sql long-running mission

[jira] [Created] (SPARK-23626) Spark DAGScheduler scheduling performance hindered on JobSubmitted Event

2018-03-07 Thread Ajith S (JIRA)
Ajith S created SPARK-23626: --- Summary: Spark DAGScheduler scheduling performance hindered on JobSubmitted Event Key: SPARK-23626 URL: https://issues.apache.org/jira/browse/SPARK-23626 Project: Spark

[jira] [Updated] (SPARK-23625) spark sql long-running mission will be dead

2018-03-07 Thread Yu Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23625?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yu Wang updated SPARK-23625: Attachment: 1520489867.png 1520489861.png 1520489854.png

[jira] [Created] (SPARK-23625) spark sql long-running mission will be dead

2018-03-07 Thread Yu Wang (JIRA)
Yu Wang created SPARK-23625: --- Summary: spark sql long-running mission will be dead Key: SPARK-23625 URL: https://issues.apache.org/jira/browse/SPARK-23625 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-23525) ALTER TABLE CHANGE COLUMN doesn't work for external hive table

2018-03-07 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23525?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-23525: Fix Version/s: 2.2.2 > ALTER TABLE CHANGE COLUMN doesn't work for external hive table >

[jira] [Updated] (SPARK-23490) Check storage.locationUri with existing table in CreateTable

2018-03-07 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23490?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-23490: Fix Version/s: 2.3.1 > Check storage.locationUri with existing table in CreateTable >

[jira] [Commented] (SPARK-17498) StringIndexer.setHandleInvalid should have another option 'new'

2018-03-07 Thread Haopu Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17498?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16390777#comment-16390777 ] Haopu Wang commented on SPARK-17498: After this fix, the reverse transformation "IndexToString" still

[jira] [Assigned] (SPARK-23624) Revise doc of method pushFilters

2018-03-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23624?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23624: Assignee: Apache Spark > Revise doc of method pushFilters >

[jira] [Assigned] (SPARK-23624) Revise doc of method pushFilters

2018-03-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23624?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23624: Assignee: (was: Apache Spark) > Revise doc of method pushFilters >

[jira] [Commented] (SPARK-23624) Revise doc of method pushFilters

2018-03-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23624?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16390761#comment-16390761 ] Apache Spark commented on SPARK-23624: -- User 'gengliangwang' has created a pull request for this

[jira] [Created] (SPARK-23624) Revise doc of method pushFilters

2018-03-07 Thread Gengliang Wang (JIRA)
Gengliang Wang created SPARK-23624: -- Summary: Revise doc of method pushFilters Key: SPARK-23624 URL: https://issues.apache.org/jira/browse/SPARK-23624 Project: Spark Issue Type: Improvement

[jira] [Assigned] (SPARK-23524) Big local shuffle blocks should not be checked for corruption.

2018-03-07 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23524?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-23524: --- Assignee: jin xing > Big local shuffle blocks should not be checked for corruption. >

[jira] [Resolved] (SPARK-23524) Big local shuffle blocks should not be checked for corruption.

2018-03-07 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23524?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-23524. - Resolution: Fixed Fix Version/s: 2.3.1 2.4.0 Issue resolved by pull

[jira] [Commented] (SPARK-23525) ALTER TABLE CHANGE COLUMN doesn't work for external hive table

2018-03-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23525?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16390649#comment-16390649 ] Apache Spark commented on SPARK-23525: -- User 'jiangxb1987' has created a pull request for this

[jira] [Assigned] (SPARK-23623) Avoid concurrent use of cached KafkaConsumer in CachedKafkaConsumer (kafka-0-10-sql)

2018-03-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23623: Assignee: Tathagata Das (was: Apache Spark) > Avoid concurrent use of cached

[jira] [Commented] (SPARK-23623) Avoid concurrent use of cached KafkaConsumer in CachedKafkaConsumer (kafka-0-10-sql)

2018-03-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23623?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16390627#comment-16390627 ] Apache Spark commented on SPARK-23623: -- User 'tdas' has created a pull request for this issue:

[jira] [Assigned] (SPARK-23623) Avoid concurrent use of cached KafkaConsumer in CachedKafkaConsumer (kafka-0-10-sql)

2018-03-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23623: Assignee: Apache Spark (was: Tathagata Das) > Avoid concurrent use of cached

[jira] [Updated] (SPARK-23623) Avoid concurrent use of cached KafkaConsumer in CachedKafkaConsumer (kafka-0-10-sql)

2018-03-07 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-23623: -- Description: CacheKafkaConsumer in the project `kafka-0-10-sql` is designed to maintain a

[jira] [Updated] (SPARK-23623) Avoid concurrent use of cached KafkaConsumer in CachedKafkaConsumer (kafka-0-10-sql)

2018-03-07 Thread Tathagata Das (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tathagata Das updated SPARK-23623: -- Description: CacheKafkaConsumer in the project `kafka-0-10-sql` is designed to maintain a

[jira] [Created] (SPARK-23623) Avoid concurrent use of cached KafkaConsumer in CachedKafkaConsumer (kafka-0-10-sql)

2018-03-07 Thread Tathagata Das (JIRA)
Tathagata Das created SPARK-23623: - Summary: Avoid concurrent use of cached KafkaConsumer in CachedKafkaConsumer (kafka-0-10-sql) Key: SPARK-23623 URL: https://issues.apache.org/jira/browse/SPARK-23623

[jira] [Commented] (SPARK-23490) Check storage.locationUri with existing table in CreateTable

2018-03-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23490?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16390601#comment-16390601 ] Apache Spark commented on SPARK-23490: -- User 'gengliangwang' has created a pull request for this

[jira] [Commented] (SPARK-23406) Stream-stream self joins does not work

2018-03-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16390596#comment-16390596 ] Apache Spark commented on SPARK-23406: -- User 'tdas' has created a pull request for this issue:

[jira] [Updated] (SPARK-23020) Re-enable Flaky Test: org.apache.spark.launcher.SparkLauncherSuite.testInProcessLauncher

2018-03-07 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23020?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan updated SPARK-23020: Fix Version/s: 2.3.1 > Re-enable Flaky Test: >

[jira] [Commented] (SPARK-23436) Incorrect Date column Inference in partition discovery

2018-03-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16390473#comment-16390473 ] Apache Spark commented on SPARK-23436: -- User 'gatorsmile' has created a pull request for this issue:

[jira] [Commented] (SPARK-23523) Incorrect result caused by the rule OptimizeMetadataOnlyQuery

2018-03-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23523?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16390461#comment-16390461 ] Apache Spark commented on SPARK-23523: -- User 'gatorsmile' has created a pull request for this issue:

[jira] [Comment Edited] (SPARK-23562) RFormula handleInvalid should handle invalid values in non-string columns.

2018-03-07 Thread yogesh garg (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23562?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16390434#comment-16390434 ] yogesh garg edited comment on SPARK-23562 at 3/7/18 11:33 PM: -- Error in

[jira] [Commented] (SPARK-23562) RFormula handleInvalid should handle invalid values in non-string columns.

2018-03-07 Thread yogesh garg (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23562?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16390434#comment-16390434 ] yogesh garg commented on SPARK-23562: - Error in question can be reproduced with the following code in

[jira] [Comment Edited] (SPARK-23562) RFormula handleInvalid should handle invalid values in non-string columns.

2018-03-07 Thread yogesh garg (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23562?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16390434#comment-16390434 ] yogesh garg edited comment on SPARK-23562 at 3/7/18 11:30 PM: -- Error in

[jira] [Updated] (SPARK-23560) A joinWith followed by groupBy requires extra shuffle

2018-03-07 Thread Bruce Robbins (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23560?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bruce Robbins updated SPARK-23560: -- Description: Depending on the size of the input, a joinWith followed by a groupBy requires

[jira] [Created] (SPARK-23622) HiveClientSuites fails with InvocationTargetException

2018-03-07 Thread Dongjoon Hyun (JIRA)
Dongjoon Hyun created SPARK-23622: - Summary: HiveClientSuites fails with InvocationTargetException Key: SPARK-23622 URL: https://issues.apache.org/jira/browse/SPARK-23622 Project: Spark

[jira] [Comment Edited] (SPARK-21030) extend hint syntax to support any expression for Python and R

2018-03-07 Thread Dylan Guedes (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16390321#comment-16390321 ] Dylan Guedes edited comment on SPARK-21030 at 3/7/18 10:11 PM: --- Hi, I

[jira] [Comment Edited] (SPARK-21030) extend hint syntax to support any expression for Python and R

2018-03-07 Thread Dylan Guedes (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16390321#comment-16390321 ] Dylan Guedes edited comment on SPARK-21030 at 3/7/18 10:11 PM: --- Hi, I

[jira] [Commented] (SPARK-21030) extend hint syntax to support any expression for Python and R

2018-03-07 Thread Dylan Guedes (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16390321#comment-16390321 ] Dylan Guedes commented on SPARK-21030: -- Hi, I would like to try this one. Do you guys think that

[jira] [Assigned] (SPARK-23525) ALTER TABLE CHANGE COLUMN doesn't work for external hive table

2018-03-07 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23525?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-23525: --- Assignee: Jiang Xingbo > ALTER TABLE CHANGE COLUMN doesn't work for external hive table >

[jira] [Resolved] (SPARK-23525) ALTER TABLE CHANGE COLUMN doesn't work for external hive table

2018-03-07 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23525?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-23525. - Issue resolved by pull request 20696 [https://github.com/apache/spark/pull/20696] > ALTER TABLE

[jira] [Commented] (SPARK-16630) Blacklist a node if executors won't launch on it.

2018-03-07 Thread Thomas Graves (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16390232#comment-16390232 ] Thomas Graves commented on SPARK-16630: --- yes yarn tells you the # of nodemanagers. allocateResponse

[jira] [Assigned] (SPARK-23620) Split thread dump lines by using the br tag

2018-03-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23620?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23620: Assignee: Apache Spark > Split thread dump lines by using the br tag >

[jira] [Commented] (SPARK-23620) Split thread dump lines by using the br tag

2018-03-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23620?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16390228#comment-16390228 ] Apache Spark commented on SPARK-23620: -- User 'MaxGekk' has created a pull request for this issue:

[jira] [Assigned] (SPARK-23620) Split thread dump lines by using the br tag

2018-03-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23620?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23620: Assignee: (was: Apache Spark) > Split thread dump lines by using the br tag >

[jira] [Updated] (SPARK-23621) DataFrame.insertInto() is persisting all columns for mixed structured data-type

2018-03-07 Thread Ravikumar Ramasamy (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23621?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ravikumar Ramasamy updated SPARK-23621: --- Description: The  configuration data is stored in Cassandra which is unstructured

[jira] [Updated] (SPARK-23621) DataFrame.insertInto() is persisting all columns for mixed structured data-type

2018-03-07 Thread Ravikumar Ramasamy (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23621?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ravikumar Ramasamy updated SPARK-23621: --- Attachment: sample_data.csv > DataFrame.insertInto() is persisting all columns for

[jira] [Created] (SPARK-23621) DataFrame.insertInto() is persisting all columns for mixed structured data-type

2018-03-07 Thread Ravikumar Ramasamy (JIRA)
Ravikumar Ramasamy created SPARK-23621: -- Summary: DataFrame.insertInto() is persisting all columns for mixed structured data-type Key: SPARK-23621 URL: https://issues.apache.org/jira/browse/SPARK-23621

[jira] [Created] (SPARK-23620) Split thread dump lines by using the br tag

2018-03-07 Thread Maxim Gekk (JIRA)
Maxim Gekk created SPARK-23620: -- Summary: Split thread dump lines by using the br tag Key: SPARK-23620 URL: https://issues.apache.org/jira/browse/SPARK-23620 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-16630) Blacklist a node if executors won't launch on it.

2018-03-07 Thread Imran Rashid (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16390159#comment-16390159 ] Imran Rashid commented on SPARK-16630: -- I'd also take {{spark.yarn.max.executor.failures}} into

[jira] [Commented] (SPARK-23560) A joinWith followed by groupBy requires extra shuffle

2018-03-07 Thread Bruce Robbins (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23560?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16390111#comment-16390111 ] Bruce Robbins commented on SPARK-23560: --- The main issue is that an AttributeReference instance

[jira] [Commented] (SPARK-23615) Add maxDF Parameter to Python CountVectorizer

2018-03-07 Thread Huaxin Gao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23615?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16390055#comment-16390055 ] Huaxin Gao commented on SPARK-23615: Hi Bryan, are you working on this youself? If not, may I work on

[jira] [Created] (SPARK-23619) Document the column names created by explode and posexplode functions

2018-03-07 Thread Joe Pallas (JIRA)
Joe Pallas created SPARK-23619: -- Summary: Document the column names created by explode and posexplode functions Key: SPARK-23619 URL: https://issues.apache.org/jira/browse/SPARK-23619 Project: Spark

[jira] [Commented] (SPARK-20922) Unsafe deserialization in Spark LauncherConnection

2018-03-07 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20922?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16389907#comment-16389907 ] Marcelo Vanzin commented on SPARK-20922: You remediate it by upgrading to a version with the fix

[jira] [Assigned] (SPARK-20327) Add CLI support for YARN custom resources, like GPUs

2018-03-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20327?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20327: Assignee: Apache Spark > Add CLI support for YARN custom resources, like GPUs >

[jira] [Commented] (SPARK-20327) Add CLI support for YARN custom resources, like GPUs

2018-03-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20327?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16389880#comment-16389880 ] Apache Spark commented on SPARK-20327: -- User 'szyszy' has created a pull request for this issue:

[jira] [Assigned] (SPARK-20327) Add CLI support for YARN custom resources, like GPUs

2018-03-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20327?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-20327: Assignee: (was: Apache Spark) > Add CLI support for YARN custom resources, like GPUs

[jira] [Assigned] (SPARK-23592) Add interpreted execution for DecodeUsingSerializer expression

2018-03-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23592?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23592: Assignee: Apache Spark (was: Marco Gaido) > Add interpreted execution for

[jira] [Commented] (SPARK-23592) Add interpreted execution for DecodeUsingSerializer expression

2018-03-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23592?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16389871#comment-16389871 ] Apache Spark commented on SPARK-23592: -- User 'mgaido91' has created a pull request for this issue:

[jira] [Assigned] (SPARK-23592) Add interpreted execution for DecodeUsingSerializer expression

2018-03-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23592?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-23592: Assignee: Marco Gaido (was: Apache Spark) > Add interpreted execution for

[jira] [Updated] (SPARK-23291) SparkR : substr : In SparkR dataframe , starting and ending position arguments in "substr" is giving wrong result when the position is greater than 1

2018-03-07 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23291?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-23291: - Affects Version/s: 2.1.2 2.2.0 2.3.0 > SparkR :

[jira] [Resolved] (SPARK-23291) SparkR : substr : In SparkR dataframe , starting and ending position arguments in "substr" is giving wrong result when the position is greater than 1

2018-03-07 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23291?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung resolved SPARK-23291. -- Resolution: Fixed Assignee: Liang-Chi Hsieh Fix Version/s: 2.4.0

[jira] [Resolved] (SPARK-23591) Add interpreted execution for EncodeUsingSerializer expression

2018-03-07 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23591?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell resolved SPARK-23591. --- Resolution: Fixed Fix Version/s: 2.4.0 > Add interpreted execution for

[jira] [Commented] (SPARK-20922) Unsafe deserialization in Spark LauncherConnection

2018-03-07 Thread Patrick John Esteban (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20922?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16389809#comment-16389809 ] Patrick John Esteban commented on SPARK-20922: -- Hi Guys, I'm new to this vulnerability. Can

[jira] [Comment Edited] (SPARK-23598) WholeStageCodegen can lead to IllegalAccessError calling append for HashAggregateExec

2018-03-07 Thread Kazuaki Ishizaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23598?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16389766#comment-16389766 ] Kazuaki Ishizaki edited comment on SPARK-23598 at 3/7/18 4:39 PM: -- I

[jira] [Commented] (SPARK-23598) WholeStageCodegen can lead to IllegalAccessError calling append for HashAggregateExec

2018-03-07 Thread Kazuaki Ishizaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23598?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16389766#comment-16389766 ] Kazuaki Ishizaki commented on SPARK-23598: -- I also think that the easiest way is to make

[jira] [Resolved] (SPARK-23535) MinMaxScaler return 0.5 for an all zero column

2018-03-07 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23535?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-23535. --- Resolution: Won't Fix > MinMaxScaler return 0.5 for an all zero column >

[jira] [Commented] (SPARK-18492) GeneratedIterator grows beyond 64 KB

2018-03-07 Thread Kazuaki Ishizaki (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18492?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16389761#comment-16389761 ] Kazuaki Ishizaki commented on SPARK-18492: -- [~imranshaik] I ran the following program with

[jira] [Commented] (SPARK-23598) WholeStageCodegen can lead to IllegalAccessError calling append for HashAggregateExec

2018-03-07 Thread David Vogelbacher (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23598?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16389745#comment-16389745 ] David Vogelbacher commented on SPARK-23598: --- [~hvanhovell] unfortunately, I can't extract any

[jira] [Commented] (SPARK-18492) GeneratedIterator grows beyond 64 KB

2018-03-07 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18492?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16389723#comment-16389723 ] Nicholas Chammas commented on SPARK-18492: -- [~imranshaik] - This is an open source project. You

[jira] [Issue Comment Deleted] (SPARK-17495) Hive hash implementation

2018-03-07 Thread Xiaoju Wu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17495?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiaoju Wu updated SPARK-17495: -- Comment: was deleted (was: [~tejasp] I can see HiveHash merged but never used. Seems the using of

[jira] [Commented] (SPARK-21157) Report Total Memory Used by Spark Executors

2018-03-07 Thread assia ydroudj (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16389683#comment-16389683 ] assia ydroudj commented on SPARK-21157: --- Is there a PR for this please? > Report Total Memory Used

[jira] [Commented] (SPARK-18492) GeneratedIterator grows beyond 64 KB

2018-03-07 Thread imran shaik (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18492?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16389648#comment-16389648 ] imran shaik commented on SPARK-18492: - @Kazuaki Ishizaki any update on this? > GeneratedIterator

[jira] [Commented] (SPARK-23206) Additional Memory Tuning Metrics

2018-03-07 Thread assia ydroudj (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16389644#comment-16389644 ] assia ydroudj commented on SPARK-23206: --- [~elu] , thanks I ll be for wait! I have another simple

[jira] [Commented] (SPARK-23534) Spark run on Hadoop 3.0.0

2018-03-07 Thread Darek (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23534?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16389629#comment-16389629 ] Darek commented on SPARK-23534: --- We need a resolution sooner than later. Most vendors have moved on to

[jira] [Resolved] (SPARK-23611) Extend ExpressionEvalHelper harness to also test failures

2018-03-07 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell resolved SPARK-23611. --- Resolution: Fixed Fix Version/s: 2.4.0 > Extend ExpressionEvalHelper harness

[jira] [Commented] (SPARK-14681) Provide label/impurity stats for spark.ml decision tree nodes

2018-03-07 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14681?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16389382#comment-16389382 ] Apache Spark commented on SPARK-14681: -- User 'WeichenXu123' has created a pull request for this