[jira] [Commented] (SPARK-650) Add a "setup hook" API for running initialization code on each executor

2017-11-05 Thread quang nguyen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-650?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16239991#comment-16239991 ] quang nguyen commented on SPARK-650: Hi, We had an application run on spark cluster to a secured

[jira] [Assigned] (SPARK-21888) Cannot add stuff to Client Classpath for Yarn Cluster Mode

2017-11-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21888?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21888: Assignee: Apache Spark > Cannot add stuff to Client Classpath for Yarn Cluster Mode >

[jira] [Assigned] (SPARK-21888) Cannot add stuff to Client Classpath for Yarn Cluster Mode

2017-11-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21888?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21888: Assignee: (was: Apache Spark) > Cannot add stuff to Client Classpath for Yarn Cluster

[jira] [Commented] (SPARK-21888) Cannot add stuff to Client Classpath for Yarn Cluster Mode

2017-11-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21888?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16239930#comment-16239930 ] Apache Spark commented on SPARK-21888: -- User 'yaooqinn' has created a pull request for this issue:

[jira] [Issue Comment Deleted] (SPARK-11502) Word2VecSuite needs appropriate checks

2017-11-05 Thread Teng Peng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11502?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Teng Peng updated SPARK-11502: -- Comment: was deleted (was: I am interested in this one. My plan is to compare the test against 1.

[jira] [Resolved] (SPARK-7146) Should ML sharedParams be a public API?

2017-11-05 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7146?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk resolved SPARK-7146. Resolution: Fixed Fix Version/s: 2.3.0 Target Version/s: 2.3.0 Exposed the ML params as a

[jira] [Assigned] (SPARK-7146) Should ML sharedParams be a public API?

2017-11-05 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7146?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk reassigned SPARK-7146: -- Assignee: holdenk > Should ML sharedParams be a public API? > ---

[jira] [Assigned] (SPARK-22446) Optimizer causing StringIndexerModel's indexer UDF to throw "Unseen label" exception incorrectly for filtered data.

2017-11-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22446?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22446: Assignee: (was: Apache Spark) > Optimizer causing StringIndexerModel's indexer UDF to

[jira] [Assigned] (SPARK-22446) Optimizer causing StringIndexerModel's indexer UDF to throw "Unseen label" exception incorrectly for filtered data.

2017-11-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22446?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22446: Assignee: Apache Spark > Optimizer causing StringIndexerModel's indexer UDF to throw

[jira] [Commented] (SPARK-22446) Optimizer causing StringIndexerModel's indexer UDF to throw "Unseen label" exception incorrectly for filtered data.

2017-11-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22446?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16239905#comment-16239905 ] Apache Spark commented on SPARK-22446: -- User 'viirya' has created a pull request for this issue:

[jira] [Commented] (SPARK-22446) Optimizer causing StringIndexerModel's indexer UDF to throw "Unseen label" exception incorrectly for filtered data.

2017-11-05 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22446?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16239897#comment-16239897 ] Liang-Chi Hsieh commented on SPARK-22446: - For this special case, the simplest workaround is to

[jira] [Resolved] (SPARK-21625) Add incompatible Hive UDF describe to DOC

2017-11-05 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21625?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-21625. - Resolution: Fixed Assignee: Yuming Wang Fix Version/s: 2.3.0 > Add incompatible Hive UDF

[jira] [Commented] (SPARK-14228) Lost executor of RPC disassociated, and occurs exception: Could not find CoarseGrainedScheduler or it has been stopped

2017-11-05 Thread Danula Eranjith (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14228?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16239854#comment-16239854 ] Danula Eranjith commented on SPARK-14228: - I encountered the same issue in 1.6.3 {code} 17/11/03

[jira] [Commented] (SPARK-20077) Documentation for ml.stats.Correlation

2017-11-05 Thread Teng Peng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20077?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16239846#comment-16239846 ] Teng Peng commented on SPARK-20077: --- [~srowen] On this

[jira] [Commented] (SPARK-22450) Safely register class for mllib

2017-11-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22450?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16239841#comment-16239841 ] Apache Spark commented on SPARK-22450: -- User 'ConeyLiu' has created a pull request for this issue:

[jira] [Assigned] (SPARK-22450) Safely register class for mllib

2017-11-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22450?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22450: Assignee: (was: Apache Spark) > Safely register class for mllib >

[jira] [Assigned] (SPARK-22450) Safely register class for mllib

2017-11-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22450?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22450: Assignee: Apache Spark > Safely register class for mllib >

[jira] [Resolved] (SPARK-22398) Partition directories with leading 0s cause wrong results

2017-11-05 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22398?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-22398. -- Resolution: Duplicate > Partition directories with leading 0s cause wrong results >

[jira] [Created] (SPARK-22450) Safely register class for mllib

2017-11-05 Thread Xianyang Liu (JIRA)
Xianyang Liu created SPARK-22450: Summary: Safely register class for mllib Key: SPARK-22450 URL: https://issues.apache.org/jira/browse/SPARK-22450 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-22427) StackOverFlowError when using FPGrowth

2017-11-05 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22427?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16239824#comment-16239824 ] Liang-Chi Hsieh commented on SPARK-22427: - >From a rough glance, looks like the error didn't be

[jira] [Commented] (SPARK-22442) Schema generated by Product Encoder doesn't match case class field name when using non-standard characters

2017-11-05 Thread Mikel San Vicente (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16239821#comment-16239821 ] Mikel San Vicente commented on SPARK-22442: --- yes, that will work but it wont work for the

[jira] [Commented] (SPARK-22442) Schema generated by Product Encoder doesn't match case class field name when using non-standard characters

2017-11-05 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16239815#comment-16239815 ] Liang-Chi Hsieh commented on SPARK-22442: - I tried on latest master branch. It can work with

[jira] [Commented] (SPARK-22398) Partition directories with leading 0s cause wrong results

2017-11-05 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22398?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16239810#comment-16239810 ] Liang-Chi Hsieh commented on SPARK-22398: - As we can control it with the config

[jira] [Assigned] (SPARK-18755) Add Randomized Grid Search to Spark ML

2017-11-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18755?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18755: Assignee: (was: Apache Spark) > Add Randomized Grid Search to Spark ML >

[jira] [Commented] (SPARK-18755) Add Randomized Grid Search to Spark ML

2017-11-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18755?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16239754#comment-16239754 ] Apache Spark commented on SPARK-18755: -- User 'tengpeng' has created a pull request for this issue:

[jira] [Assigned] (SPARK-18755) Add Randomized Grid Search to Spark ML

2017-11-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18755?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18755: Assignee: Apache Spark > Add Randomized Grid Search to Spark ML >

[jira] [Updated] (SPARK-22442) Schema generated by Product Encoder doesn't match case class field name when using non-standard characters

2017-11-05 Thread Mikel San Vicente (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22442?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mikel San Vicente updated SPARK-22442: -- Description: Product encoder encodes special characters wrongly when field name

[jira] [Assigned] (SPARK-22411) Heuristic to combine splits in DataSourceScanExec isn't accurate when dynamic allocation is enabled

2017-11-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22411?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22411: Assignee: Vinitha Reddy Gankidi (was: Apache Spark) > Heuristic to combine splits in

[jira] [Assigned] (SPARK-22411) Heuristic to combine splits in DataSourceScanExec isn't accurate when dynamic allocation is enabled

2017-11-05 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22411?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-22411: Assignee: Apache Spark (was: Vinitha Reddy Gankidi) > Heuristic to combine splits in

[jira] [Resolved] (SPARK-22439) Not able to get numeric columns for the file having decimal values

2017-11-05 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22439?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-22439. --- Resolution: Not A Problem > Not able to get numeric columns for the file having decimal values >

[jira] [Resolved] (SPARK-22424) Task not finished for a long time in monitor UI, but I found it finished in logs

2017-11-05 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22424?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-22424. --- Resolution: Not A Problem > Task not finished for a long time in monitor UI, but I found it finished

[jira] [Resolved] (SPARK-22365) Spark UI executors empty list with 500 error

2017-11-05 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-22365. --- Resolution: Cannot Reproduce This doesn't come up in tests and I have heard no other reports of it.

[jira] [Commented] (SPARK-22449) Add BIC for GLM

2017-11-05 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22449?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16239444#comment-16239444 ] Sean Owen commented on SPARK-22449: --- I personally don't have a strong feeling. It should be similar

[jira] [Commented] (SPARK-22441) JDBC REAL type is mapped to Double instead of Float

2017-11-05 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22441?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16239443#comment-16239443 ] Sean Owen commented on SPARK-22441: --- I think my only strong feeling here is to keep the forward/reverse

[jira] [Assigned] (SPARK-22429) Streaming checkpointing code does not retry after failure due to NullPointerException

2017-11-05 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22429?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reassigned SPARK-22429: - Assignee: Tristan Stevens Priority: Minor (was: Trivial) > Streaming checkpointing code

[jira] [Resolved] (SPARK-22429) Streaming checkpointing code does not retry after failure due to NullPointerException

2017-11-05 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22429?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-22429. --- Resolution: Fixed Fix Version/s: 2.1.3 2.3.0 2.2.1

[jira] [Assigned] (SPARK-22406) pyspark version tag is wrong on PyPi

2017-11-05 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22406?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk reassigned SPARK-22406: --- Assignee: holdenk > pyspark version tag is wrong on PyPi > > >

[jira] [Commented] (SPARK-22406) pyspark version tag is wrong on PyPi

2017-11-05 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16239432#comment-16239432 ] holdenk commented on SPARK-22406: - Due to restrictions on PyPI, no. We can try and fix this in 2.2.1

[jira] [Updated] (SPARK-22406) pyspark version tag is wrong on PyPi

2017-11-05 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22406?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk updated SPARK-22406: Target Version/s: 2.2.1 > pyspark version tag is wrong on PyPi > > >

[jira] [Commented] (SPARK-22443) AggregatedDialect doesn't override quoteIdentifier and other methods in JdbcDialects

2017-11-05 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22443?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16239418#comment-16239418 ] Xiao Li commented on SPARK-22443: - It sounds your custom dialect is a good solution for your scenario.

[jira] [Resolved] (SPARK-22443) AggregatedDialect doesn't override quoteIdentifier and other methods in JdbcDialects

2017-11-05 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-22443?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-22443. - Resolution: Fixed Assignee: Huaxin Gao Fix Version/s: 2.3.0 > AggregatedDialect doesn't