[jira] [Assigned] (SPARK-21658) Adds the default None for value in na.replace in PySpark to match

2017-08-31 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21658?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Saisai Shao reassigned SPARK-21658: --- Assignee: Chin Han Yu > Adds the default None for value in na.replace in PySpark to match >

[jira] [Commented] (SPARK-21658) Adds the default None for value in na.replace in PySpark to match

2017-08-31 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21658?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16150059#comment-16150059 ] Saisai Shao commented on SPARK-21658: - Done :). > Adds the default None for value in na.replace in

[jira] [Commented] (SPARK-21658) Adds the default None for value in na.replace in PySpark to match

2017-08-31 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21658?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16150022#comment-16150022 ] Hyukjin Kwon commented on SPARK-21658: -- [~jerryshao], I think you are now able to add a user to

[jira] [Resolved] (SPARK-21789) Remove obsolete codes for parsing abstract schema strings

2017-08-31 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21789?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-21789. -- Resolution: Fixed Fix Version/s: 2.3.0 Issue resolved by pull request 18647

[jira] [Assigned] (SPARK-21789) Remove obsolete codes for parsing abstract schema strings

2017-08-31 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21789?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-21789: Assignee: Hyukjin Kwon > Remove obsolete codes for parsing abstract schema strings >

[jira] [Assigned] (SPARK-21779) Simpler Dataset.sample API in Python

2017-08-31 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21779?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-21779: Assignee: Hyukjin Kwon > Simpler Dataset.sample API in Python >

[jira] [Commented] (SPARK-21780) Simpler Dataset.sample API in R

2017-08-31 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21780?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16150007#comment-16150007 ] Hyukjin Kwon commented on SPARK-21780: -- Let me work on this. > Simpler Dataset.sample API in R >

[jira] [Resolved] (SPARK-21779) Simpler Dataset.sample API in Python

2017-08-31 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21779?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-21779. -- Resolution: Fixed Fix Version/s: 2.3.0 Issue resolved by pull request 18999

[jira] [Commented] (SPARK-21885) HiveMetastoreCatalog.InferIfNeeded too slow when caseSensitiveInference enabled

2017-08-31 Thread liupengcheng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21885?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16149997#comment-16149997 ] liupengcheng commented on SPARK-21885: -- [~viirya] I think it's necessary, consider this senario, you

[jira] [Commented] (SPARK-21885) HiveMetastoreCatalog.InferIfNeeded too slow when caseSensitiveInference enabled

2017-08-31 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21885?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16149973#comment-16149973 ] Liang-Chi Hsieh commented on SPARK-21885: - I tend to agree that when we don't actually need the

[jira] [Commented] (SPARK-21885) HiveMetastoreCatalog.InferIfNeeded too slow when caseSensitiveInference enabled

2017-08-31 Thread liupengcheng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21885?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16149880#comment-16149880 ] liupengcheng commented on SPARK-21885: -- [~srowen] Fixed! thanks! anybody check this problem? >

[jira] [Resolved] (SPARK-21852) Empty Parquet Files created as a result of spark jobs fail when read

2017-08-31 Thread Hyukjin Kwon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21852?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-21852. -- Resolution: Cannot Reproduce [~sdalmia_asf], I think I need some details about "certain spark

[jira] [Commented] (SPARK-21477) Mark LocalTableScanExec's input data transient

2017-08-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21477?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16149802#comment-16149802 ] Apache Spark commented on SPARK-21477: -- User 'gatorsmile' has created a pull request for this issue:

[jira] [Commented] (SPARK-21884) Fix StackOverflowError on MetadataOnlyQuery

2017-08-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21884?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16149801#comment-16149801 ] Apache Spark commented on SPARK-21884: -- User 'gatorsmile' has created a pull request for this issue:

[jira] [Assigned] (SPARK-21891) Add TBLPROPERTIES to DDL statement: CREATE TABLE USING

2017-08-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21891?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21891: Assignee: Apache Spark (was: Xiao Li) > Add TBLPROPERTIES to DDL statement: CREATE TABLE

[jira] [Assigned] (SPARK-21891) Add TBLPROPERTIES to DDL statement: CREATE TABLE USING

2017-08-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21891?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21891: Assignee: Xiao Li (was: Apache Spark) > Add TBLPROPERTIES to DDL statement: CREATE TABLE

[jira] [Commented] (SPARK-21891) Add TBLPROPERTIES to DDL statement: CREATE TABLE USING

2017-08-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21891?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16149781#comment-16149781 ] Apache Spark commented on SPARK-21891: -- User 'gatorsmile' has created a pull request for this issue:

[jira] [Created] (SPARK-21891) Add TBLPROPERTIES to DDL statement: CREATE TABLE USING

2017-08-31 Thread Xiao Li (JIRA)
Xiao Li created SPARK-21891: --- Summary: Add TBLPROPERTIES to DDL statement: CREATE TABLE USING Key: SPARK-21891 URL: https://issues.apache.org/jira/browse/SPARK-21891 Project: Spark Issue Type:

[jira] [Resolved] (SPARK-21862) Add overflow check in PCA

2017-08-31 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21862?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-21862. --- Resolution: Fixed Fix Version/s: 2.3.0 Issue resolved by pull request 19078

[jira] [Closed] (SPARK-21884) Fix StackOverflowError on MetadataOnlyQuery

2017-08-31 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21884?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun closed SPARK-21884. - Resolution: Duplicate This is fixed at SPARK-21477 in master branch. > Fix StackOverflowError

[jira] [Commented] (SPARK-21088) CrossValidator, TrainValidationSplit should preserve all models after fitting: Python

2017-08-31 Thread Daniel Imberman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21088?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16149743#comment-16149743 ] Daniel Imberman commented on SPARK-21088: - [~ajaysaini] Are you still working on this one? >

[jira] [Commented] (SPARK-20221) Port pyspark.mllib.linalg tests in pyspark/mllib/tests.py to pyspark.ml.linalg

2017-08-31 Thread Daniel Imberman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20221?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16149742#comment-16149742 ] Daniel Imberman commented on SPARK-20221: - I can do this one > Port pyspark.mllib.linalg tests

[jira] [Resolved] (SPARK-21110) Structs should be usable in inequality filters

2017-08-31 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21110?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-21110. - Resolution: Fixed Assignee: Andrew Ray Fix Version/s: 2.3.0 > Structs should be usable

[jira] [Assigned] (SPARK-21652) Optimizer cannot reach a fixed point on certain queries

2017-08-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21652?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21652: Assignee: (was: Apache Spark) > Optimizer cannot reach a fixed point on certain

[jira] [Assigned] (SPARK-21652) Optimizer cannot reach a fixed point on certain queries

2017-08-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21652?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21652: Assignee: Apache Spark > Optimizer cannot reach a fixed point on certain queries >

[jira] [Commented] (SPARK-21652) Optimizer cannot reach a fixed point on certain queries

2017-08-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21652?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16149680#comment-16149680 ] Apache Spark commented on SPARK-21652: -- User 'jiangxb1987' has created a pull request for this

[jira] [Resolved] (SPARK-20676) Upload to PyPi

2017-08-31 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20676?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk resolved SPARK-20676. - Resolution: Fixed Fix Version/s: 2.2.0 > Upload to PyPi > -- > > Key:

[jira] [Commented] (SPARK-20676) Upload to PyPi

2017-08-31 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20676?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16149656#comment-16149656 ] holdenk commented on SPARK-20676: - Yes. > Upload to PyPi > -- > > Key:

[jira] [Updated] (SPARK-21866) SPIP: Image support in Spark

2017-08-31 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21866?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Timothy Hunter updated SPARK-21866: --- Attachment: (was: SPIP - Image support for Apache Spark.pdf) > SPIP: Image support in

[jira] [Updated] (SPARK-21866) SPIP: Image support in Spark

2017-08-31 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21866?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Timothy Hunter updated SPARK-21866: --- Attachment: SPIP - Image support for Apache Spark V1.1.pdf Updated authors' list. > SPIP:

[jira] [Commented] (SPARK-21807) The getAliasedConstraints function in LogicalPlan will take a long time when number of expressions is greater than 100

2017-08-31 Thread Andrew Ash (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21807?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16149600#comment-16149600 ] Andrew Ash commented on SPARK-21807: For reference, here's a stacktrace I'm seeing on a cluster

[jira] [Commented] (SPARK-21276) Update lz4-java to remove custom LZ4BlockInputStream

2017-08-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21276?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16149568#comment-16149568 ] Apache Spark commented on SPARK-21276: -- User 'maropu' has created a pull request for this issue:

[jira] [Created] (SPARK-21890) ObtainCredentials does not pass creds to addDelegationTokens

2017-08-31 Thread Sanket Reddy (JIRA)
Sanket Reddy created SPARK-21890: Summary: ObtainCredentials does not pass creds to addDelegationTokens Key: SPARK-21890 URL: https://issues.apache.org/jira/browse/SPARK-21890 Project: Spark

[jira] [Commented] (SPARK-21184) QuantileSummaries implementation is wrong and QuantileSummariesSuite fails with larger n

2017-08-31 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21184?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16149510#comment-16149510 ] Timothy Hunter commented on SPARK-21184: [~a1ray] thank you for the report, someone should

[jira] [Issue Comment Deleted] (SPARK-21652) Optimizer cannot reach a fixed point on certain queries

2017-08-31 Thread Anton Okolnychyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21652?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Anton Okolnychyi updated SPARK-21652: - Comment: was deleted (was: Is there anything I can help here? I see that some cost-based

[jira] [Commented] (SPARK-21652) Optimizer cannot reach a fixed point on certain queries

2017-08-31 Thread Anton Okolnychyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21652?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16149462#comment-16149462 ] Anton Okolnychyi commented on SPARK-21652: -- Is there anything I can help here? I see that some

[jira] [Commented] (SPARK-21842) Support Kerberos ticket renewal and creation in Mesos

2017-08-31 Thread Kalvin Chau (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21842?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16149433#comment-16149433 ] Kalvin Chau commented on SPARK-21842: - I'm looking into implementing this feature. Is the

[jira] [Commented] (SPARK-21850) SparkSQL cannot perform LIKE someColumn if someColumn's value contains a backslash \

2017-08-31 Thread Anton Okolnychyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21850?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16149401#comment-16149401 ] Anton Okolnychyi commented on SPARK-21850: -- [~instanceof me] yeah, you are absolutely correct, I

[jira] [Commented] (SPARK-21583) Create a ColumnarBatch with ArrowColumnVectors for row based iteration

2017-08-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21583?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16149397#comment-16149397 ] Apache Spark commented on SPARK-21583: -- User 'BryanCutler' has created a pull request for this

[jira] [Commented] (SPARK-21866) SPIP: Image support in Spark

2017-08-31 Thread Danil Kirsanov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21866?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16149393#comment-16149393 ] Danil Kirsanov commented on SPARK-21866: Hi Sean, echoing the previous comments: yes, this is a

[jira] [Comment Edited] (SPARK-21888) Cannot add stuff to Client Classpath for Yarn Cluster Mode

2017-08-31 Thread Parth Gandhi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21888?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16149350#comment-16149350 ] Parth Gandhi edited comment on SPARK-21888 at 8/31/17 6:01 PM: --- The spark

[jira] [Resolved] (SPARK-20812) Add Mesos Secrets support to the spark dispatcher

2017-08-31 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20812?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-20812. Resolution: Fixed Assignee: Arthur Rand (was: Apache Spark) Fix Version/s:

[jira] [Commented] (SPARK-21888) Cannot add stuff to Client Classpath for Yarn Cluster Mode

2017-08-31 Thread Parth Gandhi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21888?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16149350#comment-16149350 ] Parth Gandhi commented on SPARK-21888: -- The spark job runs successfully only if hbase-site.xml is

[jira] [Commented] (SPARK-17107) Remove redundant pushdown rule for Union

2017-08-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17107?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16149322#comment-16149322 ] Apache Spark commented on SPARK-17107: -- User 'gatorsmile' has created a pull request for this issue:

[jira] [Resolved] (SPARK-21889) Web site headers not rendered correctly in some pages

2017-08-31 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21889?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Vanzin resolved SPARK-21889. Resolution: Duplicate > Web site headers not rendered correctly in some pages >

[jira] [Commented] (SPARK-21889) Web site headers not rendered correctly in some pages

2017-08-31 Thread Marcelo Vanzin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21889?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16149318#comment-16149318 ] Marcelo Vanzin commented on SPARK-21889: Ah, I only searched for open bugs, not closed ones. >

[jira] [Commented] (SPARK-21888) Cannot add stuff to Client Classpath for Yarn Cluster Mode

2017-08-31 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21888?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16149315#comment-16149315 ] Sean Owen commented on SPARK-21888: --- For your case specifically, shouldn't hbase-site.xml be available

[jira] [Commented] (SPARK-21889) Web site headers not rendered correctly in some pages

2017-08-31 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21889?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16149308#comment-16149308 ] Sean Owen commented on SPARK-21889: --- Yeah I think this is the stuff fixed in

[jira] [Created] (SPARK-21889) Web site headers not rendered correctly in some pages

2017-08-31 Thread Marcelo Vanzin (JIRA)
Marcelo Vanzin created SPARK-21889: -- Summary: Web site headers not rendered correctly in some pages Key: SPARK-21889 URL: https://issues.apache.org/jira/browse/SPARK-21889 Project: Spark

[jira] [Commented] (SPARK-21888) Cannot add stuff to Client Classpath for Yarn Cluster Mode

2017-08-31 Thread Parth Gandhi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21888?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16149263#comment-16149263 ] Parth Gandhi commented on SPARK-21888: -- Sorry I forgot to mention that, --jars certainly adds jar

[jira] [Resolved] (SPARK-21886) Use SparkSession.internalCreateDataFrame to create Dataset with LogicalRDD logical operator

2017-08-31 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21886?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-21886. - Resolution: Fixed Assignee: Jacek Laskowski Fix Version/s: 2.3.0 > Use

[jira] [Commented] (SPARK-21888) Cannot add stuff to Client Classpath for Yarn Cluster Mode

2017-08-31 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21888?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16149217#comment-16149217 ] Sean Owen commented on SPARK-21888: --- You haven't said what you tried. --jars does this. > Cannot add

[jira] [Resolved] (SPARK-21878) Create SQLMetricsTestUtils

2017-08-31 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21878?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li resolved SPARK-21878. - Resolution: Fixed Fix Version/s: 2.3.0 > Create SQLMetricsTestUtils > --

[jira] [Commented] (SPARK-21866) SPIP: Image support in Spark

2017-08-31 Thread Timothy Hunter (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21866?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16149186#comment-16149186 ] Timothy Hunter commented on SPARK-21866: [~srowen] thank you for the comments. Indeed, this

[jira] [Updated] (SPARK-21888) Cannot add stuff to Client Classpath for Yarn Cluster Mode

2017-08-31 Thread Parth Gandhi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21888?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Parth Gandhi updated SPARK-21888: - Description: While running Spark on Yarn in cluster mode, currently there is no way to add any

[jira] [Updated] (SPARK-21888) Cannot add stuff to Client Classpath for Yarn Cluster Mode

2017-08-31 Thread Parth Gandhi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21888?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Parth Gandhi updated SPARK-21888: - Description: While running Spark on Yarn in cluster mode, currently there is no way to add any

[jira] [Created] (SPARK-21888) Cannot add stuff to Client Classpath for Yarn Cluster Mode

2017-08-31 Thread Parth Gandhi (JIRA)
Parth Gandhi created SPARK-21888: Summary: Cannot add stuff to Client Classpath for Yarn Cluster Mode Key: SPARK-21888 URL: https://issues.apache.org/jira/browse/SPARK-21888 Project: Spark

[jira] [Updated] (SPARK-21885) HiveMetastoreCatalog.InferIfNeeded too slow when caseSensitiveInference enabled

2017-08-31 Thread liupengcheng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21885?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] liupengcheng updated SPARK-21885: - Summary: HiveMetastoreCatalog.InferIfNeeded too slow when caseSensitiveInference enabled (was:

[jira] [Updated] (SPARK-21885) HiveMetastoreCatalog#InferIfNeeded too slow when caseSensitiveInference enabled

2017-08-31 Thread liupengcheng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21885?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] liupengcheng updated SPARK-21885: - Summary: HiveMetastoreCatalog#InferIfNeeded too slow when caseSensitiveInference enabled (was:

[jira] [Comment Edited] (SPARK-21802) Make sparkR MLP summary() expose probability column

2017-08-31 Thread Weichen Xu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21802?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16136234#comment-16136234 ] Weichen Xu edited comment on SPARK-21802 at 8/31/17 1:48 PM: - cc

[jira] [Resolved] (SPARK-13258) --conf properties not honored in Mesos cluster mode

2017-08-31 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13258?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-13258. --- Resolution: Not A Problem Great, thanks for the research and follow up. > --conf properties not

[jira] [Commented] (SPARK-13258) --conf properties not honored in Mesos cluster mode

2017-08-31 Thread Stavros Kontopoulos (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16148982#comment-16148982 ] Stavros Kontopoulos commented on SPARK-13258: - SPARK_JAVA_OPTS has been removed from the code

[jira] [Updated] (SPARK-21882) OutputMetrics doesn't count written bytes correctly in the saveAsHadoopDataset function

2017-08-31 Thread linxiaojun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21882?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] linxiaojun updated SPARK-21882: --- Description: The first job called from saveAsHadoopDataset, running in each executor, does not

[jira] [Resolved] (SPARK-21887) DST on History server

2017-08-31 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21887?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-21887. --- Resolution: Invalid > DST on History server > - > > Key:

[jira] [Created] (SPARK-21887) DST on History server

2017-08-31 Thread Iraitz Montalban (JIRA)
Iraitz Montalban created SPARK-21887: Summary: DST on History server Key: SPARK-21887 URL: https://issues.apache.org/jira/browse/SPARK-21887 Project: Spark Issue Type: Question

[jira] [Commented] (SPARK-21869) A cached Kafka producer should not be closed if any task is using it.

2017-08-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21869?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16148878#comment-16148878 ] Apache Spark commented on SPARK-21869: -- User 'ScrapCodes' has created a pull request for this issue:

[jira] [Assigned] (SPARK-21869) A cached Kafka producer should not be closed if any task is using it.

2017-08-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21869?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21869: Assignee: (was: Apache Spark) > A cached Kafka producer should not be closed if any

[jira] [Assigned] (SPARK-21869) A cached Kafka producer should not be closed if any task is using it.

2017-08-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21869?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21869: Assignee: Apache Spark > A cached Kafka producer should not be closed if any task is

[jira] [Commented] (SPARK-21885) SQL inferSchema too slow

2017-08-31 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21885?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16148818#comment-16148818 ] Sean Owen commented on SPARK-21885: --- Please fix the title > SQL inferSchema too slow >

[jira] [Assigned] (SPARK-21886) Use SparkSession.internalCreateDataFrame to create Dataset with LogicalRDD logical operator

2017-08-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21886?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21886: Assignee: Apache Spark > Use SparkSession.internalCreateDataFrame to create Dataset with

[jira] [Commented] (SPARK-21886) Use SparkSession.internalCreateDataFrame to create Dataset with LogicalRDD logical operator

2017-08-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21886?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16148817#comment-16148817 ] Apache Spark commented on SPARK-21886: -- User 'jaceklaskowski' has created a pull request for this

[jira] [Assigned] (SPARK-21886) Use SparkSession.internalCreateDataFrame to create Dataset with LogicalRDD logical operator

2017-08-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21886?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21886: Assignee: (was: Apache Spark) > Use SparkSession.internalCreateDataFrame to create

[jira] [Created] (SPARK-21886) Use SparkSession.internalCreateDataFrame to create Dataset with LogicalRDD logical operator

2017-08-31 Thread Jacek Laskowski (JIRA)
Jacek Laskowski created SPARK-21886: --- Summary: Use SparkSession.internalCreateDataFrame to create Dataset with LogicalRDD logical operator Key: SPARK-21886 URL: https://issues.apache.org/jira/browse/SPARK-21886

[jira] [Assigned] (SPARK-21884) Fix StackOverflowError on MetadataOnlyQuery

2017-08-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21884?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21884: Assignee: Apache Spark > Fix StackOverflowError on MetadataOnlyQuery >

[jira] [Assigned] (SPARK-21884) Fix StackOverflowError on MetadataOnlyQuery

2017-08-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21884?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21884: Assignee: (was: Apache Spark) > Fix StackOverflowError on MetadataOnlyQuery >

[jira] [Commented] (SPARK-21884) Fix StackOverflowError on MetadataOnlyQuery

2017-08-31 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21884?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16148794#comment-16148794 ] Apache Spark commented on SPARK-21884: -- User 'dongjoon-hyun' has created a pull request for this

[jira] [Commented] (SPARK-21885) SQL inferSchema too slow

2017-08-31 Thread liupengcheng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21885?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16148793#comment-16148793 ] liupengcheng commented on SPARK-21885: -- [~smilegator] [~dongjoon] [~viirya] > SQL inferSchema too

[jira] [Created] (SPARK-21885) SQL inferSchema too slow

2017-08-31 Thread liupengcheng (JIRA)
liupengcheng created SPARK-21885: Summary: SQL inferSchema too slow Key: SPARK-21885 URL: https://issues.apache.org/jira/browse/SPARK-21885 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-21884) Fix StackOverflowError on MetadataOnlyQuery

2017-08-31 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21884?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-21884: -- Description: This issue aims to fix StackOverflowError in branch-2.2. In Apache master

[jira] [Created] (SPARK-21884) Fix StackOverflowError on MetadataOnlyQuery

2017-08-31 Thread Dongjoon Hyun (JIRA)
Dongjoon Hyun created SPARK-21884: - Summary: Fix StackOverflowError on MetadataOnlyQuery Key: SPARK-21884 URL: https://issues.apache.org/jira/browse/SPARK-21884 Project: Spark Issue Type:

[jira] [Commented] (SPARK-21850) SparkSQL cannot perform LIKE someColumn if someColumn's value contains a backslash \

2017-08-31 Thread Adrien Lavoillotte (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21850?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16148746#comment-16148746 ] Adrien Lavoillotte commented on SPARK-21850: [~aokolnychyi] aren't you supposed to double the

[jira] [Commented] (SPARK-11574) Spark should support StatsD sink out of box

2017-08-31 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11574?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16148713#comment-16148713 ] Saisai Shao commented on SPARK-11574: - Thanks a lot [~srowen]! > Spark should support StatsD sink

[jira] [Assigned] (SPARK-11574) Spark should support StatsD sink out of box

2017-08-31 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11574?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reassigned SPARK-11574: - Assignee: Xiaofeng Lin > Spark should support StatsD sink out of box >

[jira] [Commented] (SPARK-11574) Spark should support StatsD sink out of box

2017-08-31 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11574?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16148707#comment-16148707 ] Sean Owen commented on SPARK-11574: --- [~jerryshao] I'll make you a JIRA admin so you can add the user to

[jira] [Commented] (SPARK-21850) SparkSQL cannot perform LIKE someColumn if someColumn's value contains a backslash \

2017-08-31 Thread Anton Okolnychyi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21850?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16148705#comment-16148705 ] Anton Okolnychyi commented on SPARK-21850: -- Then we should not be bound to the LIKE case only. I

[jira] [Commented] (SPARK-11574) Spark should support StatsD sink out of box

2017-08-31 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11574?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16148701#comment-16148701 ] Saisai Shao commented on SPARK-11574: - Ping [~srowen]. Hi Sean I cannot assign this JIRA to Xiaofeng,

[jira] [Commented] (SPARK-18523) OOM killer may leave SparkContext in broken state causing Connection Refused errors

2017-08-31 Thread Alexander Shorin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18523?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16148698#comment-16148698 ] Alexander Shorin commented on SPARK-18523: -- [~kadeng] I don't have a 2.2.0 in production for now

[jira] [Resolved] (SPARK-21883) when a stage failed,should remove it's child stages's pending state on spark UI,and mark jobs which this stage belongs to failed

2017-08-31 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21883?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-21883. --- Resolution: Invalid I'm going to preemptively close this because it's poorly described, is reported

[jira] [Created] (SPARK-21883) when a stage failed,should remove it's child stages's pending state on spark UI,and mark jobs which this stage belongs to failed

2017-08-31 Thread xianlongZhang (JIRA)
xianlongZhang created SPARK-21883: - Summary: when a stage failed,should remove it's child stages's pending state on spark UI,and mark jobs which this stage belongs to failed Key: SPARK-21883 URL:

[jira] [Commented] (SPARK-21866) SPIP: Image support in Spark

2017-08-31 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21866?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16148643#comment-16148643 ] Sean Owen commented on SPARK-21866: --- It makes some sense. I guess I'm mostly trying to match up the

[jira] [Resolved] (SPARK-21881) Again: OOM killer may leave SparkContext in broken state causing Connection Refused errors

2017-08-31 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21881?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-21881. --- Resolution: Duplicate Let's not fork the discussion. Post on the original issue and someone can

[jira] [Commented] (SPARK-14327) Scheduler holds locks which cause huge scheulder delays and executor timeouts

2017-08-31 Thread Fei Chen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14327?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16148622#comment-16148622 ] Fei Chen commented on SPARK-14327: -- I also came across this problem. In TaskScheduler, a thread which

[jira] [Updated] (SPARK-21882) OutputMetrics doesn't count written bytes correctly in the saveAsHadoopDataset function

2017-08-31 Thread linxiaojun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21882?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] linxiaojun updated SPARK-21882: --- Attachment: SPARK-21882.patch SPARK-21882.patch for Spark-1.6.1 > OutputMetrics doesn't count

[jira] [Created] (SPARK-21882) OutputMetrics doesn't count written bytes correctly in the saveAsHadoopDataset function

2017-08-31 Thread linxiaojun (JIRA)
linxiaojun created SPARK-21882: -- Summary: OutputMetrics doesn't count written bytes correctly in the saveAsHadoopDataset function Key: SPARK-21882 URL: https://issues.apache.org/jira/browse/SPARK-21882

[jira] [Updated] (SPARK-21881) Again: OOM killer may leave SparkContext in broken state causing Connection Refused errors

2017-08-31 Thread Kai Londenberg (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21881?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kai Londenberg updated SPARK-21881: --- Description: This is a duplicate of SPARK-18523, which was not really fixed for me (PySpark

[jira] [Updated] (SPARK-21881) Again: OOM killer may leave SparkContext in broken state causing Connection Refused errors

2017-08-31 Thread Kai Londenberg (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21881?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kai Londenberg updated SPARK-21881: --- Affects Version/s: (was: 1.6.1) (was: 2.0.0) > Again: OOM

[jira] [Updated] (SPARK-21881) Again: OOM killer may leave SparkContext in broken state causing Connection Refused errors

2017-08-31 Thread Kai Londenberg (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21881?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kai Londenberg updated SPARK-21881: --- Description: This is a duplicate of SPARK-18523, which was not really fixed for me (PySpark

[jira] [Updated] (SPARK-21881) Again: OOM killer may leave SparkContext in broken state causing Connection Refused errors

2017-08-31 Thread Kai Londenberg (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21881?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kai Londenberg updated SPARK-21881: --- Fix Version/s: (was: 2.1.0) > Again: OOM killer may leave SparkContext in broken state

[jira] [Updated] (SPARK-21881) Again: OOM killer may leave SparkContext in broken state causing Connection Refused errors

2017-08-31 Thread Kai Londenberg (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21881?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kai Londenberg updated SPARK-21881: --- Affects Version/s: 2.2.0 > Again: OOM killer may leave SparkContext in broken state causing

[jira] [Created] (SPARK-21881) Again: OOM killer may leave SparkContext in broken state causing Connection Refused errors

2017-08-31 Thread Kai Londenberg (JIRA)
Kai Londenberg created SPARK-21881: -- Summary: Again: OOM killer may leave SparkContext in broken state causing Connection Refused errors Key: SPARK-21881 URL: https://issues.apache.org/jira/browse/SPARK-21881

  1   2   >