[jira] [Comment Edited] (SPARK-19012) CreateOrReplaceTempView throws org.apache.spark.sql.catalyst.parser.ParseException when viewName first char is numerical

2016-12-28 Thread Jork Zijlstra (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19012?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15784796#comment-15784796 ] Jork Zijlstra edited comment on SPARK-19012 at 12/29/16 7:56 AM: - Good to

[jira] [Commented] (SPARK-19012) CreateOrReplaceTempView throws org.apache.spark.sql.catalyst.parser.ParseException when viewName first char is numerical

2016-12-28 Thread Jork Zijlstra (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19012?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15784796#comment-15784796 ] Jork Zijlstra commented on SPARK-19012: --- Good to see that its already being discussed. MSSQL also

[jira] [Assigned] (SPARK-19021) Generailize HDFSCredentialProvider to support non HDFS security FS

2016-12-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19021?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19021: Assignee: (was: Apache Spark) > Generailize HDFSCredentialProvider to support non

[jira] [Assigned] (SPARK-19021) Generailize HDFSCredentialProvider to support non HDFS security FS

2016-12-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19021?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19021: Assignee: Apache Spark > Generailize HDFSCredentialProvider to support non HDFS security

[jira] [Commented] (SPARK-19021) Generailize HDFSCredentialProvider to support non HDFS security FS

2016-12-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19021?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15784772#comment-15784772 ] Apache Spark commented on SPARK-19021: -- User 'jerryshao' has created a pull request for this issue:

[jira] [Created] (SPARK-19021) Generailize HDFSCredentialProvider to support non HDFS security FS

2016-12-28 Thread Saisai Shao (JIRA)
Saisai Shao created SPARK-19021: --- Summary: Generailize HDFSCredentialProvider to support non HDFS security FS Key: SPARK-19021 URL: https://issues.apache.org/jira/browse/SPARK-19021 Project: Spark

[jira] [Updated] (SPARK-19021) Generailize HDFSCredentialProvider to support non HDFS security FS

2016-12-28 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19021?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Saisai Shao updated SPARK-19021: Priority: Minor (was: Major) > Generailize HDFSCredentialProvider to support non HDFS security FS

[jira] [Comment Edited] (SPARK-18930) Inserting in partitioned table - partitioned field should be last in select statement.

2016-12-28 Thread Song Jun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18930?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15784678#comment-15784678 ] Song Jun edited comment on SPARK-18930 at 12/29/16 6:44 AM: from hive

[jira] [Commented] (SPARK-18930) Inserting in partitioned table - partitioned field should be last in select statement.

2016-12-28 Thread Song Jun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18930?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15784678#comment-15784678 ] Song Jun commented on SPARK-18930: -- from hive document,

[jira] [Assigned] (SPARK-19020) Cardinality estimation of aggregate operator

2016-12-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19020?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19020: Assignee: Apache Spark > Cardinality estimation of aggregate operator >

[jira] [Assigned] (SPARK-19020) Cardinality estimation of aggregate operator

2016-12-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19020?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19020: Assignee: (was: Apache Spark) > Cardinality estimation of aggregate operator >

[jira] [Commented] (SPARK-19020) Cardinality estimation of aggregate operator

2016-12-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19020?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15784674#comment-15784674 ] Apache Spark commented on SPARK-19020: -- User 'wzhfy' has created a pull request for this issue:

[jira] [Resolved] (SPARK-18567) Simplify CreateDataSourceTableAsSelectCommand

2016-12-28 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18567?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yin Huai resolved SPARK-18567. -- Resolution: Fixed Fix Version/s: 2.2.0 Issue resolved by pull request 15996

[jira] [Issue Comment Deleted] (SPARK-16849) Improve subquery execution by deduplicating the subqueries with the same results

2016-12-28 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16849?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liang-Chi Hsieh updated SPARK-16849: Comment: was deleted (was: Design doc v1) > Improve subquery execution by deduplicating

[jira] [Updated] (SPARK-16849) Improve subquery execution by deduplicating the subqueries with the same results

2016-12-28 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16849?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liang-Chi Hsieh updated SPARK-16849: Attachment: de-duplicating subqueries.pdf > Improve subquery execution by deduplicating

[jira] [Updated] (SPARK-16849) Improve subquery execution by deduplicating the subqueries with the same results

2016-12-28 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16849?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liang-Chi Hsieh updated SPARK-16849: Attachment: (was: de-duplicating subqueries.pdf) > Improve subquery execution by

[jira] [Created] (SPARK-19020) Cardinality estimation of aggregate operator

2016-12-28 Thread Zhenhua Wang (JIRA)
Zhenhua Wang created SPARK-19020: Summary: Cardinality estimation of aggregate operator Key: SPARK-19020 URL: https://issues.apache.org/jira/browse/SPARK-19020 Project: Spark Issue Type:

[jira] [Updated] (SPARK-17077) Cardinality estimation of project operator

2016-12-28 Thread Zhenhua Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17077?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhenhua Wang updated SPARK-17077: - Summary: Cardinality estimation of project operator (was: Cardinality estimation for project

[jira] [Updated] (SPARK-17077) Cardinality estimation for project operator

2016-12-28 Thread Zhenhua Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17077?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhenhua Wang updated SPARK-17077: - Summary: Cardinality estimation for project operator (was: Cardinality estimation project

[jira] [Assigned] (SPARK-17077) Cardinality estimation project operator

2016-12-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17077?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17077: Assignee: Apache Spark > Cardinality estimation project operator >

[jira] [Assigned] (SPARK-17077) Cardinality estimation project operator

2016-12-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17077?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17077: Assignee: (was: Apache Spark) > Cardinality estimation project operator >

[jira] [Commented] (SPARK-17077) Cardinality estimation project operator

2016-12-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17077?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15784384#comment-15784384 ] Apache Spark commented on SPARK-17077: -- User 'wzhfy' has created a pull request for this issue:

[jira] [Updated] (SPARK-17077) Cardinality estimation project operator

2016-12-28 Thread Zhenhua Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17077?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhenhua Wang updated SPARK-17077: - Summary: Cardinality estimation project operator (was: Cardinality estimation of group-by,

[jira] [Resolved] (SPARK-16213) Reduce runtime overhead of a program that creates an primitive array in DataFrame

2016-12-28 Thread Wenchen Fan (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16213?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-16213. - Resolution: Fixed Assignee: Kazuaki Ishizaki Fix Version/s: 2.2.0 > Reduce

[jira] [Assigned] (SPARK-19019) PySpark does not work with Python 3.6.0

2016-12-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19019?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19019: Assignee: Apache Spark > PySpark does not work with Python 3.6.0 >

[jira] [Assigned] (SPARK-19019) PySpark does not work with Python 3.6.0

2016-12-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19019?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19019: Assignee: (was: Apache Spark) > PySpark does not work with Python 3.6.0 >

[jira] [Commented] (SPARK-19019) PySpark does not work with Python 3.6.0

2016-12-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15784329#comment-15784329 ] Apache Spark commented on SPARK-19019: -- User 'HyukjinKwon' has created a pull request for this

[jira] [Created] (SPARK-19019) PySpark does not work with Python 3.6.0

2016-12-28 Thread Hyukjin Kwon (JIRA)
Hyukjin Kwon created SPARK-19019: Summary: PySpark does not work with Python 3.6.0 Key: SPARK-19019 URL: https://issues.apache.org/jira/browse/SPARK-19019 Project: Spark Issue Type: Bug

[jira] [Assigned] (SPARK-19018) spark csv writer charset support

2016-12-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19018?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19018: Assignee: (was: Apache Spark) > spark csv writer charset support >

[jira] [Assigned] (SPARK-19018) spark csv writer charset support

2016-12-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19018?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19018: Assignee: Apache Spark > spark csv writer charset support >

[jira] [Commented] (SPARK-19018) spark csv writer charset support

2016-12-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19018?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15784278#comment-15784278 ] Apache Spark commented on SPARK-19018: -- User 'cjuexuan' has created a pull request for this issue:

[jira] [Created] (SPARK-19018) spark csv writer charset support

2016-12-28 Thread todd.chen (JIRA)
todd.chen created SPARK-19018: - Summary: spark csv writer charset support Key: SPARK-19018 URL: https://issues.apache.org/jira/browse/SPARK-19018 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-19007) Speedup and optimize the GradientBoostedTrees in the "data>memory" scene

2016-12-28 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19007?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-19007: -- Component/s: (was: MLlib) > Speedup and optimize the GradientBoostedTrees in the

[jira] [Commented] (SPARK-18948) Add Mean Percentile Rank metric for ranking algorithms

2016-12-28 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18948?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15784216#comment-15784216 ] Joseph K. Bradley commented on SPARK-18948: --- Thanks [~danilo.ascione] for suggesting this. A

[jira] [Updated] (SPARK-18948) Add Mean Percentile Rank metric for ranking algorithms

2016-12-28 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18948?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-18948: -- Shepherd: (was: Xiangrui Meng) > Add Mean Percentile Rank metric for ranking

[jira] [Updated] (SPARK-18929) Add Tweedie distribution in GLM

2016-12-28 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18929?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-18929: -- Affects Version/s: (was: 2.0.2) > Add Tweedie distribution in GLM >

[jira] [Commented] (SPARK-18862) Split SparkR mllib.R into multiple files

2016-12-28 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18862?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15784190#comment-15784190 ] Joseph K. Bradley commented on SPARK-18862: --- I like the chosen organization too! > Split

[jira] [Commented] (SPARK-16552) Store the Inferred Schemas into External Catalog Tables when Creating Tables

2016-12-28 Thread Xiao Li (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16552?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15784183#comment-15784183 ] Xiao Li commented on SPARK-16552: - [~yhuai] Yeah, see the discussion in

[jira] [Commented] (SPARK-19012) CreateOrReplaceTempView throws org.apache.spark.sql.catalyst.parser.ParseException when viewName first char is numerical

2016-12-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19012?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15784174#comment-15784174 ] Apache Spark commented on SPARK-19012: -- User 'dongjoon-hyun' has created a pull request for this

[jira] [Assigned] (SPARK-19012) CreateOrReplaceTempView throws org.apache.spark.sql.catalyst.parser.ParseException when viewName first char is numerical

2016-12-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19012?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19012: Assignee: Apache Spark > CreateOrReplaceTempView throws >

[jira] [Assigned] (SPARK-19012) CreateOrReplaceTempView throws org.apache.spark.sql.catalyst.parser.ParseException when viewName first char is numerical

2016-12-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19012?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19012: Assignee: (was: Apache Spark) > CreateOrReplaceTempView throws >

[jira] [Commented] (SPARK-19012) CreateOrReplaceTempView throws org.apache.spark.sql.catalyst.parser.ParseException when viewName first char is numerical

2016-12-28 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19012?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15784169#comment-15784169 ] Dongjoon Hyun commented on SPARK-19012: --- In API docs and many places, `createOrReplaceTempView` was

[jira] [Commented] (SPARK-19012) CreateOrReplaceTempView throws org.apache.spark.sql.catalyst.parser.ParseException when viewName first char is numerical

2016-12-28 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19012?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15784083#comment-15784083 ] Dongjoon Hyun commented on SPARK-19012: --- Thank you for decision. Yep. I'll make the PR like that.

[jira] [Comment Edited] (SPARK-19012) CreateOrReplaceTempView throws org.apache.spark.sql.catalyst.parser.ParseException when viewName first char is numerical

2016-12-28 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19012?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15784074#comment-15784074 ] Herman van Hovell edited comment on SPARK-19012 at 12/29/16 12:21 AM:

[jira] [Commented] (SPARK-19012) CreateOrReplaceTempView throws org.apache.spark.sql.catalyst.parser.ParseException when viewName first char is numerical

2016-12-28 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19012?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15784074#comment-15784074 ] Herman van Hovell commented on SPARK-19012: --- Yeah, you have a point there. I was wondering if

[jira] [Commented] (SPARK-19012) CreateOrReplaceTempView throws org.apache.spark.sql.catalyst.parser.ParseException when viewName first char is numerical

2016-12-28 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19012?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15784062#comment-15784062 ] Dongjoon Hyun commented on SPARK-19012: --- Ur, actually, we already support

[jira] [Commented] (SPARK-19012) CreateOrReplaceTempView throws org.apache.spark.sql.catalyst.parser.ParseException when viewName first char is numerical

2016-12-28 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19012?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15784056#comment-15784056 ] Dongjoon Hyun commented on SPARK-19012: --- BTW, [~hvanhovell]. I found the existing related issue and

[jira] [Commented] (SPARK-19012) CreateOrReplaceTempView throws org.apache.spark.sql.catalyst.parser.ParseException when viewName first char is numerical

2016-12-28 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19012?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15783990#comment-15783990 ] Dongjoon Hyun commented on SPARK-19012: --- No problem. However, we need to raise AnalysisException on

[jira] [Commented] (SPARK-19012) CreateOrReplaceTempView throws org.apache.spark.sql.catalyst.parser.ParseException when viewName first char is numerical

2016-12-28 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19012?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15783992#comment-15783992 ] Dongjoon Hyun commented on SPARK-19012: --- +1 > CreateOrReplaceTempView throws >

[jira] [Commented] (SPARK-19012) CreateOrReplaceTempView throws org.apache.spark.sql.catalyst.parser.ParseException when viewName first char is numerical

2016-12-28 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19012?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15783988#comment-15783988 ] Herman van Hovell commented on SPARK-19012: --- Yeah, maybe a bit more subtle than that (we need

[jira] [Commented] (SPARK-19012) CreateOrReplaceTempView throws org.apache.spark.sql.catalyst.parser.ParseException when viewName first char is numerical

2016-12-28 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19012?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15783983#comment-15783983 ] Dongjoon Hyun commented on SPARK-19012: --- Oh, you mean always wrap the name with backticks right? >

[jira] [Comment Edited] (SPARK-19012) CreateOrReplaceTempView throws org.apache.spark.sql.catalyst.parser.ParseException when viewName first char is numerical

2016-12-28 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19012?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15783966#comment-15783966 ] Herman van Hovell edited comment on SPARK-19012 at 12/28/16 11:23 PM:

[jira] [Commented] (SPARK-19012) CreateOrReplaceTempView throws org.apache.spark.sql.catalyst.parser.ParseException when viewName first char is numerical

2016-12-28 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19012?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15783966#comment-15783966 ] Herman van Hovell commented on SPARK-19012: --- [~dongjoon] Could make a PR that puts the code in

[jira] [Commented] (SPARK-19012) CreateOrReplaceTempView throws org.apache.spark.sql.catalyst.parser.ParseException when viewName first char is numerical

2016-12-28 Thread Dongjoon Hyun (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19012?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15783958#comment-15783958 ] Dongjoon Hyun commented on SPARK-19012: --- Hi, [~hvanhovell] and [~jzijlstra]. I'll make a PR to

[jira] [Commented] (SPARK-19017) NOT IN subquery with more than one column may return incorrect results

2016-12-28 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19017?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15783937#comment-15783937 ] Herman van Hovell commented on SPARK-19017: --- [~nsyca] Why is this incorrect? If I rewrite the

[jira] [Updated] (SPARK-17847) Reduce shuffled data size of GaussianMixture & copy the implementation from mllib to ml

2016-12-28 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17847?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-17847: -- Target Version/s: 2.2.0 > Reduce shuffled data size of GaussianMixture & copy the

[jira] [Commented] (SPARK-15359) Mesos dispatcher should handle DRIVER_ABORTED status from mesosDriver.run()

2016-12-28 Thread Devaraj K (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15359?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15783821#comment-15783821 ] Devaraj K commented on SPARK-15359: --- [~yu2003w], seems you are also facing the same issue which I

[jira] [Commented] (SPARK-16552) Store the Inferred Schemas into External Catalog Tables when Creating Tables

2016-12-28 Thread Yin Huai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16552?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15783705#comment-15783705 ] Yin Huai commented on SPARK-16552: -- [~smilegator] [~cloud_fan] i think we will not do partitioning

[jira] [Commented] (SPARK-19017) NOT IN subquery with more than one column may return incorrect results

2016-12-28 Thread Nattavut Sutyanyong (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19017?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15783653#comment-15783653 ] Nattavut Sutyanyong commented on SPARK-19017: - The semantics of the NOT IN for multiple

[jira] [Resolved] (SPARK-18958) SparkR should support toJSON on DataFrame

2016-12-28 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18958?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung resolved SPARK-18958. -- Resolution: Fixed Target Version/s: 2.2.0 > SparkR should support toJSON on

[jira] [Created] (SPARK-19017) NOT IN subquery with more than one column may return incorrect results

2016-12-28 Thread Nattavut Sutyanyong (JIRA)
Nattavut Sutyanyong created SPARK-19017: --- Summary: NOT IN subquery with more than one column may return incorrect results Key: SPARK-19017 URL: https://issues.apache.org/jira/browse/SPARK-19017

[jira] [Commented] (SPARK-18669) Update Apache docs regard watermarking in Structured Streaming

2016-12-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18669?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15783633#comment-15783633 ] Apache Spark commented on SPARK-18669: -- User 'zsxwing' has created a pull request for this issue:

[jira] [Comment Edited] (SPARK-18737) Serialization setting "spark.serializer" ignored in Spark 2.x

2016-12-28 Thread Josh Bacon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18737?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15783543#comment-15783543 ] Josh Bacon edited comment on SPARK-18737 at 12/28/16 7:39 PM: -- Hi Sean,

[jira] [Commented] (SPARK-18737) Serialization setting "spark.serializer" ignored in Spark 2.x

2016-12-28 Thread Josh Bacon (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18737?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15783543#comment-15783543 ] Josh Bacon commented on SPARK-18737: Hi Sean, We've perform a more tests and are experiencing the

[jira] [Commented] (SPARK-19016) Document scalable partition handling feature in the programming guide

2016-12-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19016?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15783538#comment-15783538 ] Apache Spark commented on SPARK-19016: -- User 'liancheng' has created a pull request for this issue:

[jira] [Assigned] (SPARK-19016) Document scalable partition handling feature in the programming guide

2016-12-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19016?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19016: Assignee: Cheng Lian (was: Apache Spark) > Document scalable partition handling feature

[jira] [Assigned] (SPARK-19016) Document scalable partition handling feature in the programming guide

2016-12-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19016?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19016: Assignee: Apache Spark (was: Cheng Lian) > Document scalable partition handling feature

[jira] [Commented] (SPARK-10878) Race condition when resolving Maven coordinates via Ivy

2016-12-28 Thread Andrew Snare (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10878?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15783531#comment-15783531 ] Andrew Snare commented on SPARK-10878: -- I see this with Spark 2.0 as well. There doesn't appear to

[jira] [Created] (SPARK-19016) Document scalable partition handling feature in the programming guide

2016-12-28 Thread Cheng Lian (JIRA)
Cheng Lian created SPARK-19016: -- Summary: Document scalable partition handling feature in the programming guide Key: SPARK-19016 URL: https://issues.apache.org/jira/browse/SPARK-19016 Project: Spark

[jira] [Comment Edited] (SPARK-18966) NOT IN subquery with correlated expressions may return incorrect result

2016-12-28 Thread Nattavut Sutyanyong (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18966?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15783457#comment-15783457 ] Nattavut Sutyanyong edited comment on SPARK-18966 at 12/28/16 6:40 PM:

[jira] [Commented] (SPARK-18966) NOT IN subquery with correlated expressions may return incorrect result

2016-12-28 Thread Nattavut Sutyanyong (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18966?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15783457#comment-15783457 ] Nattavut Sutyanyong commented on SPARK-18966: - Considering the following subquery: {code}

[jira] [Commented] (SPARK-3246) Support weighted SVMWithSGD for classification of unbalanced dataset

2016-12-28 Thread Sheridan Rawlins (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3246?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15783435#comment-15783435 ] Sheridan Rawlins commented on SPARK-3246: - Hey, I have a solution that just uses liblinear to do

[jira] [Updated] (SPARK-17999) Add getPreferredLocations for KafkaSourceRDD

2016-12-28 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17999?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-17999: - Component/s: (was: DStreams) (was: SQL) Structured

[jira] [Updated] (SPARK-15698) Ability to remove old metadata for structure streaming MetadataLog

2016-12-28 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15698?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-15698: - Component/s: (was: DStreams) (was: SQL) Structured

[jira] [Updated] (SPARK-16963) Change Source API so that sources do not need to keep unbounded state

2016-12-28 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16963?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-16963: - Component/s: (was: DStreams) Structured Streaming > Change Source API so

[jira] [Updated] (SPARK-17153) [Structured streams] readStream ignores partition columns

2016-12-28 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17153?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-17153: - Component/s: (was: DStreams) Structured Streaming > [Structured streams]

[jira] [Updated] (SPARK-17085) Documentation and actual code differs - Unsupported Operations

2016-12-28 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17085?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-17085: - Component/s: (was: DStreams) Structured Streaming > Documentation and

[jira] [Updated] (SPARK-17475) HDFSMetadataLog should not leak CRC files

2016-12-28 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17475?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-17475: - Component/s: (was: DStreams) Structured Streaming > HDFSMetadataLog should

[jira] [Updated] (SPARK-17513) StreamExecution should discard unneeded metadata

2016-12-28 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17513?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-17513: - Component/s: (was: DStreams) Structured Streaming > StreamExecution should

[jira] [Updated] (SPARK-18152) CLONE - FileStreamSource should not track the list of seen files indefinitely

2016-12-28 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18152?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-18152: - Component/s: (was: DStreams) (was: SQL) Structured

[jira] [Updated] (SPARK-18030) Flaky test: org.apache.spark.sql.streaming.FileStreamSourceSuite

2016-12-28 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-18030: - Component/s: (was: DStreams) Structured Streaming > Flaky test:

[jira] [Updated] (SPARK-18151) CLONE - MetadataLog should support purging old logs

2016-12-28 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18151?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-18151: - Component/s: (was: DStreams) (was: SQL) Structured

[jira] [Updated] (SPARK-18153) CLONE - Ability to remove old metadata for structure streaming MetadataLog

2016-12-28 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18153?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-18153: - Component/s: (was: DStreams) (was: SQL) Structured

[jira] [Updated] (SPARK-18156) CLONE - StreamExecution should discard unneeded metadata

2016-12-28 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18156?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-18156: - Component/s: (was: DStreams) Structured Streaming > CLONE - StreamExecution

[jira] [Updated] (SPARK-18154) CLONE - Change Source API so that sources do not need to keep unbounded state

2016-12-28 Thread Shixiong Zhu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18154?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu updated SPARK-18154: - Component/s: (was: DStreams) Structured Streaming > CLONE - Change Source

[jira] [Updated] (SPARK-16849) Improve subquery execution by deduplicating the subqueries with the same results

2016-12-28 Thread Liang-Chi Hsieh (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16849?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liang-Chi Hsieh updated SPARK-16849: Attachment: de-duplicating subqueries.pdf Design doc v1 > Improve subquery execution by

[jira] [Resolved] (SPARK-17772) Add helper testing methods for instance weighting

2016-12-28 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17772?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang resolved SPARK-17772. - Resolution: Fixed Fix Version/s: 2.2.0 > Add helper testing methods for instance

[jira] [Resolved] (SPARK-17645) Add feature selector methods based on: False Discovery Rate (FDR) and Family Wise Error rate (FWE)

2016-12-28 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17645?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang resolved SPARK-17645. - Resolution: Fixed Fix Version/s: 2.2.0 > Add feature selector methods based on: False

[jira] [Assigned] (SPARK-17642) support DESC FORMATTED TABLE COLUMN command to show column-level statistics

2016-12-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17642?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17642: Assignee: Apache Spark > support DESC FORMATTED TABLE COLUMN command to show column-level

[jira] [Assigned] (SPARK-17642) support DESC FORMATTED TABLE COLUMN command to show column-level statistics

2016-12-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17642?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17642: Assignee: (was: Apache Spark) > support DESC FORMATTED TABLE COLUMN command to show

[jira] [Commented] (SPARK-17642) support DESC FORMATTED TABLE COLUMN command to show column-level statistics

2016-12-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17642?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15782943#comment-15782943 ] Apache Spark commented on SPARK-17642: -- User 'wzhfy' has created a pull request for this issue:

[jira] [Created] (SPARK-19015) SQL request with transformation cannot be eecuted if not run first a scan table

2016-12-28 Thread lakhdar adil (JIRA)
lakhdar adil created SPARK-19015: Summary: SQL request with transformation cannot be eecuted if not run first a scan table Key: SPARK-19015 URL: https://issues.apache.org/jira/browse/SPARK-19015

[jira] [Commented] (SPARK-19014) support complex aggregate buffer in HashAggregateExec

2016-12-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19014?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15782927#comment-15782927 ] Apache Spark commented on SPARK-19014: -- User 'cloud-fan' has created a pull request for this issue:

[jira] [Assigned] (SPARK-19014) support complex aggregate buffer in HashAggregateExec

2016-12-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19014?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19014: Assignee: Wenchen Fan (was: Apache Spark) > support complex aggregate buffer in

[jira] [Assigned] (SPARK-19014) support complex aggregate buffer in HashAggregateExec

2016-12-28 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19014?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-19014: Assignee: Apache Spark (was: Wenchen Fan) > support complex aggregate buffer in

[jira] [Updated] (SPARK-17642) support DESC FORMATTED TABLE COLUMN command to show column-level statistics

2016-12-28 Thread Zhenhua Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17642?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhenhua Wang updated SPARK-17642: - Description: Support DESC (EXTENDED | FORMATTED) ? TABLE COLUMN command. Support DESC FORMATTED

[jira] [Updated] (SPARK-17642) support DESC FORMATTED TABLE COLUMN command to show column-level statistics

2016-12-28 Thread Zhenhua Wang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17642?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhenhua Wang updated SPARK-17642: - Description: Support DESC FORMATTED TABLE COLUMN command to show column-level statistics. We

[jira] [Assigned] (SPARK-18993) Unable to build/compile Spark in IntelliJ due to missing Scala deps in spark-tags

2016-12-28 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18993?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen reassigned SPARK-18993: - Assignee: Sean Owen > Unable to build/compile Spark in IntelliJ due to missing Scala deps in >

[jira] [Resolved] (SPARK-18993) Unable to build/compile Spark in IntelliJ due to missing Scala deps in spark-tags

2016-12-28 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18993?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-18993. --- Resolution: Fixed Fix Version/s: 2.2.0 2.0.3 2.1.1

[jira] [Commented] (SPARK-9686) Spark Thrift server doesn't return correct JDBC metadata

2016-12-28 Thread karthik G S (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9686?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15782768#comment-15782768 ] karthik G S commented on SPARK-9686: - Didn't help > Spark Thrift server doesn't return correct JDBC

  1   2   >