[jira] [Commented] (SPARK-18584) multiple Spark Thrift Servers running in the same machine throws org.apache.hadoop.security.AccessControlException

2016-11-25 Thread tanxinz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18584?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15697237#comment-15697237 ] tanxinz commented on SPARK-18584: - Two STS ran diffent Queue on yarn etl Spark Thrift Server ran root.etl

[jira] [Resolved] (SPARK-18583) Fix nullability of InputFileName.

2016-11-25 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18583?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-18583. - Resolution: Fixed Assignee: Takuya Ueshin Fix Version/s: 2.1.0 > Fix nullability

[jira] [Commented] (SPARK-18584) multiple Spark Thrift Servers running in the same machine throws org.apache.hadoop.security.AccessControlException

2016-11-25 Thread tanxinz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18584?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15697216#comment-15697216 ] tanxinz commented on SPARK-18584: - Different users have different authorizations to access different hdfs

[jira] [Commented] (SPARK-18502) Spark does not handle columns that contain backquote (`)

2016-11-25 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18502?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15697210#comment-15697210 ] Takeshi Yamamuro commented on SPARK-18502: -- Please give us a simple query to reproduce this? I

[jira] [Commented] (SPARK-17251) "ClassCastException: OuterReference cannot be cast to NamedExpression" for correlated subquery on the RHS of an IN operator

2016-11-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17251?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15697176#comment-15697176 ] Apache Spark commented on SPARK-17251: -- User 'dongjoon-hyun' has created a pull request for this

[jira] [Commented] (SPARK-18591) Replace hash-based aggregates with sort-based ones if inputs already sorted

2016-11-25 Thread Takeshi Yamamuro (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18591?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15697111#comment-15697111 ] Takeshi Yamamuro commented on SPARK-18591: -- If it's worth trying this, I'll do. I just made a

[jira] [Created] (SPARK-18591) Replace hash-based aggregates with sort-based ones if inputs already sorted

2016-11-25 Thread Takeshi Yamamuro (JIRA)
Takeshi Yamamuro created SPARK-18591: Summary: Replace hash-based aggregates with sort-based ones if inputs already sorted Key: SPARK-18591 URL: https://issues.apache.org/jira/browse/SPARK-18591

[jira] [Comment Edited] (SPARK-18405) Add yarn-cluster mode support to Spark Thrift Server

2016-11-25 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15696978#comment-15696978 ] Jeff Zhang edited comment on SPARK-18405 at 11/26/16 1:01 AM: -- I think he

[jira] [Commented] (SPARK-18405) Add yarn-cluster mode support to Spark Thrift Server

2016-11-25 Thread Jeff Zhang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15696978#comment-15696978 ] Jeff Zhang commented on SPARK-18405: I think he mean to launch multiple spark thrift server in

[jira] [Assigned] (SPARK-18590) R - Include package vignettes and help pages, build source package in Spark distribution

2016-11-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18590?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18590: Assignee: Felix Cheung (was: Apache Spark) > R - Include package vignettes and help

[jira] [Commented] (SPARK-18590) R - Include package vignettes and help pages, build source package in Spark distribution

2016-11-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18590?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15696848#comment-15696848 ] Apache Spark commented on SPARK-18590: -- User 'felixcheung' has created a pull request for this

[jira] [Assigned] (SPARK-18590) R - Include package vignettes and help pages, build source package in Spark distribution

2016-11-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18590?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18590: Assignee: Apache Spark (was: Felix Cheung) > R - Include package vignettes and help

[jira] [Updated] (SPARK-18590) R - Include package vignettes and help pages, build source package in Spark distribution

2016-11-25 Thread Felix Cheung (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18590?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Felix Cheung updated SPARK-18590: - Description: We should include in Spark distribution the built source package for SparkR. This

[jira] [Created] (SPARK-18590) R - Include package vignettes and help pages, build source package in Spark distribution

2016-11-25 Thread Felix Cheung (JIRA)
Felix Cheung created SPARK-18590: Summary: R - Include package vignettes and help pages, build source package in Spark distribution Key: SPARK-18590 URL: https://issues.apache.org/jira/browse/SPARK-18590

[jira] [Commented] (SPARK-18589) persist() resolves "java.lang.RuntimeException: Invalid PythonUDF (...), requires attributes from more than one child"

2016-11-25 Thread Nicholas Chammas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18589?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15696717#comment-15696717 ] Nicholas Chammas commented on SPARK-18589: -- cc [~davies] [~hvanhovell] > persist() resolves

[jira] [Created] (SPARK-18589) persist() resolves "java.lang.RuntimeException: Invalid PythonUDF (...), requires attributes from more than one child"

2016-11-25 Thread Nicholas Chammas (JIRA)
Nicholas Chammas created SPARK-18589: Summary: persist() resolves "java.lang.RuntimeException: Invalid PythonUDF (...), requires attributes from more than one child" Key: SPARK-18589 URL:

[jira] [Resolved] (SPARK-18436) isin causing SQL syntax error with JDBC

2016-11-25 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18436?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell resolved SPARK-18436. --- Resolution: Fixed Assignee: Jiang Xingbo Fix Version/s: 2.1.0

[jira] [Commented] (SPARK-18527) UDAFPercentile (bigint, array) needs explicity cast to double

2016-11-25 Thread Thomas Sebastian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18527?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15696598#comment-15696598 ] Thomas Sebastian commented on SPARK-18527: -- I am interested to work on this. > UDAFPercentile

[jira] [Commented] (SPARK-18543) SaveAsTable(CTAS) using overwrite could change table definition

2016-11-25 Thread Thomas Sebastian (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18543?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15696586#comment-15696586 ] Thomas Sebastian commented on SPARK-18543: -- I would like to take a look at this fix, if you have

[jira] [Commented] (SPARK-18220) ClassCastException occurs when using select query on ORC file

2016-11-25 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18220?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15696541#comment-15696541 ] Herman van Hovell commented on SPARK-18220: --- I tried reproducing this but to no avail.

[jira] [Commented] (SPARK-18487) Add task completion listener to HashAggregate to avoid memory leak

2016-11-25 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18487?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15696458#comment-15696458 ] Reynold Xin commented on SPARK-18487: - As discussed on the pull request, this is not an issue. >

[jira] [Closed] (SPARK-18487) Add task completion listener to HashAggregate to avoid memory leak

2016-11-25 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18487?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin closed SPARK-18487. --- Resolution: Not A Problem > Add task completion listener to HashAggregate to avoid memory leak >

[jira] [Commented] (SPARK-3359) `sbt/sbt unidoc` doesn't work with Java 8

2016-11-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3359?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15696373#comment-15696373 ] Apache Spark commented on SPARK-3359: - User 'HyukjinKwon' has created a pull request for this issue:

[jira] [Commented] (SPARK-18407) Inferred partition columns cause assertion error

2016-11-25 Thread Burak Yavuz (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18407?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15696337#comment-15696337 ] Burak Yavuz commented on SPARK-18407: - This is also resolved as part of

[jira] [Comment Edited] (SPARK-17788) RangePartitioner results in few very large tasks and many small to empty tasks

2016-11-25 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17788?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15696321#comment-15696321 ] Herman van Hovell edited comment on SPARK-17788 at 11/25/16 5:09 PM: -

[jira] [Commented] (SPARK-17788) RangePartitioner results in few very large tasks and many small to empty tasks

2016-11-25 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17788?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15696321#comment-15696321 ] Herman van Hovell commented on SPARK-17788: --- That is fair. The solution is not that

[jira] [Comment Edited] (SPARK-17788) RangePartitioner results in few very large tasks and many small to empty tasks

2016-11-25 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17788?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15696134#comment-15696134 ] Herman van Hovell edited comment on SPARK-17788 at 11/25/16 4:56 PM: -

[jira] [Comment Edited] (SPARK-17788) RangePartitioner results in few very large tasks and many small to empty tasks

2016-11-25 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17788?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15696134#comment-15696134 ] Herman van Hovell edited comment on SPARK-17788 at 11/25/16 4:10 PM: -

[jira] [Reopened] (SPARK-17788) RangePartitioner results in few very large tasks and many small to empty tasks

2016-11-25 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17788?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk reopened SPARK-17788: - This is somewhat distinct from the join case, but certainly related. > RangePartitioner results in few very

[jira] [Commented] (SPARK-17788) RangePartitioner results in few very large tasks and many small to empty tasks

2016-11-25 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17788?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15696188#comment-15696188 ] holdenk commented on SPARK-17788: - I don't think this is a duplicate - its related but a join doesn't

[jira] [Updated] (SPARK-18220) ClassCastException occurs when using select query on ORC file

2016-11-25 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18220?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell updated SPARK-18220: -- Description: Error message is below. {noformat}

[jira] [Commented] (SPARK-17788) RangePartitioner results in few very large tasks and many small to empty tasks

2016-11-25 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17788?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15696154#comment-15696154 ] Herman van Hovell commented on SPARK-17788: --- I am closing this one as a duplicate. Feel free to

[jira] [Closed] (SPARK-17788) RangePartitioner results in few very large tasks and many small to empty tasks

2016-11-25 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17788?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell closed SPARK-17788. - Resolution: Duplicate > RangePartitioner results in few very large tasks and many small

[jira] [Commented] (SPARK-17788) RangePartitioner results in few very large tasks and many small to empty tasks

2016-11-25 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17788?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15696134#comment-15696134 ] Herman van Hovell commented on SPARK-17788: --- Spark makes a sketch of your data as soon when you

[jira] [Commented] (SPARK-18581) MultivariateGaussian does not check if covariance matrix is invertible

2016-11-25 Thread Hao Ren (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15696056#comment-15696056 ] Hao Ren commented on SPARK-18581: - Thank you for the clarification. I totally missed that part. I will

[jira] [Commented] (SPARK-18581) MultivariateGaussian does not check if covariance matrix is invertible

2016-11-25 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15696030#comment-15696030 ] Sean Owen commented on SPARK-18581: --- The PDF largest at the mean, and it can be > 1 if the determinant

[jira] [Commented] (SPARK-6522) Standardize Random Number Generation

2016-11-25 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6522?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15696024#comment-15696024 ] holdenk commented on SPARK-6522: We have a standardized RDD generator in MLlib (see the RandomRDDs

[jira] [Closed] (SPARK-6522) Standardize Random Number Generation

2016-11-25 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6522?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk closed SPARK-6522. -- Resolution: Fixed Fix Version/s: 1.1.0 > Standardize Random Number Generation >

[jira] [Commented] (SPARK-5997) Increase partition count without performing a shuffle

2016-11-25 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5997?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15696016#comment-15696016 ] holdenk commented on SPARK-5997: That could work, although we'd probably want a different API and we'd

[jira] [Resolved] (SPARK-3348) Support user-defined SparkListeners properly

2016-11-25 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3348?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk resolved SPARK-3348. Resolution: Duplicate > Support user-defined SparkListeners properly >

[jira] [Commented] (SPARK-5190) Allow spark listeners to be added before spark context gets initialized.

2016-11-25 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5190?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15696003#comment-15696003 ] holdenk commented on SPARK-5190: This seems to be fixed, but we forgot to close (cc [~joshrosen]) > Allow

[jira] [Created] (SPARK-18588) KafkaSourceStressForDontFailOnDataLossSuite is flaky

2016-11-25 Thread Herman van Hovell (JIRA)
Herman van Hovell created SPARK-18588: - Summary: KafkaSourceStressForDontFailOnDataLossSuite is flaky Key: SPARK-18588 URL: https://issues.apache.org/jira/browse/SPARK-18588 Project: Spark

[jira] [Commented] (SPARK-18588) KafkaSourceStressForDontFailOnDataLossSuite is flaky

2016-11-25 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18588?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15695975#comment-15695975 ] Herman van Hovell commented on SPARK-18588: --- cc [~zsxwing] >

[jira] [Commented] (SPARK-636) Add mechanism to run system management/configuration tasks on all workers

2016-11-25 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-636?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15695967#comment-15695967 ] holdenk commented on SPARK-636: --- If you have a logging system you want to initialize wouldn't using an object

[jira] [Commented] (SPARK-17788) RangePartitioner results in few very large tasks and many small to empty tasks

2016-11-25 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17788?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15695956#comment-15695956 ] holdenk commented on SPARK-17788: - This is semi-expected behaviour of the range partitioner (and really

[jira] [Updated] (SPARK-17788) RangePartitioner results in few very large tasks and many small to empty tasks

2016-11-25 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17788?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk updated SPARK-17788: Target Version/s: (was: 2.1.0) > RangePartitioner results in few very large tasks and many small to

[jira] [Comment Edited] (SPARK-18581) MultivariateGaussian does not check if covariance matrix is invertible

2016-11-25 Thread Hao Ren (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15695942#comment-15695942 ] Hao Ren edited comment on SPARK-18581 at 11/25/16 2:14 PM: --- After reading the

[jira] [Comment Edited] (SPARK-18581) MultivariateGaussian does not check if covariance matrix is invertible

2016-11-25 Thread Hao Ren (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15695942#comment-15695942 ] Hao Ren edited comment on SPARK-18581 at 11/25/16 2:09 PM: --- After reading the

[jira] [Commented] (SPARK-18554) leader master lost the leadership, when the slave become master, the perivious app's state display as waitting

2016-11-25 Thread Saisai Shao (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18554?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15695947#comment-15695947 ] Saisai Shao commented on SPARK-18554: - Nothing blocked actually, just no one reviewed that PR, also

[jira] [Comment Edited] (SPARK-18581) MultivariateGaussian does not check if covariance matrix is invertible

2016-11-25 Thread Hao Ren (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15695942#comment-15695942 ] Hao Ren edited comment on SPARK-18581 at 11/25/16 2:08 PM: --- After reading the

[jira] [Comment Edited] (SPARK-18581) MultivariateGaussian does not check if covariance matrix is invertible

2016-11-25 Thread Hao Ren (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15695942#comment-15695942 ] Hao Ren edited comment on SPARK-18581 at 11/25/16 2:07 PM: --- After reading the

[jira] [Comment Edited] (SPARK-18581) MultivariateGaussian does not check if covariance matrix is invertible

2016-11-25 Thread Hao Ren (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15695942#comment-15695942 ] Hao Ren edited comment on SPARK-18581 at 11/25/16 2:06 PM: --- After reading the

[jira] [Updated] (SPARK-18108) Partition discovery fails with explicitly written long partitions

2016-11-25 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18108?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk updated SPARK-18108: Component/s: (was: Spark Core) SQL > Partition discovery fails with explicitly

[jira] [Comment Edited] (SPARK-18581) MultivariateGaussian does not check if covariance matrix is invertible

2016-11-25 Thread Hao Ren (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15695942#comment-15695942 ] Hao Ren edited comment on SPARK-18581 at 11/25/16 2:05 PM: --- After reading the

[jira] [Comment Edited] (SPARK-18581) MultivariateGaussian does not check if covariance matrix is invertible

2016-11-25 Thread Hao Ren (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15695942#comment-15695942 ] Hao Ren edited comment on SPARK-18581 at 11/25/16 2:04 PM: --- After reading the

[jira] [Commented] (SPARK-18581) MultivariateGaussian does not check if covariance matrix is invertible

2016-11-25 Thread Hao Ren (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15695942#comment-15695942 ] Hao Ren commented on SPARK-18581: - After reading the code comments, I find it takes into consideration on

[jira] [Commented] (SPARK-18128) Add support for publishing to PyPI

2016-11-25 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18128?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15695941#comment-15695941 ] holdenk commented on SPARK-18128: - Thanks! :) I'll start working on this issue once we start work on 2.2

[jira] [Commented] (SPARK-18128) Add support for publishing to PyPI

2016-11-25 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18128?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15695939#comment-15695939 ] holdenk commented on SPARK-18128: - Thanks! :) I'll start working on this issue once we start work on 2.2

[jira] [Commented] (SPARK-18405) Add yarn-cluster mode support to Spark Thrift Server

2016-11-25 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18405?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15695936#comment-15695936 ] holdenk commented on SPARK-18405: - Even in cluster mode you could overwhelm the node running the

[jira] [Updated] (SPARK-18502) Spark does not handle columns that contain backquote (`)

2016-11-25 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18502?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk updated SPARK-18502: Component/s: (was: Spark Core) SQL > Spark does not handle columns that contain

[jira] [Updated] (SPARK-18532) Code generation memory issue

2016-11-25 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18532?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] holdenk updated SPARK-18532: Component/s: (was: Spark Core) SQL > Code generation memory issue >

[jira] [Commented] (SPARK-18541) Add pyspark.sql.Column.aliasWithMetadata to allow dynamic metadata management in pyspark SQL API

2016-11-25 Thread holdenk (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18541?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15695922#comment-15695922 ] holdenk commented on SPARK-18541: - Making it easier for PySpark SQL users to specify metadata sounds

[jira] [Updated] (SPARK-18538) Concurrent Fetching DataFrameReader JDBC APIs Do Not Work

2016-11-25 Thread Herman van Hovell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18538?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Herman van Hovell updated SPARK-18538: -- Priority: Blocker (was: Critical) > Concurrent Fetching DataFrameReader JDBC APIs Do

[jira] [Updated] (SPARK-18356) Issue + Resolution: Kmeans Spark Performances (ML package)

2016-11-25 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18356?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-18356: -- Assignee: zakaria hili > Issue + Resolution: Kmeans Spark Performances (ML package) >

[jira] [Resolved] (SPARK-18356) Issue + Resolution: Kmeans Spark Performances (ML package)

2016-11-25 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18356?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-18356. --- Resolution: Fixed Fix Version/s: 2.2.0 Issue resolved by pull request 15965

[jira] [Resolved] (SPARK-18565) subtractByKey modifes values in the source RDD

2016-11-25 Thread Dmitry Dzhus (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18565?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dmitry Dzhus resolved SPARK-18565. -- Resolution: Invalid > subtractByKey modifes values in the source RDD >

[jira] [Commented] (SPARK-18565) subtractByKey modifes values in the source RDD

2016-11-25 Thread Dmitry Dzhus (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15695809#comment-15695809 ] Dmitry Dzhus commented on SPARK-18565: -- The problem was that I assumed that caching and forcing an

[jira] [Commented] (SPARK-18512) FileNotFoundException on _temporary directory with Spark Streaming 2.0.1 and S3A

2016-11-25 Thread Genmao Yu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18512?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15695800#comment-15695800 ] Genmao Yu commented on SPARK-18512: --- [~giuseppe.bonaccorso] How can i reproduce this failure? Can u

[jira] [Commented] (SPARK-17251) "ClassCastException: OuterReference cannot be cast to NamedExpression" for correlated subquery on the RHS of an IN operator

2016-11-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17251?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15695645#comment-15695645 ] Apache Spark commented on SPARK-17251: -- User 'dongjoon-hyun' has created a pull request for this

[jira] [Assigned] (SPARK-17251) "ClassCastException: OuterReference cannot be cast to NamedExpression" for correlated subquery on the RHS of an IN operator

2016-11-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17251?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17251: Assignee: Apache Spark > "ClassCastException: OuterReference cannot be cast to

[jira] [Assigned] (SPARK-17251) "ClassCastException: OuterReference cannot be cast to NamedExpression" for correlated subquery on the RHS of an IN operator

2016-11-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17251?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-17251: Assignee: (was: Apache Spark) > "ClassCastException: OuterReference cannot be cast to

[jira] [Updated] (SPARK-18584) multiple Spark Thrift Servers running in the same machine throws org.apache.hadoop.security.AccessControlException

2016-11-25 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18584?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-18584: -- Target Version/s: (was: 2.0.2) Fix Version/s: (was: 2.0.2) > multiple Spark Thrift

[jira] [Updated] (SPARK-3359) `sbt/sbt unidoc` doesn't work with Java 8

2016-11-25 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3359?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-3359: - Assignee: Hyukjin Kwon > `sbt/sbt unidoc` doesn't work with Java 8 >

[jira] [Commented] (SPARK-16151) Make generated params non-final

2016-11-25 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16151?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15695614#comment-15695614 ] zhengruifeng commented on SPARK-16151: -- Some param shoud be made non-final : such as {{setSolver}}

[jira] [Commented] (SPARK-18581) MultivariateGaussian does not check if covariance matrix is invertible

2016-11-25 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15695552#comment-15695552 ] Sean Owen commented on SPARK-18581: --- Yes, but it need not be invertible, for the reason you give. It

[jira] [Commented] (SPARK-18581) MultivariateGaussian does not check if covariance matrix is invertible

2016-11-25 Thread Hao Ren (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15695481#comment-15695481 ] Hao Ren commented on SPARK-18581: - [~srowen] I have updated the description. The problem is that my

[jira] [Updated] (SPARK-18581) MultivariateGaussian does not check if covariance matrix is invertible

2016-11-25 Thread Hao Ren (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18581?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hao Ren updated SPARK-18581: Description: When training GaussianMixtureModel, I found some probability much larger than 1. That leads

[jira] [Commented] (SPARK-18318) ML, Graph 2.1 QA: API: New Scala APIs, docs

2016-11-25 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18318?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15695473#comment-15695473 ] Yanbo Liang commented on SPARK-18318: - Finished reviewing for all classes which were added/changed

[jira] [Assigned] (SPARK-18587) Remove handleInvalid from QuantileDiscretizer

2016-11-25 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18587?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang reassigned SPARK-18587: --- Assignee: Yanbo Liang > Remove handleInvalid from QuantileDiscretizer >

[jira] [Updated] (SPARK-18481) ML 2.1 QA: Remove deprecated methods for ML

2016-11-25 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18481?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-18481: Description: Remove deprecated methods for ML. We removed the following public APIs in this JIRA:

[jira] [Updated] (SPARK-18481) ML 2.1 QA: Remove deprecated methods for ML

2016-11-25 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18481?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-18481: Description: Remove deprecated methods for ML. We removed the following public APIs:

[jira] [Updated] (SPARK-18481) ML 2.1 QA: Remove deprecated methods for ML

2016-11-25 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18481?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-18481: Priority: Major (was: Minor) > ML 2.1 QA: Remove deprecated methods for ML >

[jira] [Updated] (SPARK-18481) ML 2.1 QA: Remove deprecated methods for ML

2016-11-25 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18481?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-18481: Priority: Minor (was: Major) > ML 2.1 QA: Remove deprecated methods for ML >

[jira] [Updated] (SPARK-18481) ML 2.1 QA: Remove deprecated methods for ML

2016-11-25 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18481?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang updated SPARK-18481: Description: Remove deprecated methods for ML. We removed was:Remove deprecated methods for

[jira] [Resolved] (SPARK-18548) OnlineLDAOptimizer reads the same broadcast data after deletion

2016-11-25 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18548?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-18548. --- Resolution: Duplicate > OnlineLDAOptimizer reads the same broadcast data after deletion >

[jira] [Updated] (SPARK-18586) netty-3.8.0.Final.jar has vulnerability CVE-2014-3488 and CVE-2014-0193

2016-11-25 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18586?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-18586: -- Priority: Major (was: Critical) Spark doesn't use netty 3, but it is pulled in as a transitive

[jira] [Updated] (SPARK-18119) Namenode safemode check is only performed on one namenode which can stuck the startup of SparkHistory server

2016-11-25 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18119?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-18119: -- Assignee: Nicolas Fraison > Namenode safemode check is only performed on one namenode which can stuck

[jira] [Resolved] (SPARK-18119) Namenode safemode check is only performed on one namenode which can stuck the startup of SparkHistory server

2016-11-25 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18119?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-18119. --- Resolution: Fixed Fix Version/s: 2.1.0 Issue resolved by pull request 15648

[jira] [Assigned] (SPARK-18324) ML, Graph 2.1 QA: Programming guide update and migration guide

2016-11-25 Thread Yanbo Liang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18324?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yanbo Liang reassigned SPARK-18324: --- Assignee: Yanbo Liang > ML, Graph 2.1 QA: Programming guide update and migration guide >

[jira] [Resolved] (SPARK-18554) leader master lost the leadership, when the slave become master, the perivious app's state display as waitting

2016-11-25 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18554?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-18554. --- Resolution: Duplicate [~jerryshao] is anything blocking you from proceeding on that PR? > leader

[jira] [Commented] (SPARK-18581) MultivariateGaussian does not check if covariance matrix is invertible

2016-11-25 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15695359#comment-15695359 ] Sean Owen commented on SPARK-18581: --- It sounds like there's definitely a problem here but why is the

[jira] [Updated] (SPARK-18575) Keep same style: adjust the position of driver log links

2016-11-25 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18575?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated SPARK-18575: -- Assignee: Genmao Yu Target Version/s: (was: 2.0.3, 2.1.0) Priority: Trivial

[jira] [Resolved] (SPARK-18575) Keep same style: adjust the position of driver log links

2016-11-25 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18575?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved SPARK-18575. --- Resolution: Fixed Fix Version/s: 2.1.0 Issue resolved by pull request 16001

[jira] [Updated] (SPARK-18581) MultivariateGaussian does not check if covariance matrix is invertible

2016-11-25 Thread Hao Ren (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18581?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hao Ren updated SPARK-18581: Description: When training GaussianMixtureModel, I found some probability much larger than 1. That leads

[jira] [Updated] (SPARK-18581) MultivariateGaussian does not check if covariance matrix is invertible

2016-11-25 Thread Hao Ren (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18581?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hao Ren updated SPARK-18581: Description: When training GaussianMixtureModel, I found some probability much larger than 1. That leads

[jira] [Updated] (SPARK-18581) MultivariateGaussian does not check if covariance matrix is invertible

2016-11-25 Thread Hao Ren (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18581?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hao Ren updated SPARK-18581: Description: When training GaussianMixtureModel, I found some probability much larger than 1. That leads

[jira] [Updated] (SPARK-18581) MultivariateGaussian does not check if covariance matrix is invertible

2016-11-25 Thread Hao Ren (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18581?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hao Ren updated SPARK-18581: Summary: MultivariateGaussian does not check if covariance matrix is invertible (was:

[jira] [Assigned] (SPARK-18587) Remove handleInvalid from QuantileDiscretizer

2016-11-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18587?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18587: Assignee: Apache Spark > Remove handleInvalid from QuantileDiscretizer >

[jira] [Commented] (SPARK-18587) Remove handleInvalid from QuantileDiscretizer

2016-11-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18587?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15695291#comment-15695291 ] Apache Spark commented on SPARK-18587: -- User 'yanboliang' has created a pull request for this issue:

[jira] [Assigned] (SPARK-18587) Remove handleInvalid from QuantileDiscretizer

2016-11-25 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18587?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-18587: Assignee: (was: Apache Spark) > Remove handleInvalid from QuantileDiscretizer >

  1   2   >