[jira] [Updated] (SPARK-7564) performance bottleneck in SparkSQL using columnar storage

2015-05-12 Thread Noam Barkai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7564?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Noam Barkai updated SPARK-7564: --- Summary: performance bottleneck in SparkSQL using columnar storage (was: possible performance bottlen

[jira] [Resolved] (SPARK-7526) Specify ip of RBackend, MonitorServer and RRDD Socket server

2015-05-12 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7526?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman resolved SPARK-7526. -- Resolution: Fixed Issue resolved by pull request 6053 [https://github.com/apache

[jira] [Updated] (SPARK-7564) possible performance bottleneck in SparkSQL's SparkSqlSerializer class

2015-05-12 Thread Noam Barkai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7564?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Noam Barkai updated SPARK-7564: --- Affects Version/s: 1.3.1 > possible performance bottleneck in SparkSQL's SparkSqlSerializer class > --

[jira] [Created] (SPARK-7595) Window will cause resolve failed with self join

2015-05-12 Thread Weizhong (JIRA)
Weizhong created SPARK-7595: --- Summary: Window will cause resolve failed with self join Key: SPARK-7595 URL: https://issues.apache.org/jira/browse/SPARK-7595 Project: Spark Issue Type: Bug

[jira] [Resolved] (SPARK-7482) Rename some DataFrame API methods in SparkR to match their counterparts in Scala

2015-05-12 Thread Shivaram Venkataraman (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7482?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shivaram Venkataraman resolved SPARK-7482. -- Resolution: Fixed Issue resolved by pull request 6007 [https://github.com/apache

[jira] [Resolved] (SPARK-7566) HiveContext.analyzer cannot be overriden

2015-05-12 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7566?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-7566. Resolution: Fixed Fix Version/s: 1.4.0 Assignee: Santiago M. Mola > HiveContext.anal

[jira] [Created] (SPARK-7594) Increase maximum amount of columns for covariance matrix for principal components

2015-05-12 Thread Sebastian Alfers (JIRA)
Sebastian Alfers created SPARK-7594: --- Summary: Increase maximum amount of columns for covariance matrix for principal components Key: SPARK-7594 URL: https://issues.apache.org/jira/browse/SPARK-7594

[jira] [Closed] (SPARK-7496) User guide update for Online LDA

2015-05-12 Thread yuhao yang (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7496?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] yuhao yang closed SPARK-7496. - Doc updated. Thanks for review. > User guide update for Online LDA > > >

[jira] [Assigned] (SPARK-7422) Add argmax to Vector, SparseVector

2015-05-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7422?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7422: --- Assignee: (was: Apache Spark) > Add argmax to Vector, SparseVector >

[jira] [Assigned] (SPARK-7422) Add argmax to Vector, SparseVector

2015-05-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7422?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7422: --- Assignee: Apache Spark > Add argmax to Vector, SparseVector > ---

[jira] [Commented] (SPARK-7422) Add argmax to Vector, SparseVector

2015-05-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7422?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14541450#comment-14541450 ] Apache Spark commented on SPARK-7422: - User 'GeorgeDittmar' has created a pull request

[jira] [Commented] (SPARK-7579) User guide update for OneHotEncoder

2015-05-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7579?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14541436#comment-14541436 ] Joseph K. Bradley commented on SPARK-7579: -- For this JIRA, I really meant a secti

[jira] [Commented] (SPARK-7127) Broadcast spark.ml tree ensemble models for predict

2015-05-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7127?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14541433#comment-14541433 ] Joseph K. Bradley commented on SPARK-7127: -- The mapPartitions function is really

[jira] [Commented] (SPARK-5888) Add OneHotEncoder as a Transformer

2015-05-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5888?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14541434#comment-14541434 ] Xiangrui Meng commented on SPARK-5888: -- [~sandyr] We can change the behavior in this

[jira] [Commented] (SPARK-7568) ml.LogisticRegression doesn't output the right prediction

2015-05-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14541423#comment-14541423 ] Xiangrui Meng commented on SPARK-7568: -- That is expected because intercept adds one d

[jira] [Updated] (SPARK-7564) possible performance bottleneck in SparkSQL's SparkSqlSerializer class

2015-05-12 Thread Noam Barkai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7564?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Noam Barkai updated SPARK-7564: --- Attachment: worker profiling showing the bottle-neck.png > possible performance bottleneck in SparkSQL

[jira] [Comment Edited] (SPARK-7568) ml.LogisticRegression doesn't output the right prediction

2015-05-12 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14541416#comment-14541416 ] DB Tsai edited comment on SPARK-7568 at 5/13/15 6:06 AM: - `fitInte

[jira] [Comment Edited] (SPARK-7568) ml.LogisticRegression doesn't output the right prediction

2015-05-12 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14541416#comment-14541416 ] DB Tsai edited comment on SPARK-7568 at 5/13/15 6:05 AM: - `fitInte

[jira] [Commented] (SPARK-7568) ml.LogisticRegression doesn't output the right prediction

2015-05-12 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14541416#comment-14541416 ] DB Tsai commented on SPARK-7568: `fitIntercept = false`, or in Spark 1.3, the training acc

[jira] [Commented] (SPARK-7522) ML Examples option for dataFormat should not be enclosed in angle brackets

2015-05-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7522?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14541404#comment-14541404 ] Apache Spark commented on SPARK-7522: - User 'BryanCutler' has created a pull request f

[jira] [Commented] (SPARK-7269) Incorrect aggregation analysis

2015-05-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7269?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14541393#comment-14541393 ] Apache Spark commented on SPARK-7269: - User 'chenghao-intel' has created a pull reques

[jira] [Comment Edited] (SPARK-7568) ml.LogisticRegression doesn't output the right prediction

2015-05-12 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14541385#comment-14541385 ] DB Tsai edited comment on SPARK-7568 at 5/13/15 5:43 AM: - Default

[jira] [Commented] (SPARK-7568) ml.LogisticRegression doesn't output the right prediction

2015-05-12 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14541385#comment-14541385 ] DB Tsai commented on SPARK-7568: Default for R is true. > ml.LogisticRegression doesn't

[jira] [Commented] (SPARK-7568) ml.LogisticRegression doesn't output the right prediction

2015-05-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14541375#comment-14541375 ] Xiangrui Meng commented on SPARK-7568: -- What is the default value in R for fitInterce

[jira] [Comment Edited] (SPARK-7568) ml.LogisticRegression doesn't output the right prediction

2015-05-12 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14541374#comment-14541374 ] DB Tsai edited comment on SPARK-7568 at 5/13/15 5:37 AM: - In 1.3,

[jira] [Commented] (SPARK-7568) ml.LogisticRegression doesn't output the right prediction

2015-05-12 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14541374#comment-14541374 ] DB Tsai commented on SPARK-7568: In 1.3, https://github.com/apache/spark/blob/branch-1.3/

[jira] [Comment Edited] (SPARK-7568) ml.LogisticRegression doesn't output the right prediction

2015-05-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14541363#comment-14541363 ] Xiangrui Meng edited comment on SPARK-7568 at 5/13/15 5:31 AM: -

[jira] [Commented] (SPARK-5888) Add OneHotEncoder as a Transformer

2015-05-12 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5888?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14541367#comment-14541367 ] Sandy Ryza commented on SPARK-5888: --- Right, but while the values are unknown at first, t

[jira] [Commented] (SPARK-7568) ml.LogisticRegression doesn't output the right prediction

2015-05-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14541363#comment-14541363 ] Xiangrui Meng commented on SPARK-7568: -- Instance 6 still has prediction 0.0, which wa

[jira] [Assigned] (SPARK-7568) ml.LogisticRegression doesn't output the right prediction

2015-05-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7568?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7568: --- Assignee: DB Tsai (was: Apache Spark) > ml.LogisticRegression doesn't output the right predi

[jira] [Commented] (SPARK-7568) ml.LogisticRegression doesn't output the right prediction

2015-05-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14541358#comment-14541358 ] Apache Spark commented on SPARK-7568: - User 'dbtsai' has created a pull request for th

[jira] [Assigned] (SPARK-7568) ml.LogisticRegression doesn't output the right prediction

2015-05-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7568?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7568: --- Assignee: Apache Spark (was: DB Tsai) > ml.LogisticRegression doesn't output the right predi

[jira] [Commented] (SPARK-7579) User guide update for OneHotEncoder

2015-05-12 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7579?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14541356#comment-14541356 ] Sandy Ryza commented on SPARK-7579: --- I can take this up. Any thoughts on how it should

[jira] [Commented] (SPARK-5888) Add OneHotEncoder as a Transformer

2015-05-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5888?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14541354#comment-14541354 ] Xiangrui Meng commented on SPARK-5888: -- The values of an nominal attribute is an Opti

[jira] [Commented] (SPARK-7568) ml.LogisticRegression doesn't output the right prediction

2015-05-12 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14541352#comment-14541352 ] DB Tsai commented on SPARK-7568: Actually, with lambda = 0.001, the training accuracy is p

[jira] [Issue Comment Deleted] (SPARK-7568) ml.LogisticRegression doesn't output the right prediction

2015-05-12 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7568?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] DB Tsai updated SPARK-7568: --- Comment: was deleted (was: Oh... my bad. I guess you are referring the third example in the training set. Oka

[jira] [Commented] (SPARK-5888) Add OneHotEncoder as a Transformer

2015-05-12 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5888?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14541351#comment-14541351 ] Sandy Ryza commented on SPARK-5888: --- The values of the nominal output attribute should b

[jira] [Commented] (SPARK-5888) Add OneHotEncoder as a Transformer

2015-05-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5888?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14541344#comment-14541344 ] Xiangrui Meng commented on SPARK-5888: -- `transformSchema` should be optimistic about

[jira] [Commented] (SPARK-5888) Add OneHotEncoder as a Transformer

2015-05-12 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5888?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14541339#comment-14541339 ] Sandy Ryza commented on SPARK-5888: --- Hi [~hvanhovell], I agree that this should work. [

[jira] [Commented] (SPARK-7568) ml.LogisticRegression doesn't output the right prediction

2015-05-12 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14541334#comment-14541334 ] DB Tsai commented on SPARK-7568: Oh... my bad. I guess you are referring the third example

[jira] [Commented] (SPARK-7568) ml.LogisticRegression doesn't output the right prediction

2015-05-12 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14541332#comment-14541332 ] DB Tsai commented on SPARK-7568: Well, the third example is 0.0 in the old code. ``` (4, s

[jira] [Commented] (SPARK-7548) Add explode expression

2015-05-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7548?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14541331#comment-14541331 ] Apache Spark commented on SPARK-7548: - User 'marmbrus' has created a pull request for

[jira] [Assigned] (SPARK-7548) Add explode expression

2015-05-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7548?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7548: --- Assignee: Michael Armbrust (was: Apache Spark) > Add explode expression > --

[jira] [Assigned] (SPARK-7548) Add explode expression

2015-05-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7548?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7548: --- Assignee: Apache Spark (was: Michael Armbrust) > Add explode expression > --

[jira] [Resolved] (SPARK-7321) Add Column expression for conditional statements (if, case)

2015-05-12 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7321?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-7321. Resolution: Fixed > Add Column expression for conditional statements (if, case) > --

[jira] [Reopened] (SPARK-7321) Add Column expression for conditional statements (if, case)

2015-05-12 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7321?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin reopened SPARK-7321: > Add Column expression for conditional statements (if, case) >

[jira] [Commented] (SPARK-7127) Broadcast spark.ml tree ensemble models for predict

2015-05-12 Thread Bryan Cutler (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7127?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14541326#comment-14541326 ] Bryan Cutler commented on SPARK-7127: - Hi [~josephkb], I've been working with to inco

[jira] [Comment Edited] (SPARK-7568) ml.LogisticRegression doesn't output the right prediction

2015-05-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14541316#comment-14541316 ] Xiangrui Meng edited comment on SPARK-7568 at 5/13/15 4:50 AM: -

[jira] [Commented] (SPARK-7568) ml.LogisticRegression doesn't output the right prediction

2015-05-12 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14541316#comment-14541316 ] Xiangrui Meng commented on SPARK-7568: -- The 3rd example still has prediction 0.0. >

[jira] [Commented] (SPARK-7382) Python API for ml.classification

2015-05-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7382?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14541315#comment-14541315 ] Apache Spark commented on SPARK-7382: - User 'brkyvz' has created a pull request for th

[jira] [Assigned] (SPARK-7382) Python API for ml.classification

2015-05-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7382?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7382: --- Assignee: Burak Yavuz (was: Apache Spark) > Python API for ml.classification > -

[jira] [Assigned] (SPARK-7382) Python API for ml.classification

2015-05-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7382?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7382: --- Assignee: Apache Spark (was: Burak Yavuz) > Python API for ml.classification > -

[jira] [Assigned] (SPARK-7381) Missing Python API for o.a.s.ml

2015-05-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7381?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7381: --- Assignee: Apache Spark (was: Burak Yavuz) > Missing Python API for o.a.s.ml > --

[jira] [Assigned] (SPARK-7381) Missing Python API for o.a.s.ml

2015-05-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7381?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7381: --- Assignee: Burak Yavuz (was: Apache Spark) > Missing Python API for o.a.s.ml > --

[jira] [Commented] (SPARK-7381) Missing Python API for o.a.s.ml

2015-05-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7381?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14541314#comment-14541314 ] Apache Spark commented on SPARK-7381: - User 'brkyvz' has created a pull request for th

[jira] [Commented] (SPARK-7581) User guide update for PolynomialExpansion

2015-05-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14541313#comment-14541313 ] Joseph K. Bradley commented on SPARK-7581: -- You can go to the docs/ folder and ru

[jira] [Resolved] (SPARK-7321) Add Column expression for conditional statements (if, case)

2015-05-12 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7321?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-7321. Resolution: Pending Closed Fix Version/s: 1.4.0 > Add Column expression for conditional state

[jira] [Commented] (SPARK-7568) ml.LogisticRegression doesn't output the right prediction

2015-05-12 Thread DB Tsai (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14541301#comment-14541301 ] DB Tsai commented on SPARK-7568: This is because we regularize the intercept before which

[jira] [Commented] (SPARK-7297) Make timeline more discoverable

2015-05-12 Thread Kousuke Saruta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7297?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14541297#comment-14541297 ] Kousuke Saruta commented on SPARK-7297: --- This issue was resolved by SPARK-7298 right

[jira] [Commented] (SPARK-6980) Akka timeout exceptions indicate which conf controls them

2015-05-12 Thread Harsh Gupta (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6980?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14541270#comment-14541270 ] Harsh Gupta commented on SPARK-6980: [~bryanc] Sure thing .. I am already keeping a wa

[jira] [Commented] (SPARK-7581) User guide update for PolynomialExpansion

2015-05-12 Thread Xusen Yin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14541256#comment-14541256 ] Xusen Yin commented on SPARK-7581: -- How to preview the docs that I wrote when I finished

[jira] [Assigned] (SPARK-7322) Add DataFrame DSL for window function support

2015-05-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7322: --- Assignee: Cheng Hao (was: Apache Spark) > Add DataFrame DSL for window function support > --

[jira] [Commented] (SPARK-7322) Add DataFrame DSL for window function support

2015-05-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7322?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14541245#comment-14541245 ] Apache Spark commented on SPARK-7322: - User 'chenghao-intel' has created a pull reques

[jira] [Assigned] (SPARK-7322) Add DataFrame DSL for window function support

2015-05-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7322: --- Assignee: Apache Spark (was: Cheng Hao) > Add DataFrame DSL for window function support > --

[jira] [Commented] (SPARK-6289) PySpark doesn't maintain SQL date Types

2015-05-12 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6289?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14541235#comment-14541235 ] Davies Liu commented on SPARK-6289: --- [~mnazario] Is this still a problem after we upgrad

[jira] [Issue Comment Deleted] (SPARK-7581) User guide update for PolynomialExpansion

2015-05-12 Thread Xusen Yin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7581?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xusen Yin updated SPARK-7581: - Comment: was deleted (was: Yes, I will do it right now. If I miss anything else, pls let me know.) > User

[jira] [Commented] (SPARK-7581) User guide update for PolynomialExpansion

2015-05-12 Thread Xusen Yin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14541202#comment-14541202 ] Xusen Yin commented on SPARK-7581: -- Yes, I will do it right now. If I miss anything else,

[jira] [Commented] (SPARK-7581) User guide update for PolynomialExpansion

2015-05-12 Thread Xusen Yin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14541203#comment-14541203 ] Xusen Yin commented on SPARK-7581: -- Yes, I will do it right now. If I miss anything else,

[jira] [Resolved] (SPARK-7588) Document all SQL/DataFrame public methods with @since tag

2015-05-12 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7588?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-7588. Resolution: Fixed Fix Version/s: 1.4.0 > Document all SQL/DataFrame public methods with @sinc

[jira] [Updated] (SPARK-7578) User guide update for spark.ml IDF, Normalizer, StandardScaler

2015-05-12 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7578?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-7578: - Assignee: Joseph K. Bradley > User guide update for spark.ml IDF, Normalizer, StandardScal

[jira] [Resolved] (SPARK-7592) Resolution set to "Pending Closed" when using PR merge script

2015-05-12 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7592?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-7592. Resolution: Fixed Fix Version/s: 1.4.0 > Resolution set to "Pending Closed" when usin

[jira] [Resolved] (SPARK-6876) DataFrame.na.replace value support for Python

2015-05-12 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-6876. Resolution: Fixed > DataFrame.na.replace value support for Python >

[jira] [Reopened] (SPARK-5182) Partitioning support for tables created by the data source API

2015-05-12 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5182?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell reopened SPARK-5182: > Partitioning support for tables created by the data source API > -

[jira] [Reopened] (SPARK-7435) Make DataFrame.show() consistent with that of Scala and pySpark

2015-05-12 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7435?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell reopened SPARK-7435: > Make DataFrame.show() consistent with that of Scala and pySpark >

[jira] [Resolved] (SPARK-7435) Make DataFrame.show() consistent with that of Scala and pySpark

2015-05-12 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7435?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-7435. Resolution: Fixed > Make DataFrame.show() consistent with that of Scala and pySpark > --

[jira] [Resolved] (SPARK-5182) Partitioning support for tables created by the data source API

2015-05-12 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5182?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-5182. Resolution: Fixed > Partitioning support for tables created by the data source API > ---

[jira] [Reopened] (SPARK-7534) Fix the Stage table when a stage is missing

2015-05-12 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7534?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell reopened SPARK-7534: > Fix the Stage table when a stage is missing > --- > >

[jira] [Reopened] (SPARK-6876) DataFrame.na.replace value support for Python

2015-05-12 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-6876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell reopened SPARK-6876: > DataFrame.na.replace value support for Python > -

[jira] [Resolved] (SPARK-7534) Fix the Stage table when a stage is missing

2015-05-12 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7534?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-7534. Resolution: Fixed > Fix the Stage table when a stage is missing > --

[jira] [Resolved] (SPARK-7276) withColumn is very slow on dataframe with large number of columns

2015-05-12 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7276?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-7276. Resolution: Fixed > withColumn is very slow on dataframe with large number of columns >

[jira] [Reopened] (SPARK-2018) Big-Endian (IBM Power7) Spark Serialization issue

2015-05-12 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2018?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell reopened SPARK-2018: > Big-Endian (IBM Power7) Spark Serialization issue > -

[jira] [Resolved] (SPARK-7487) Python API for ml.regression

2015-05-12 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7487?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-7487. Resolution: Fixed > Python API for ml.regression > > >

[jira] [Reopened] (SPARK-7487) Python API for ml.regression

2015-05-12 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7487?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell reopened SPARK-7487: > Python API for ml.regression > > > Key: SPARK-748

[jira] [Resolved] (SPARK-7531) Install GPG on Jenkins machines

2015-05-12 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7531?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-7531. Resolution: Fixed > Install GPG on Jenkins machines > --- > >

[jira] [Reopened] (SPARK-7531) Install GPG on Jenkins machines

2015-05-12 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7531?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell reopened SPARK-7531: > Install GPG on Jenkins machines > --- > > Key: SPA

[jira] [Closed] (SPARK-7552) Close files correctly when iteration is finished in WAL recovery

2015-05-12 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7552?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell closed SPARK-7552. -- Resolution: Fixed > Close files correctly when iteration is finished in WAL recovery > -

[jira] [Reopened] (SPARK-7276) withColumn is very slow on dataframe with large number of columns

2015-05-12 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7276?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell reopened SPARK-7276: > withColumn is very slow on dataframe with large number of columns > --

[jira] [Resolved] (SPARK-2018) Big-Endian (IBM Power7) Spark Serialization issue

2015-05-12 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2018?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-2018. Resolution: Fixed > Big-Endian (IBM Power7) Spark Serialization issue > ---

[jira] [Reopened] (SPARK-7552) Close files correctly when iteration is finished in WAL recovery

2015-05-12 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7552?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell reopened SPARK-7552: > Close files correctly when iteration is finished in WAL recovery > ---

[jira] [Resolved] (SPARK-7015) Multiclass to Binary Reduction

2015-05-12 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-7015. Resolution: Fixed > Multiclass to Binary Reduction > -- > >

[jira] [Resolved] (SPARK-7528) Java compatibility of RankingMetrics

2015-05-12 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7528?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-7528. Resolution: Fixed > Java compatibility of RankingMetrics > -

[jira] [Reopened] (SPARK-7528) Java compatibility of RankingMetrics

2015-05-12 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7528?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell reopened SPARK-7528: > Java compatibility of RankingMetrics > > >

[jira] [Reopened] (SPARK-7015) Multiclass to Binary Reduction

2015-05-12 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell reopened SPARK-7015: > Multiclass to Binary Reduction > -- > > Key: SPARK

[jira] [Resolved] (SPARK-7406) Add tooltips for "Scheduling Delay", "Processing Time" and "Total Delay" in Streaming WebUI

2015-05-12 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7406?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-7406. Resolution: Fixed > Add tooltips for "Scheduling Delay", "Processing Time" and "Total Delay"

[jira] [Reopened] (SPARK-7406) Add tooltips for "Scheduling Delay", "Processing Time" and "Total Delay" in Streaming WebUI

2015-05-12 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7406?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell reopened SPARK-7406: > Add tooltips for "Scheduling Delay", "Processing Time" and "Total Delay" in > Streaming WebUI

[jira] [Created] (SPARK-7593) Python API for Bucketizer

2015-05-12 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-7593: Summary: Python API for Bucketizer Key: SPARK-7593 URL: https://issues.apache.org/jira/browse/SPARK-7593 Project: Spark Issue Type: New Feature Com

[jira] [Assigned] (SPARK-7592) Resolution set to "Pending Closed" when using PR merge script

2015-05-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7592?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7592: --- Assignee: Patrick Wendell (was: Apache Spark) > Resolution set to "Pending Closed" when usin

[jira] [Commented] (SPARK-7592) Resolution set to "Pending Closed" when using PR merge script

2015-05-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7592?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14541131#comment-14541131 ] Apache Spark commented on SPARK-7592: - User 'pwendell' has created a pull request for

[jira] [Assigned] (SPARK-7592) Resolution set to "Pending Closed" when using PR merge script

2015-05-12 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7592?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-7592: --- Assignee: Apache Spark (was: Patrick Wendell) > Resolution set to "Pending Closed" when usin

[jira] [Created] (SPARK-7592) Resolution set to "Pending Closed" when using PR merge script

2015-05-12 Thread Patrick Wendell (JIRA)
Patrick Wendell created SPARK-7592: -- Summary: Resolution set to "Pending Closed" when using PR merge script Key: SPARK-7592 URL: https://issues.apache.org/jira/browse/SPARK-7592 Project: Spark

  1   2   3   4   >