[jira] [Resolved] (SPARK-3673) Move IndexedRDD from a pull request into a separate repository

2015-01-29 Thread Ankur Dave (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3673?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ankur Dave resolved SPARK-3673. --- Resolution: Fixed > Move IndexedRDD from a pull request into a separate repository > -

[jira] [Commented] (SPARK-3673) Move IndexedRDD from a pull request into a separate repository

2015-01-29 Thread Alexander Bezzubov (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3673?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14298314#comment-14298314 ] Alexander Bezzubov commented on SPARK-3673: --- Looks like this was resolved by htt

[jira] [Created] (SPARK-5494) SparkSqlSerializer Ignores KryoRegistrators

2015-01-29 Thread Hamel Ajay Kothari (JIRA)
Hamel Ajay Kothari created SPARK-5494: - Summary: SparkSqlSerializer Ignores KryoRegistrators Key: SPARK-5494 URL: https://issues.apache.org/jira/browse/SPARK-5494 Project: Spark Issue Typ

[jira] [Resolved] (SPARK-5322) Add transpose() to BlockMatrix

2015-01-29 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-5322. -- Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 4275 [https://githu

[jira] [Updated] (SPARK-5322) Add transpose() to BlockMatrix

2015-01-29 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5322: - Assignee: Burak Yavuz > Add transpose() to BlockMatrix > -- > >

[jira] [Commented] (SPARK-3080) ArrayIndexOutOfBoundsException in ALS for Large datasets

2015-01-29 Thread Michael Hynes (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3080?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14298192#comment-14298192 ] Michael Hynes commented on SPARK-3080: -- What is the status of this SimpleALS.scala re

[jira] [Commented] (SPARK-5492) Thread statistics can break with older Hadoop versions

2015-01-29 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5492?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14298190#comment-14298190 ] Sandy Ryza commented on SPARK-5492: --- Are you able to provide any more detail on the envi

[jira] [Created] (SPARK-5493) Support proxy users under kerberos

2015-01-29 Thread Brock Noland (JIRA)
Brock Noland created SPARK-5493: --- Summary: Support proxy users under kerberos Key: SPARK-5493 URL: https://issues.apache.org/jira/browse/SPARK-5493 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-5492) Thread statistics can break with older Hadoop versions

2015-01-29 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5492?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14298138#comment-14298138 ] Sandy Ryza commented on SPARK-5492: --- Very weird. I'll look into it. Did that come up d

[jira] [Assigned] (SPARK-5492) Thread statistics can break with older Hadoop versions

2015-01-29 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5492?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sandy Ryza reassigned SPARK-5492: - Assignee: Sandy Ryza > Thread statistics can break with older Hadoop versions > -

[jira] [Commented] (SPARK-3976) Detect block matrix partitioning schemes

2015-01-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3976?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14298132#comment-14298132 ] Apache Spark commented on SPARK-3976: - User 'brkyvz' has created a pull request for th

[jira] [Commented] (SPARK-3996) Shade Jetty in Spark deliverables

2015-01-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3996?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14298128#comment-14298128 ] Apache Spark commented on SPARK-3996: - User 'pwendell' has created a pull request for

[jira] [Resolved] (SPARK-5462) Catalyst UnresolvedException "Invalid call to qualifiers on unresolved object" error when accessing fields in DataFrames returned from sqlCtx.sql()

2015-01-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5462?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-5462. Resolution: Fixed Fix Version/s: 1.3.0 Assignee: Josh Rosen > Catalyst UnresolvedExc

[jira] [Commented] (SPARK-5492) Thread statistics can break with older Hadoop versions

2015-01-29 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5492?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14298075#comment-14298075 ] Patrick Wendell commented on SPARK-5492: /cc [~sandyr] > Thread statistics can br

[jira] [Updated] (SPARK-5492) Thread statistics can break with older Hadoop versions

2015-01-29 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5492?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-5492: --- Priority: Blocker (was: Major) > Thread statistics can break with older Hadoop versions > ---

[jira] [Created] (SPARK-5492) Thread statistics can break with older Hadoop versions

2015-01-29 Thread Patrick Wendell (JIRA)
Patrick Wendell created SPARK-5492: -- Summary: Thread statistics can break with older Hadoop versions Key: SPARK-5492 URL: https://issues.apache.org/jira/browse/SPARK-5492 Project: Spark Issu

[jira] [Commented] (SPARK-5489) KMeans clustering java.lang.NoSuchMethodError: scala.runtime.IntRef.create (I)Lscala/runtime/IntRef;

2015-01-29 Thread DeepakVohra (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14298063#comment-14298063 ] DeepakVohra commented on SPARK-5489: If Scala 2.11.1 is used the scala.Cloneable is no

[jira] [Commented] (SPARK-5454) [SQL] Self join with ArrayType columns problems

2015-01-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5454?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14298048#comment-14298048 ] Apache Spark commented on SPARK-5454: - User 'chenghao-intel' has created a pull reques

[jira] [Updated] (SPARK-1473) Feature selection for high dimensional datasets

2015-01-29 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1473?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-1473: - Assignee: (was: Alexander Ulanov) > Feature selection for high dimensional datasets >

[jira] [Updated] (SPARK-1473) Feature selection for high dimensional datasets

2015-01-29 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1473?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-1473: - Target Version/s: (was: 1.3.0) > Feature selection for high dimensional datasets > -

[jira] [Created] (SPARK-5491) Chi-square feature selection

2015-01-29 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-5491: Summary: Chi-square feature selection Key: SPARK-5491 URL: https://issues.apache.org/jira/browse/SPARK-5491 Project: Spark Issue Type: New Feature

[jira] [Updated] (SPARK-5395) Large number of Python workers causing resource depletion

2015-01-29 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5395?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-5395: -- Target Version/s: 1.3.0, 1.2.2 Fix Version/s: 1.3.0 Assignee: Davies Liu

[jira] [Updated] (SPARK-2199) Distributed probabilistic latent semantic analysis in MLlib

2015-01-29 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2199?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-2199: - Target Version/s: 1.4.0 (was: 1.3.0) > Distributed probabilistic latent semantic analysis in MLli

[jira] [Updated] (SPARK-3080) ArrayIndexOutOfBoundsException in ALS for Large datasets

2015-01-29 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3080?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-3080: - Target Version/s: (was: 1.3.0) > ArrayIndexOutOfBoundsException in ALS for Large datasets >

[jira] [Updated] (SPARK-3147) Implement A/B testing

2015-01-29 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3147?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-3147: - Target Version/s: (was: 1.3.0) > Implement A/B testing > - > >

[jira] [Commented] (SPARK-4259) Add Power Iteration Clustering Algorithm with Gaussian Similarity Function

2015-01-29 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4259?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14298030#comment-14298030 ] Xiangrui Meng commented on SPARK-4259: -- [~andrew.musselman] PIC is more or less a spe

[jira] [Updated] (SPARK-3147) Implement A/B testing

2015-01-29 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3147?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-3147: - Target Version/s: 1.4.0 > Implement A/B testing > - > > Key: S

[jira] [Updated] (SPARK-4259) Add Power Iteration Clustering Algorithm with Gaussian Similarity Function

2015-01-29 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4259?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4259: - Target Version/s: 1.3.0 > Add Power Iteration Clustering Algorithm with Gaussian Similarity Functi

[jira] [Updated] (SPARK-1405) parallel Latent Dirichlet Allocation (LDA) atop of spark in MLlib

2015-01-29 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1405?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-1405: - Assignee: Joseph K. Bradley (was: Guoqiang Li) > parallel Latent Dirichlet Allocation (LDA) atop

[jira] [Reopened] (SPARK-3996) Shade Jetty in Spark deliverables

2015-01-29 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3996?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell reopened SPARK-3996: This was causing compiler failures in the master build, so I reverted it. I think it's the same

[jira] [Comment Edited] (SPARK-4349) Spark driver hangs on sc.parallelize() if exception is thrown during serialization

2015-01-29 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4349?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14298017#comment-14298017 ] Matt Cheah edited comment on SPARK-4349 at 1/30/15 1:12 AM: Wh

[jira] [Updated] (SPARK-5399) tree Losses strings should match loss names

2015-01-29 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5399?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5399: - Assignee: Kai Sasaki > tree Losses strings should match loss names > -

[jira] [Closed] (SPARK-4349) Spark driver hangs on sc.parallelize() if exception is thrown during serialization

2015-01-29 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4349?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Cheah closed SPARK-4349. - Resolution: Fixed > Spark driver hangs on sc.parallelize() if exception is thrown during > serialization

[jira] [Commented] (SPARK-4349) Spark driver hangs on sc.parallelize() if exception is thrown during serialization

2015-01-29 Thread Matt Cheah (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4349?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14298017#comment-14298017 ] Matt Cheah commented on SPARK-4349: --- Whoops, this was fixed by SPARK-4737. Someone want

[jira] [Updated] (SPARK-4118) Create python bindings for Streaming KMeans

2015-01-29 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4118?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4118: - Target Version/s: (was: 1.3.0) > Create python bindings for Streaming KMeans > -

[jira] [Updated] (SPARK-5101) Add common ML math functions

2015-01-29 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5101?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5101: - Target Version/s: (was: 1.3.0) > Add common ML math functions > > >

[jira] [Updated] (SPARK-3188) Add Robust Regression Algorithm with Tukey bisquare weight function (Biweight Estimates)

2015-01-29 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3188?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-3188: - Target Version/s: 1.4.0 (was: 1.3.0) > Add Robust Regression Algorithm with Tukey bisquare weight

[jira] [Updated] (SPARK-5012) Python API for Gaussian Mixture Model

2015-01-29 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5012?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5012: - Priority: Critical (was: Major) > Python API for Gaussian Mixture Model > ---

[jira] [Updated] (SPARK-5094) Python API for gradient-boosted trees

2015-01-29 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5094?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5094: - Priority: Critical (was: Major) > Python API for gradient-boosted trees > ---

[jira] [Updated] (SPARK-4240) Refine Tree Predictions in Gradient Boosting to Improve Prediction Accuracy.

2015-01-29 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4240?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4240: - Target Version/s: (was: 1.3.0) > Refine Tree Predictions in Gradient Boosting to Improve Predict

[jira] [Updated] (SPARK-4036) Add Conditional Random Fields (CRF) algorithm to Spark MLlib

2015-01-29 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4036?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4036: - Assignee: Kai Sasaki > Add Conditional Random Fields (CRF) algorithm to Spark MLlib >

[jira] [Commented] (SPARK-4036) Add Conditional Random Fields (CRF) algorithm to Spark MLlib

2015-01-29 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4036?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14298012#comment-14298012 ] Xiangrui Meng commented on SPARK-4036: -- [~lewuathe] I've assigned this ticket to you.

[jira] [Updated] (SPARK-4036) Add Conditional Random Fields (CRF) algorithm to Spark MLlib

2015-01-29 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4036?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4036: - Target Version/s: (was: 1.3.0) > Add Conditional Random Fields (CRF) algorithm to Spark MLlib >

[jira] [Updated] (SPARK-3181) Add Robust Regression Algorithm with Huber Estimator

2015-01-29 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3181?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-3181: - Target Version/s: 1.4.0 (was: 1.3.0) > Add Robust Regression Algorithm with Huber Estimator > ---

[jira] [Updated] (SPARK-5486) Add validate function for BlockMatrix

2015-01-29 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-5486: - Priority: Major (was: Critical) > Add validate function for BlockMatrix > ---

[jira] [Commented] (SPARK-5420) Cross-langauge load/store functions for creating and saving DataFrames

2015-01-29 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5420?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14297987#comment-14297987 ] Michael Armbrust commented on SPARK-5420: - Here are the dimensions that I think we

[jira] [Updated] (SPARK-5472) Add support for reading from and writing to a JDBC database

2015-01-29 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5472?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-5472: Priority: Blocker (was: Minor) > Add support for reading from and writing to a JDBC databas

[jira] [Updated] (SPARK-5472) Add support for reading from and writing to a JDBC database

2015-01-29 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5472?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-5472: Assignee: Tor Myklebust > Add support for reading from and writing to a JDBC database >

[jira] [Updated] (SPARK-5472) Add support for reading from and writing to a JDBC database

2015-01-29 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5472?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust updated SPARK-5472: Target Version/s: 1.3.0 > Add support for reading from and writing to a JDBC database >

[jira] [Resolved] (SPARK-4959) Attributes are case sensitive when using a select query from a projection

2015-01-29 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4959?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-4959. - Resolution: Fixed > Attributes are case sensitive when using a select query from a project

[jira] [Updated] (SPARK-3778) newAPIHadoopRDD doesn't properly pass credentials for secure hdfs on yarn

2015-01-29 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3778?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-3778: --- Priority: Critical (was: Major) > newAPIHadoopRDD doesn't properly pass credentials for secur

[jira] [Updated] (SPARK-3778) newAPIHadoopRDD doesn't properly pass credentials for secure hdfs on yarn

2015-01-29 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3778?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell updated SPARK-3778: --- Target Version/s: 1.3.0 (was: 1.1.1, 1.2.0) > newAPIHadoopRDD doesn't properly pass credentia

[jira] [Resolved] (SPARK-3996) Shade Jetty in Spark deliverables

2015-01-29 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3996?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-3996. Resolution: Fixed Fix Version/s: 1.3.0 Assignee: Patrick Wendell (was: Matth

[jira] [Commented] (SPARK-5462) Catalyst UnresolvedException "Invalid call to qualifiers on unresolved object" error when accessing fields in DataFrames returned from sqlCtx.sql()

2015-01-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5462?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14297956#comment-14297956 ] Apache Spark commented on SPARK-5462: - User 'JoshRosen' has created a pull request for

[jira] [Commented] (SPARK-5489) KMeans clustering java.lang.NoSuchMethodError: scala.runtime.IntRef.create (I)Lscala/runtime/IntRef;

2015-01-29 Thread DeepakVohra (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14297951#comment-14297951 ] DeepakVohra commented on SPARK-5489: Sean, Some dependency is making use of scala.run

[jira] [Commented] (SPARK-5424) Make the new ALS implementation take generic ID types

2015-01-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5424?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14297949#comment-14297949 ] Apache Spark commented on SPARK-5424: - User 'mengxr' has created a pull request for th

[jira] [Resolved] (SPARK-5464) Calling help() on a Python DataFrame fails with "cannot resolve column name __name__" error

2015-01-29 Thread Reynold Xin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5464?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Reynold Xin resolved SPARK-5464. Resolution: Fixed Fix Version/s: 1.3.0 > Calling help() on a Python DataFrame fails with "can

[jira] [Commented] (SPARK-5489) KMeans clustering java.lang.NoSuchMethodError: scala.runtime.IntRef.create (I)Lscala/runtime/IntRef;

2015-01-29 Thread DeepakVohra (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14297937#comment-14297937 ] DeepakVohra commented on SPARK-5489: Sean, Made the Scala version the same, but still

[jira] [Commented] (SPARK-5483) java.lang.NoSuchMethodError: scala.Predef$.ArrowAssoc(Ljava/lang/Object;)Ljava/lang/Object;

2015-01-29 Thread DeepakVohra (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5483?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14297936#comment-14297936 ] DeepakVohra commented on SPARK-5483: Sean, Made the Scala version the same, but still

[jira] [Commented] (SPARK-5462) Catalyst UnresolvedException "Invalid call to qualifiers on unresolved object" error when accessing fields in DataFrames returned from sqlCtx.sql()

2015-01-29 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5462?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14297935#comment-14297935 ] Josh Rosen commented on SPARK-5462: --- [~liancheng] [~marmbrus] Is this possibly related t

[jira] [Updated] (SPARK-5462) Catalyst UnresolvedException "Invalid call to qualifiers on unresolved object" error when accessing fields in DataFrames returned from sqlCtx.sql()

2015-01-29 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5462?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-5462: -- Component/s: (was: PySpark) > Catalyst UnresolvedException "Invalid call to qualifiers on unresolved

[jira] [Updated] (SPARK-5462) Catalyst UnresolvedException "Invalid call to qualifiers on unresolved object" error when accessing fields in DataFrames returned from sqlCtx.sql()

2015-01-29 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5462?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-5462: -- Assignee: (was: Josh Rosen) > Catalyst UnresolvedException "Invalid call to qualifiers on unresolved

[jira] [Commented] (SPARK-5462) Catalyst UnresolvedException "Invalid call to qualifiers on unresolved object" error when accessing fields in DataFrames returned from sqlCtx.sql()

2015-01-29 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5462?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14297934#comment-14297934 ] Josh Rosen commented on SPARK-5462: --- Actually, this issue isn't Python-specific: it also

[jira] [Updated] (SPARK-5462) Catalyst UnresolvedException "Invalid call to qualifiers on unresolved object" error when accessing fields in DataFrames returned from sqlCtx.sql()

2015-01-29 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5462?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen updated SPARK-5462: -- Summary: Catalyst UnresolvedException "Invalid call to qualifiers on unresolved object" error when acces

[jira] [Resolved] (SPARK-5373) literal in agg grouping expressioons leads to incorrect result

2015-01-29 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5373?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-5373. - Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 4169 [https:/

[jira] [Commented] (SPARK-4133) PARSING_ERROR(2) when upgrading issues from 1.0.2 to 1.1.0

2015-01-29 Thread Derrick Burns (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4133?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14297919#comment-14297919 ] Derrick Burns commented on SPARK-4133: -- I worked around it, so feel free On Thu,

[jira] [Resolved] (SPARK-5367) support star expression in udf

2015-01-29 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5367?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-5367. - Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 4163 [https:/

[jira] [Commented] (SPARK-5462) Catalyst UnresolvedException "Invalid call to qualifiers on unresolved object" error when accessing fields in Python DataFrame

2015-01-29 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5462?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14297913#comment-14297913 ] Josh Rosen commented on SPARK-5462: --- I'm working on a patch for this now. It looks like

[jira] [Resolved] (SPARK-4786) Parquet filter pushdown for BYTE and SHORT types

2015-01-29 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4786?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-4786. - Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 4156 [https:/

[jira] [Resolved] (SPARK-5309) Reduce Binary/String conversion overhead when reading/writing Parquet files

2015-01-29 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5309?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust resolved SPARK-5309. - Resolution: Fixed Fix Version/s: 1.3.0 Issue resolved by pull request 4187 [https:/

[jira] [Assigned] (SPARK-5462) Catalyst UnresolvedException "Invalid call to qualifiers on unresolved object" error when accessing fields in Python DataFrame

2015-01-29 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5462?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen reassigned SPARK-5462: - Assignee: Josh Rosen > Catalyst UnresolvedException "Invalid call to qualifiers on unresolved >

[jira] [Closed] (SPARK-5429) Can't generate Hive golden answer on Hive 0.13.1

2015-01-29 Thread Michael Armbrust (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5429?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael Armbrust closed SPARK-5429. --- Resolution: Fixed Fix Version/s: 1.3.0 Assignee: Liang-Chi Hsieh > Can't gener

[jira] [Created] (SPARK-5490) KMeans costs can be incorrect if tasks need to be rerun

2015-01-29 Thread Sandy Ryza (JIRA)
Sandy Ryza created SPARK-5490: - Summary: KMeans costs can be incorrect if tasks need to be rerun Key: SPARK-5490 URL: https://issues.apache.org/jira/browse/SPARK-5490 Project: Spark Issue Type: B

[jira] [Created] (SPARK-5489) KMeans clustering java.lang.NoSuchMethodError: scala.runtime.IntRef.create (I)Lscala/runtime/IntRef;

2015-01-29 Thread DeepakVohra (JIRA)
DeepakVohra created SPARK-5489: -- Summary: KMeans clustering java.lang.NoSuchMethodError: scala.runtime.IntRef.create (I)Lscala/runtime/IntRef; Key: SPARK-5489 URL: https://issues.apache.org/jira/browse/SPARK-5489

[jira] [Commented] (SPARK-5483) java.lang.NoSuchMethodError: scala.Predef$.ArrowAssoc(Ljava/lang/Object;)Ljava/lang/Object;

2015-01-29 Thread DeepakVohra (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5483?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14297886#comment-14297886 ] DeepakVohra commented on SPARK-5483: Sean, As indicated Spark is compiled with Scala

[jira] [Commented] (SPARK-5486) Add validate function for BlockMatrix

2015-01-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5486?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14297859#comment-14297859 ] Apache Spark commented on SPARK-5486: - User 'brkyvz' has created a pull request for th

[jira] [Reopened] (SPARK-603) add simple Counter API

2015-01-29 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-603?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sandy Ryza reopened SPARK-603: -- > add simple Counter API > -- > > Key: SPARK-603 > URL: h

[jira] [Commented] (SPARK-5464) Calling help() on a Python DataFrame fails with "cannot resolve column name __name__" error

2015-01-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14297842#comment-14297842 ] Apache Spark commented on SPARK-5464: - User 'JoshRosen' has created a pull request for

[jira] [Closed] (SPARK-3888) Limit the memory used by python worker

2015-01-29 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3888?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu closed SPARK-3888. - Resolution: Won't Fix > Limit the memory used by python worker > -- >

[jira] [Updated] (SPARK-4939) Python updateStateByKey example hang in local mode

2015-01-29 Thread Davies Liu (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4939?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Davies Liu updated SPARK-4939: -- Affects Version/s: (was: 1.2.0) > Python updateStateByKey example hang in local mode > -

[jira] [Updated] (SPARK-5151) Parquet Predicate Pushdown Does Not Work with Nested Structures.

2015-01-29 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5151?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sandy Ryza updated SPARK-5151: -- Component/s: (was: Spark Core) > Parquet Predicate Pushdown Does Not Work with Nested Structures. >

[jira] [Updated] (SPARK-5151) Parquet Predicate Pushdown Does Not Work with Nested Structures.

2015-01-29 Thread Sandy Ryza (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5151?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sandy Ryza updated SPARK-5151: -- Component/s: SQL > Parquet Predicate Pushdown Does Not Work with Nested Structures. > --

[jira] [Assigned] (SPARK-5464) Calling help() on a Python DataFrame fails with "cannot resolve column name __name__" error

2015-01-29 Thread Josh Rosen (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5464?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Rosen reassigned SPARK-5464: - Assignee: Josh Rosen > Calling help() on a Python DataFrame fails with "cannot resolve column nam

[jira] [Commented] (SPARK-5445) Make sure DataFrame expressions are usable in Java

2015-01-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14297714#comment-14297714 ] Apache Spark commented on SPARK-5445: - User 'rxin' has created a pull request for this

[jira] [Commented] (SPARK-5192) Parquet fails to parse schema contains '\r'

2015-01-29 Thread Rekha Joshi (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5192?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14297710#comment-14297710 ] Rekha Joshi commented on SPARK-5192: I have made a parquet patch on it.thanks > Parqu

[jira] [Created] (SPARK-5488) SPARK_LOCAL_IP not read by mesos scheduler

2015-01-29 Thread Martin Tapp (JIRA)
Martin Tapp created SPARK-5488: -- Summary: SPARK_LOCAL_IP not read by mesos scheduler Key: SPARK-5488 URL: https://issues.apache.org/jira/browse/SPARK-5488 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-5461) Graph should have isCheckpointed, getCheckpointFiles methods

2015-01-29 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5461?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14297663#comment-14297663 ] Joseph K. Bradley commented on SPARK-5461: -- That sounds great if partitionsRDD ca

[jira] [Resolved] (SPARK-5466) Build Error caused by Guava shading in Spark

2015-01-29 Thread Patrick Wendell (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5466?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Wendell resolved SPARK-5466. Resolution: Fixed Fix Version/s: 1.3.0 Assignee: Marcelo Vanzin Thanks [~van

[jira] [Commented] (SPARK-5021) GaussianMixtureEM should be faster for SparseVector input

2015-01-29 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5021?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14297652#comment-14297652 ] Joseph K. Bradley commented on SPARK-5021: -- You can also generate the documentati

[jira] [Updated] (SPARK-5021) GaussianMixtureEM should be faster for SparseVector input

2015-01-29 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5021?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-5021: - Affects Version/s: (was: 1.2.0) 1.3.0 > GaussianMixtureEM shoul

[jira] [Updated] (SPARK-5400) Rename GaussianMixtureEM to GaussianMixture

2015-01-29 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5400?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-5400: - Assignee: Travis Galoppo > Rename GaussianMixtureEM to GaussianMixture > -

[jira] [Commented] (SPARK-5400) Rename GaussianMixtureEM to GaussianMixture

2015-01-29 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5400?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14297648#comment-14297648 ] Joseph K. Bradley commented on SPARK-5400: -- Thanks! Could you also please change

[jira] [Commented] (SPARK-5021) GaussianMixtureEM should be faster for SparseVector input

2015-01-29 Thread Travis Galoppo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5021?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14297634#comment-14297634 ] Travis Galoppo commented on SPARK-5021: --- [~josephkb] This ticket is marked as affect

[jira] [Commented] (SPARK-5021) GaussianMixtureEM should be faster for SparseVector input

2015-01-29 Thread Travis Galoppo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5021?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14297622#comment-14297622 ] Travis Galoppo commented on SPARK-5021: --- [~MechCoder] The documentation for GMM is n

[jira] [Commented] (SPARK-5400) Rename GaussianMixtureEM to GaussianMixture

2015-01-29 Thread Travis Galoppo (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5400?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14297613#comment-14297613 ] Travis Galoppo commented on SPARK-5400: --- Please assign to me and I will make the nam

[jira] [Commented] (SPARK-5322) Add transpose() to BlockMatrix

2015-01-29 Thread Apache Spark (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5322?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14297486#comment-14297486 ] Apache Spark commented on SPARK-5322: - User 'brkyvz' has created a pull request for th

[jira] [Commented] (SPARK-4768) Add Support For Impala Encoded Timestamp (INT96)

2015-01-29 Thread Taiji Okada (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4768?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14297475#comment-14297475 ] Taiji Okada commented on SPARK-4768: [~yhuai], I've uploaded the string_timestamp tarb

[jira] [Updated] (SPARK-4768) Add Support For Impala Encoded Timestamp (INT96)

2015-01-29 Thread Taiji Okada (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4768?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Taiji Okada updated SPARK-4768: --- Attachment: string_timestamp.gz > Add Support For Impala Encoded Timestamp (INT96) > -

[jira] [Comment Edited] (SPARK-5487) Dockerfile to build spark's custom akka.

2015-01-29 Thread jay vyas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5487?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14297458#comment-14297458 ] jay vyas edited comment on SPARK-5487 at 1/29/15 7:57 PM: -- To rep

[jira] [Commented] (SPARK-5487) Dockerfile to build spark's custom akka.

2015-01-29 Thread jay vyas (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-5487?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14297458#comment-14297458 ] jay vyas commented on SPARK-5487: - To reproduce this, you can use the following dockerfile

  1   2   >