[jira] [Created] (SPARK-4396) Support lookup by index in Rating

2014-11-13 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-4396: Summary: Support lookup by index in Rating Key: SPARK-4396 URL: https://issues.apache.org/jira/browse/SPARK-4396 Project: Spark Issue Type: New Feature

[jira] [Commented] (SPARK-4348) pyspark.mllib.random conflicts with random module

2014-11-13 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4348?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14211768#comment-14211768 ] Xiangrui Meng commented on SPARK-4348: -- Note that after this fix, it is very likely

[jira] [Commented] (SPARK-3080) ArrayIndexOutOfBoundsException in ALS for Large datasets

2014-11-13 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3080?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14211898#comment-14211898 ] Xiangrui Meng commented on SPARK-3080: -- I see. If the procedure of sample negatives

[jira] [Created] (SPARK-4398) Specialize rdd.parallelize for xrange

2014-11-14 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-4398: Summary: Specialize rdd.parallelize for xrange Key: SPARK-4398 URL: https://issues.apache.org/jira/browse/SPARK-4398 Project: Spark Issue Type: Improvement

[jira] [Resolved] (SPARK-4398) Specialize rdd.parallelize for xrange

2014-11-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4398?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-4398. -- Resolution: Fixed Fix Version/s: 1.2.0 Issue resolved by pull request 3264

[jira] [Comment Edited] (SPARK-3080) ArrayIndexOutOfBoundsException in ALS for Large datasets

2014-11-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3080?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14212815#comment-14212815 ] Xiangrui Meng edited comment on SPARK-3080 at 11/14/14 8:56 PM:

[jira] [Commented] (SPARK-3080) ArrayIndexOutOfBoundsException in ALS for Large datasets

2014-11-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3080?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14212815#comment-14212815 ] Xiangrui Meng commented on SPARK-3080: -- Thanks for the confirmation! If [~ilganeli]

[jira] [Updated] (SPARK-3080) ArrayIndexOutOfBoundsException in ALS for Large datasets

2014-11-14 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3080?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-3080: - Assignee: Xiangrui Meng ArrayIndexOutOfBoundsException in ALS for Large datasets

[jira] [Created] (SPARK-4433) Racing condition in zipWithIndex

2014-11-15 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-4433: Summary: Racing condition in zipWithIndex Key: SPARK-4433 URL: https://issues.apache.org/jira/browse/SPARK-4433 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-4422) In some cases, Vectors.fromBreeze get wrong results.

2014-11-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4422?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4422: - Assignee: Guoqiang Li In some cases, Vectors.fromBreeze get wrong results.

[jira] [Resolved] (SPARK-4422) In some cases, Vectors.fromBreeze get wrong results.

2014-11-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4422?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-4422. -- Resolution: Fixed Fix Version/s: 1.2.0 Issue resolved by pull request 3281

[jira] [Updated] (SPARK-4422) In some cases, Vectors.fromBreeze get wrong results.

2014-11-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4422?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4422: - Priority: Minor (was: Critical) In some cases, Vectors.fromBreeze get wrong results.

[jira] [Updated] (SPARK-4422) In some cases, Vectors.fromBreeze get wrong results.

2014-11-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4422?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4422: - Target Version/s: 1.2.0, 1.0.3, 1.1.2 (was: 1.1.0, 1.2.0, 1.3.0) In some cases,

[jira] [Updated] (SPARK-4422) In some cases, Vectors.fromBreeze get wrong results.

2014-11-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4422?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4422: - Affects Version/s: (was: 1.3.0) (was: 1.2.0)

[jira] [Reopened] (SPARK-4422) In some cases, Vectors.fromBreeze get wrong results.

2014-11-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4422?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng reopened SPARK-4422: -- Reopened for branch-1.0 and branch-1.1. Changed the priority to minor because `fromBreeze` is

[jira] [Updated] (SPARK-4422) In some cases, Vectors.fromBreeze get wrong results.

2014-11-16 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4422?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4422: - Affects Version/s: 1.2.0 In some cases, Vectors.fromBreeze get wrong results.

[jira] [Updated] (SPARK-4435) Add setThreshold in Python LogisticRegressionModel and SVMModel

2014-11-17 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4435?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4435: - Assignee: Davies Liu Add setThreshold in Python LogisticRegressionModel and SVMModel

[jira] [Updated] (SPARK-4439) Expose RandomForest in Python

2014-11-17 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4439?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4439: - Assignee: Davies Liu Expose RandomForest in Python -

[jira] [Updated] (SPARK-4306) LogisticRegressionWithLBFGS support for PySpark MLlib

2014-11-17 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4306?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4306: - Assignee: Davies Liu (was: Varadharajan) LogisticRegressionWithLBFGS support for PySpark MLlib

[jira] [Updated] (SPARK-4406) SVD should check for k 1

2014-11-17 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4406?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4406: - Target Version/s: 1.2.0 SVD should check for k 1 --

[jira] [Updated] (SPARK-4431) Implement efficient activeIterator for dense and sparse vector

2014-11-17 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4431?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4431: - Target Version/s: 1.2.0 Implement efficient activeIterator for dense and sparse vector

[jira] [Updated] (SPARK-4431) Implement efficient activeIterator for dense and sparse vector

2014-11-17 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4431?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4431: - Assignee: DB Tsai Implement efficient activeIterator for dense and sparse vector

[jira] [Updated] (SPARK-4405) Matrices.* construction methods should check for rows x cols overflow

2014-11-17 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4405?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4405: - Target Version/s: 1.2.0 Matrices.* construction methods should check for rows x cols overflow

[jira] [Commented] (SPARK-4288) Add Sparse Autoencoder algorithm to MLlib

2014-11-17 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4288?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14215479#comment-14215479 ] Xiangrui Meng commented on SPARK-4288: -- The implementation of neural network requires

[jira] [Commented] (SPARK-4127) Streaming Linear Regression- Python bindings

2014-11-17 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4127?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14215489#comment-14215489 ] Xiangrui Meng commented on SPARK-4127: -- [~slcclimber] I think you need to call

[jira] [Comment Edited] (SPARK-4127) Streaming Linear Regression- Python bindings

2014-11-17 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4127?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14215489#comment-14215489 ] Xiangrui Meng edited comment on SPARK-4127 at 11/18/14 12:38 AM:

[jira] [Resolved] (SPARK-4435) Add setThreshold in Python LogisticRegressionModel and SVMModel

2014-11-18 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4435?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-4435. -- Issue resolved by pull request 3305 [https://github.com/apache/spark/pull/3305] Add setThreshold

[jira] [Resolved] (SPARK-4396) Support lookup by index in Rating

2014-11-18 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4396?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-4396. -- Resolution: Fixed Fix Version/s: 1.2.0 Issue resolved by pull request 3261

[jira] [Resolved] (SPARK-4306) LogisticRegressionWithLBFGS support for PySpark MLlib

2014-11-18 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4306?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-4306. -- Resolution: Fixed Fix Version/s: 1.2.0 Issue resolved by pull request 3307

[jira] [Resolved] (SPARK-4433) Racing condition in zipWithIndex

2014-11-18 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4433?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-4433. -- Resolution: Fixed Fix Version/s: 1.0.3 1.1.1 1.2.0

[jira] [Updated] (SPARK-4433) Racing condition in zipWithIndex

2014-11-18 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4433?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4433: - Target Version/s: 1.1.1, 1.2.0, 1.0.3 (was: 1.2.0, 1.0.3, 1.1.2) Racing condition in

[jira] [Updated] (SPARK-4327) Python API for RDD.randomSplit()

2014-11-18 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4327?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4327: - Assignee: Davies Liu Python API for RDD.randomSplit()

[jira] [Resolved] (SPARK-4327) Python API for RDD.randomSplit()

2014-11-18 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4327?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-4327. -- Resolution: Fixed Fix Version/s: 1.2.0 Issue resolved by pull request 3193

[jira] [Updated] (SPARK-1856) Standardize MLlib interfaces

2014-11-18 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1856?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-1856: - Target Version/s: 1.3.0 (was: 1.2.0) Standardize MLlib interfaces

[jira] [Updated] (SPARK-3702) Standardize MLlib classes for learners, models

2014-11-18 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3702?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-3702: - Target Version/s: 1.3.0 (was: 1.2.0) Standardize MLlib classes for learners, models

[jira] [Created] (SPARK-4486) Improve GradientBoosting APIs and doc

2014-11-19 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-4486: Summary: Improve GradientBoosting APIs and doc Key: SPARK-4486 URL: https://issues.apache.org/jira/browse/SPARK-4486 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-4355) OnlineSummarizer doesn't merge mean correctly

2014-11-19 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4355?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4355: - Fix Version/s: 1.2.0 OnlineSummarizer doesn't merge mean correctly

[jira] [Resolved] (SPARK-4486) Improve GradientBoosting APIs and doc

2014-11-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-4486. -- Resolution: Fixed Fix Version/s: 1.2.0 Issue resolved by pull request 3374

[jira] [Updated] (SPARK-4486) Improve GradientBoosting APIs and doc

2014-11-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4486: - Assignee: Xiangrui Meng Improve GradientBoosting APIs and doc

[jira] [Commented] (SPARK-4510) Add k-medoids Partitioning Around Medoids (PAM) algorithm

2014-11-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4510?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14219736#comment-14219736 ] Xiangrui Meng commented on SPARK-4510: -- [~fjiang6] Could you explain the complexity

[jira] [Resolved] (SPARK-4439) Expose RandomForest in Python

2014-11-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4439?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-4439. -- Resolution: Fixed Fix Version/s: 1.2.0 Issue resolved by pull request 3320

[jira] [Resolved] (SPARK-4477) remove numpy from RDDSampler of PySpark

2014-11-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4477?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-4477. -- Resolution: Fixed Fix Version/s: 1.2.0 Issue resolved by pull request 3351

[jira] [Updated] (SPARK-4477) remove numpy from RDDSampler of PySpark

2014-11-20 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4477?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4477: - Assignee: Davies Liu remove numpy from RDDSampler of PySpark

[jira] [Closed] (SPARK-4531) Cache serialized java objects instead of serialized python objects in MLlib

2014-11-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4531?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng closed SPARK-4531. Resolution: Fixed Fix Version/s: 1.2.0 Assignee: Davies Liu Cache serialized java

[jira] [Resolved] (SPARK-4431) Implement efficient activeIterator for dense and sparse vector

2014-11-21 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4431?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-4431. -- Resolution: Fixed Fix Version/s: 1.2.0 Issue resolved by pull request 3288

[jira] [Created] (SPARK-4575) Documentation for the pipeline features

2014-11-24 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-4575: Summary: Documentation for the pipeline features Key: SPARK-4575 URL: https://issues.apache.org/jira/browse/SPARK-4575 Project: Spark Issue Type:

[jira] [Updated] (SPARK-4562) GLM testing time regressions from Spark 1.1

2014-11-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4562?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4562: - Assignee: Davies Liu GLM testing time regressions from Spark 1.1

[jira] [Updated] (SPARK-4121) Master build failures after shading commons-math3

2014-11-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4121?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4121: - Fix Version/s: 1.2.0 Master build failures after shading commons-math3

[jira] [Updated] (SPARK-3189) Add Robust Regression Algorithm with Turkey bisquare weight function (Biweight Estimates)

2014-11-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-3189: - Fix Version/s: (was: 1.2.0) (was: 1.1.1) Add Robust Regression

[jira] [Closed] (SPARK-3820) Specialize columnSimilarity() without any threshold

2014-11-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3820?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng closed SPARK-3820. Resolution: Won't Fix Specialize columnSimilarity() without any threshold

[jira] [Reopened] (SPARK-3820) Specialize columnSimilarity() without any threshold

2014-11-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3820?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng reopened SPARK-3820: -- Specialize columnSimilarity() without any threshold

[jira] [Updated] (SPARK-3396) Change LogistricRegressionWithSGD's default regType to L2

2014-11-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3396?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-3396: - Fix Version/s: 1.2.0 Change LogistricRegressionWithSGD's default regType to L2

[jira] [Resolved] (SPARK-4562) GLM testing time regressions from Spark 1.1

2014-11-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4562?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-4562. -- Resolution: Fixed Fix Version/s: 1.2.0 Issue resolved by pull request 3420

[jira] [Updated] (SPARK-4580) Document random forests and boosting in programming guide

2014-11-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4580?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4580: - Assignee: Joseph K. Bradley Document random forests and boosting in programming guide

[jira] [Created] (SPARK-4582) Add getVectors to Word2VecModel

2014-11-24 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-4582: Summary: Add getVectors to Word2VecModel Key: SPARK-4582 URL: https://issues.apache.org/jira/browse/SPARK-4582 Project: Spark Issue Type: New Feature

[jira] [Updated] (SPARK-4582) Add getVectors to Word2VecModel

2014-11-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4582?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4582: - Fix Version/s: 1.2.0 Add getVectors to Word2VecModel ---

[jira] [Updated] (SPARK-3080) ArrayIndexOutOfBoundsException in ALS for Large datasets

2014-11-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3080?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-3080: - Target Version/s: 1.3.0 (was: 1.2.0) ArrayIndexOutOfBoundsException in ALS for Large datasets

[jira] [Updated] (SPARK-2206) Automatically infer the number of classification classes in multiclass classification

2014-11-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2206?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-2206: - Target Version/s: 1.3.0 (was: 1.2.0) Automatically infer the number of classification classes

[jira] [Updated] (SPARK-3080) ArrayIndexOutOfBoundsException in ALS for Large datasets

2014-11-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3080?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-3080: - Affects Version/s: 1.2.0 ArrayIndexOutOfBoundsException in ALS for Large datasets

[jira] [Updated] (SPARK-4577) Python example of LBFGS for MLlib guide

2014-11-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4577?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4577: - Target Version/s: 1.2.0 Python example of LBFGS for MLlib guide

[jira] [Updated] (SPARK-4577) Python example of LBFGS for MLlib guide

2014-11-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4577?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4577: - Issue Type: Improvement (was: Bug) Python example of LBFGS for MLlib guide

[jira] [Updated] (SPARK-4547) OOM when making bins in BinaryClassificationMetrics

2014-11-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4547?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4547: - Assignee: Sean Owen OOM when making bins in BinaryClassificationMetrics

[jira] [Updated] (SPARK-4581) Refactorize StandardScaler to improve the transformation performance

2014-11-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4581?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4581: - Target Version/s: 1.2.0 Assignee: DB Tsai Refactorize StandardScaler to improve the

[jira] [Updated] (SPARK-4547) OOM when making bins in BinaryClassificationMetrics

2014-11-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4547?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4547: - Target Version/s: 1.3.0 OOM when making bins in BinaryClassificationMetrics

[jira] [Updated] (SPARK-4530) GradientDescent get a wrong gradient value according to the gradient formula, which is caused by the miniBatchSize parameter.

2014-11-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4530?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4530: - Assignee: Guoqiang Li GradientDescent get a wrong gradient value according to the gradient

[jira] [Updated] (SPARK-4530) GradientDescent get a wrong gradient value according to the gradient formula, which is caused by the miniBatchSize parameter.

2014-11-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4530?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4530: - Priority: Minor (was: Major) GradientDescent get a wrong gradient value according to the

[jira] [Updated] (SPARK-4530) GradientDescent get a wrong gradient value according to the gradient formula, which is caused by the miniBatchSize parameter.

2014-11-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4530?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4530: - Target Version/s: 1.0.2, 1.2.0, 1.1.2 GradientDescent get a wrong gradient value according to

[jira] [Updated] (SPARK-4510) Add k-medoids Partitioning Around Medoids (PAM) algorithm

2014-11-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4510?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4510: - Assignee: Fan Jiang Add k-medoids Partitioning Around Medoids (PAM) algorithm

[jira] [Updated] (SPARK-4494) IDFModel.transform() add support for single vector

2014-11-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4494?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4494: - Affects Version/s: (was: 1.1.0) 1.1.1 IDFModel.transform() add

[jira] [Updated] (SPARK-4494) IDFModel.transform() add support for single vector

2014-11-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4494?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4494: - Affects Version/s: 1.2.0 IDFModel.transform() add support for single vector

[jira] [Updated] (SPARK-4494) IDFModel.transform() add support for single vector

2014-11-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4494?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4494: - Target Version/s: 1.3.0 (was: 1.1.1) IDFModel.transform() add support for single vector

[jira] [Updated] (SPARK-4409) Additional (but limited) Linear Algebra Utils

2014-11-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4409?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4409: - Priority: Major (was: Minor) Additional (but limited) Linear Algebra Utils

[jira] [Updated] (SPARK-4409) Additional (but limited) Linear Algebra Utils

2014-11-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4409?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4409: - Assignee: Burak Yavuz Additional (but limited) Linear Algebra Utils

[jira] [Updated] (SPARK-4583) GradientBoostedTrees error logging should use loss being minimized

2014-11-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4583?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4583: - Assignee: Joseph K. Bradley GradientBoostedTrees error logging should use loss being minimized

[jira] [Resolved] (SPARK-4582) Add getVectors to Word2VecModel

2014-11-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4582?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-4582. -- Resolution: Fixed Issue resolved by pull request 3437

[jira] [Updated] (SPARK-4582) Add getVectors to Word2VecModel

2014-11-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4582?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4582: - Assignee: Tobias Kässmann Add getVectors to Word2VecModel ---

[jira] [Updated] (SPARK-3188) Add Robust Regression Algorithm with Tukey bisquare weight function (Biweight Estimates)

2014-11-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3188?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-3188: - Assignee: Fan Jiang Add Robust Regression Algorithm with Tukey bisquare weight function

[jira] [Updated] (SPARK-4494) IDFModel.transform() add support for single vector

2014-11-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4494?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4494: - Priority: Minor (was: Major) IDFModel.transform() add support for single vector

[jira] [Updated] (SPARK-4156) Add expectation maximization for Gaussian mixture models to MLLib clustering

2014-11-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4156?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4156: - Priority: Major (was: Minor) Add expectation maximization for Gaussian mixture models to MLLib

[jira] [Updated] (SPARK-4251) Add Restricted Boltzmann machine(RBM) algorithm to MLlib

2014-11-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4251?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4251: - Target Version/s: (was: 1.3.0) Add Restricted Boltzmann machine(RBM) algorithm to MLlib

[jira] [Created] (SPARK-4586) Python API for ML Pipeline

2014-11-24 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-4586: Summary: Python API for ML Pipeline Key: SPARK-4586 URL: https://issues.apache.org/jira/browse/SPARK-4586 Project: Spark Issue Type: New Feature

[jira] [Updated] (SPARK-1406) PMML model evaluation support via MLib

2014-11-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1406?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-1406: - Assignee: Vincenzo Selvaggio PMML model evaluation support via MLib

[jira] [Created] (SPARK-4587) Model export/import

2014-11-24 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-4587: Summary: Model export/import Key: SPARK-4587 URL: https://issues.apache.org/jira/browse/SPARK-4587 Project: Spark Issue Type: New Feature

[jira] [Updated] (SPARK-3717) DecisionTree, RandomForest: Partition by feature

2014-11-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3717?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-3717: - Assignee: Joseph K. Bradley DecisionTree, RandomForest: Partition by feature

[jira] [Updated] (SPARK-3717) DecisionTree, RandomForest: Partition by feature

2014-11-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3717?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-3717: - Target Version/s: 1.3.0 DecisionTree, RandomForest: Partition by feature

[jira] [Commented] (SPARK-3588) Gaussian Mixture Model clustering

2014-11-24 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3588?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14224042#comment-14224042 ] Xiangrui Meng commented on SPARK-3588: -- [~MeethuMathew] Just want to check with you

[jira] [Created] (SPARK-4588) Add API for feature attributes

2014-11-24 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-4588: Summary: Add API for feature attributes Key: SPARK-4588 URL: https://issues.apache.org/jira/browse/SPARK-4588 Project: Spark Issue Type: Sub-task

[jira] [Created] (SPARK-4589) ML add-ons to SchemaRDD

2014-11-24 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-4589: Summary: ML add-ons to SchemaRDD Key: SPARK-4589 URL: https://issues.apache.org/jira/browse/SPARK-4589 Project: Spark Issue Type: New Feature

[jira] [Created] (SPARK-4590) Early investigation of parameter server

2014-11-24 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-4590: Summary: Early investigation of parameter server Key: SPARK-4590 URL: https://issues.apache.org/jira/browse/SPARK-4590 Project: Spark Issue Type:

[jira] [Created] (SPARK-4591) Add algorithm/model wrappers in spark.ml to adapt the new API

2014-11-24 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-4591: Summary: Add algorithm/model wrappers in spark.ml to adapt the new API Key: SPARK-4591 URL: https://issues.apache.org/jira/browse/SPARK-4591 Project: Spark

[jira] [Commented] (SPARK-3588) Gaussian Mixture Model clustering

2014-11-25 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-3588?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14224188#comment-14224188 ] Xiangrui Meng commented on SPARK-3588: -- Since [~tgaloppo] already submitted a PR, we

[jira] [Assigned] (SPARK-4509) Revert EC2 tag-based cluster membership patch in branch-1.2

2014-11-25 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4509?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng reassigned SPARK-4509: Assignee: Xiangrui Meng Revert EC2 tag-based cluster membership patch in branch-1.2

[jira] [Updated] (SPARK-4596) Refactorize Normalizer to make code cleaner

2014-11-25 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4596?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4596: - Assignee: DB Tsai Refactorize Normalizer to make code cleaner

[jira] [Resolved] (SPARK-4596) Refactorize Normalizer to make code cleaner

2014-11-25 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4596?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng resolved SPARK-4596. -- Resolution: Fixed Fix Version/s: 1.2.0 Issue resolved by pull request 3446

[jira] [Updated] (SPARK-4530) GradientDescent get a wrong gradient value according to the gradient formula, which is caused by the miniBatchSize parameter.

2014-11-25 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4530?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4530: - Fix Version/s: 1.2.0 GradientDescent get a wrong gradient value according to the gradient

[jira] [Commented] (SPARK-4530) GradientDescent get a wrong gradient value according to the gradient formula, which is caused by the miniBatchSize parameter.

2014-11-25 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4530?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14224312#comment-14224312 ] Xiangrui Meng commented on SPARK-4530: -- PR: https://github.com/apache/spark/pull/3399

[jira] [Created] (SPARK-4604) Make MatrixFactorizationModel constructor public

2014-11-25 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-4604: Summary: Make MatrixFactorizationModel constructor public Key: SPARK-4604 URL: https://issues.apache.org/jira/browse/SPARK-4604 Project: Spark Issue Type:

[jira] [Commented] (SPARK-2495) Ability to re-create ML models

2014-11-25 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-2495?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14224970#comment-14224970 ] Xiangrui Meng commented on SPARK-2495: -- I created SPARK-4604 for

[jira] [Updated] (SPARK-4611) Implement the efficient vector norm

2014-11-25 Thread Xiangrui Meng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiangrui Meng updated SPARK-4611: - Assignee: DB Tsai Implement the efficient vector norm ---

[jira] [Created] (SPARK-4614) Slight API changes in Matrix and Matrices

2014-11-25 Thread Xiangrui Meng (JIRA)
Xiangrui Meng created SPARK-4614: Summary: Slight API changes in Matrix and Matrices Key: SPARK-4614 URL: https://issues.apache.org/jira/browse/SPARK-4614 Project: Spark Issue Type:

<    4   5   6   7   8   9   10   11   12   13   >