[jira] [Commented] (SPARK-7008) An Implement of Factorization Machine (LibFM)

2015-04-20 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7008?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14504114#comment-14504114 ] zhengruifeng commented on SPARK-7008: - thanks for this information! An Implement of

[jira] [Updated] (SPARK-7008) An implementation of Factorization Machine (LibFM)

2015-04-21 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7008?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng updated SPARK-7008: Summary: An implementation of Factorization Machine (LibFM) (was: An Implement of Factorization

[jira] [Updated] (SPARK-7008) An Implement of Factorization Machine (LibFM)

2015-04-20 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7008?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng updated SPARK-7008: Description: An implement of Factorization Machines based on Scala and Spark MLlib. Factorization

[jira] [Updated] (SPARK-7008) Implement of Factorization Machine (LibFM)

2015-04-20 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7008?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng updated SPARK-7008: Description: An implementation of Factorization Machines based on Scala and Spark MLlib.

[jira] [Updated] (SPARK-7008) Implement of Factorization Machine (LibFM)

2015-04-20 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7008?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng updated SPARK-7008: Labels: features patch (was: features) Implement of Factorization Machine (LibFM)

[jira] [Updated] (SPARK-7008) Implement of Factorization Machine (LibFM)

2015-04-20 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7008?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng updated SPARK-7008: Affects Version/s: 1.3.2 1.3.1 Implement of Factorization Machine (LibFM)

[jira] [Updated] (SPARK-7008) Implement of Factorization Machine (LibFM)

2015-04-20 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7008?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng updated SPARK-7008: Target Version/s: 1.3.0, 1.3.1, 1.3.2 (was: 1.3.0) Implement of Factorization Machine (LibFM)

[jira] [Created] (SPARK-7008) Implement of Factorization Machine (LibFM)

2015-04-20 Thread zhengruifeng (JIRA)
zhengruifeng created SPARK-7008: --- Summary: Implement of Factorization Machine (LibFM) Key: SPARK-7008 URL: https://issues.apache.org/jira/browse/SPARK-7008 Project: Spark Issue Type: New

[jira] [Commented] (SPARK-7008) An implementation of Factorization Machine (LibFM)

2015-04-21 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7008?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14504596#comment-14504596 ] zhengruifeng commented on SPARK-7008: - I had not considered of the size of model,

[jira] [Updated] (SPARK-7008) An implementation of Factorization Machine (LibFM)

2015-04-24 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7008?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng updated SPARK-7008: Description: An implementation of Factorization Machines based on Scala and Spark MLlib. FM is a

[jira] [Comment Edited] (SPARK-7008) An implementation of Factorization Machine (LibFM)

2015-04-24 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7008?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14512110#comment-14512110 ] zhengruifeng edited comment on SPARK-7008 at 4/25/15 12:46 AM:

[jira] [Comment Edited] (SPARK-7008) An implementation of Factorization Machine (LibFM)

2015-04-24 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7008?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14512110#comment-14512110 ] zhengruifeng edited comment on SPARK-7008 at 4/25/15 12:44 AM:

[jira] [Commented] (SPARK-7008) An implementation of Factorization Machine (LibFM)

2015-04-24 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7008?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14512110#comment-14512110 ] zhengruifeng commented on SPARK-7008: - The convergence curves of Binary Classification

[jira] [Updated] (SPARK-7008) An implementation of Factorization Machine (LibFM)

2015-04-24 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7008?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng updated SPARK-7008: Attachment: FM_CR.xlsx An implementation of Factorization Machine (LibFM)

[jira] [Commented] (SPARK-7008) An implementation of Factorization Machine (LibFM)

2015-04-27 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7008?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14513780#comment-14513780 ] zhengruifeng commented on SPARK-7008: - AdaGrad works pretty well in practice, but I

[jira] [Closed] (SPARK-7008) An implementation of Factorization Machine (LibFM)

2015-05-06 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7008?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng closed SPARK-7008. --- Resolution: Fixed An implementation of Factorization Machine (LibFM)

[jira] [Commented] (SPARK-11585) AssociationRules should generates all association rules with consequents of arbitrary length

2015-11-08 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11585?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14996174#comment-14996174 ] zhengruifeng commented on SPARK-11585: -- I have implemented it based on Apriori's Rule-Generation

[jira] [Comment Edited] (SPARK-11585) AssociationRules should generates all association rules with consequents of arbitrary length

2015-11-09 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11585?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14996174#comment-14996174 ] zhengruifeng edited comment on SPARK-11585 at 11/9/15 8:11 AM: --- I have

[jira] [Created] (SPARK-11585) AssociationRules should generates all association rules with consequents of arbitrary length

2015-11-08 Thread zhengruifeng (JIRA)
zhengruifeng created SPARK-11585: Summary: AssociationRules should generates all association rules with consequents of arbitrary length Key: SPARK-11585 URL: https://issues.apache.org/jira/browse/SPARK-11585

[jira] [Updated] (SPARK-11585) AssociationRules should generates all association rules with consequents of arbitrary length

2015-11-09 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11585?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng updated SPARK-11585: - Attachment: rule-generation.pdf Apriori's Rule Generation Algorithm > AssociationRules should

[jira] [Commented] (SPARK-7008) An implementation of Factorization Machine (LibFM)

2015-07-10 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-7008?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14621830#comment-14621830 ] zhengruifeng commented on SPARK-7008: - Yes, LBFGS provide a faster convergence rate.

[jira] [Created] (SPARK-15770) 'Experimental' annotation audit

2016-06-04 Thread zhengruifeng (JIRA)
zhengruifeng created SPARK-15770: Summary: 'Experimental' annotation audit Key: SPARK-15770 URL: https://issues.apache.org/jira/browse/SPARK-15770 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-15770) 'Experimental' annotation audit

2016-06-04 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15770?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng updated SPARK-15770: - Description: 1, remove comments {{:: Experimental ::}} for non-experimental API 2, add comments

[jira] [Updated] (SPARK-15770) annotation audit for Experimental and DeveloperApi

2016-06-04 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15770?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng updated SPARK-15770: - Description: 1, remove comments {{:: Experimental ::}} for non-experimental API 2, add comments

[jira] [Updated] (SPARK-15770) annotation audit for Experimental and DeveloperApi

2016-06-04 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15770?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng updated SPARK-15770: - Description: 1, remove comments {{:: Experimental ::}} for non-experimental API 2, add comments

[jira] [Updated] (SPARK-15770) annotation audit for Experimental and DeveloperApi

2016-06-04 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15770?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng updated SPARK-15770: - Summary: annotation audit for Experimental and DeveloperApi (was: 'Experimental' annotation

[jira] [Comment Edited] (SPARK-15823) Add @property for 'accuracy' in MulticlassMetrics

2016-06-09 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15823?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15322308#comment-15322308 ] zhengruifeng edited comment on SPARK-15823 at 6/9/16 10:20 AM: ---

[jira] [Commented] (SPARK-15823) Add @property for 'accuracy' in MulticlassMetrics

2016-06-09 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15823?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15322309#comment-15322309 ] zhengruifeng commented on SPARK-15823: -- {MulticlassMetrics.confusionMatrix} may need {@property}

[jira] [Issue Comment Deleted] (SPARK-15823) Add @property for 'accuracy' in MulticlassMetrics

2016-06-09 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15823?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng updated SPARK-15823: - Comment: was deleted (was: {MulticlassMetrics.confusionMatrix} may need {@property} too, but I

[jira] [Commented] (SPARK-15823) Add @property for 'accuracy' in MulticlassMetrics

2016-06-09 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15823?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15322308#comment-15322308 ] zhengruifeng commented on SPARK-15823: -- {MulticlassMetrics.confusionMatrix} may need {@property}

[jira] [Updated] (SPARK-15823) Add @property for 'accuracy' in MulticlassMetrics

2016-06-09 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15823?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng updated SPARK-15823: - Summary: Add @property for 'accuracy' in MulticlassMetrics (was: Add @property for 'property'

[jira] [Created] (SPARK-15823) Add @property for 'property' in MulticlassMetrics

2016-06-08 Thread zhengruifeng (JIRA)
zhengruifeng created SPARK-15823: Summary: Add @property for 'property' in MulticlassMetrics Key: SPARK-15823 URL: https://issues.apache.org/jira/browse/SPARK-15823 Project: Spark Issue

[jira] [Commented] (SPARK-15617) Clarify that fMeasure in MulticlassMetrics and MulticlassClassificationEvaluator is "micro" f1_score

2016-05-27 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15617?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15305089#comment-15305089 ] zhengruifeng commented on SPARK-15617: -- I can work on this > Clarify that fMeasure in

[jira] [Commented] (SPARK-15617) Clarify that fMeasure in MulticlassMetrics and MulticlassClassificationEvaluator is "micro" f1_score

2016-05-27 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15617?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15305086#comment-15305086 ] zhengruifeng commented on SPARK-15617: --

[jira] [Commented] (SPARK-15581) MLlib 2.1 Roadmap

2016-05-28 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15305239#comment-15305239 ] zhengruifeng commented on SPARK-15581: -- In regard to gbt, xgboost4j may be involved > MLlib 2.1

[jira] [Created] (SPARK-15939) Clarify ml.linalg usage

2016-06-14 Thread zhengruifeng (JIRA)
zhengruifeng created SPARK-15939: Summary: Clarify ml.linalg usage Key: SPARK-15939 URL: https://issues.apache.org/jira/browse/SPARK-15939 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-15939) Clarify ml.linalg usage

2016-06-14 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15939?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng updated SPARK-15939: - Description: 1, update comments in {{pyspark.ml}} that it use {{ml.linalg}} not

[jira] [Updated] (SPARK-15939) Clarify ml.linalg usage

2016-06-14 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15939?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng updated SPARK-15939: - Description: 1, update comments in {{pyspark.ml}} that it use {ml.linalg} not {mllib.linalg} 2,

[jira] [Created] (SPARK-15650) Add correctness test for MulticlassClassification

2016-05-30 Thread zhengruifeng (JIRA)
zhengruifeng created SPARK-15650: Summary: Add correctness test for MulticlassClassification Key: SPARK-15650 URL: https://issues.apache.org/jira/browse/SPARK-15650 Project: Spark Issue

[jira] [Commented] (SPARK-15614) ml.feature should support default value of input column

2016-05-30 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15614?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15306419#comment-15306419 ] zhengruifeng commented on SPARK-15614: -- Agreed. What about setting the default value of

[jira] [Updated] (SPARK-15650) Add correctness test for MulticlassClassificationEvaluator

2016-05-30 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15650?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng updated SPARK-15650: - Summary: Add correctness test for MulticlassClassificationEvaluator (was: Add correctness test

[jira] [Closed] (SPARK-15291) Remove redundant codes in SVD++

2016-05-27 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15291?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng closed SPARK-15291. Resolution: Won't Fix > Remove redundant codes in SVD++ > --- > >

[jira] [Closed] (SPARK-15607) Remove redundant toArray in ml.linalg

2016-05-27 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15607?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng closed SPARK-15607. Resolution: Won't Fix > Remove redundant toArray in ml.linalg >

[jira] [Updated] (SPARK-15610) update error message for k in pca

2016-05-27 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng updated SPARK-15610: - Summary: update error message for k in pca (was: PCA should not support k == numFeatures) >

[jira] [Updated] (SPARK-15610) PCA should not support k == numFeatures

2016-05-27 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng updated SPARK-15610: - Priority: Minor (was: Major) > PCA should not support k == numFeatures >

[jira] [Updated] (SPARK-15610) update error message for k in pca

2016-05-27 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng updated SPARK-15610: - Description: error message for {{k}} should match the bound (was: Vector size must be greater

[jira] [Created] (SPARK-15607) Remove redundant toArray in ml.linalg

2016-05-27 Thread zhengruifeng (JIRA)
zhengruifeng created SPARK-15607: Summary: Remove redundant toArray in ml.linalg Key: SPARK-15607 URL: https://issues.apache.org/jira/browse/SPARK-15607 Project: Spark Issue Type:

[jira] [Updated] (SPARK-15614) ml.feature should support default value of input column

2016-05-27 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15614?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng updated SPARK-15614: - Priority: Minor (was: Major) > ml.feature should support default value of input column >

[jira] [Created] (SPARK-15614) ml.feature should support default value of input column

2016-05-27 Thread zhengruifeng (JIRA)
zhengruifeng created SPARK-15614: Summary: ml.feature should support default value of input column Key: SPARK-15614 URL: https://issues.apache.org/jira/browse/SPARK-15614 Project: Spark

[jira] [Commented] (SPARK-15614) ml.feature should support default value of input column

2016-05-27 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15614?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15303971#comment-15303971 ] zhengruifeng commented on SPARK-15614: -- [~josephkb] [~mengxr] [~yanboliang] any thoughts? >

[jira] [Comment Edited] (SPARK-15614) ml.feature should support default value of input column

2016-05-27 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15614?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15303971#comment-15303971 ] zhengruifeng edited comment on SPARK-15614 at 5/27/16 11:54 AM:

[jira] [Created] (SPARK-15610) PCA should not support k == numFeatures

2016-05-27 Thread zhengruifeng (JIRA)
zhengruifeng created SPARK-15610: Summary: PCA should not support k == numFeatures Key: SPARK-15610 URL: https://issues.apache.org/jira/browse/SPARK-15610 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-15617) Clarify that fMeasure in MulticlassMetrics and MulticlassClassificationEvaluator is "micro" f1_score

2016-06-01 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15617?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15311752#comment-15311752 ] zhengruifeng commented on SPARK-15617: -- Agreed. In {{MulticlassClassificationEvaluator}}, I will

[jira] [Created] (SPARK-13435) Add Weighted Cohen's kappa to MulticlassMetrics

2016-02-22 Thread zhengruifeng (JIRA)
zhengruifeng created SPARK-13435: Summary: Add Weighted Cohen's kappa to MulticlassMetrics Key: SPARK-13435 URL: https://issues.apache.org/jira/browse/SPARK-13435 Project: Spark Issue Type:

[jira] [Created] (SPARK-13506) Fix the wrong parameter in R code comment in AssociationRulesSuite

2016-02-26 Thread zhengruifeng (JIRA)
zhengruifeng created SPARK-13506: Summary: Fix the wrong parameter in R code comment in AssociationRulesSuite Key: SPARK-13506 URL: https://issues.apache.org/jira/browse/SPARK-13506 Project: Spark

[jira] [Created] (SPARK-13538) Add GaussianMixture to ML

2016-02-28 Thread zhengruifeng (JIRA)
zhengruifeng created SPARK-13538: Summary: Add GaussianMixture to ML Key: SPARK-13538 URL: https://issues.apache.org/jira/browse/SPARK-13538 Project: Spark Issue Type: Improvement

[jira] [Created] (SPARK-13550) Add java example for ml.clustering.BisectingKMeans

2016-02-29 Thread zhengruifeng (JIRA)
zhengruifeng created SPARK-13550: Summary: Add java example for ml.clustering.BisectingKMeans Key: SPARK-13550 URL: https://issues.apache.org/jira/browse/SPARK-13550 Project: Spark Issue

[jira] [Created] (SPARK-13551) Fix fix wrong comment and remove meanless lines in mllib.JavaBisectingKMeansExample

2016-02-29 Thread zhengruifeng (JIRA)
zhengruifeng created SPARK-13551: Summary: Fix fix wrong comment and remove meanless lines in mllib.JavaBisectingKMeansExample Key: SPARK-13551 URL: https://issues.apache.org/jira/browse/SPARK-13551

[jira] [Commented] (SPARK-13435) Add Weighted Cohen's kappa to MulticlassMetrics

2016-02-22 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13435?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15157015#comment-15157015 ] zhengruifeng commented on SPARK-13435: -- I dont think so. Recently, many Competitions use quadratic

[jira] [Created] (SPARK-13385) Enable AssociationRules to generate consequents with user-defined lengths

2016-02-18 Thread zhengruifeng (JIRA)
zhengruifeng created SPARK-13385: Summary: Enable AssociationRules to generate consequents with user-defined lengths Key: SPARK-13385 URL: https://issues.apache.org/jira/browse/SPARK-13385 Project:

[jira] [Updated] (SPARK-13385) Enable AssociationRules to generate consequents with user-defined lengths

2016-02-18 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13385?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng updated SPARK-13385: - Attachment: rule-generation.pdf rule-generation algorithm > Enable AssociationRules to generate

[jira] [Created] (SPARK-13386) ConnectedComponents should support maxIteration option

2016-02-19 Thread zhengruifeng (JIRA)
zhengruifeng created SPARK-13386: Summary: ConnectedComponents should support maxIteration option Key: SPARK-13386 URL: https://issues.apache.org/jira/browse/SPARK-13386 Project: Spark Issue

[jira] [Created] (SPARK-13416) Add positive check for option 'numIter' in StronglyConnectedComponents

2016-02-20 Thread zhengruifeng (JIRA)
zhengruifeng created SPARK-13416: Summary: Add positive check for option 'numIter' in StronglyConnectedComponents Key: SPARK-13416 URL: https://issues.apache.org/jira/browse/SPARK-13416 Project:

[jira] [Created] (SPARK-13814) Delete unnecessary imports in python examples files

2016-03-10 Thread zhengruifeng (JIRA)
zhengruifeng created SPARK-13814: Summary: Delete unnecessary imports in python examples files Key: SPARK-13814 URL: https://issues.apache.org/jira/browse/SPARK-13814 Project: Spark Issue

[jira] [Created] (SPARK-13816) Add parameter checks for algorithms in Graphx

2016-03-11 Thread zhengruifeng (JIRA)
zhengruifeng created SPARK-13816: Summary: Add parameter checks for algorithms in Graphx Key: SPARK-13816 URL: https://issues.apache.org/jira/browse/SPARK-13816 Project: Spark Issue Type:

[jira] [Commented] (SPARK-14005) Make RDD more compatible with Scala's collection

2016-03-19 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14005?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15202658#comment-15202658 ] zhengruifeng commented on SPARK-14005: -- I think easiness to implement should not be the reason to

[jira] [Created] (SPARK-13970) Add Non-Negative Matrix Factorization to MLlib

2016-03-19 Thread zhengruifeng (JIRA)
zhengruifeng created SPARK-13970: Summary: Add Non-Negative Matrix Factorization to MLlib Key: SPARK-13970 URL: https://issues.apache.org/jira/browse/SPARK-13970 Project: Spark Issue Type:

[jira] [Created] (SPARK-14022) What about adding RandomProjection to ML/MLLIB as a new dimensionality reduction algorithm?

2016-03-19 Thread zhengruifeng (JIRA)
zhengruifeng created SPARK-14022: Summary: What about adding RandomProjection to ML/MLLIB as a new dimensionality reduction algorithm? Key: SPARK-14022 URL: https://issues.apache.org/jira/browse/SPARK-14022

[jira] [Issue Comment Deleted] (SPARK-13712) Add OneVsOne to ML

2016-03-14 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13712?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng updated SPARK-13712: - Comment: was deleted (was: OK, I have closed the PR. I had also planned to implement ECC after

[jira] [Commented] (SPARK-13712) Add OneVsOne to ML

2016-03-14 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13712?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15194554#comment-15194554 ] zhengruifeng commented on SPARK-13712: -- OK, I have closed the PR. I had also planned to implement

[jira] [Commented] (SPARK-13712) Add OneVsOne to ML

2016-03-14 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13712?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15194555#comment-15194555 ] zhengruifeng commented on SPARK-13712: -- OK, I have closed the PR. I had also planned to implement

[jira] [Commented] (SPARK-14516) Clustering evaluator

2016-04-13 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14516?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15239140#comment-15239140 ] zhengruifeng commented on SPARK-14516: -- ok, I will work on clarify this API. > Clustering evaluator

[jira] [Comment Edited] (SPARK-14516) Clustering evaluator

2016-04-13 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14516?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15239140#comment-15239140 ] zhengruifeng edited comment on SPARK-14516 at 4/13/16 12:22 PM: ok, I

[jira] [Created] (SPARK-14510) Add args-checking for LDA and StreamingKMeans

2016-04-09 Thread zhengruifeng (JIRA)
zhengruifeng created SPARK-14510: Summary: Add args-checking for LDA and StreamingKMeans Key: SPARK-14510 URL: https://issues.apache.org/jira/browse/SPARK-14510 Project: Spark Issue Type:

[jira] [Created] (SPARK-14509) Add python CountVectorizerExample

2016-04-09 Thread zhengruifeng (JIRA)
zhengruifeng created SPARK-14509: Summary: Add python CountVectorizerExample Key: SPARK-14509 URL: https://issues.apache.org/jira/browse/SPARK-14509 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-14022) What about adding RandomProjection to ML/MLLIB as a new dimensionality reduction algorithm?

2016-04-10 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14022?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng updated SPARK-14022: - Issue Type: Brainstorming (was: Question) > What about adding RandomProjection to ML/MLLIB as a

[jira] [Reopened] (SPARK-14022) What about adding RandomProjection to ML/MLLIB as a new dimensionality reduction algorithm?

2016-04-10 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14022?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng reopened SPARK-14022: -- There may need some discuss on whether to add RandomProjection or Not. > What about adding

[jira] [Created] (SPARK-14514) Add python example for VectorSlicer

2016-04-09 Thread zhengruifeng (JIRA)
zhengruifeng created SPARK-14514: Summary: Add python example for VectorSlicer Key: SPARK-14514 URL: https://issues.apache.org/jira/browse/SPARK-14514 Project: Spark Issue Type: Improvement

[jira] [Created] (SPARK-14515) Add python example for ChiSqSelector

2016-04-09 Thread zhengruifeng (JIRA)
zhengruifeng created SPARK-14515: Summary: Add python example for ChiSqSelector Key: SPARK-14515 URL: https://issues.apache.org/jira/browse/SPARK-14515 Project: Spark Issue Type: Improvement

[jira] [Updated] (SPARK-14514) Add python example for VectorSlicer

2016-04-09 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14514?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng updated SPARK-14514: - Component/s: Documentation > Add python example for VectorSlicer >

[jira] [Commented] (SPARK-14022) What about adding RandomProjection to ML/MLLIB as a new dimensionality reduction algorithm?

2016-04-10 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14022?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15233937#comment-15233937 ] zhengruifeng commented on SPARK-14022: -- Ok, I change the Type from Question to Brainstroming. I

[jira] [Updated] (SPARK-13385) Enable AssociationRules to generate consequents with user-defined lengths

2016-04-09 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13385?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng updated SPARK-13385: - Priority: Major (was: Minor) > Enable AssociationRules to generate consequents with

[jira] [Created] (SPARK-14512) Add python example for QuantileDiscretizer

2016-04-09 Thread zhengruifeng (JIRA)
zhengruifeng created SPARK-14512: Summary: Add python example for QuantileDiscretizer Key: SPARK-14512 URL: https://issues.apache.org/jira/browse/SPARK-14512 Project: Spark Issue Type:

[jira] [Created] (SPARK-14516) What about adding general clustering metrics?

2016-04-10 Thread zhengruifeng (JIRA)
zhengruifeng created SPARK-14516: Summary: What about adding general clustering metrics? Key: SPARK-14516 URL: https://issues.apache.org/jira/browse/SPARK-14516 Project: Spark Issue Type:

[jira] [Commented] (SPARK-14022) What about adding RandomProjection to ML/MLLIB as a new dimensionality reduction algorithm?

2016-04-10 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14022?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15233941#comment-15233941 ] zhengruifeng commented on SPARK-14022: -- cc [~yanboliang] [~mengxr] [~josephkb] > What about adding

[jira] [Commented] (SPARK-14516) What about adding general clustering metrics?

2016-04-10 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14516?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15233946#comment-15233946 ] zhengruifeng commented on SPARK-14516: -- cc [~mengxr] [~josephkb] [~yanboliang] > What about adding

[jira] [Created] (SPARK-14027) Add parameter check to GradientDescent

2016-03-20 Thread zhengruifeng (JIRA)
zhengruifeng created SPARK-14027: Summary: Add parameter check to GradientDescent Key: SPARK-14027 URL: https://issues.apache.org/jira/browse/SPARK-14027 Project: Spark Issue Type:

[jira] [Created] (SPARK-14030) Add parameter check to LBFGS

2016-03-20 Thread zhengruifeng (JIRA)
zhengruifeng created SPARK-14030: Summary: Add parameter check to LBFGS Key: SPARK-14030 URL: https://issues.apache.org/jira/browse/SPARK-14030 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-14005) Make RDD more compatible with Scala's collection

2016-03-19 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14005?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15203056#comment-15203056 ] zhengruifeng commented on SPARK-14005: -- ok, plz close this jira. > Make RDD more compatible with

[jira] [Commented] (SPARK-14174) Accelerate KMeans via Mini-Batch EM

2016-03-25 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14174?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15212776#comment-15212776 ] zhengruifeng commented on SPARK-14174: -- There is another sklean example for MiniBatch KMeans:

[jira] [Created] (SPARK-14174) Accelerate KMeans via Mini-Batch EM

2016-03-25 Thread zhengruifeng (JIRA)
zhengruifeng created SPARK-14174: Summary: Accelerate KMeans via Mini-Batch EM Key: SPARK-14174 URL: https://issues.apache.org/jira/browse/SPARK-14174 Project: Spark Issue Type: Improvement

[jira] [Created] (SPARK-14005) Make RDD more compatible with Scala's collection

2016-03-19 Thread zhengruifeng (JIRA)
zhengruifeng created SPARK-14005: Summary: Make RDD more compatible with Scala's collection Key: SPARK-14005 URL: https://issues.apache.org/jira/browse/SPARK-14005 Project: Spark Issue

[jira] [Created] (SPARK-13677) Support Tree-Based Feature Transformation for mllib

2016-03-04 Thread zhengruifeng (JIRA)
zhengruifeng created SPARK-13677: Summary: Support Tree-Based Feature Transformation for mllib Key: SPARK-13677 URL: https://issues.apache.org/jira/browse/SPARK-13677 Project: Spark Issue

[jira] [Created] (SPARK-13672) Add python examples of BisectingKMeans in ML and MLLIB

2016-03-04 Thread zhengruifeng (JIRA)
zhengruifeng created SPARK-13672: Summary: Add python examples of BisectingKMeans in ML and MLLIB Key: SPARK-13672 URL: https://issues.apache.org/jira/browse/SPARK-13672 Project: Spark Issue

[jira] [Created] (SPARK-13714) Another ConnectedComponents based on Max-Degree Propagation

2016-03-07 Thread zhengruifeng (JIRA)
zhengruifeng created SPARK-13714: Summary: Another ConnectedComponents based on Max-Degree Propagation Key: SPARK-13714 URL: https://issues.apache.org/jira/browse/SPARK-13714 Project: Spark

[jira] [Updated] (SPARK-13714) Another ConnectedComponents based on Max-Degree Propagation

2016-03-07 Thread zhengruifeng (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13714?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng updated SPARK-13714: - Description: Current ConnectedComponents algorithm was based on Min-VertexId Propagation, which

[jira] [Created] (SPARK-13712) Add OneVsOne to ML

2016-03-06 Thread zhengruifeng (JIRA)
zhengruifeng created SPARK-13712: Summary: Add OneVsOne to ML Key: SPARK-13712 URL: https://issues.apache.org/jira/browse/SPARK-13712 Project: Spark Issue Type: New Feature

[jira] [Created] (SPARK-14352) approxQuantile should support multi columns

2016-04-03 Thread zhengruifeng (JIRA)
zhengruifeng created SPARK-14352: Summary: approxQuantile should support multi columns Key: SPARK-14352 URL: https://issues.apache.org/jira/browse/SPARK-14352 Project: Spark Issue Type:

[jira] [Created] (SPARK-14272) Evaluate GaussianMixtureModel with LogLooklihood

2016-03-30 Thread zhengruifeng (JIRA)
zhengruifeng created SPARK-14272: Summary: Evaluate GaussianMixtureModel with LogLooklihood Key: SPARK-14272 URL: https://issues.apache.org/jira/browse/SPARK-14272 Project: Spark Issue Type:

[jira] [Created] (SPARK-14339) Add python examples for DCT,MinMaxScaler,MaxAbsScaler

2016-04-01 Thread zhengruifeng (JIRA)
zhengruifeng created SPARK-14339: Summary: Add python examples for DCT,MinMaxScaler,MaxAbsScaler Key: SPARK-14339 URL: https://issues.apache.org/jira/browse/SPARK-14339 Project: Spark Issue

  1   2   3   4   5   6   7   8   9   10   >