[jira] [Updated] (SPARK-20043) Decision Tree loader does not handle uppercase impurity param values

2017-03-25 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20043?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-20043: -- Affects Version/s: 2.2.0 > Decision Tree loader does not handle uppercase impurity

[jira] [Updated] (SPARK-20043) Decision Tree loader does not handle uppercase impurity param values

2017-03-25 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20043?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-20043: -- Target Version/s: 2.1.1, 2.2.0 > Decision Tree loader does not handle uppercase

[jira] [Updated] (SPARK-20043) Decision Tree loader does not handle uppercase impurity param values

2017-03-25 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20043?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-20043: -- Shepherd: Joseph K. Bradley > Decision Tree loader does not handle uppercase impurity

[jira] [Updated] (SPARK-20043) Decision Tree loader does not handle uppercase impurity param values

2017-03-25 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20043?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-20043: -- Summary: Decision Tree loader does not handle uppercase impurity param values (was:

[jira] [Updated] (SPARK-20043) Decision Tree loader does not recognize impurity "Gini" and "Entropy" on ML random forest and decision. Only "gini" and "entropy" (in lower case) are accepted

2017-03-25 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20043?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-20043: -- Summary: Decision Tree loader does not recognize impurity "Gini" and "Entropy" on ML

[jira] [Commented] (SPARK-20099) Add transformSchema to pyspark.ml

2017-03-25 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20099?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15942082#comment-15942082 ] Joseph K. Bradley commented on SPARK-20099: --- Linking [SPARK-15574] since it brought up a need

[jira] [Created] (SPARK-20099) Add transformSchema to pyspark.ml

2017-03-25 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-20099: - Summary: Add transformSchema to pyspark.ml Key: SPARK-20099 URL: https://issues.apache.org/jira/browse/SPARK-20099 Project: Spark Issue Type:

[jira] [Created] (SPARK-20090) Add StructType.fieldNames to Python API

2017-03-24 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-20090: - Summary: Add StructType.fieldNames to Python API Key: SPARK-20090 URL: https://issues.apache.org/jira/browse/SPARK-20090 Project: Spark Issue

[jira] [Updated] (SPARK-20082) Incremental update of LDA model, by adding initialModel as start point

2017-03-24 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20082?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-20082: -- Component/s: (was: MLlib) ML > Incremental update of LDA model,

[jira] [Updated] (SPARK-20082) Incremental update of LDA model, by adding initialModel as start point

2017-03-24 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20082?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-20082: -- Issue Type: New Feature (was: Wish) > Incremental update of LDA model, by adding

[jira] [Resolved] (SPARK-19636) Feature parity for correlation statistics in MLlib

2017-03-23 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19636?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-19636. --- Resolution: Fixed Fix Version/s: 2.2.0 Issue resolved by pull request 17108

[jira] [Commented] (SPARK-13333) DataFrame filter + randn + unionAll has bad interaction

2017-03-23 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-1?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15939323#comment-15939323 ] Joseph K. Bradley commented on SPARK-1: --- [~smilegator] I wouldn't call that result "right."

[jira] [Updated] (SPARK-19791) Add doc and example for fpgrowth

2017-03-23 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19791?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-19791: -- Target Version/s: 2.2.0 > Add doc and example for fpgrowth >

[jira] [Updated] (SPARK-19791) Add doc and example for fpgrowth

2017-03-23 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19791?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-19791: -- Shepherd: Joseph K. Bradley > Add doc and example for fpgrowth >

[jira] [Assigned] (SPARK-19791) Add doc and example for fpgrowth

2017-03-23 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19791?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-19791: - Assignee: yuhao yang > Add doc and example for fpgrowth >

[jira] [Updated] (SPARK-19591) Add sample weights to decision trees

2017-03-23 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19591?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-19591: -- Shepherd: Joseph K. Bradley > Add sample weights to decision trees >

[jira] [Assigned] (SPARK-19591) Add sample weights to decision trees

2017-03-23 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19591?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-19591: - Assignee: Seth Hendrickson > Add sample weights to decision trees >

[jira] [Commented] (SPARK-20040) Python API for ml.stat.ChiSquareTest

2017-03-22 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20040?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15936905#comment-15936905 ] Joseph K. Bradley commented on SPARK-20040: --- Sure, go ahead, thanks! > Python API for

[jira] [Updated] (SPARK-20039) Rename ml.stat.ChiSquare to ml.stat.ChiSquareTest

2017-03-21 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20039?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-20039: -- Priority: Minor (was: Major) > Rename ml.stat.ChiSquare to ml.stat.ChiSquareTest >

[jira] [Resolved] (SPARK-20039) Rename ml.stat.ChiSquare to ml.stat.ChiSquareTest

2017-03-21 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20039?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-20039. --- Resolution: Fixed Fix Version/s: 2.2.0 Issue resolved by pull request 17368

[jira] [Created] (SPARK-20040) Python API for ml.stat.ChiSquareTest

2017-03-20 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-20040: - Summary: Python API for ml.stat.ChiSquareTest Key: SPARK-20040 URL: https://issues.apache.org/jira/browse/SPARK-20040 Project: Spark Issue Type:

[jira] [Updated] (SPARK-20040) Python API for ml.stat.ChiSquareTest

2017-03-20 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20040?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-20040: -- Description: Add PySpark wrapper for ChiSquareTest. Note that it's currently called

[jira] [Created] (SPARK-20039) Rename ml.stat.ChiSquare to ml.stat.ChiSquareTest

2017-03-20 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-20039: - Summary: Rename ml.stat.ChiSquare to ml.stat.ChiSquareTest Key: SPARK-20039 URL: https://issues.apache.org/jira/browse/SPARK-20039 Project: Spark

[jira] [Updated] (SPARK-19636) Feature parity for correlation statistics in MLlib

2017-03-20 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19636?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-19636: -- Shepherd: Joseph K. Bradley > Feature parity for correlation statistics in MLlib >

[jira] [Assigned] (SPARK-19636) Feature parity for correlation statistics in MLlib

2017-03-20 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19636?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-19636: - Assignee: Timothy Hunter (was: Tim Hunter) > Feature parity for correlation

[jira] [Resolved] (SPARK-19899) FPGrowth input column naming

2017-03-20 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19899?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-19899. --- Resolution: Fixed Fix Version/s: 2.2.0 Issue resolved by pull request 17321

[jira] [Assigned] (SPARK-19899) FPGrowth input column naming

2017-03-20 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19899?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-19899: - Assignee: Maciej Szymkiewicz > FPGrowth input column naming >

[jira] [Resolved] (SPARK-19635) Feature parity for Chi-square hypothesis testing in MLlib

2017-03-16 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19635?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-19635. --- Resolution: Fixed Fix Version/s: 2.2.0 Issue resolved by pull request 17110

[jira] [Updated] (SPARK-19899) FPGrowth input column naming

2017-03-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19899?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-19899: -- Target Version/s: 2.2.0 > FPGrowth input column naming >

[jira] [Updated] (SPARK-19899) FPGrowth input column naming

2017-03-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19899?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-19899: -- Shepherd: Joseph K. Bradley > FPGrowth input column naming >

[jira] [Commented] (SPARK-19899) FPGrowth input column naming

2017-03-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19899?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15924362#comment-15924362 ] Joseph K. Bradley commented on SPARK-19899: --- Thanks for bringing this up. I'm pretty convinced

[jira] [Commented] (SPARK-11569) StringIndexer transform fails when column contains nulls

2017-03-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11569?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15924355#comment-15924355 ] Joseph K. Bradley commented on SPARK-11569: --- Linking [SPARK-19852], which can update the Python

[jira] [Assigned] (SPARK-11569) StringIndexer transform fails when column contains nulls

2017-03-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11569?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-11569: - Assignee: Menglong TAN > StringIndexer transform fails when column contains

[jira] [Resolved] (SPARK-11569) StringIndexer transform fails when column contains nulls

2017-03-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11569?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-11569. --- Resolution: Fixed Fix Version/s: 2.2.0 Issue resolved by pull request 17233

[jira] [Updated] (SPARK-11569) StringIndexer transform fails when column contains nulls

2017-03-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11569?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-11569: -- Issue Type: Improvement (was: Bug) > StringIndexer transform fails when column

[jira] [Resolved] (SPARK-19940) FPGrowthModel.transform should skip duplicated items

2017-03-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19940?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-19940. --- Resolution: Fixed Fix Version/s: 2.2.0 Issue resolved by pull request 17283

[jira] [Assigned] (SPARK-19940) FPGrowthModel.transform should skip duplicated items

2017-03-14 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19940?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-19940: - Assignee: Maciej Szymkiewicz > FPGrowthModel.transform should skip duplicated

[jira] [Commented] (SPARK-14174) Accelerate KMeans via Mini-Batch EM

2017-03-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14174?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15923314#comment-15923314 ] Joseph K. Bradley commented on SPARK-14174: --- I'm fine with improving KMeans, but I'm still not

[jira] [Commented] (SPARK-14682) Provide evaluateEachIteration method or equivalent for spark.ml GBTs

2017-03-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14682?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15923309#comment-15923309 ] Joseph K. Bradley commented on SPARK-14682: --- [~podongfeng] Sorry for the slow response. To

[jira] [Commented] (SPARK-19653) `Vector` Type Should Be A First-Class Citizen In Spark SQL

2017-03-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19653?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15923307#comment-15923307 ] Joseph K. Bradley commented on SPARK-19653: --- I agree it'd be nice to make it easier to work

[jira] [Commented] (SPARK-4591) Algorithm/model parity for spark.ml (Scala)

2017-03-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-4591?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15923294#comment-15923294 ] Joseph K. Bradley commented on SPARK-4591: -- For the record: * Kernel Density: later, I'd say *

[jira] [Commented] (SPARK-19416) Dataset.schema is inconsistent with Dataset in handling columns with periods

2017-03-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19416?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15923286#comment-15923286 ] Joseph K. Bradley commented on SPARK-19416: --- Hm, I'd call my synopsis above a "complaint" but

[jira] [Commented] (SPARK-10413) ML models should support prediction on single instances

2017-03-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10413?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15923258#comment-15923258 ] Joseph K. Bradley commented on SPARK-10413: --- [~akrim] I agree this would be useful, but it will

[jira] [Updated] (SPARK-11569) StringIndexer transform fails when column contains nulls

2017-03-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11569?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-11569: -- Shepherd: Joseph K. Bradley > StringIndexer transform fails when column contains nulls

[jira] [Comment Edited] (SPARK-11569) StringIndexer transform fails when column contains nulls

2017-03-13 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-11569?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15813712#comment-15813712 ] Joseph K. Bradley edited comment on SPARK-11569 at 3/13/17 4:31 PM:

[jira] [Resolved] (SPARK-19348) pyspark.ml.Pipeline gets corrupted under multi threaded use

2017-03-07 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19348?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-19348. --- Resolution: Fixed Fix Version/s: 2.0.3 2.1.1 Issue

[jira] [Updated] (SPARK-19866) Add local version of Word2Vec findSynonyms for spark.ml: Python API

2017-03-07 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19866?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-19866: -- Shepherd: Joseph K. Bradley > Add local version of Word2Vec findSynonyms for spark.ml:

[jira] [Created] (SPARK-19866) Add local version of Word2Vec findSynonyms for spark.ml: Python API

2017-03-07 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-19866: - Summary: Add local version of Word2Vec findSynonyms for spark.ml: Python API Key: SPARK-19866 URL: https://issues.apache.org/jira/browse/SPARK-19866

[jira] [Resolved] (SPARK-17629) Add local version of Word2Vec findSynonyms for spark.ml

2017-03-07 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17629?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-17629. --- Resolution: Fixed Fix Version/s: 2.2.0 Issue resolved by pull request 16811

[jira] [Commented] (SPARK-13969) Extend input format that feature hashing can handle

2017-03-07 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13969?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15900662#comment-15900662 ] Joseph K. Bradley commented on SPARK-13969: --- Noticing this JIRA again. I feel like this is

[jira] [Created] (SPARK-19852) StringIndexer.setHandleInvalid should have another option 'new': Python API and docs

2017-03-07 Thread Joseph K. Bradley (JIRA)
Joseph K. Bradley created SPARK-19852: - Summary: StringIndexer.setHandleInvalid should have another option 'new': Python API and docs Key: SPARK-19852 URL: https://issues.apache.org/jira/browse/SPARK-19852

[jira] [Resolved] (SPARK-17498) StringIndexer.setHandleInvalid should have another option 'new'

2017-03-07 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17498?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-17498. --- Resolution: Fixed Fix Version/s: 2.2.0 Issue resolved by pull request 16883

[jira] [Commented] (SPARK-14409) Investigate adding a RankingEvaluator to ML

2017-03-06 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14409?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15898823#comment-15898823 ] Joseph K. Bradley commented on SPARK-14409: --- Thanks [~nick.pentre...@gmail.com]! I like this

[jira] [Resolved] (SPARK-19382) Test sparse vectors in LinearSVCSuite

2017-03-06 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19382?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-19382. --- Resolution: Fixed Fix Version/s: 2.2.0 Issue resolved by pull request 16784

[jira] [Assigned] (SPARK-19382) Test sparse vectors in LinearSVCSuite

2017-03-06 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19382?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-19382: - Assignee: Miao Wang > Test sparse vectors in LinearSVCSuite >

[jira] [Resolved] (SPARK-19535) ALSModel recommendAll analogs

2017-03-05 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19535?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-19535. --- Resolution: Fixed Fix Version/s: 2.2.0 Issue resolved by pull request 17090

[jira] [Updated] (SPARK-19348) pyspark.ml.Pipeline gets corrupted under multi threaded use

2017-03-03 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19348?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-19348: -- Fix Version/s: 2.2.0 > pyspark.ml.Pipeline gets corrupted under multi threaded use >

[jira] [Assigned] (SPARK-19635) Feature parity for Chi-square hypothesis testing in MLlib

2017-02-28 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19635?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-19635: - Assignee: Joseph K. Bradley > Feature parity for Chi-square hypothesis testing

[jira] [Commented] (SPARK-19635) Feature parity for Chi-square hypothesis testing in MLlib

2017-02-28 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19635?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15889190#comment-15889190 ] Joseph K. Bradley commented on SPARK-19635: --- That PR for trees looks pretty different. This

[jira] [Commented] (SPARK-19634) Feature parity for descriptive statistics in MLlib

2017-02-28 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19634?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15889181#comment-15889181 ] Joseph K. Bradley commented on SPARK-19634: --- I'll assign this to [~timhunter] given the time

[jira] [Assigned] (SPARK-19634) Feature parity for descriptive statistics in MLlib

2017-02-28 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19634?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-19634: - Assignee: Timothy Hunter > Feature parity for descriptive statistics in MLlib >

[jira] [Updated] (SPARK-19382) Test sparse vectors in LinearSVCSuite

2017-02-28 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19382?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-19382: -- Shepherd: Joseph K. Bradley > Test sparse vectors in LinearSVCSuite >

[jira] [Resolved] (SPARK-14503) spark.ml Scala API for FPGrowth

2017-02-28 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14503?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-14503. --- Resolution: Fixed Fix Version/s: 2.2.0 Issue resolved by pull request 15415

[jira] [Commented] (SPARK-14503) spark.ml Scala API for FPGrowth

2017-02-28 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15889085#comment-15889085 ] Joseph K. Bradley commented on SPARK-14503: --- Sorry for the slow reply. I actually haven't read

[jira] [Issue Comment Deleted] (SPARK-19636) Feature parity for correlation statistics in MLlib

2017-02-28 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19636?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-19636: -- Comment: was deleted (was: I'm going to work on this.) > Feature parity for

[jira] [Assigned] (SPARK-19636) Feature parity for correlation statistics in MLlib

2017-02-28 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19636?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-19636: - Assignee: Tim Hunter (was: Joseph K. Bradley) > Feature parity for correlation

[jira] [Commented] (SPARK-19636) Feature parity for correlation statistics in MLlib

2017-02-27 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19636?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15887315#comment-15887315 ] Joseph K. Bradley commented on SPARK-19636: --- I'm going to work on this. > Feature parity for

[jira] [Assigned] (SPARK-19636) Feature parity for correlation statistics in MLlib

2017-02-27 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19636?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-19636: - Assignee: Joseph K. Bradley > Feature parity for correlation statistics in

[jira] [Commented] (SPARK-18080) Locality Sensitive Hashing (LSH) Python API

2017-02-27 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18080?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15887000#comment-15887000 ] Joseph K. Bradley commented on SPARK-18080: --- No, no problem. Thanks for committing it! I saw

[jira] [Updated] (SPARK-9140) Replace TimeTracker by Stopwatch

2017-02-27 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9140?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-9140: - Shepherd: Joseph K. Bradley Target Version/s: (was: 2.2.0) > Replace

[jira] [Commented] (SPARK-9140) Replace TimeTracker by Stopwatch

2017-02-27 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9140?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15886749#comment-15886749 ] Joseph K. Bradley commented on SPARK-9140: -- I'll unset the target version but assign myself as

[jira] [Updated] (SPARK-18903) uiWebUrl is not accessible to SparkR

2017-02-27 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18903?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-18903: -- Fix Version/s: 2.2.0 2.1.1 > uiWebUrl is not accessible to SparkR >

[jira] [Assigned] (SPARK-17498) StringIndexer.setHandleInvalid should have another option 'new'

2017-02-27 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17498?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-17498: - Assignee: Vincent > StringIndexer.setHandleInvalid should have another option

[jira] [Updated] (SPARK-17498) StringIndexer.setHandleInvalid should have another option 'new'

2017-02-27 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17498?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-17498: -- Priority: Minor (was: Major) > StringIndexer.setHandleInvalid should have another

[jira] [Updated] (SPARK-17498) StringIndexer.setHandleInvalid should have another option 'new'

2017-02-27 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17498?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-17498: -- Shepherd: Joseph K. Bradley > StringIndexer.setHandleInvalid should have another

[jira] [Commented] (SPARK-17265) EdgeRDD Difference throws an exception

2017-02-26 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-17265?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15884936#comment-15884936 ] Joseph K. Bradley commented on SPARK-17265: --- Are you able to post code (and data or generated

[jira] [Assigned] (SPARK-19348) pyspark.ml.Pipeline gets corrupted under multi threaded use

2017-02-26 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-19348?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-19348: - Shepherd: Joseph K. Bradley Assignee: Bryan Cutler

[jira] [Updated] (SPARK-14772) Python ML Params.copy treats uid, paramMaps differently than Scala

2017-02-25 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14772?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-14772: -- Fix Version/s: 2.2.0 > Python ML Params.copy treats uid, paramMaps differently than

[jira] [Resolved] (SPARK-14772) Python ML Params.copy treats uid, paramMaps differently than Scala

2017-02-25 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14772?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-14772. --- Resolution: Fixed Fix Version/s: (was: 2.2.0) 2.1.1

[jira] [Commented] (SPARK-14501) spark.ml parity for fpm - frequent items

2017-02-24 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14501?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15883542#comment-15883542 ] Joseph K. Bradley commented on SPARK-14501: --- I set the target for Scala to 2.2. Not sure if

[jira] [Updated] (SPARK-14501) spark.ml parity for fpm - frequent items

2017-02-24 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14501?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-14501: -- Target Version/s: (was: 2.2.0) > spark.ml parity for fpm - frequent items >

[jira] [Assigned] (SPARK-14503) spark.ml Scala API for FPGrowth

2017-02-24 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14503?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-14503: - Assignee: yuhao yang > spark.ml Scala API for FPGrowth >

[jira] [Updated] (SPARK-14503) spark.ml Scala API for FPGrowth

2017-02-24 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14503?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-14503: -- Target Version/s: 2.2.0 > spark.ml Scala API for FPGrowth >

[jira] [Updated] (SPARK-14503) spark.ml Scala API for FPGrowth

2017-02-24 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14503?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-14503: -- Shepherd: Joseph K. Bradley (was: Nick Pentreath) > spark.ml Scala API for FPGrowth >

[jira] [Updated] (SPARK-14772) Python ML Params.copy treats uid, paramMaps differently than Scala

2017-02-23 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14772?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-14772: -- Fix Version/s: 2.2.0 > Python ML Params.copy treats uid, paramMaps differently than

[jira] [Updated] (SPARK-14772) Python ML Params.copy treats uid, paramMaps differently than Scala

2017-02-23 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14772?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-14772: -- Shepherd: Joseph K. Bradley Affects Version/s: 2.1.0 Target

[jira] [Assigned] (SPARK-14772) Python ML Params.copy treats uid, paramMaps differently than Scala

2017-02-23 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14772?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-14772: - Assignee: Bryan Cutler > Python ML Params.copy treats uid, paramMaps

[jira] [Closed] (SPARK-14523) Feature parity for Statistics ML with MLlib

2017-02-23 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14523?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley closed SPARK-14523. - Resolution: Done > Feature parity for Statistics ML with MLlib >

[jira] [Commented] (SPARK-14523) Feature parity for Statistics ML with MLlib

2017-02-23 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-14523?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15881567#comment-15881567 ] Joseph K. Bradley commented on SPARK-14523: --- Alright, given that there are now 3 more subtasks

[jira] [Commented] (SPARK-16920) Investigate and fix issues introduced in SPARK-15858

2017-02-23 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16920?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15881537#comment-15881537 ] Joseph K. Bradley commented on SPARK-16920: --- Thanks for adding that gist! I agree with your

[jira] [Resolved] (SPARK-16920) Investigate and fix issues introduced in SPARK-15858

2017-02-23 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16920?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley resolved SPARK-16920. --- Resolution: Done Fix Version/s: 2.2.0 Target Version/s: 2.2.0 >

[jira] [Assigned] (SPARK-16920) Investigate and fix issues introduced in SPARK-15858

2017-02-23 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16920?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley reassigned SPARK-16920: - Assignee: Mahmoud Rawas > Investigate and fix issues introduced in SPARK-15858

[jira] [Updated] (SPARK-16920) Investigate and fix issues introduced in SPARK-15858

2017-02-23 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16920?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-16920: -- Target Version/s: (was: 2.2.0) > Investigate and fix issues introduced in

[jira] [Updated] (SPARK-18618) SparkR GLM model predict should support type as a argument

2017-02-23 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-18618: -- Labels: (was: 2.2.0) > SparkR GLM model predict should support type as a argument >

[jira] [Updated] (SPARK-18592) Move DT/RF/GBT Param setter methods to subclasses

2017-02-23 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18592?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-18592: -- Target Version/s: 2.1.0 (was: 2.1.0, 2.2.0) > Move DT/RF/GBT Param setter methods to

[jira] [Updated] (SPARK-15571) Pipeline unit test improvements for 2.3

2017-02-23 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15571?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-15571: -- Summary: Pipeline unit test improvements for 2.3 (was: Pipeline unit test

[jira] [Updated] (SPARK-15571) Pipeline unit test improvements for 2.3

2017-02-23 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15571?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-15571: -- Target Version/s: 2.3.0 (was: 2.2.0) > Pipeline unit test improvements for 2.3 >

[jira] [Commented] (SPARK-15571) Pipeline unit test improvements for 2.2

2017-02-23 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-15571?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15881102#comment-15881102 ] Joseph K. Bradley commented on SPARK-15571: --- [~rowanv] Thanks, and sorry for the long delay!

[jira] [Commented] (SPARK-18822) Support ML Pipeline in SparkR

2017-02-23 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18822?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15881098#comment-15881098 ] Joseph K. Bradley commented on SPARK-18822: --- How's this going? Just checking in; I know

[jira] [Updated] (SPARK-13786) Pyspark ml.tuning support export/import

2017-02-23 Thread Joseph K. Bradley (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13786?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joseph K. Bradley updated SPARK-13786: -- Target Version/s: 2.3.0 (was: 2.2.0) > Pyspark ml.tuning support export/import >

<    4   5   6   7   8   9   10   11   12   13   >