[jira] [Created] (SPARK-32298) tree models prediction optimization

2020-07-13 Thread zhengruifeng (Jira)
zhengruifeng created SPARK-32298: Summary: tree models prediction optimization Key: SPARK-32298 URL: https://issues.apache.org/jira/browse/SPARK-32298 Project: Spark Issue Type: Improvement

[jira] [Created] (SPARK-32384) repartitionAndSortWithinPartitions avoid shuffle with same partitioner

2020-07-22 Thread zhengruifeng (Jira)
zhengruifeng created SPARK-32384: Summary: repartitionAndSortWithinPartitions avoid shuffle with same partitioner Key: SPARK-32384 URL: https://issues.apache.org/jira/browse/SPARK-32384 Project: Spark

[jira] [Created] (SPARK-32455) LogisticRegressionModel prediction optimization

2020-07-27 Thread zhengruifeng (Jira)
zhengruifeng created SPARK-32455: Summary: LogisticRegressionModel prediction optimization Key: SPARK-32455 URL: https://issues.apache.org/jira/browse/SPARK-32455 Project: Spark Issue Type: I

[jira] [Created] (SPARK-32457) logParam thresholds in DT/GBT/FM/LR/MLP

2020-07-27 Thread zhengruifeng (Jira)
zhengruifeng created SPARK-32457: Summary: logParam thresholds in DT/GBT/FM/LR/MLP Key: SPARK-32457 URL: https://issues.apache.org/jira/browse/SPARK-32457 Project: Spark Issue Type: Improvem

[jira] [Assigned] (SPARK-29116) Refactor py classes related to DecisionTree

2019-10-12 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29116?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng reassigned SPARK-29116: Assignee: Huaxin Gao > Refactor py classes related to DecisionTree >

[jira] [Resolved] (SPARK-29116) Refactor py classes related to DecisionTree

2019-10-12 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29116?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng resolved SPARK-29116. -- Fix Version/s: 3.0.0 Resolution: Fixed Issue resolved by pull request 25929 [https://gi

[jira] [Assigned] (SPARK-29380) RFormula avoid repeated 'first' jobs to get vector size

2019-10-12 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29380?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng reassigned SPARK-29380: Assignee: zhengruifeng > RFormula avoid repeated 'first' jobs to get vector size > --

[jira] [Resolved] (SPARK-29380) RFormula avoid repeated 'first' jobs to get vector size

2019-10-12 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29380?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng resolved SPARK-29380. -- Fix Version/s: 3.0.0 Resolution: Fixed Issue resolved by pull request 26052 [https://gi

[jira] [Resolved] (SPARK-29377) parity between scala ml tuning and python ml tuning

2019-10-13 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29377?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng resolved SPARK-29377. -- Fix Version/s: 3.0.0 Resolution: Fixed Issue resolved by pull request 26057 [https://gi

[jira] [Assigned] (SPARK-29377) parity between scala ml tuning and python ml tuning

2019-10-13 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29377?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng reassigned SPARK-29377: Assignee: Huaxin Gao > parity between scala ml tuning and python ml tuning >

[jira] [Commented] (SPARK-29381) Add 'private' _XXXParams classes for classification & regression

2019-10-14 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29381?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16951657#comment-16951657 ] zhengruifeng commented on SPARK-29381: -- [~huaxingao]  Hi, I think we need another P

[jira] [Created] (SPARK-29489) ml.evaluation support log-loss

2019-10-16 Thread zhengruifeng (Jira)
zhengruifeng created SPARK-29489: Summary: ml.evaluation support log-loss Key: SPARK-29489 URL: https://issues.apache.org/jira/browse/SPARK-29489 Project: Spark Issue Type: New Feature

[jira] [Resolved] (SPARK-23578) Add multicolumn support for Binarizer

2019-10-16 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23578?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng resolved SPARK-23578. -- Fix Version/s: 3.0.0 Resolution: Fixed Issue resolved by pull request 26064 [https://gi

[jira] [Assigned] (SPARK-23578) Add multicolumn support for Binarizer

2019-10-16 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-23578?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng reassigned SPARK-23578: Assignee: zhengruifeng > Add multicolumn support for Binarizer >

[jira] [Resolved] (SPARK-29489) ml.evaluation support log-loss

2019-10-18 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29489?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng resolved SPARK-29489. -- Fix Version/s: 3.0.0 Resolution: Fixed Issue resolved by pull request 26135 [https://gi

[jira] [Assigned] (SPARK-29489) ml.evaluation support log-loss

2019-10-18 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29489?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng reassigned SPARK-29489: Assignee: zhengruifeng > ml.evaluation support log-loss > --

[jira] [Assigned] (SPARK-29232) RandomForestRegressionModel does not update the parameter maps of the DecisionTreeRegressionModels underneath

2019-10-22 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29232?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng reassigned SPARK-29232: Assignee: Huaxin Gao > RandomForestRegressionModel does not update the parameter maps of

[jira] [Resolved] (SPARK-29232) RandomForestRegressionModel does not update the parameter maps of the DecisionTreeRegressionModels underneath

2019-10-22 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29232?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng resolved SPARK-29232. -- Fix Version/s: 3.0.0 Resolution: Fixed Issue resolved by pull request 26154 [https://gi

[jira] [Commented] (SPARK-29093) Remove automatically generated param setters in _shared_params_code_gen.py

2019-10-22 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29093?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16957601#comment-16957601 ] zhengruifeng commented on SPARK-29093: -- [~huaxingao] Thanks! > Remove automaticall

[jira] [Assigned] (SPARK-29093) Remove automatically generated param setters in _shared_params_code_gen.py

2019-10-22 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29093?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng reassigned SPARK-29093: Assignee: Huaxin Gao > Remove automatically generated param setters in _shared_params_cod

[jira] [Created] (SPARK-29565) OneHotEncoder should support single-column input/ouput

2019-10-23 Thread zhengruifeng (Jira)
zhengruifeng created SPARK-29565: Summary: OneHotEncoder should support single-column input/ouput Key: SPARK-29565 URL: https://issues.apache.org/jira/browse/SPARK-29565 Project: Spark Issue

[jira] [Created] (SPARK-29566) Imputer should support single-column input/ouput

2019-10-23 Thread zhengruifeng (Jira)
zhengruifeng created SPARK-29566: Summary: Imputer should support single-column input/ouput Key: SPARK-29566 URL: https://issues.apache.org/jira/browse/SPARK-29566 Project: Spark Issue Type:

[jira] [Updated] (SPARK-29566) Imputer should support single-column input/ouput

2019-10-23 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29566?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng updated SPARK-29566: - Description: Imputer should support single-column input/ouput refer to https://issues.apache.or

[jira] [Commented] (SPARK-29565) OneHotEncoder should support single-column input/ouput

2019-10-23 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16957664#comment-16957664 ] zhengruifeng commented on SPARK-29565: -- [~huaxingao]  In  [https://github.com/apach

[jira] [Resolved] (SPARK-9612) Add instance weight support for GBTs

2019-10-25 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-9612?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng resolved SPARK-9612. - Resolution: Fixed > Add instance weight support for GBTs > >

[jira] [Reopened] (SPARK-9612) Add instance weight support for GBTs

2019-10-25 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-9612?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng reopened SPARK-9612: - Assignee: zhengruifeng (was: DB Tsai) > Add instance weight support for GBTs > ---

[jira] [Resolved] (SPARK-29093) Remove automatically generated param setters in _shared_params_code_gen.py

2019-10-27 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29093?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng resolved SPARK-29093. -- Fix Version/s: 3.0.0 Resolution: Fixed Issue resolved by pull request 26232 [https://gi

[jira] [Assigned] (SPARK-29566) Imputer should support single-column input/ouput

2019-10-28 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29566?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng reassigned SPARK-29566: Assignee: Huaxin Gao > Imputer should support single-column input/ouput > ---

[jira] [Resolved] (SPARK-29566) Imputer should support single-column input/ouput

2019-10-28 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29566?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng resolved SPARK-29566. -- Fix Version/s: 3.0.0 Resolution: Fixed Issue resolved by pull request 26247 [https://gi

[jira] [Created] (SPARK-29645) ML add param RelativeError

2019-10-29 Thread zhengruifeng (Jira)
zhengruifeng created SPARK-29645: Summary: ML add param RelativeError Key: SPARK-29645 URL: https://issues.apache.org/jira/browse/SPARK-29645 Project: Spark Issue Type: Improvement

[jira] [Created] (SPARK-29656) ML algs expose aggregationDepth

2019-10-30 Thread zhengruifeng (Jira)
zhengruifeng created SPARK-29656: Summary: ML algs expose aggregationDepth Key: SPARK-29656 URL: https://issues.apache.org/jira/browse/SPARK-29656 Project: Spark Issue Type: Improvement

[jira] [Assigned] (SPARK-29645) ML add param RelativeError

2019-10-30 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29645?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng reassigned SPARK-29645: Assignee: zhengruifeng > ML add param RelativeError > -- > >

[jira] [Resolved] (SPARK-29645) ML add param RelativeError

2019-10-30 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29645?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng resolved SPARK-29645. -- Fix Version/s: 3.0.0 Resolution: Fixed Issue resolved by pull request 26305 [https://gi

[jira] [Assigned] (SPARK-29686) LinearSVC should persist instances if needed

2019-10-31 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29686?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng reassigned SPARK-29686: Assignee: zhengruifeng > LinearSVC should persist instances if needed > -

[jira] [Created] (SPARK-29686) LinearSVC should persist instances if needed

2019-10-31 Thread zhengruifeng (Jira)
zhengruifeng created SPARK-29686: Summary: LinearSVC should persist instances if needed Key: SPARK-29686 URL: https://issues.apache.org/jira/browse/SPARK-29686 Project: Spark Issue Type: Impr

[jira] [Resolved] (SPARK-29686) LinearSVC should persist instances if needed

2019-10-31 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29686?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng resolved SPARK-29686. -- Fix Version/s: 3.0.0 Resolution: Fixed Issue resolved by pull request 26344 [https://gi

[jira] [Created] (SPARK-29751) Scalers use Summarizer instead of MultivariateOnlineSummarizer

2019-11-04 Thread zhengruifeng (Jira)
zhengruifeng created SPARK-29751: Summary: Scalers use Summarizer instead of MultivariateOnlineSummarizer Key: SPARK-29751 URL: https://issues.apache.org/jira/browse/SPARK-29751 Project: Spark

[jira] [Created] (SPARK-29754) LoR/AFT/LiR/SVC use Summarizer instead of MultivariateOnlineSummarizer

2019-11-05 Thread zhengruifeng (Jira)
zhengruifeng created SPARK-29754: Summary: LoR/AFT/LiR/SVC use Summarizer instead of MultivariateOnlineSummarizer Key: SPARK-29754 URL: https://issues.apache.org/jira/browse/SPARK-29754 Project: Spark

[jira] [Created] (SPARK-29756) CountVectorizer forget to unpersist intermediate rdd

2019-11-05 Thread zhengruifeng (Jira)
zhengruifeng created SPARK-29756: Summary: CountVectorizer forget to unpersist intermediate rdd Key: SPARK-29756 URL: https://issues.apache.org/jira/browse/SPARK-29756 Project: Spark Issue Ty

[jira] [Updated] (SPARK-29756) CountVectorizer forget to unpersist intermediate rdd

2019-11-05 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29756?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng updated SPARK-29756: - Description: {code:java} scala> val df = spark.createDataFrame(Seq( | (0, Array("a",

[jira] [Resolved] (SPARK-29656) ML algs expose aggregationDepth

2019-11-05 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29656?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng resolved SPARK-29656. -- Fix Version/s: 3.0.0 Resolution: Fixed Issue resolved by pull request 26322 [https://gi

[jira] [Assigned] (SPARK-29656) ML algs expose aggregationDepth

2019-11-05 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29656?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng reassigned SPARK-29656: Assignee: zhengruifeng > ML algs expose aggregationDepth > --

[jira] [Reopened] (SPARK-16872) Include Gaussian Naive Bayes Classifier

2019-11-06 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-16872?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng reopened SPARK-16872: -- > Include Gaussian Naive Bayes Classifier > --- > >

[jira] [Resolved] (SPARK-29754) LoR/AFT/LiR/SVC use Summarizer instead of MultivariateOnlineSummarizer

2019-11-06 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29754?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng resolved SPARK-29754. -- Fix Version/s: 3.0.0 Resolution: Fixed Issue resolved by pull request 26396 [https://gi

[jira] [Assigned] (SPARK-29754) LoR/AFT/LiR/SVC use Summarizer instead of MultivariateOnlineSummarizer

2019-11-06 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29754?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng reassigned SPARK-29754: Assignee: zhengruifeng > LoR/AFT/LiR/SVC use Summarizer instead of MultivariateOnlineSumm

[jira] [Updated] (SPARK-16872) Include Gaussian Naive Bayes Classifier

2019-11-06 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-16872?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng updated SPARK-16872: - Component/s: PySpark > Include Gaussian Naive Bayes Classifier > ---

[jira] [Updated] (SPARK-16872) Impl Gaussian Naive Bayes Classifier

2019-11-06 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-16872?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng updated SPARK-16872: - Summary: Impl Gaussian Naive Bayes Classifier (was: Include Gaussian Naive Bayes Classifier) >

[jira] [Assigned] (SPARK-29756) CountVectorizer forget to unpersist intermediate rdd

2019-11-08 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29756?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng reassigned SPARK-29756: Assignee: zhengruifeng > CountVectorizer forget to unpersist intermediate rdd > -

[jira] [Resolved] (SPARK-29756) CountVectorizer forget to unpersist intermediate rdd

2019-11-08 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29756?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng resolved SPARK-29756. -- Fix Version/s: 3.0.0 Resolution: Fixed Issue resolved by pull request 26398 [https://gi

[jira] [Created] (SPARK-29801) ML models unify toString method

2019-11-08 Thread zhengruifeng (Jira)
zhengruifeng created SPARK-29801: Summary: ML models unify toString method Key: SPARK-29801 URL: https://issues.apache.org/jira/browse/SPARK-29801 Project: Spark Issue Type: Improvement

[jira] [Created] (SPARK-29808) StopWordsRemover should support multi-cols

2019-11-08 Thread zhengruifeng (Jira)
zhengruifeng created SPARK-29808: Summary: StopWordsRemover should support multi-cols Key: SPARK-29808 URL: https://issues.apache.org/jira/browse/SPARK-29808 Project: Spark Issue Type: Improv

[jira] [Assigned] (SPARK-29808) StopWordsRemover should support multi-cols

2019-11-11 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29808?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng reassigned SPARK-29808: Assignee: Huaxin Gao > StopWordsRemover should support multi-cols > -

[jira] [Created] (SPARK-29914) ML models append metadata in `transform`/`transformSchema`

2019-11-15 Thread zhengruifeng (Jira)
zhengruifeng created SPARK-29914: Summary: ML models append metadata in `transform`/`transformSchema` Key: SPARK-29914 URL: https://issues.apache.org/jira/browse/SPARK-29914 Project: Spark Is

[jira] [Resolved] (SPARK-16872) Impl Gaussian Naive Bayes Classifier

2019-11-17 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-16872?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng resolved SPARK-16872. -- Fix Version/s: 3.1.0 Resolution: Fixed Issue resolved by pull request 26413 [https://gi

[jira] [Created] (SPARK-29942) Impl Complement Naive Bayes Classifier

2019-11-18 Thread zhengruifeng (Jira)
zhengruifeng created SPARK-29942: Summary: Impl Complement Naive Bayes Classifier Key: SPARK-29942 URL: https://issues.apache.org/jira/browse/SPARK-29942 Project: Spark Issue Type: Improvemen

[jira] [Created] (SPARK-29959) Summarizer support more metrics

2019-11-19 Thread zhengruifeng (Jira)
zhengruifeng created SPARK-29959: Summary: Summarizer support more metrics Key: SPARK-29959 URL: https://issues.apache.org/jira/browse/SPARK-29959 Project: Spark Issue Type: Improvement

[jira] [Created] (SPARK-29960) MulticlassClassificationEvaluator support hammingLoss

2019-11-19 Thread zhengruifeng (Jira)
zhengruifeng created SPARK-29960: Summary: MulticlassClassificationEvaluator support hammingLoss Key: SPARK-29960 URL: https://issues.apache.org/jira/browse/SPARK-29960 Project: Spark Issue T

[jira] [Created] (SPARK-29967) KMeans support instance weighting

2019-11-19 Thread zhengruifeng (Jira)
zhengruifeng created SPARK-29967: Summary: KMeans support instance weighting Key: SPARK-29967 URL: https://issues.apache.org/jira/browse/SPARK-29967 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-29967) KMeans support instance weighting

2019-11-20 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29967?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16978897#comment-16978897 ] zhengruifeng commented on SPARK-29967: -- [~srowen]   Hi, Owen how would you think of

[jira] [Commented] (SPARK-29967) KMeans support instance weighting

2019-11-20 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29967?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16978942#comment-16978942 ] zhengruifeng commented on SPARK-29967: -- [~srowen] I suggested move the impl, since

[jira] [Resolved] (SPARK-29942) Impl Complement Naive Bayes Classifier

2019-11-21 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29942?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng resolved SPARK-29942. -- Fix Version/s: 3.1.0 Resolution: Fixed Issue resolved by pull request 26575 [https://gi

[jira] [Assigned] (SPARK-29942) Impl Complement Naive Bayes Classifier

2019-11-21 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29942?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng reassigned SPARK-29942: Assignee: zhengruifeng > Impl Complement Naive Bayes Classifier > ---

[jira] [Updated] (SPARK-29942) Impl Complement Naive Bayes Classifier

2019-11-21 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29942?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng updated SPARK-29942: - Fix Version/s: (was: 3.1.0) 3.0.0 > Impl Complement Naive Bayes Classifie

[jira] [Resolved] (SPARK-29960) MulticlassClassificationEvaluator support hammingLoss

2019-11-21 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29960?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng resolved SPARK-29960. -- Fix Version/s: 3.0.0 Resolution: Fixed Issue resolved by pull request 26597 [https://gi

[jira] [Assigned] (SPARK-29960) MulticlassClassificationEvaluator support hammingLoss

2019-11-21 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29960?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng reassigned SPARK-29960: Assignee: zhengruifeng > MulticlassClassificationEvaluator support hammingLoss >

[jira] [Created] (SPARK-30044) MNB/CNB/BNB use empty matrix instead of null

2019-11-26 Thread zhengruifeng (Jira)
zhengruifeng created SPARK-30044: Summary: MNB/CNB/BNB use empty matrix instead of null Key: SPARK-30044 URL: https://issues.apache.org/jira/browse/SPARK-30044 Project: Spark Issue Type: Impr

[jira] [Created] (SPARK-30046) linalg parity between scala and py sides

2019-11-26 Thread zhengruifeng (Jira)
zhengruifeng created SPARK-30046: Summary: linalg parity between scala and py sides Key: SPARK-30046 URL: https://issues.apache.org/jira/browse/SPARK-30046 Project: Spark Issue Type: Improve

[jira] [Assigned] (SPARK-29959) Summarizer support more metrics

2019-12-01 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29959?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng reassigned SPARK-29959: Assignee: zhengruifeng > Summarizer support more metrics > --

[jira] [Resolved] (SPARK-29959) Summarizer support more metrics

2019-12-01 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29959?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng resolved SPARK-29959. -- Fix Version/s: 3.1.0 Resolution: Fixed Issue resolved by pull request 26596 [https://gi

[jira] [Created] (SPARK-30102) GMM supports instance weighting

2019-12-02 Thread zhengruifeng (Jira)
zhengruifeng created SPARK-30102: Summary: GMM supports instance weighting Key: SPARK-30102 URL: https://issues.apache.org/jira/browse/SPARK-30102 Project: Spark Issue Type: Improvement

[jira] [Assigned] (SPARK-30044) MNB/CNB/BNB use empty matrix instead of null

2019-12-02 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30044?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng reassigned SPARK-30044: Assignee: zhengruifeng > MNB/CNB/BNB use empty matrix instead of null > -

[jira] [Resolved] (SPARK-30044) MNB/CNB/BNB use empty matrix instead of null

2019-12-02 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30044?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng resolved SPARK-30044. -- Fix Version/s: 3.0.0 Resolution: Fixed Issue resolved by pull request 26679 [https://gi

[jira] [Created] (SPARK-30109) PCA use BLAS.gemv with sparse vector

2019-12-03 Thread zhengruifeng (Jira)
zhengruifeng created SPARK-30109: Summary: PCA use BLAS.gemv with sparse vector Key: SPARK-30109 URL: https://issues.apache.org/jira/browse/SPARK-30109 Project: Spark Issue Type: Improvement

[jira] [Assigned] (SPARK-30109) PCA use BLAS.gemv with sparse vector

2019-12-03 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30109?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng reassigned SPARK-30109: Assignee: zhengruifeng > PCA use BLAS.gemv with sparse vector > -

[jira] [Resolved] (SPARK-30109) PCA use BLAS.gemv with sparse vector

2019-12-03 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30109?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng resolved SPARK-30109. -- Fix Version/s: 3.0.0 Resolution: Fixed Issue resolved by pull request 26745 [https://gi

[jira] [Created] (SPARK-30120) LSH approxNearestNeighbors should use TopByKeyAggregator when numNearestNeighbors is small

2019-12-03 Thread zhengruifeng (Jira)
zhengruifeng created SPARK-30120: Summary: LSH approxNearestNeighbors should use TopByKeyAggregator when numNearestNeighbors is small Key: SPARK-30120 URL: https://issues.apache.org/jira/browse/SPARK-30120

[jira] [Updated] (SPARK-30120) LSH approxNearestNeighbors should use TopByKeyAggregator when numNearestNeighbors is small

2019-12-03 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30120?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng updated SPARK-30120: - Description: ping [~huaxingao] > LSH approxNearestNeighbors should use TopByKeyAggregator when

[jira] [Assigned] (SPARK-29914) ML models append metadata in `transform`/`transformSchema`

2019-12-04 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29914?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng reassigned SPARK-29914: Assignee: zhengruifeng > ML models append metadata in `transform`/`transformSchema` > ---

[jira] [Resolved] (SPARK-29914) ML models append metadata in `transform`/`transformSchema`

2019-12-04 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29914?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng resolved SPARK-29914. -- Fix Version/s: 3.1.0 Resolution: Fixed Issue resolved by pull request 26547 [https://gi

[jira] [Comment Edited] (SPARK-30144) MLP param map missing

2019-12-08 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16991050#comment-16991050 ] zhengruifeng edited comment on SPARK-30144 at 12/9/19 1:40 AM: ---

[jira] [Commented] (SPARK-30144) MLP param map missing

2019-12-08 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16991050#comment-16991050 ] zhengruifeng commented on SPARK-30144: -- [~huaxingao]  It seems like that Multilayer

[jira] [Created] (SPARK-30178) RobustScaler support bigger numFeatures

2019-12-08 Thread zhengruifeng (Jira)
zhengruifeng created SPARK-30178: Summary: RobustScaler support bigger numFeatures Key: SPARK-30178 URL: https://issues.apache.org/jira/browse/SPARK-30178 Project: Spark Issue Type: Improveme

[jira] [Created] (SPARK-30202) impl QuantileTransform

2019-12-10 Thread zhengruifeng (Jira)
zhengruifeng created SPARK-30202: Summary: impl QuantileTransform Key: SPARK-30202 URL: https://issues.apache.org/jira/browse/SPARK-30202 Project: Spark Issue Type: Improvement Comp

[jira] [Created] (SPARK-30247) GaussianMixtureModel in py side should expose gaussian

2019-12-12 Thread zhengruifeng (Jira)
zhengruifeng created SPARK-30247: Summary: GaussianMixtureModel in py side should expose gaussian Key: SPARK-30247 URL: https://issues.apache.org/jira/browse/SPARK-30247 Project: Spark Issue

[jira] [Created] (SPARK-30286) Some thoughts on new features for MLLIB

2019-12-17 Thread zhengruifeng (Jira)
zhengruifeng created SPARK-30286: Summary: Some thoughts on new features for MLLIB Key: SPARK-30286 URL: https://issues.apache.org/jira/browse/SPARK-30286 Project: Spark Issue Type: Wish

[jira] [Updated] (SPARK-30286) Some thoughts on new features for MLLIB

2019-12-17 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30286?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng updated SPARK-30286: - Description: Some thoughts on new features for ML: 1, clustering: *mini-batch KMeans*: KMeans m

[jira] [Commented] (SPARK-30286) Some thoughts on new features for MLLIB

2019-12-17 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16998096#comment-16998096 ] zhengruifeng commented on SPARK-30286: --  It seem that the last roadmap for mllib is

[jira] [Updated] (SPARK-30286) Some thoughts on new features for MLLIB

2019-12-17 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30286?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng updated SPARK-30286: - Description: Some thoughts on new features for ML: 1, clustering: *mini-batch KMeans*: KMeans m

[jira] [Commented] (SPARK-30286) Some thoughts on new features for MLLIB

2019-12-17 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16998792#comment-16998792 ] zhengruifeng commented on SPARK-30286: -- [~srowen] Thanks for the reply. I will make

[jira] [Updated] (SPARK-30120) LSH approxNearestNeighbors should use BoundedPriorityQueue when numNearestNeighbors is small

2019-12-19 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30120?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng updated SPARK-30120: - Summary: LSH approxNearestNeighbors should use BoundedPriorityQueue when numNearestNeighbors is

[jira] [Created] (SPARK-30329) add iterator/foreach methods for Vectors

2019-12-22 Thread zhengruifeng (Jira)
zhengruifeng created SPARK-30329: Summary: add iterator/foreach methods for Vectors Key: SPARK-30329 URL: https://issues.apache.org/jira/browse/SPARK-30329 Project: Spark Issue Type: Wish

[jira] [Assigned] (SPARK-30329) add iterator/foreach methods for Vectors

2019-12-22 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30329?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng reassigned SPARK-30329: Assignee: zhengruifeng > add iterator/foreach methods for Vectors > -

[jira] [Resolved] (SPARK-30120) LSH approxNearestNeighbors should use BoundedPriorityQueue when numNearestNeighbors is small

2019-12-23 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30120?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng resolved SPARK-30120. -- Resolution: Not A Problem > LSH approxNearestNeighbors should use BoundedPriorityQueue when >

[jira] [Created] (SPARK-30347) LibSVMDataSource attach AttributeGroup

2019-12-24 Thread zhengruifeng (Jira)
zhengruifeng created SPARK-30347: Summary: LibSVMDataSource attach AttributeGroup Key: SPARK-30347 URL: https://issues.apache.org/jira/browse/SPARK-30347 Project: Spark Issue Type: Improvemen

[jira] [Assigned] (SPARK-30178) RobustScaler support bigger numFeatures

2019-12-24 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30178?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng reassigned SPARK-30178: Assignee: zhengruifeng > RobustScaler support bigger numFeatures > --

[jira] [Resolved] (SPARK-30178) RobustScaler support bigger numFeatures

2019-12-24 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30178?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng resolved SPARK-30178. -- Fix Version/s: 3.0.0 Resolution: Fixed Issue resolved by pull request 26803 [https://gi

[jira] [Created] (SPARK-30351) BisectingKMeans support instance weighting

2019-12-24 Thread zhengruifeng (Jira)
zhengruifeng created SPARK-30351: Summary: BisectingKMeans support instance weighting Key: SPARK-30351 URL: https://issues.apache.org/jira/browse/SPARK-30351 Project: Spark Issue Type: Improv

[jira] [Assigned] (SPARK-30347) LibSVMDataSource attach AttributeGroup

2019-12-25 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30347?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng reassigned SPARK-30347: Assignee: zhengruifeng > LibSVMDataSource attach AttributeGroup > ---

[jira] [Resolved] (SPARK-30347) LibSVMDataSource attach AttributeGroup

2019-12-25 Thread zhengruifeng (Jira)
[ https://issues.apache.org/jira/browse/SPARK-30347?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] zhengruifeng resolved SPARK-30347. -- Fix Version/s: 3.0.0 Resolution: Fixed Issue resolved by pull request 27003 [https://gi

[jira] [Created] (SPARK-30354) GBT reuse DecisionTreeMetadata among iterations

2019-12-25 Thread zhengruifeng (Jira)
zhengruifeng created SPARK-30354: Summary: GBT reuse DecisionTreeMetadata among iterations Key: SPARK-30354 URL: https://issues.apache.org/jira/browse/SPARK-30354 Project: Spark Issue Type: I

<    2   3   4   5   6   7   8   9   10   11   >