zhengruifeng created SPARK-32298:
Summary: tree models prediction optimization
Key: SPARK-32298
URL: https://issues.apache.org/jira/browse/SPARK-32298
Project: Spark
Issue Type: Improvement
zhengruifeng created SPARK-32384:
Summary: repartitionAndSortWithinPartitions avoid shuffle with
same partitioner
Key: SPARK-32384
URL: https://issues.apache.org/jira/browse/SPARK-32384
Project: Spark
zhengruifeng created SPARK-32455:
Summary: LogisticRegressionModel prediction optimization
Key: SPARK-32455
URL: https://issues.apache.org/jira/browse/SPARK-32455
Project: Spark
Issue Type: I
zhengruifeng created SPARK-32457:
Summary: logParam thresholds in DT/GBT/FM/LR/MLP
Key: SPARK-32457
URL: https://issues.apache.org/jira/browse/SPARK-32457
Project: Spark
Issue Type: Improvem
[
https://issues.apache.org/jira/browse/SPARK-29116?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhengruifeng reassigned SPARK-29116:
Assignee: Huaxin Gao
> Refactor py classes related to DecisionTree
>
[
https://issues.apache.org/jira/browse/SPARK-29116?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhengruifeng resolved SPARK-29116.
--
Fix Version/s: 3.0.0
Resolution: Fixed
Issue resolved by pull request 25929
[https://gi
[
https://issues.apache.org/jira/browse/SPARK-29380?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhengruifeng reassigned SPARK-29380:
Assignee: zhengruifeng
> RFormula avoid repeated 'first' jobs to get vector size
> --
[
https://issues.apache.org/jira/browse/SPARK-29380?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhengruifeng resolved SPARK-29380.
--
Fix Version/s: 3.0.0
Resolution: Fixed
Issue resolved by pull request 26052
[https://gi
[
https://issues.apache.org/jira/browse/SPARK-29377?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhengruifeng resolved SPARK-29377.
--
Fix Version/s: 3.0.0
Resolution: Fixed
Issue resolved by pull request 26057
[https://gi
[
https://issues.apache.org/jira/browse/SPARK-29377?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhengruifeng reassigned SPARK-29377:
Assignee: Huaxin Gao
> parity between scala ml tuning and python ml tuning
>
[
https://issues.apache.org/jira/browse/SPARK-29381?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16951657#comment-16951657
]
zhengruifeng commented on SPARK-29381:
--
[~huaxingao] Hi, I think we need another P
zhengruifeng created SPARK-29489:
Summary: ml.evaluation support log-loss
Key: SPARK-29489
URL: https://issues.apache.org/jira/browse/SPARK-29489
Project: Spark
Issue Type: New Feature
[
https://issues.apache.org/jira/browse/SPARK-23578?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhengruifeng resolved SPARK-23578.
--
Fix Version/s: 3.0.0
Resolution: Fixed
Issue resolved by pull request 26064
[https://gi
[
https://issues.apache.org/jira/browse/SPARK-23578?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhengruifeng reassigned SPARK-23578:
Assignee: zhengruifeng
> Add multicolumn support for Binarizer
>
[
https://issues.apache.org/jira/browse/SPARK-29489?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhengruifeng resolved SPARK-29489.
--
Fix Version/s: 3.0.0
Resolution: Fixed
Issue resolved by pull request 26135
[https://gi
[
https://issues.apache.org/jira/browse/SPARK-29489?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhengruifeng reassigned SPARK-29489:
Assignee: zhengruifeng
> ml.evaluation support log-loss
> --
[
https://issues.apache.org/jira/browse/SPARK-29232?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhengruifeng reassigned SPARK-29232:
Assignee: Huaxin Gao
> RandomForestRegressionModel does not update the parameter maps of
[
https://issues.apache.org/jira/browse/SPARK-29232?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhengruifeng resolved SPARK-29232.
--
Fix Version/s: 3.0.0
Resolution: Fixed
Issue resolved by pull request 26154
[https://gi
[
https://issues.apache.org/jira/browse/SPARK-29093?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16957601#comment-16957601
]
zhengruifeng commented on SPARK-29093:
--
[~huaxingao] Thanks!
> Remove automaticall
[
https://issues.apache.org/jira/browse/SPARK-29093?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhengruifeng reassigned SPARK-29093:
Assignee: Huaxin Gao
> Remove automatically generated param setters in _shared_params_cod
zhengruifeng created SPARK-29565:
Summary: OneHotEncoder should support single-column input/ouput
Key: SPARK-29565
URL: https://issues.apache.org/jira/browse/SPARK-29565
Project: Spark
Issue
zhengruifeng created SPARK-29566:
Summary: Imputer should support single-column input/ouput
Key: SPARK-29566
URL: https://issues.apache.org/jira/browse/SPARK-29566
Project: Spark
Issue Type:
[
https://issues.apache.org/jira/browse/SPARK-29566?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhengruifeng updated SPARK-29566:
-
Description:
Imputer should support single-column input/ouput
refer to https://issues.apache.or
[
https://issues.apache.org/jira/browse/SPARK-29565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16957664#comment-16957664
]
zhengruifeng commented on SPARK-29565:
--
[~huaxingao] In [https://github.com/apach
[
https://issues.apache.org/jira/browse/SPARK-9612?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhengruifeng resolved SPARK-9612.
-
Resolution: Fixed
> Add instance weight support for GBTs
>
>
[
https://issues.apache.org/jira/browse/SPARK-9612?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhengruifeng reopened SPARK-9612:
-
Assignee: zhengruifeng (was: DB Tsai)
> Add instance weight support for GBTs
> ---
[
https://issues.apache.org/jira/browse/SPARK-29093?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhengruifeng resolved SPARK-29093.
--
Fix Version/s: 3.0.0
Resolution: Fixed
Issue resolved by pull request 26232
[https://gi
[
https://issues.apache.org/jira/browse/SPARK-29566?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhengruifeng reassigned SPARK-29566:
Assignee: Huaxin Gao
> Imputer should support single-column input/ouput
> ---
[
https://issues.apache.org/jira/browse/SPARK-29566?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhengruifeng resolved SPARK-29566.
--
Fix Version/s: 3.0.0
Resolution: Fixed
Issue resolved by pull request 26247
[https://gi
zhengruifeng created SPARK-29645:
Summary: ML add param RelativeError
Key: SPARK-29645
URL: https://issues.apache.org/jira/browse/SPARK-29645
Project: Spark
Issue Type: Improvement
zhengruifeng created SPARK-29656:
Summary: ML algs expose aggregationDepth
Key: SPARK-29656
URL: https://issues.apache.org/jira/browse/SPARK-29656
Project: Spark
Issue Type: Improvement
[
https://issues.apache.org/jira/browse/SPARK-29645?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhengruifeng reassigned SPARK-29645:
Assignee: zhengruifeng
> ML add param RelativeError
> --
>
>
[
https://issues.apache.org/jira/browse/SPARK-29645?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhengruifeng resolved SPARK-29645.
--
Fix Version/s: 3.0.0
Resolution: Fixed
Issue resolved by pull request 26305
[https://gi
[
https://issues.apache.org/jira/browse/SPARK-29686?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhengruifeng reassigned SPARK-29686:
Assignee: zhengruifeng
> LinearSVC should persist instances if needed
> -
zhengruifeng created SPARK-29686:
Summary: LinearSVC should persist instances if needed
Key: SPARK-29686
URL: https://issues.apache.org/jira/browse/SPARK-29686
Project: Spark
Issue Type: Impr
[
https://issues.apache.org/jira/browse/SPARK-29686?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhengruifeng resolved SPARK-29686.
--
Fix Version/s: 3.0.0
Resolution: Fixed
Issue resolved by pull request 26344
[https://gi
zhengruifeng created SPARK-29751:
Summary: Scalers use Summarizer instead of
MultivariateOnlineSummarizer
Key: SPARK-29751
URL: https://issues.apache.org/jira/browse/SPARK-29751
Project: Spark
zhengruifeng created SPARK-29754:
Summary: LoR/AFT/LiR/SVC use Summarizer instead of
MultivariateOnlineSummarizer
Key: SPARK-29754
URL: https://issues.apache.org/jira/browse/SPARK-29754
Project: Spark
zhengruifeng created SPARK-29756:
Summary: CountVectorizer forget to unpersist intermediate rdd
Key: SPARK-29756
URL: https://issues.apache.org/jira/browse/SPARK-29756
Project: Spark
Issue Ty
[
https://issues.apache.org/jira/browse/SPARK-29756?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhengruifeng updated SPARK-29756:
-
Description:
{code:java}
scala> val df = spark.createDataFrame(Seq(
| (0, Array("a",
[
https://issues.apache.org/jira/browse/SPARK-29656?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhengruifeng resolved SPARK-29656.
--
Fix Version/s: 3.0.0
Resolution: Fixed
Issue resolved by pull request 26322
[https://gi
[
https://issues.apache.org/jira/browse/SPARK-29656?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhengruifeng reassigned SPARK-29656:
Assignee: zhengruifeng
> ML algs expose aggregationDepth
> --
[
https://issues.apache.org/jira/browse/SPARK-16872?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhengruifeng reopened SPARK-16872:
--
> Include Gaussian Naive Bayes Classifier
> ---
>
>
[
https://issues.apache.org/jira/browse/SPARK-29754?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhengruifeng resolved SPARK-29754.
--
Fix Version/s: 3.0.0
Resolution: Fixed
Issue resolved by pull request 26396
[https://gi
[
https://issues.apache.org/jira/browse/SPARK-29754?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhengruifeng reassigned SPARK-29754:
Assignee: zhengruifeng
> LoR/AFT/LiR/SVC use Summarizer instead of MultivariateOnlineSumm
[
https://issues.apache.org/jira/browse/SPARK-16872?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhengruifeng updated SPARK-16872:
-
Component/s: PySpark
> Include Gaussian Naive Bayes Classifier
> ---
[
https://issues.apache.org/jira/browse/SPARK-16872?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhengruifeng updated SPARK-16872:
-
Summary: Impl Gaussian Naive Bayes Classifier (was: Include Gaussian Naive
Bayes Classifier)
>
[
https://issues.apache.org/jira/browse/SPARK-29756?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhengruifeng reassigned SPARK-29756:
Assignee: zhengruifeng
> CountVectorizer forget to unpersist intermediate rdd
> -
[
https://issues.apache.org/jira/browse/SPARK-29756?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhengruifeng resolved SPARK-29756.
--
Fix Version/s: 3.0.0
Resolution: Fixed
Issue resolved by pull request 26398
[https://gi
zhengruifeng created SPARK-29801:
Summary: ML models unify toString method
Key: SPARK-29801
URL: https://issues.apache.org/jira/browse/SPARK-29801
Project: Spark
Issue Type: Improvement
zhengruifeng created SPARK-29808:
Summary: StopWordsRemover should support multi-cols
Key: SPARK-29808
URL: https://issues.apache.org/jira/browse/SPARK-29808
Project: Spark
Issue Type: Improv
[
https://issues.apache.org/jira/browse/SPARK-29808?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhengruifeng reassigned SPARK-29808:
Assignee: Huaxin Gao
> StopWordsRemover should support multi-cols
> -
zhengruifeng created SPARK-29914:
Summary: ML models append metadata in `transform`/`transformSchema`
Key: SPARK-29914
URL: https://issues.apache.org/jira/browse/SPARK-29914
Project: Spark
Is
[
https://issues.apache.org/jira/browse/SPARK-16872?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhengruifeng resolved SPARK-16872.
--
Fix Version/s: 3.1.0
Resolution: Fixed
Issue resolved by pull request 26413
[https://gi
zhengruifeng created SPARK-29942:
Summary: Impl Complement Naive Bayes Classifier
Key: SPARK-29942
URL: https://issues.apache.org/jira/browse/SPARK-29942
Project: Spark
Issue Type: Improvemen
zhengruifeng created SPARK-29959:
Summary: Summarizer support more metrics
Key: SPARK-29959
URL: https://issues.apache.org/jira/browse/SPARK-29959
Project: Spark
Issue Type: Improvement
zhengruifeng created SPARK-29960:
Summary: MulticlassClassificationEvaluator support hammingLoss
Key: SPARK-29960
URL: https://issues.apache.org/jira/browse/SPARK-29960
Project: Spark
Issue T
zhengruifeng created SPARK-29967:
Summary: KMeans support instance weighting
Key: SPARK-29967
URL: https://issues.apache.org/jira/browse/SPARK-29967
Project: Spark
Issue Type: Improvement
[
https://issues.apache.org/jira/browse/SPARK-29967?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16978897#comment-16978897
]
zhengruifeng commented on SPARK-29967:
--
[~srowen] Hi, Owen how would you think of
[
https://issues.apache.org/jira/browse/SPARK-29967?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16978942#comment-16978942
]
zhengruifeng commented on SPARK-29967:
--
[~srowen] I suggested move the impl, since
[
https://issues.apache.org/jira/browse/SPARK-29942?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhengruifeng resolved SPARK-29942.
--
Fix Version/s: 3.1.0
Resolution: Fixed
Issue resolved by pull request 26575
[https://gi
[
https://issues.apache.org/jira/browse/SPARK-29942?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhengruifeng reassigned SPARK-29942:
Assignee: zhengruifeng
> Impl Complement Naive Bayes Classifier
> ---
[
https://issues.apache.org/jira/browse/SPARK-29942?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhengruifeng updated SPARK-29942:
-
Fix Version/s: (was: 3.1.0)
3.0.0
> Impl Complement Naive Bayes Classifie
[
https://issues.apache.org/jira/browse/SPARK-29960?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhengruifeng resolved SPARK-29960.
--
Fix Version/s: 3.0.0
Resolution: Fixed
Issue resolved by pull request 26597
[https://gi
[
https://issues.apache.org/jira/browse/SPARK-29960?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhengruifeng reassigned SPARK-29960:
Assignee: zhengruifeng
> MulticlassClassificationEvaluator support hammingLoss
>
zhengruifeng created SPARK-30044:
Summary: MNB/CNB/BNB use empty matrix instead of null
Key: SPARK-30044
URL: https://issues.apache.org/jira/browse/SPARK-30044
Project: Spark
Issue Type: Impr
zhengruifeng created SPARK-30046:
Summary: linalg parity between scala and py sides
Key: SPARK-30046
URL: https://issues.apache.org/jira/browse/SPARK-30046
Project: Spark
Issue Type: Improve
[
https://issues.apache.org/jira/browse/SPARK-29959?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhengruifeng reassigned SPARK-29959:
Assignee: zhengruifeng
> Summarizer support more metrics
> --
[
https://issues.apache.org/jira/browse/SPARK-29959?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhengruifeng resolved SPARK-29959.
--
Fix Version/s: 3.1.0
Resolution: Fixed
Issue resolved by pull request 26596
[https://gi
zhengruifeng created SPARK-30102:
Summary: GMM supports instance weighting
Key: SPARK-30102
URL: https://issues.apache.org/jira/browse/SPARK-30102
Project: Spark
Issue Type: Improvement
[
https://issues.apache.org/jira/browse/SPARK-30044?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhengruifeng reassigned SPARK-30044:
Assignee: zhengruifeng
> MNB/CNB/BNB use empty matrix instead of null
> -
[
https://issues.apache.org/jira/browse/SPARK-30044?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhengruifeng resolved SPARK-30044.
--
Fix Version/s: 3.0.0
Resolution: Fixed
Issue resolved by pull request 26679
[https://gi
zhengruifeng created SPARK-30109:
Summary: PCA use BLAS.gemv with sparse vector
Key: SPARK-30109
URL: https://issues.apache.org/jira/browse/SPARK-30109
Project: Spark
Issue Type: Improvement
[
https://issues.apache.org/jira/browse/SPARK-30109?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhengruifeng reassigned SPARK-30109:
Assignee: zhengruifeng
> PCA use BLAS.gemv with sparse vector
> -
[
https://issues.apache.org/jira/browse/SPARK-30109?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhengruifeng resolved SPARK-30109.
--
Fix Version/s: 3.0.0
Resolution: Fixed
Issue resolved by pull request 26745
[https://gi
zhengruifeng created SPARK-30120:
Summary: LSH approxNearestNeighbors should use TopByKeyAggregator
when numNearestNeighbors is small
Key: SPARK-30120
URL: https://issues.apache.org/jira/browse/SPARK-30120
[
https://issues.apache.org/jira/browse/SPARK-30120?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhengruifeng updated SPARK-30120:
-
Description: ping [~huaxingao]
> LSH approxNearestNeighbors should use TopByKeyAggregator when
[
https://issues.apache.org/jira/browse/SPARK-29914?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhengruifeng reassigned SPARK-29914:
Assignee: zhengruifeng
> ML models append metadata in `transform`/`transformSchema`
> ---
[
https://issues.apache.org/jira/browse/SPARK-29914?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhengruifeng resolved SPARK-29914.
--
Fix Version/s: 3.1.0
Resolution: Fixed
Issue resolved by pull request 26547
[https://gi
[
https://issues.apache.org/jira/browse/SPARK-30144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16991050#comment-16991050
]
zhengruifeng edited comment on SPARK-30144 at 12/9/19 1:40 AM:
---
[
https://issues.apache.org/jira/browse/SPARK-30144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16991050#comment-16991050
]
zhengruifeng commented on SPARK-30144:
--
[~huaxingao] It seems like that Multilayer
zhengruifeng created SPARK-30178:
Summary: RobustScaler support bigger numFeatures
Key: SPARK-30178
URL: https://issues.apache.org/jira/browse/SPARK-30178
Project: Spark
Issue Type: Improveme
zhengruifeng created SPARK-30202:
Summary: impl QuantileTransform
Key: SPARK-30202
URL: https://issues.apache.org/jira/browse/SPARK-30202
Project: Spark
Issue Type: Improvement
Comp
zhengruifeng created SPARK-30247:
Summary: GaussianMixtureModel in py side should expose gaussian
Key: SPARK-30247
URL: https://issues.apache.org/jira/browse/SPARK-30247
Project: Spark
Issue
zhengruifeng created SPARK-30286:
Summary: Some thoughts on new features for MLLIB
Key: SPARK-30286
URL: https://issues.apache.org/jira/browse/SPARK-30286
Project: Spark
Issue Type: Wish
[
https://issues.apache.org/jira/browse/SPARK-30286?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhengruifeng updated SPARK-30286:
-
Description:
Some thoughts on new features for ML:
1, clustering: *mini-batch KMeans*: KMeans m
[
https://issues.apache.org/jira/browse/SPARK-30286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16998096#comment-16998096
]
zhengruifeng commented on SPARK-30286:
--
It seem that the last roadmap for mllib is
[
https://issues.apache.org/jira/browse/SPARK-30286?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhengruifeng updated SPARK-30286:
-
Description:
Some thoughts on new features for ML:
1, clustering: *mini-batch KMeans*: KMeans m
[
https://issues.apache.org/jira/browse/SPARK-30286?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16998792#comment-16998792
]
zhengruifeng commented on SPARK-30286:
--
[~srowen] Thanks for the reply. I will make
[
https://issues.apache.org/jira/browse/SPARK-30120?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhengruifeng updated SPARK-30120:
-
Summary: LSH approxNearestNeighbors should use BoundedPriorityQueue when
numNearestNeighbors is
zhengruifeng created SPARK-30329:
Summary: add iterator/foreach methods for Vectors
Key: SPARK-30329
URL: https://issues.apache.org/jira/browse/SPARK-30329
Project: Spark
Issue Type: Wish
[
https://issues.apache.org/jira/browse/SPARK-30329?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhengruifeng reassigned SPARK-30329:
Assignee: zhengruifeng
> add iterator/foreach methods for Vectors
> -
[
https://issues.apache.org/jira/browse/SPARK-30120?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhengruifeng resolved SPARK-30120.
--
Resolution: Not A Problem
> LSH approxNearestNeighbors should use BoundedPriorityQueue when
>
zhengruifeng created SPARK-30347:
Summary: LibSVMDataSource attach AttributeGroup
Key: SPARK-30347
URL: https://issues.apache.org/jira/browse/SPARK-30347
Project: Spark
Issue Type: Improvemen
[
https://issues.apache.org/jira/browse/SPARK-30178?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhengruifeng reassigned SPARK-30178:
Assignee: zhengruifeng
> RobustScaler support bigger numFeatures
> --
[
https://issues.apache.org/jira/browse/SPARK-30178?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhengruifeng resolved SPARK-30178.
--
Fix Version/s: 3.0.0
Resolution: Fixed
Issue resolved by pull request 26803
[https://gi
zhengruifeng created SPARK-30351:
Summary: BisectingKMeans support instance weighting
Key: SPARK-30351
URL: https://issues.apache.org/jira/browse/SPARK-30351
Project: Spark
Issue Type: Improv
[
https://issues.apache.org/jira/browse/SPARK-30347?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhengruifeng reassigned SPARK-30347:
Assignee: zhengruifeng
> LibSVMDataSource attach AttributeGroup
> ---
[
https://issues.apache.org/jira/browse/SPARK-30347?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhengruifeng resolved SPARK-30347.
--
Fix Version/s: 3.0.0
Resolution: Fixed
Issue resolved by pull request 27003
[https://gi
zhengruifeng created SPARK-30354:
Summary: GBT reuse DecisionTreeMetadata among iterations
Key: SPARK-30354
URL: https://issues.apache.org/jira/browse/SPARK-30354
Project: Spark
Issue Type: I
601 - 700 of 1191 matches
Mail list logo