Mao Li created SPARK-57860:
------------------------------

             Summary: Add HasIntermediateStorageLevel shared param trait and 
apply to classification/clustering/regression estimators
                 Key: SPARK-57860
                 URL: https://issues.apache.org/jira/browse/SPARK-57860
             Project: Spark
          Issue Type: Sub-task
          Components: MLlib
    Affects Versions: 4.0.0
            Reporter: Mao Li


Add a HasIntermediateStorageLevel trait to sharedParams.scala, following the 
same pattern
as other Has* param traits. Apply it to the following estimators that currently 
hardcode
StorageLevel.MEMORY_AND_DISK for intermediate dataset persistence:


Classification: LogisticRegression, LinearSVC, FMClassifier, OneVsRest

Clustering: KMeans, GaussianMixture, LDA

Regression: LinearRegression, AFTSurvivalRegression, FMRegressor, 
IsotonicRegression
Other: FPGrowth


Also update the corresponding PySpark classes.

Part of SPARK-47103.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to