[ 
https://issues.apache.org/jira/browse/SPARK-23455?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16368116#comment-16368116
 ] 

Liang-Chi Hsieh commented on SPARK-23455:
-----------------------------------------

Currently, {{DefaultParamsWriter}} saves the following metadata + params:

   *  - class
   *  - timestamp
   *  - sparkVersion
   *  - uid
   *  - paramMap
   *  - (optionally, extra metadata)

User-supplied params and default params are all saved in {{paramMap}} field in 
JSON. We can have a {{defaultParamMap}} for saving default params.

For backward compatibility, when loading metadata, if it is a metadata file 
prior to Spark 2.4, we shouldn't raise error if we can't find 
{{defaultParamMap}} field in the file.

 

 

> Default Params in ML should be saved separately
> -----------------------------------------------
>
>                 Key: SPARK-23455
>                 URL: https://issues.apache.org/jira/browse/SPARK-23455
>             Project: Spark
>          Issue Type: Improvement
>          Components: ML
>    Affects Versions: 2.4.0
>            Reporter: Liang-Chi Hsieh
>            Priority: Major
>
> We save ML's user-supplied params and default params as one entity in JSON. 
> During loading the saved models, we set all the loaded params into created ML 
> model instances as user-supplied params.
> It causes some problems, e.g., if we strictly disallow some params to be set 
> at the same time, a default param can fail the param check because it is 
> treated as user-supplied param after loading.
> The loaded default params should not be set as user-supplied params. We 
> should save ML default params separately in JSON.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to