[ 
https://issues.apache.org/jira/browse/SINGA-107?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

wangwei updated SINGA-107:
--------------------------
    Description: 
When params are loaded from checkpoint files, their version numbers will be 
reset to 0 for fine-tuning as explained in the comments of SINGA-42.
Then if this param is used again in another model, then the version number will 
become 0 and this param is not regarded as pre-training param thus will require 
initialization, which will incur problems.

Present solution is to load this param more than one time, so that the latter 
loading can override the first loading, and the version number will not be 0 
and this param is still regarded as pre-training param.

For example, in rbm3.conf, we write like:
checkpoint_path: "examples/rbm/rbm2/checkpoint/step6000-worker0"
checkpoint_path: "examples/rbm/rbm1/checkpoint/step6000-worker0"
in order to load w1 and b12 twice.

  was:
When loading pre-training params, this version of code will reset the param 
version to 0. Then if this param is used again in another model, then the 
version number will become 0 and this param is not regarded as pre-training 
param thus will require initialization, which will incur problems.

Present solution is to load this param more than one time, so that the latter 
loading can override the first loading, and the version number will not be 0 
and this param is still regarded as pre-training param.

For example, in rbm3.conf, we write like:
checkpoint_path: "examples/rbm/rbm2/checkpoint/step6000-worker0"
checkpoint_path: "examples/rbm/rbm1/checkpoint/step6000-worker0"
in order to load w1 and b12 twice.


> pre-trained param checkpoint version reset problem 
> ---------------------------------------------------
>
>                 Key: SINGA-107
>                 URL: https://issues.apache.org/jira/browse/SINGA-107
>             Project: Singa
>          Issue Type: Bug
>            Reporter: ZHAOJING
>
> When params are loaded from checkpoint files, their version numbers will be 
> reset to 0 for fine-tuning as explained in the comments of SINGA-42.
> Then if this param is used again in another model, then the version number 
> will become 0 and this param is not regarded as pre-training param thus will 
> require initialization, which will incur problems.
> Present solution is to load this param more than one time, so that the latter 
> loading can override the first loading, and the version number will not be 0 
> and this param is still regarded as pre-training param.
> For example, in rbm3.conf, we write like:
> checkpoint_path: "examples/rbm/rbm2/checkpoint/step6000-worker0"
> checkpoint_path: "examples/rbm/rbm1/checkpoint/step6000-worker0"
> in order to load w1 and b12 twice.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to